What set of python libraries should I learn for AI and Data mining

I am learning python and django. I need to make extensive material for data analysis, statistics, AI and data mining. Many libraries are available. so I want to know which SET I should learn. I currently have the following SET in My Mind.

  • Parsing: beautiful soup
  • Search Engine: Whoosh (will it crawl and index the site or not?)
  • AI: PyBrain
  • Data Mining: Orange (I'm not sure if this is anything else)

So, I want to know if there is anything else I need to know for these fields. Or some libraries that someone tried. I basically want libraries that I can use from inside Django, I don't need different structures for them.

+5
source share
2 answers

In addition to the scikit provided by @aix, you can take a look at the following libraries:

  • mlpy , ML library for preprocessing, clustering, forecasting, regression, and feature selection.
  • PyML , another ML library.
  • pandas (Pythonic cross-section, time series and statistical analysis) is a set of optimized NumPy-optimized data structures for panels, time series and cross-section data analysis with an emphasis on econometric applications.
  • scikits.statsmodels, (OLS/GLS, GLM, M- ..). , , , R.

, , , ., , Orange, .

+1
+2

All Articles