halyph / mind-flow

blog and/or personal knowledge base
https://halyph.github.io/blog/
Creative Commons Attribution Share Alike 4.0 International
25 stars 6 forks source link

Popular python libs #115

Open halyph opened 3 years ago

halyph commented 3 years ago
  1. NumPy
  2. Pandas
  3. Scipy
  4. matplotlib
  5. Patsy
  6. Sympy
  7. Plotly
  8. statsmodels
  9. ADiPy
  10. matalg27
  11. mplstyler
  12. Scikit-image
  13. Scikit-learn
  14. Gnuplot.py
  15. NetworkX
  16. Seaborn
halyph commented 3 years ago

From https://www.edureka.co/blog/python-libraries/

  1. TensorFlow
  2. Scikit-Learn
  3. Numpy
  4. Keras
  5. PyTorch
  6. LightGBM
  7. Eli5
  8. SciPy
  9. Theano
  10. Pandas
halyph commented 3 years ago

From https://www.stxnext.com/blog/most-popular-python-scientific-libraries/

30+ essential Python libraries for data science, machine learning, and more

  1. Astropy http://www.astropy.org/ Astropy is a collection of packages designed for use in astronomy. The core Astropy package contains functionality aimed at professional astronomers and astrophysicists, but may be useful to anyone developing software for astronomy.

  2. Biopython http://biopython.org/ Biopython is a collection of non-commercial Python tools for computational biology and bioinformatics. It contains classes to represent biological sequences and sequence annotations. The library can also read and write to a variety of file formats.

  3. Bokeh https://docs.bokeh.org/en/latest/ Bokeh is a Python interactive visualization library that targets modern web browsers for presentation. It can help anyone who wishes to quickly and easily create interactive plots, dashboards, and data applications. The purpose of Bokeh is to provide elegant, concise construction of novel graphics in the style of D3.js, but also deliver this capability with high-performance interactivity over very large or streaming datasets.

  4. Cubes http://cubes.databrewery.org/ Cubes is a light-weight Python framework and set of tools for the development of reporting and analytical applications, Online Analytical Processing (OLAP), multidimensional analysis, and browsing of aggregated data.

  5. Dask http://dask.pydata.org/en/latest/ Dask is a flexible parallel computing library for analytic computing, composed of two components: dynamic task scheduling optimized for computation and interactive computational workloads; Big Data collections like parallel arrays, dataframes, and lists that extend common interfaces such as NumPy, Pandas, or Python iterators to larger-than-memory or distributed environments.

  6. DEAP https://github.com/deap DEAP is an evolutionary computation framework for rapid prototyping and testing of ideas. It incorporates the data structures and tools required to implement the most common evolutionary computation techniques, such as genetic algorithms, genetic programming, evolution strategies, particle swarm optimization, differential evolution, and estimation of distribution algorithms.

  7. DMelt http://jwork.org/dmelt/ DataMelt, or DMelt, is a software for numeric computation, statistics, analysis of large data volumes (Big Data), and scientific visualization. It can be used with several scripting languages, including Python/Jython, BeanShell, Groovy, Ruby, and Java. The library has numerous applications, such as natural sciences, engineering, modeling, and analysis of financial markets.

  8. graph-tool http://graph-tool.skewed.de/ Graph-tool is a module for the manipulation and statistical analysis of graphs.

  9. matplotlib https://github.com/matplotlib/matplotlib Matplotlib is a Python 2D plotting library that produces publication-quality figures in a variety of hard-copy formats and interactive cross-platform environments. It allows you to generate plots, histograms, power spectra, bar charts, error charts, scatter plots, and more.

  10. Mlpy http://mlpy.sourceforge.net/ Mlpy is a machine learning library built on top of NumPy/SciPy, the GNU Scientific Libraries. It provides a wide range of machine learning methods for supervised and unsupervised problems, and is aimed at finding a reasonable compromise between modularity, maintainability, reproducibility, usability, and efficiency.

  11. NetworkX http://networkx.github.io/ NetworkX is a library for studying graphs which helps you create, manipulate, and study the structure, dynamics, and functions of complex networks.

  12. Nilearn http://nilearn.github.io/ Nilearn is a Python module for fast and easy statistical learning on neuroimaging data. This library makes it easy to use many advanced machine learning, pattern recognition, and multivariate statistical techniques on neuroimaging data for applications such as MVPA (Multi-Voxel Pattern Analysis), decoding, predictive modelling, functional connectivity, brain parcellations, or connectomes.

  13. NumPy http://www.numpy.org/ NumPy is the fundamental package for scientific computing with Python, adding support for large, multidimensional arrays and matrices, along with a large library of high-level mathematical functions to operate on these arrays.

  14. Pandas http://pandas.pydata.org/ Pandas is a library for data manipulation and analysis, providing data structures and operations for manipulating numerical tables and time series.

  15. Pipenv https://github.com/pypa/pipenv Pipenv is a tool designed to bring the best of all packaging worlds to the Python world. It automatically creates and manages a virtualenv for your projects, along with adding or removing packages from your Pipfile as you install or uninstall packages. Pipenv is primarily meant to provide users and developers of applications with an easy method to set up a working environment.

  16. PsychoPy http://www.psychopy.org/ PsychoPy is a package for the generation of experiments for neuroscience and experimental psychology. It is designed to allow the presentation of stimuli and collection of data for a wide range of neuroscience, psychology, and psychophysical experiments.

  17. PySpark http://spark.apache.org/docs/2.1.0/api/python/pyspark.html PySpark is the Python API for Apache Spark. Spark is a distributed computing framework for big data processing. It serves as a unified analytics engine, built with speed, ease of use, and generality in mind. Spark offers modules for streaming, machine learning, and graph processing. It’s also completely open-source.

  18. python-weka-wrapper https://pypi.python.org/pypi/python-weka-wrapper Weka is a suite of machine learning software written in Java, developed at the University of Waikato, New Zealand. It contains a collection of visualization tools and algorithms for data analysis and predictive modeling, together with graphical user interfaces for easy access to these functions. The python-weka-wrapper package makes it easy to run Weka algorithms and filters from within Python.

  19. PyTorch https://pytorch.org/ PyTorch is a deep learning framework for fast, flexible experimentation. This package provides two high-level features: Tensor computation with strong GPU acceleration and deep neural networks built on a tape-based autodiff system. It can be used either as a replacement for numpy to use the power of GPUs, or a deep learning research platform that provides maximum flexibility and speed.

  20. SQLAlchemy http://www.sqlalchemy.org/ SQLAlchemy is an open-source SQL toolkit and Object-Relational Mapper that gives application developers the full power and flexibility of SQL. It provides a full suite of well-known enterprise-level persistence patterns, designed for efficient and high-performing database access, adapted into a simple and Pythonic domain language. The main goal of the library is to change the way we approach databases and SQL.

  21. SageMath http://www.sagemath.org/ SageMath is a mathematical software system with features covering multiple aspects of mathematics, including algebra, combinatorics, numerical mathematics, number theory, and calculus. It uses Python to support procedural, functional, and object-oriented constructs.

  22. ScientificPython https://pypi.org/project/ScientificPython/ ScientificPython is a collection of modules for scientific computing. It contains support for geometry, mathematical functions, statistics, physical units, IO, visualization, and parallelization.

  23. scikit-image http://scikit-image.org/ Scikit-image is an image processing library. It includes algorithms for segmentation, geometric transformations, color space manipulation, analysis, filtering, morphology, feature detection, and more.

  24. scikit-learn http://scikit-learn.org/stable Scikit-learn is a machine learning library. It features various classification, regression, and clustering algorithms, including support vector machines, random forests, gradient boosting, k-means, and DBSCAN. The library is designed to interoperate with the Python numerical and scientific libraries NumPy and SciPy.

  25. SciPy http://www.scipy.org/ SciPy is a library used by scientists, analysts, and engineers doing scientific computing and technical computing. It contains modules for optimization, linear algebra, integration, interpolation, special functions, FFT, signal and image processing, ODE solvers, and other tasks common in science and engineering.

  26. SCOOP http://scoop.readthedocs.org/ SCOOP is a Python module for distributing concurrent parallel tasks on various environments, from heterogeneous grids of workstations to supercomputers.

  27. SunPy http://sunpy.org/ SunPy is a data-analysis environment specializing in providing the software necessary to analyze solar and heliospheric data in Python.

  28. SymPy http://www.sympy.org/ SymPy is a library for symbolic computation, offering features ranging from basic symbolic arithmetic to calculus, algebra, discrete mathematics, and quantum physics. It provides computer algebra capabilities either as a standalone application, a library to other applications, or live on the web.

  29. TensorFlow https://www.tensorflow.org/ TensorFlow is an open-source software library for machine learning across a range of tasks, developed by Google to meet their needs for systems capable of building and training neural networks to detect and decipher patterns and correlations, analogous to the learning and reasoning employed by humans. It is currently used for both research and production at Google products,‍ often replacing the role of its closed-source predecessor, DistBelief.

  30. Theano http://deeplearning.net/software/theano Theano is a numerical computation Python library, allowing you to define, optimize, and evaluate mathematical expressions involving multidimensional arrays efficiently.

  31. TomoPy http://tomopy.readthedocs.org/en/latest TomoPy is an open-source Python toolbox for performing tomographic data processing and image reconstruction tasks. It offers a collaborative framework for the analysis of synchrotron tomographic data, with the goal to unify the efforts of different facilities and beamlines performing similar tasks.

  32. Veusz https://veusz.github.io/ Veusz is a scientific plotting and graphing package designed to produce publication-quality plots in popular vector formats, including PDF, PostScript, and SVG.