Hand-picked awesome Python libraries and frameworks, organised by category ๐
Interactive version: www.awesomepython.org
Updated 12 Aug 2024
Awesome Python is regularly updated, and this category lists the most recently created GitHub repositories from all the other repositories here.
naklecha/llama3-from-scratch โญ 11,767
llama3 implementation one matrix multiplication at a time
codium-ai/cover-agent โญ 4,101
CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! ๐ป๐ค๐งช๐
๐ www.codium.ai
blealtan/efficient-kan โญ 3,720
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
meta-llama/llama-agentic-system โญ 2,909
Agentic components of the Llama Stack APIs
lm-sys/RouteLLM โญ 2,675
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
mistralai/mistral-finetune โญ 2,606
A light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models. It is based on LoRA.
thudm/CogVLM2 โญ 1,738
GPT4V-level open-source multi-modal model based on Llama3-8B
facebookresearch/chameleon โญ 1,668
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
๐ arxiv.org/abs/2405.09818
mlfoundations/dclm โญ 1,024
DataComp for Language Models
likejazz/llama3.np โญ 943
llama3.np is a pure NumPy implementation for Llama 3 model.
Code quality tooling: linters, formatters, pre-commit hooks, unused code removal.
psf/black โญ 38,144
The uncompromising Python code formatter
๐ black.readthedocs.io/en/stable
astral-sh/ruff โญ 29,753
An extremely fast Python linter and code formatter, written in Rust.
๐ docs.astral.sh/ruff
pre-commit/pre-commit โญ 12,555
A framework for managing and maintaining multi-language pre-commit hooks.
๐ pre-commit.com
sqlfluff/sqlfluff โญ 7,492
A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
๐ www.sqlfluff.com
pycqa/isort โญ 6,418
A Python utility / library to sort imports.
๐ pycqa.github.io/isort
davidhalter/jedi โญ 5,740
Awesome autocompletion, static analysis and refactoring library for python
๐ jedi.readthedocs.io
pycqa/pylint โญ 5,222
It's not just a linter that annoys you!
๐ pylint.readthedocs.io/en/latest
asottile/pyupgrade โญ 3,447
A tool (and pre-commit hook) to automatically upgrade syntax for newer versions of the language.
pycqa/flake8 โญ 3,377
flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.
๐ flake8.pycqa.org
jendrikseipp/vulture โญ 3,357
Find dead Python code
wemake-services/wemake-python-styleguide โญ 2,481
The strictest and most opinionated python linter ever!
๐ wemake-python-styleguide.rtfd.io
codespell-project/codespell โญ 1,830
check code for common misspellings
python-lsp/python-lsp-server โญ 1,812
Fork of the python-language-server project, maintained by the Spyder IDE team and the community
sourcery-ai/sourcery โญ 1,513
Instant AI code reviews
๐ sourcery.ai
akaihola/darker โญ 629
Apply black reformatting to Python files only in regions changed since a given commit. For a practical usage example, see the blog post at https://dev.to/akaihola/improving-python-code-incrementally-3f7a
๐ pypi.org/project/darker
tconbeer/sqlfmt โญ 382
sqlfmt formats your dbt SQL files so you don't have to
๐ sqlfmt.com
Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity.
ccxt/ccxt โญ 32,194
A JavaScript / TypeScript / Python / C# / PHP cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges
๐ docs.ccxt.com
freqtrade/freqtrade โญ 27,405
Free, open source crypto trading bot
๐ www.freqtrade.io
crytic/slither โญ 5,187
Static Analyzer for Solidity and Vyper
๐ blog.trailofbits.com/2018/10/19/slither-a-solidity-static-analysis-framework
ethereum/web3.py โญ 4,918
A python interface for interacting with the Ethereum blockchain and ecosystem.
๐ web3py.readthedocs.io
ethereum/consensus-specs โญ 3,507
Ethereum Proof-of-Stake Consensus Specifications
cyberpunkmetalhead/Binance-volatility-trading-bot โญ 3,387
This is a fully functioning Binance trading bot that measures the volatility of every coin on Binance and places trades with the highest gaining coins If you like this project consider donating though the Brave browser to allow me to continuously improve the script.
ethereum/py-evm โญ 2,237
A Python implementation of the Ethereum Virtual Machine
๐ py-evm.readthedocs.io/en/latest
bmoscon/cryptofeed โญ 2,167
Cryptocurrency Exchange Websocket Data Feed Handler
binance/binance-public-data โญ 1,431
Details on how to get Binance public data
ofek/bit โญ 1,237
Bitcoin made easy.
๐ ofek.dev/bit
man-c/pycoingecko โญ 1,038
Python wrapper for the CoinGecko API
dylanhogg/awesome-crypto โญ 66
A list of awesome crypto and blockchain projects
๐ www.awesomecrypto.xyz
General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks.
scrapy/scrapy โญ 51,973
Scrapy, a fast high-level web crawling & scraping framework for Python.
๐ scrapy.org
apache/spark โญ 39,092
Apache Spark - A unified analytics engine for large-scale data processing
๐ spark.apache.org
mindsdb/mindsdb โญ 25,976
The platform for building AI from enterprise data
๐ mindsdb.com
getredash/redash โญ 25,759
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
๐ redash.io
jaidedai/EasyOCR โญ 23,183
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
๐ www.jaided.ai
qdrant/qdrant โญ 19,410
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
๐ qdrant.tech
humansignal/label-studio โญ 17,813
Label Studio is an open source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats.
๐ labelstud.io
joke2k/faker โญ 17,463
Faker is a Python package that generates fake data for you.
๐ faker.readthedocs.io
binux/pyspider โญ 16,437
A Powerful Spider(Web Crawler) System in Python.
๐ docs.pyspider.org
twintproject/twint โญ 15,677
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
airbytehq/airbyte โญ 15,089
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
๐ airbyte.com
chroma-core/chroma โญ 14,048
the AI-native open-source embedding database
๐ www.trychroma.com
apache/arrow โญ 14,047
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
๐ arrow.apache.org
tiangolo/sqlmodel โญ 13,782
SQL databases in Python, designed for simplicity, compatibility, and robustness.
๐ sqlmodel.tiangolo.com
coleifer/peewee โญ 11,012
a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
๐ docs.peewee-orm.com
s0md3v/Photon โญ 10,766
Incredibly fast crawler designed for OSINT.
avaiga/taipy โญ 10,597
Turns Data and AI algorithms into production-ready web applications in no time.
๐ www.taipy.io
sqlalchemy/sqlalchemy โญ 9,265
The Database Toolkit for Python
๐ www.sqlalchemy.org
simonw/datasette โญ 9,167
An open source multi-tool for exploring and publishing data
๐ datasette.io
bigscience-workshop/petals โญ 9,007
๐ธ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
๐ petals.dev
yzhao062/pyod โญ 8,359
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques
๐ pyod.readthedocs.io
voxel51/fiftyone โญ 7,983
The open-source tool for building high-quality datasets and computer vision models
๐ fiftyone.ai
gristlabs/grist-core โญ 6,758
Grist is the evolution of spreadsheets.
๐ www.getgrist.com
tobymao/sqlglot โญ 6,190
Python SQL Parser and Transpiler
๐ sqlglot.com
alirezamika/autoscraper โญ 6,109
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
kaggle/kaggle-api โญ 6,075
Official Kaggle API
madmaze/pytesseract โญ 5,712
A Python wrapper for Google Tesseract
vi3k6i5/flashtext โญ 5,570
Extract Keywords from sentence or Replace keywords in sentences.
airbnb/knowledge-repo โญ 5,462
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
facebookresearch/AugLy โญ 4,928
A data augmentations library for audio, image, text, and video.
๐ ai.facebook.com/blog/augly-a-new-data-augmentation-library-to-help-build-more-robust-ai-models
ibis-project/ibis โญ 4,752
Ibis is a Python library that provides a lightweight, universal interface for data wrangling. It helps Python users explore and transform data of any size, stored anywhere.
๐ ibis-project.org
superduperdb/superduper โญ 4,596
Superduper: Bring AI to your database! Integrate AI models and workflows with your database to implement custom AI applications, without moving your data. Including streaming inference, scalable model hosting, training and vector search.
๐ superduper.io
jazzband/tablib โญ 4,582
Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
๐ tablib.readthedocs.io
lk-geimfari/mimesis โญ 4,377
Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.
๐ mimesis.name
amundsen-io/amundsen โญ 4,361
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
๐ www.amundsen.io/amundsen
mongodb/mongo-python-driver โญ 4,099
PyMongo - the Official MongoDB Python driver
๐ pymongo.readthedocs.io
lancedb/lancedb โญ 3,842
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
๐ lancedb.github.io/lancedb
giskard-ai/giskard โญ 3,839
๐ข Open-Source Evaluation & Testing for LLMs and ML models
๐ docs.giskard.ai
andialbrecht/sqlparse โญ 3,671
A non-validating SQL parser module for Python
jmcnamara/XlsxWriter โญ 3,576
A Python module for creating Excel XLSX files.
๐ xlsxwriter.readthedocs.io
deepchecks/deepchecks โญ 3,520
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
๐ docs.deepchecks.com/stable
rom1504/img2dataset โญ 3,515
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
run-llama/llama-hub โญ 3,434
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
๐ llamahub.ai
praw-dev/praw โญ 3,425
PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
๐ praw.readthedocs.io
pyeve/cerberus โญ 3,134
Lightweight, extensible data validation library for Python
๐ python-cerberus.org
datafold/data-diff โญ 2,928
Compare tables within or across databases
๐ docs.datafold.com
zoomeranalytics/xlwings โญ 2,921
xlwings is a Python library that makes it easy to call Python from Excel and vice versa. It works with Excel on Windows and macOS as well as with Google Sheets and Excel on the web.
๐ www.xlwings.org
pathwaycom/pathway โญ 2,890
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
๐ pathway.com
docarray/docarray โญ 2,881
Represent, send, store and search multimodal data
๐ docs.docarray.org
pallets/itsdangerous โญ 2,875
Safely pass trusted data to untrusted environments and back.
๐ itsdangerous.palletsprojects.com
goldsmith/Wikipedia โญ 2,859
A Pythonic wrapper for the Wikipedia API
๐ wikipedia.readthedocs.org
awslabs/amazon-redshift-utils โญ 2,739
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
sqlalchemy/alembic โญ 2,656
A database migrations tool for SQLAlchemy.
rapidai/RapidOCR โญ 2,612
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. ๏ผๅฐPaddleOCRๆจกๅๅไบ่ฝฌๆข๏ผ้็จONNXRuntimeๆจ็๏ผ้ๅบฆๅพๅฟซ๏ผ
๐ rapidai.github.io/rapidocrdocs
kayak/pypika โญ 2,453
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
๐ pypika.readthedocs.io/en/latest
pynamodb/PynamoDB โญ 2,413
A pythonic interface to Amazon's DynamoDB
๐ pynamodb.readthedocs.io
emirozer/fake2db โญ 2,276
Generate fake but valid data filled databases for test purposes using most popular patterns(AFAIK). Current support is sqlite, mysql, postgresql, mongodb, redis, couchdb.
sdv-dev/SDV โญ 2,253
Synthetic data generation for tabular data
๐ docs.sdv.dev/sdv
uqfoundation/dill โญ 2,227
serialize all of Python
๐ dill.rtfd.io
accenture/AmpliGraph โญ 2,127
Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org
dlt-hub/dlt โญ 2,123
data load tool (dlt) is an open source Python library that makes data loading easy ๐ ๏ธ
๐ dlthub.com/docs
graphistry/pygraphistry โญ 2,106
PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
samuelcolvin/arq โญ 2,059
Fast job queuing and RPC in python with asyncio and redis.
๐ arq-docs.helpmanual.io
sfu-db/connector-x โญ 1,906
Fastest library to load data from DB to DataFrames in Rust and Python
๐ sfu-db.github.io/connector-x
huggingface/datatrove โญ 1,838
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
agronholm/sqlacodegen โญ 1,821
Automatic model code generator for SQLAlchemy
uber/petastorm โญ 1,773
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
milvus-io/bootcamp โญ 1,761
Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
๐ milvus.io
aminalaee/sqladmin โญ 1,745
SQLAlchemy Admin for FastAPI and Starlette
๐ aminalaee.dev/sqladmin
aio-libs/aiomysql โญ 1,734
aiomysql is a library for accessing a MySQL database from the asyncio
๐ aiomysql.rtfd.io
simple-salesforce/simple-salesforce โญ 1,638
A very simple Salesforce.com REST API client for Python
collerek/ormar โญ 1,630
python async orm with fastapi in mind and pydantic validation
๐ collerek.github.io/ormar
simonw/sqlite-utils โญ 1,602
Python CLI utility and library for manipulating SQLite databases
๐ sqlite-utils.datasette.io
eleutherai/the-pile โญ 1,451
The Pile is a large, diverse, open source language modelling data set that consists of many smaller datasets combined together.
sdispater/orator โญ 1,429
The Orator ORM provides a simple yet beautiful ActiveRecord implementation.
๐ orator-orm.com
zarr-developers/zarr-python โญ 1,425
An implementation of chunked, compressed, N-dimensional arrays for Python.
๐ zarr.readthedocs.io
mchong6/JoJoGAN โญ 1,416
Official PyTorch repo for JoJoGAN: One Shot Face Stylization
ydataai/ydata-synthetic โญ 1,396
Synthetic data generators for tabular and time-series data
๐ docs.synthetic.ydata.ai
aio-libs/aiopg โญ 1,387
aiopg is a library for accessing a PostgreSQL database from the asyncio
๐ aiopg.readthedocs.io
google/tensorstore โญ 1,328
Library for reading and writing large multi-dimensional arrays.
๐ google.github.io/tensorstore
scholarly-python-package/scholarly โญ 1,314
Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
๐ scholarly.readthedocs.io
eliasdabbas/advertools โญ 1,109
advertools - online marketing productivity and analysis tools
๐ advertools.readthedocs.io
pytorch/data โญ 1,104
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
aio-libs/aiocache โญ 1,082
Asyncio cache manager for redis, memcached and memory
๐ aiocache.readthedocs.io
brettkromkamp/contextualise โญ 1,043
Contextualise is an effective tool particularly suited for organising information-heavy projects and activities consisting of unstructured and widely diverse data and information resources
๐ contextualise.dev
uber/fiber โญ 1,039
Distributed Computing for AI Made Simple
๐ uber.github.io/fiber
intake/intake โญ 998
Intake is a lightweight package for finding, investigating, loading and disseminating data.
๐ intake.readthedocs.io
duckdb/dbt-duckdb โญ 828
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
scikit-hep/awkward โญ 816
Manipulate JSON-like data with NumPy-like idioms.
๐ awkward-array.org
koaning/human-learn โญ 791
Natural Intelligence is still a pretty good idea.
๐ koaning.github.io/human-learn
goccy/bigquery-emulator โญ 779
BigQuery emulator provides a way to launch a BigQuery server on your local machine for testing and development.
macbre/sql-metadata โญ 773
Uses tokenized query returned by python-sqlparse and generates query metadata
๐ pypi.python.org/pypi/sql-metadata
hyperqueryhq/whale โญ 724
๐ณ The stupidly simple CLI workspace for your data warehouse.
๐ rsyi.gitbook.io/whale
googleapis/python-bigquery โญ 723
Python Client for Google BigQuery
dgarnitz/vectorflow โญ 660
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
๐ www.getvectorflow.com
kagisearch/vectordb โญ 604
A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.
๐ vectordb.com
igorbenav/fastcrud โญ 592
FastCRUD is a Python package for FastAPI, offering robust async CRUD operations and flexible endpoint creation utilities.
jina-ai/vectordb โญ 517
A Python vector database you just need - no more, no less.
koaning/doubtlab โญ 503
Doubt your data, find bad labels.
๐ koaning.github.io/doubtlab
koaning/bulk โญ 485
Bulk is a quick UI developer tool to apply some bulk labels.
Debugging and tracing tools.
cool-rr/PySnooper โญ 16,310
Never use print for debugging again
gruns/icecream โญ 8,799
๐ฆ Never use print() to debug again.
shobrook/rebound โญ 4,095
Command-line tool that instantly fetches Stack Overflow results when an exception is thrown
inducer/pudb โญ 2,931
Full-screen console debugger for Python
๐ documen.tician.de/pudb
alexmojaki/heartrate โญ 1,750
Simple real time visualisation of the execution of a Python program.
alexmojaki/birdseye โญ 1,646
Graphical Python debugger which lets you easily view the values of all evaluated expressions
๐ birdseye.readthedocs.io
alexmojaki/snoop โญ 1,243
A powerful set of Python debugging tools, based on PySnooper
samuelcolvin/python-devtools โญ 969
Dev tools for python
๐ python-devtools.helpmanual.io
Text-to-image diffusion model libraries, tools and apps for generating images from natural language.
automatic1111/stable-diffusion-webui โญ 137,638
Stable Diffusion web UI
compvis/stable-diffusion โญ 67,039
A latent text-to-image diffusion model
๐ ommer-lab.com/research/latent-diffusion-models
comfyanonymous/ComfyUI โญ 45,565
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
๐ www.comfy.org
stability-ai/stablediffusion โญ 37,984
High-Resolution Image Synthesis with Latent Diffusion Models
lllyasviel/ControlNet โญ 29,344
Let us control diffusion models!
huggingface/diffusers โญ 24,540
๐ค Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
๐ huggingface.co/docs/diffusers
invoke-ai/InvokeAI โญ 22,522
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multi...
๐ invoke-ai.github.io/invokeai
apple/ml-stable-diffusion โญ 16,575
Stable Diffusion with Core ML on Apple Silicon
borisdayma/dalle-mini โญ 14,717
DALLยทE Mini - Generate images from a text prompt
๐ www.craiyon.com
divamgupta/diffusionbee-stable-diffusion-ui โญ 12,209
Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.
๐ diffusionbee.com
compvis/latent-diffusion โญ 11,289
High-Resolution Image Synthesis with Latent Diffusion Models
lucidrains/DALLE2-pytorch โญ 11,003
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
instantid/InstantID โญ 10,672
InstantID : Zero-shot Identity-Preserving Generation in Seconds ๐ฅ
๐ instantid.github.io
openbmb/MiniCPM-V โญ 9,897
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
facebookresearch/dinov2 โญ 8,573
PyTorch code and models for the DINOv2 self-supervised learning method.
ashawkey/stable-dreamfusion โญ 8,083
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
carson-katri/dream-textures โญ 7,733
Stable Diffusion built-in to Blender
xavierxiao/Dreambooth-Stable-Diffusion โญ 7,545
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
timothybrooks/instruct-pix2pix โญ 6,180
PyTorch implementation of InstructPix2Pix, an instruction-based image editing model, based on the original CompVis/stable_diffusion repo.
openai/consistency_models โญ 6,046
Official repo for consistency models.
idea-research/GroundingDINO โญ 5,942
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
๐ arxiv.org/abs/2303.05499
opengvlab/InternVL โญ 4,898
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. ๆฅ่ฟGPT-4o่กจ็ฐ็ๅฏๅ็จๅผๆบๅคๆจกๆๅฏน่ฏๆจกๅ
๐ internvl.readthedocs.io/en/latest
salesforce/BLIP โญ 4,543
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
nateraw/stable-diffusion-videos โญ 4,377
Create ๐ฅ videos with Stable Diffusion by exploring the latent space and morphing between text prompts
lkwq007/stablediffusion-infinity โญ 3,840
Outpainting with Stable Diffusion on an infinite canvas
jina-ai/discoart โญ 3,837
๐ชฉ Create Disco Diffusion artworks in one line
mlc-ai/web-stable-diffusion โญ 3,522
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
๐ mlc.ai/web-stable-diffusion
openai/glide-text2im โญ 3,515
GLIDE: a diffusion-based text-conditional image synthesis model
openai/improved-diffusion โญ 3,086
Release for Improved Denoising Diffusion Probabilistic Models
saharmor/dalle-playground โญ 2,764
A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini)
stability-ai/stability-sdk โญ 2,416
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
๐ platform.stability.ai
google-research/big_vision โญ 2,098
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
thudm/CogVLM2 โญ 1,738
GPT4V-level open-source multi-modal model based on Llama3-8B
coyote-a/ultimate-upscale-for-automatic1111 โญ 1,581
Ultimate SD Upscale extension for AUTOMATIC1111 Stable Diffusion web UI
divamgupta/stable-diffusion-tensorflow โญ 1,576
Stable Diffusion in TensorFlow / Keras
nvlabs/prismer โญ 1,291
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
๐ shikun.io/projects/prismer
chenyangqiqi/FateZero โญ 1,083
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
๐ fate-zero-edit.github.io
open-compass/VLMEvalKit โญ 836
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks
๐ huggingface.co/spaces/opencompass/open_vlm_leaderboard
thereforegames/unprompted โญ 770
Templating language written for Stable Diffusion workflows. Available as an extension for the Automatic1111 WebUI.
sharonzhou/long_stable_diffusion โญ 680
Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion)
tanelp/tiny-diffusion โญ 596
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.
laion-ai/dalle2-laion โญ 500
Pretrained Dalle2 from laion
Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives.
openbb-finance/OpenBB โญ 26,908
Investment Research for Everyone, Everywhere.
๐ openbb.co
quantopian/zipline โญ 17,388
Zipline, a Pythonic Algorithmic Trading Library
๐ www.zipline.io
microsoft/qlib โญ 14,989
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, ...
๐ qlib.readthedocs.io/en/latest
mementum/backtrader โญ 13,741
Python Backtesting library for trading strategies
๐ www.backtrader.com
ai4finance-foundation/FinGPT โญ 12,846
FinGPT: Open-Source Financial Large Language Models! Revolutionize ๐ฅ We release the trained model on HuggingFace.
๐ ai4finance.org
ranaroussi/yfinance โญ 12,714
Download market data from Yahoo! Finance's API
๐ aroussi.com/post/python-yahoo-finance
ai4finance-foundation/FinRL โญ 9,520
FinRL: Financial Reinforcement Learning. ๐ฅ
๐ ai4finance.org
ta-lib/ta-lib-python โญ 9,354
Python wrapper for TA-Lib (http://ta-lib.org/).
๐ ta-lib.github.io/ta-lib-python
quantconnect/Lean โญ 9,260
Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
๐ lean.io
goldmansachs/gs-quant โญ 7,375
Python toolkit for quantitative finance
๐ developer.gs.com/discover/products/gs-quant
quantopian/pyfolio โญ 5,569
Portfolio and risk analytics in Python
๐ quantopian.github.io/pyfolio
kernc/backtesting.py โญ 5,201
:mag_right: :chart_with_upwards_trend: ๐ :moneybag: Backtest trading strategies in Python.
๐ kernc.github.io/backtesting.py
twopirllc/pandas-ta โญ 5,108
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators
๐ twopirllc.github.io/pandas-ta
ranaroussi/quantstats โญ 4,639
Portfolio analytics for quants, written in Python
google/tf-quant-finance โญ 4,408
High-performance TensorFlow library for quantitative finance.
gbeced/pyalgotrade โญ 4,358
Python Algorithmic Trading Library
๐ gbeced.github.io/pyalgotrade
polakowo/vectorbt โญ 4,067
Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
๐ vectorbt.dev
borisbanushev/stockpredictionai โญ 4,040
In this noteboook I will create a complete process for predicting stock price movements. Follow along and we will achieve some pretty good results. For that purpose we will use a Generative Adversarial Network (GAN) with LSTM, a type of Recurrent Neural Network, as generator, and a Convolutional Neural Networ...
matplotlib/mplfinance โญ 3,544
Financial Markets Data Visualization using Matplotlib
๐ pypi.org/project/mplfinance
cuemacro/finmarketpy โญ 3,403
Python library for backtesting trading strategies & analyzing financial markets (formerly pythalesians)
๐ www.cuemacro.com
quantopian/alphalens โญ 3,220
Performance analysis of predictive (alpha) stock factors
๐ quantopian.github.io/alphalens
zvtvz/zvt โญ 3,099
modular quant framework.
๐ zvt.readthedocs.io/en/latest
robcarver17/pysystemtrade โญ 2,525
Systematic Trading in python
quantopian/research_public โญ 2,375
Quantitative research and educational materials
๐ www.quantopian.com/lectures
pmorissette/bt โญ 2,137
bt - flexible backtesting for Python
๐ pmorissette.github.io/bt
blankly-finance/blankly โญ 2,052
๐ ๐ธ Easily build, backtest and deploy your algo in just a few lines of code. Trade stocks, cryptos, and forex across exchanges w/ one package.
๐ package.blankly.finance
domokane/FinancePy โญ 2,024
A Python Finance Library that focuses on the pricing and risk-management of Financial Derivatives, including fixed-income, equity, FX and credit derivatives.
pmorissette/ffn โญ 1,878
ffn - a financial function library for Python
๐ pmorissette.github.io/ffn
cuemacro/findatapy โญ 1,630
Python library to download market data via Bloomberg, Eikon, Quandl, Yahoo etc.
quantopian/empyrical โญ 1,262
Common financial risk and performance metrics. Used by zipline and pyfolio.
๐ quantopian.github.io/empyrical
idanya/algo-trader โญ 765
Trading bot with support for realtime trading, backtesting, custom strategies and much more.
Game development tools, engines and libraries.
pygame/pygame โญ 7,235
๐๐ฎ pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, OpenGL.
๐ www.pygame.org
panda3d/panda3d โญ 4,405
Powerful, mature open-source cross-platform game engine for Python and C++, developed by Disney and CMU
๐ www.panda3d.org
pokepetter/ursina โญ 2,155
A game engine powered by python and panda3d.
๐ pokepetter.github.io/ursina
pyglet/pyglet โญ 1,830
pyglet is a cross-platform windowing and multimedia library for Python, for developing games and other visually rich applications.
๐ pyglet.org
pythonarcade/arcade โญ 1,669
Easy to use Python library for creating 2D arcade games.
๐ arcade.academy
Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections.
domlysz/BlenderGIS โญ 7,539
Blender addons to make the bridge between Blender and geographic data
python-visualization/folium โญ 6,803
Python Data. Leaflet.js Maps.
๐ python-visualization.github.io/folium
gboeing/osmnx โญ 4,783
OSMnx is a Python package to easily download, model, analyze, and visualize street networks and other geospatial features from OpenStreetMap.
๐ osmnx.readthedocs.io
osgeo/gdal โญ 4,714
GDAL is an open source MIT licensed translator library for raster and vector geospatial data formats.
๐ gdal.org
geopandas/geopandas โญ 4,392
Python tools for geographic data
๐ geopandas.org
shapely/shapely โญ 3,788
Manipulation and analysis of geometric objects
๐ shapely.readthedocs.io/en/stable
giswqs/geemap โญ 3,352
A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
๐ geemap.org
holoviz/datashader โญ 3,283
Quickly and accurately render even the largest data.
๐ datashader.org
opengeos/leafmap โญ 3,120
A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment
๐ leafmap.org
opengeos/segment-geospatial โญ 2,804
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
๐ samgeo.gishub.org
google/earthengine-api โญ 2,610
Python and JavaScript bindings for calling the Earth Engine API.
microsoft/torchgeo โญ 2,387
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
๐ www.osgeo.org/projects/torchgeo
rasterio/rasterio โญ 2,197
Rasterio reads and writes geospatial raster datasets
๐ rasterio.readthedocs.io
mcordts/cityscapesScripts โญ 2,132
README and scripts for the Cityscapes Dataset
azavea/raster-vision โญ 2,032
An open source library and framework for deep learning on satellite and aerial imagery.
๐ docs.rastervision.io
apache/sedona โญ 1,839
A cluster computing framework for processing large-scale geospatial data
๐ sedona.apache.org
gboeing/osmnx-examples โญ 1,523
Gallery of OSMnx tutorials, usage examples, and feature demonstations.
๐ osmnx.readthedocs.io
jupyter-widgets/ipyleaflet โญ 1,476
A Jupyter - Leaflet.js bridge
๐ ipyleaflet.readthedocs.io
microsoft/GlobalMLBuildingFootprints โญ 1,342
Worldwide building footprints derived from satellite imagery
pysal/pysal โญ 1,300
PySAL: Python Spatial Analysis Library Meta-Package
๐ pysal.org/pysal
anitagraser/movingpandas โญ 1,183
Movement trajectory classes and functions built on top of GeoPandas
๐ movingpandas.org
residentmario/geoplot โญ 1,141
High-level geospatial data visualization library for Python.
๐ residentmario.github.io/geoplot/index.html
sentinel-hub/eo-learn โญ 1,103
Earth observation processing framework for machine learning in Python
๐ eo-learn.readthedocs.io/en/latest
opengeos/streamlit-geospatial โญ 845
A multi-page streamlit app for geospatial
๐ huggingface.co/spaces/giswqs/streamlit
osgeo/grass โญ 810
GRASS GIS - free and open-source geospatial processing engine
๐ grass.osgeo.org
makepath/xarray-spatial โญ 808
Raster-based Spatial Analytics for Python
๐ xarray-spatial.readthedocs.io
developmentseed/titiler โญ 745
Build your own Raster dynamic map tile services
๐ developmentseed.org/titiler
scikit-mobility/scikit-mobility โญ 703
scikit-mobility: mobility analysis in Python
๐ scikit-mobility.github.io/scikit-mobility
Graphs and network libraries: network analysis, graph machine learning, visualisation.
networkx/networkx โญ 14,575
Network Analysis in Python
๐ networkx.org
stellargraph/stellargraph โญ 2,925
StellarGraph - Machine Learning on Graphs
๐ stellargraph.readthedocs.io
westhealth/pyvis โญ 960
Python package for creating and visualizing interactive network graphs.
๐ pyvis.readthedocs.io/en/latest
microsoft/graspologic โญ 737
graspologic is a package for graph statistical algorithms
๐ graspologic-org.github.io/graspologic
rampasek/GraphGPS โญ 636
Recipe for a General, Powerful, Scalable Graph Transformer
dylanhogg/llmgraph โญ 256
Create knowledge graphs with LLMs
Graphical user interface libraries and toolkits.
pysimplegui/PySimpleGUI โญ 13,304
Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.
๐ www.pysimplegui.com
hoffstadt/DearPyGui โญ 12,795
Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies
๐ dearpygui.readthedocs.io/en/latest
parthjadhav/Tkinter-Designer โญ 8,785
An easy and fast way to create a Python GUI ๐
samuelcolvin/FastUI โญ 7,960
FastUI is a new way to build web application user interfaces defined by declarative Python code.
๐ fastui-demo.onrender.com
r0x0r/pywebview โญ 4,556
Build GUI for your Python program with JavaScript, HTML, and CSS
๐ pywebview.flowrl.com
beeware/toga โญ 4,243
A Python native, OS native GUI toolkit.
๐ toga.readthedocs.io/en/latest
dddomodossola/remi โญ 3,489
Python REMote Interface library. Platform independent. In about 100 Kbytes, perfect for your diet.
wxwidgets/Phoenix โญ 2,266
wxPython's Project Phoenix. A new implementation of wxPython, better, stronger, faster than he was before.
๐ wxpython.org
Jupyter and JupyterLab and Notebook tools, libraries and plugins.
jupyterlab/jupyterlab โญ 14,006
JupyterLab computational environment.
๐ jupyterlab.readthedocs.io
jupyter/notebook โญ 11,472
Jupyter Interactive Notebook
๐ jupyter-notebook.readthedocs.io
mwouts/jupytext โญ 6,544
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
๐ jupytext.readthedocs.io
nteract/papermill โญ 5,743
๐ Parameterize, execute, and analyze notebooks
๐ papermill.readthedocs.io/en/latest
connorferster/handcalcs โญ 5,383
Python library for converting Python calculations into rendered latex.
voila-dashboards/voila โญ 5,351
Voilร turns Jupyter notebooks into standalone web applications
๐ voila.readthedocs.io
executablebooks/jupyter-book โญ 3,792
Create beautiful, publication-quality books and documents from computational content.
๐ jupyterbook.org
jupyterlite/jupyterlite โญ 3,769
Wasm powered Jupyter running in the browser ๐ก
๐ jupyterlite.rtfd.io/en/stable/try/lab
jupyterlab/jupyterlab-desktop โญ 3,585
JupyterLab desktop application, based on Electron.
jupyter-widgets/ipywidgets โญ 3,116
Interactive Widgets for the Jupyter Notebook
๐ ipywidgets.readthedocs.io
jupyterlab/jupyter-ai โญ 3,052
A generative AI extension for JupyterLab
๐ jupyter-ai.readthedocs.io
quantopian/qgrid โญ 3,034
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
jupyter/nbdime โญ 2,640
Tools for diffing and merging of Jupyter notebooks.
๐ nbdime.readthedocs.io
mito-ds/mito โญ 2,266
The mitosheet package, trymito.io, and other public Mito code.
๐ trymito.io
jupyter/nbviewer โญ 2,196
nbconvert as a web service: Render Jupyter Notebooks as static web pages
๐ nbviewer.jupyter.org
maartenbreddels/ipyvolume โญ 1,928
3d plotting for Python in the Jupyter notebook based on IPython widgets using WebGL
jupyter-lsp/jupyterlab-lsp โญ 1,773
Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol
๐ jupyterlab-lsp.readthedocs.io
jupyter/nbconvert โญ 1,703
Jupyter Notebook Conversion
๐ nbconvert.readthedocs.io
nbqa-dev/nbQA โญ 1,009
Run ruff, isort, pyupgrade, mypy, pylint, flake8, and more on Jupyter Notebooks
๐ nbqa.readthedocs.io/en/latest/index.html
vizzuhq/ipyvizzu โญ 941
Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.
๐ ipyvizzu.vizzuhq.com
koaning/drawdata โญ 749
Draw datasets from within Jupyter.
๐ calmcode.io/labs/drawdata.html
aws/graph-notebook โญ 705
Library extending Jupyter notebooks to integrate with Apache TinkerPop, openCypher, and RDF SPARQL.
๐ github.com/aws/graph-notebook
linealabs/lineapy โญ 664
Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
๐ lineapy.org
xiaohk/stickyland โญ 509
Break the linear presentation of Jupyter Notebooks with sticky cells!
๐ xiaohk.github.io/stickyland
Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover.
significant-gravitas/AutoGPT โญ 165,614
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
๐ agpt.co
hwchase17/langchain โญ 90,483
๐ฆ๐ Build context-aware reasoning applications
๐ python.langchain.com
nomic-ai/gpt4all โญ 68,427
GPT4All: Chat with Local LLMs on Any Device
๐ nomic.ai/gpt4all
ggerganov/llama.cpp โญ 63,308
LLM inference in C/C++
xtekky/gpt4free โญ 59,588
The official gpt4free repository | various collection of powerful language models
๐ g4f.ai
facebookresearch/llama โญ 54,977
Inference code for Llama models
imartinez/private-gpt โญ 53,239
Interact with your documents using the power of GPT, 100% privately, no data leaks
๐ docs.privategpt.dev
gpt-engineer-org/gpt-engineer โญ 51,581
Specify what you want it to build, the AI asks for clarification, and then builds it. Completely separate team and codebase from the AI Web App builder https://gptengineer.app
killianlucas/open-interpreter โญ 51,412
A natural language interface for computers
๐ openinterpreter.com
xai-org/grok-1 โญ 49,250
This repository contains JAX example code for loading and running the Grok-1 open-weights model.
geekan/MetaGPT โญ 42,242
๐ The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
๐ deepwisdom.ai
thudm/ChatGLM-6B โญ 40,217
ChatGLM-6B: An Open Bilingual Dialogue Language Model | ๅผๆบๅ่ฏญๅฏน่ฏ่ฏญ่จๆจกๅ
oobabooga/text-generation-webui โญ 38,906
A Gradio web UI for Large Language Models.
hpcaitech/ColossalAI โญ 38,474
Making large AI models cheaper, faster and more accessible
๐ www.colossalai.org
laion-ai/Open-Assistant โญ 36,889
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
๐ open-assistant.io
lm-sys/FastChat โญ 36,085
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
karpathy/nanoGPT โญ 35,504
The simplest, fastest repository for training/finetuning medium-sized GPTs.
moymix/TaskMatrix โญ 34,532
Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting.
quivrhq/quivr โญ 34,528
Open-source RAG Framework for building GenAI Second Brains ๐ง Build productivity assistant (RAG) โก๏ธ๐ค Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Efficient retrieval augmented generation framework
๐ quivr.com
jerryjliu/llama_index โญ 34,335
LlamaIndex is a data framework for your LLM applications
๐ docs.llamaindex.ai
microsoft/autogen โญ 29,377
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
๐ microsoft.github.io/autogen
pythagora-io/gpt-pilot โญ 29,284
The first real AI developer
tatsu-lab/stanford_alpaca โญ 29,252
Code and documentation to train Stanford's Alpaca models, and generate the data.
๐ crfm.stanford.edu/2023/03/13/alpaca.html
hiyouga/LLaMA-Factory โญ 28,756
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
๐ arxiv.org/abs/2403.13372
meta-llama/llama3 โญ 25,401
The official Meta Llama 3 GitHub site
vision-cair/MiniGPT-4 โญ 25,215
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
๐ minigpt-4.github.io
vllm-project/vllm โญ 24,549
A high-throughput and memory-efficient inference and serving engine for LLMs
๐ docs.vllm.ai
logspace-ai/langflow โญ 23,973
โ๏ธ Langflow is a visual framework for building multi-agent and RAG applications. It's open-source, Python-powered, fully customizable, model and vector store agnostic.
๐ www.langflow.org
microsoft/JARVIS โญ 23,457
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
karpathy/llm.c โญ 22,562
LLM training in simple, pure C/CUDA. There is no need for 245MB of PyTorch or 107MB of cPython
openai/gpt-2 โญ 22,097
Code for the paper "Language Models are Unsupervised Multitask Learners"
๐ openai.com/blog/better-language-models
openai/chatgpt-retrieval-plugin โญ 20,994
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
microsoft/semantic-kernel โญ 20,975
Integrate cutting-edge LLM technology quickly and easily into your apps
๐ aka.ms/semantic-kernel
danielmiessler/fabric โญ 20,228
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
๐ danielmiessler.com/p/fabric-origin-story
yoheinakajima/babyagi โญ 19,806
GPT-4 powered task-driven autonomous agent
karpathy/minGPT โญ 19,633
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
embedchain/mem0 โญ 19,322
The memory layer for Personalized AI
๐ mem0.ai
haotian-liu/LLaVA โญ 18,643
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
๐ llava.hliu.cc
tloen/alpaca-lora โญ 18,475
Instruct-tune LLaMA on consumer hardware
rasahq/rasa โญ 18,441
๐ฌ Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
๐ rasa.com/docs/rasa
guidance-ai/guidance โญ 18,437
A guidance language for controlling large language models.
mlc-ai/mlc-llm โญ 18,321
Universal LLM Deployment Engine with ML Compilation
๐ llm.mlc.ai
joaomdmoura/crewAI โญ 18,134
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
๐ crewai.com
stitionai/devika โญ 18,092
Devika is an advanced AI software engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective.
karpathy/llama2.c โญ 16,997
Inference Llama 2 in one file of pure C
facebookresearch/codellama โญ 15,754
Inference code for CodeLlama models
thudm/ChatGLM2-6B โญ 15,668
ChatGLM2-6B: An Open Bilingual Chat LLM | ๅผๆบๅ่ฏญๅฏน่ฏ่ฏญ่จๆจกๅ
huggingface/peft โญ 15,409
๐ค PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
๐ huggingface.co/docs/peft
stanfordnlp/dspy โญ 15,391
DSPy: The framework for programmingโnot promptingโfoundation models
๐ dspy-docs.vercel.app
transformeroptimus/SuperAGI โญ 15,108
<โก๏ธ> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
๐ superagi.com
deepset-ai/haystack โญ 15,085
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conv...
๐ haystack.deepset.ai
mayooear/gpt4-pdf-chatbot-langchain โญ 14,801
GPT4 & LangChain Chatbot for large PDF docs
๐ www.youtube.com/watch?v=ih9pbgvvoo4
openai/evals โญ 14,513
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
microsoft/graphrag โญ 14,488
A modular graph-based Retrieval-Augmented Generation (RAG) system
๐ microsoft.github.io/graphrag
fauxpilot/fauxpilot โญ 14,478
FauxPilot - an open-source alternative to GitHub Copilot server
idea-research/Grounded-Segment-Anything โญ 14,474
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
๐ arxiv.org/abs/2401.14159
unslothai/unsloth โญ 14,043
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
๐ unsloth.ai
infiniflow/ragflow โญ 13,868
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
๐ ragflow.io
assafelovic/gpt-researcher โญ 13,533
GPT based autonomous agent that does online comprehensive research on any given topic
๐ gptr.dev
qwenlm/Qwen โญ 12,961
The official repo of Qwen (้ไนๅ้ฎ) chat & pretrained large language model proposed by Alibaba Cloud.
dao-ailab/flash-attention โญ 12,854
Fast and memory-efficient exact attention
blinkdl/RWKV-LM โญ 12,101
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
mlc-ai/web-llm โญ 11,984
High-performance In-browser LLM Inference Engine
๐ webllm.mlc.ai
openlmlab/MOSS โญ 11,896
An open-source tool-augmented conversational language model from Fudan University
๐ txsun1997.github.io/blogs/moss.html
paddlepaddle/PaddleNLP โญ 11,836
๐ Easy-to-use and powerful NLP and LLM library with ๐ค Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including ๐Text Classification, ๐ Neural Search, โ Question Answering, โน๏ธ Information Extraction, ๐ Document Intelligence, ๐ Sentiment Analysis etc.
๐ paddlenlp.readthedocs.io
smol-ai/developer โญ 11,768
the first library to let you embed a developer agent in your own app!
๐ twitter.com/smolmodels
facebookresearch/llama-recipes โญ 11,245
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta ...
berriai/litellm โญ 11,123
LLM Proxy to call 100+ LLM APIs using the OpenAI format - Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq
๐ docs.litellm.ai/docs
h2oai/h2ogpt โญ 11,112
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
๐ h2o.ai
shishirpatil/gorilla โญ 11,026
Enables LLMs to use tools by invoking APIs. Given a query, Gorilla comes up with the semantically and syntactically correct API.
๐ gorilla.cs.berkeley.edu
phidatahq/phidata โญ 10,923
Phidata is a toolkit for building AI Assistants using function calling.
๐ docs.phidata.com
databrickslabs/dolly โญ 10,808
Databricksโ Dolly, a large language model trained on the Databricks Machine Learning Platform
๐ www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html
vanna-ai/vanna โญ 10,277
๐ค Chat with your SQL database ๐. Accurate Text-to-SQL Generation via LLMs using RAG ๐.
๐ vanna.ai/docs
microsoft/LoRA โญ 10,033
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
๐ arxiv.org/abs/2106.09685
google-research/vision_transformer โญ 9,884
Vision Transformer and MLP-Mixer Architectures
artidoro/qlora โญ 9,798
QLoRA: Efficient Finetuning of Quantized LLMs
๐ arxiv.org/abs/2305.14314
nvidia/Megatron-LM โญ 9,643
Ongoing research training transformer models at scale
๐ docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start
mistralai/mistral-inference โญ 9,407
Official inference library for Mistral models
๐ mistral.ai
blinkdl/ChatRWKV โญ 9,351
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
lightning-ai/litgpt โญ 9,325
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
๐ lightning.ai
lightning-ai/litgpt โญ 9,313
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
๐ lightning.ai
mshumer/gpt-prompt-engineer โญ 9,221
Simply input a description of your task and some test cases, and the system will generate, test, and rank a multitude of prompts to find the ones that perform the best.
microsoft/promptflow โญ 9,002
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
๐ microsoft.github.io/promptflow
lvwerra/trl โญ 8,964
Train transformer language models with reinforcement learning.
๐ hf.co/docs/trl
karpathy/minbpe โญ 8,858
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
apple/ml-ferret โญ 8,259
Ferret: Refer and Ground Anything Anywhere at Any Granularity
eleutherai/gpt-neo โญ 8,189
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
๐ www.eleuther.ai
optimalscale/LMFlow โญ 8,169
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
๐ optimalscale.github.io/lmflow
thudm/CodeGeeX โญ 8,049
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
๐ codegeex.cn
outlines-dev/outlines โญ 7,793
Structured Text Generation from LLMs
๐ outlines-dev.github.io/outlines
lianjiatech/BELLE โญ 7,765
BELLE: Be Everyone's Large Language model Engine๏ผๅผๆบไธญๆๅฏน่ฏๅคงๆจกๅ๏ผ
sjtu-ipads/PowerInfer โญ 7,764
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
thudm/GLM-130B โญ 7,650
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
plachtaa/VALL-E-X โญ 7,487
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
jzhang38/TinyLlama โญ 7,478
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
abetlen/llama-cpp-python โญ 7,433
Simple Python bindings for @ggerganov's llama.cpp library.
๐ llama-cpp-python.readthedocs.io
sweepai/sweep โญ 7,354
Sweep: open-source AI-powered Software Developer for small features and bug fixes.
๐ sweep.dev
openlm-research/open_llama โญ 7,322
OpenLLaMA: An Open Reproduction of LLaMA
bigcode-project/starcoder โญ 7,227
Home of StarCoder: fine-tuning & inference!
vaibhavs10/insanely-fast-whisper โญ 7,104
An opinionated CLI to transcribe Audio files w/ Whisper on-device! Powered by ๐ค Transformers, Optimum & flash-attn
jxnl/instructor โญ 7,069
Instructor is a Python library that makes it a breeze to work with structured outputs from large language models (LLMs). Built on top of Pydantic, it provides a simple, transparent, and user-friendly API to manage validation, retries, and streaming responses.
๐ python.useinstructor.com
zilliztech/GPTCache โญ 6,992
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
๐ gptcache.readthedocs.io
apple/corenet โญ 6,874
CoreNet is a deep neural network toolkit that allows researchers and engineers to train standard and novel small and large-scale models for variety of tasks, including foundation models (e.g., CLIP and LLM), object classification, object detection, and semantic segmentation.
bhaskatripathi/pdfGPT โญ 6,867
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
๐ huggingface.co/spaces/bhaskartripathi/pdfgpt_turbo
eleutherai/gpt-neox โญ 6,746
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
๐ www.eleuther.ai
chainlit/chainlit โญ 6,480
Build Conversational AI in minutes โก๏ธ
๐ docs.chainlit.io
mit-han-lab/streaming-llm โญ 6,413
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
๐ arxiv.org/abs/2309.17453
skypilot-org/skypilot โญ 6,380
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
๐ skypilot.readthedocs.io
langchain-ai/opengpts โญ 6,375
An open source effort to create a similar experience to OpenAI's GPTs and Assistants API.
nat/openplayground โญ 6,187
An LLM playground you can run on your laptop
run-llama/rags โญ 6,139
Build ChatGPT over your data, all with natural language
eleutherai/lm-evaluation-harness โญ 6,107
A framework for few-shot evaluation of language models.
๐ www.eleuther.ai
explodinggradients/ragas โญ 6,085
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
๐ docs.ragas.io
internlm/InternLM โญ 6,010
Official release of InternLM2.5 base and chat models. 1M context support
๐ internlm.intern-ai.org.cn
lightning-ai/lit-llama โญ 5,917
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
skyvern-ai/skyvern โญ 5,599
Skyvern automates browser-based workflows using LLMs and computer vision. It provides a simple API endpoint to fully automate manual workflows, replacing brittle or unreliable automation solutions.
๐ www.skyvern.com
pytorch-labs/gpt-fast โญ 5,439
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
minedojo/Voyager โญ 5,423
An Open-Ended Embodied Agent with Large Language Models
๐ voyager.minedojo.org
weaviate/Verba โญ 5,414
Retrieval Augmented Generation (RAG) chatbot powered by Weaviate
dsdanielpark/Bard-API โญ 5,348
The unofficial python package that returns response of Google Bard through cookie value.
๐ pypi.org/project/bardapi
microsoft/promptbase โญ 5,295
promptbase is an evolving collection of resources, best practices, and example scripts for eliciting the best performance from foundation models.
langchain-ai/chat-langchain โญ 5,057
Locally hosted chatbot specifically focused on question answering over the LangChain documentation
๐ chat.langchain.com
langchain-ai/langgraph โญ 5,022
LangGraph is a library for building stateful, multi-actor applications with LLMs, built on top of (and intended to be used with) LangChain.
๐ langchain-ai.github.io/langgraph
mnotgod96/AppAgent โญ 4,768
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
๐ appagent-official.github.io
vikhyat/moondream โญ 4,723
A tiny open-source computer-vision language model designed to run efficiently on edge devices
๐ moondream.ai
openbmb/ToolBench โญ 4,667
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
๐ openbmb.github.io/toolbench
togethercomputer/RedPajama-Data โญ 4,482
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
llmware-ai/llmware โญ 4,342
Unified framework for building enterprise RAG pipelines with small, specialized models
๐ llmware-ai.github.io/llmware
microsoft/LLMLingua โญ 4,321
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
๐ llmlingua.com
arcee-ai/mergekit โญ 4,296
Tools for merging pretrained large language models.
cg123/mergekit โญ 4,294
Tools for merging pretrained large language models.
allenai/OLMo โญ 4,284
OLMo is a repository for training and using AI2's state-of-the-art open language models. It is built by scientists, for scientists.
๐ allenai.org/olmo
microsoft/BioGPT โญ 4,273
Implementation of BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining
1rgs/jsonformer โญ 4,235
A Bulletproof Way to Generate Structured JSON from Language Models
kyegomez/tree-of-thoughts โญ 4,207
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
๐ discord.gg/qutxnk2nmf
instruction-tuning-with-gpt-4/GPT-4-LLM โญ 4,140
Instruction Tuning with GPT-4
๐ instruction-tuning-with-gpt-4.github.io
sgl-project/sglang โญ 4,033
SGLang is yet another fast serving framework for large language models and vision language models.
yizhongw/self-instruct โญ 4,005
Aligning pretrained language models with instruction data generated by themselves.
mshumer/gpt-llm-trainer โญ 3,894
Input a description of your task, and the system will generate a dataset, parse it, and fine-tune a LLaMA 2 model for you
nvidia/NeMo-Guardrails โญ 3,872
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
promptfoo/promptfoo โญ 3,854
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
๐ www.promptfoo.dev
ravenscroftj/turbopilot โญ 3,832
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU
h2oai/h2o-llmstudio โญ 3,831
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
๐ gpt-gm.h2o.ai
whitead/paper-qa โญ 3,815
LLM Chain for answering questions from documents with citations
guardrails-ai/guardrails โญ 3,738
Open-source Python package for specifying structure and type, validating and correcting the outputs of large language models (LLMs)
๐ www.guardrailsai.com/docs
simonw/llm โญ 3,648
Access large language models from the command-line
๐ llm.datasette.io
mmabrouk/llm-workflow-engine โญ 3,631
Power CLI and Workflow manager for LLMs (core package)
eth-sri/lmql โญ 3,551
A language for constraint-guided and efficient LLM programming.
๐ lmql.ai
luodian/Otter โญ 3,537
๐ฆฆ Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
๐ otter-ntu.github.io
open-compass/opencompass โญ 3,524
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
๐ opencompass.org.cn
microsoft/LMOps โญ 3,486
General technology for enabling AI capabilities w/ LLMs and MLLMs
๐ aka.ms/generalai
minimaxir/simpleaichat โญ 3,456
Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
pathwaycom/llm-app โญ 3,419
Dynamic RAG for enterprise. Ready to run with Docker,โกin sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
๐ pathway.com/developers/templates
minimaxir/gpt-2-simple โญ 3,402
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
turboderp/exllamav2 โญ 3,348
A fast inference library for running LLMs locally on modern consumer-class GPUs
deep-diver/LLM-As-Chatbot โญ 3,270
LLM as a Chatbot Service
defog-ai/sqlcoder โญ 3,200
SoTA LLM for converting natural language questions to SQL queries
jaymody/picoGPT โญ 3,141
An unnecessarily tiny implementation of GPT-2 in NumPy.
next-gpt/NExT-GPT โญ 3,138
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
๐ next-gpt.github.io
agiresearch/AIOS โญ 3,116
AIOS, a Large Language Model (LLM) Agent operating system, embeds large language model into Operating Systems (OS) as the brain of the OS, enabling an operating system "with soul" -- an important step towards AGI.
๐ aios.foundation
truefoundry/cognita โญ 3,076
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
๐ cognita.truefoundry.com
iryna-kondr/scikit-llm โญ 3,019
Seamlessly integrate LLMs into scikit-learn.
๐ beastbyte.ai
microsoft/torchscale โญ 2,986
Foundation Architecture for (M)LLMs
๐ aka.ms/generalai
baichuan-inc/Baichuan-13B โญ 2,973
A 13B large language model developed by Baichuan Intelligent Technology
๐ huggingface.co/baichuan-inc/baichuan-13b-chat
meta-llama/llama-agentic-system โญ 2,909
Agentic components of the Llama Stack APIs
freedomintelligence/LLMZoo โญ 2,909
โกLLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.โก
li-plus/chatglm.cpp โญ 2,862
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4
modelscope/ms-swift โญ 2,810
Use PEFT or Full-parameter to finetune 300+ LLMs or 60+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
๐ swift.readthedocs.io/zh-cn/latest/llm/index.html
juncongmoo/pyllama โญ 2,800
LLaMA: Open and Efficient Foundation Language Models
lm-sys/RouteLLM โญ 2,675
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
pipecat-ai/pipecat โญ 2,674
Open Source framework for voice and multimodal conversational AI
paperswithcode/galai โญ 2,673
Model API for GALACTICA
alpha-vllm/LLaMA2-Accessory โญ 2,656
An Open-source Toolkit for LLM Development
๐ llama2-accessory.readthedocs.io
cohere-ai/cohere-toolkit โญ 2,651
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
mistralai/mistral-finetune โญ 2,606
A light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models. It is based on LoRA.
bclavie/RAGatouille โญ 2,592
Bridging the gap between state-of-the-art research and alchemical RAG pipeline practices.
hegelai/prompttools โญ 2,584
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
๐ prompttools.readthedocs.io
databricks/dbrx โญ 2,488
Code examples and resources for DBRX, a large language model developed by Databricks
๐ www.databricks.com
huggingface/text-embeddings-inference โญ 2,400
A blazing fast inference solution for text embeddings models
๐ huggingface.co/docs/text-embeddings-inference/quick_tour
ofa-sys/OFA โญ 2,379
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
young-geng/EasyLM โญ 2,344
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
civitai/sd_civitai_extension โญ 2,307
All of the Civitai models inside Automatic 1111 Stable Diffusion Web UI
noahshinn/reflexion โญ 2,212
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
cheshire-cat-ai/core โญ 2,200
Production ready AI agent framework
๐ cheshirecat.ai
openai/finetune-transformer-lm โญ 2,127
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
๐ s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf
langroid/langroid โญ 2,121
Harness LLMs with Multi-Agent Programming
๐ langroid.github.io/langroid
intel/neural-compressor โญ 2,108
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
๐ intel.github.io/neural-compressor
tairov/llama2.mojo โญ 2,087
Inference Llama 2 in one file of pure ๐ฅ
๐ www.modular.com/blog/community-spotlight-how-i-built-llama2-by-aydyn-tairov
openai/image-gpt โญ 2,025
Archived. Code and models from the paper "Generative Pretraining from Pixels"
spcl/graph-of-thoughts โญ 2,021
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
๐ arxiv.org/pdf/2308.09687.pdf
predibase/lorax โญ 1,989
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
๐ loraexchange.ai
lucidrains/toolformer-pytorch โญ 1,933
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
neulab/prompt2model โญ 1,932
prompt2model - Generate Deployable Models from Natural Language Instructions
truera/trulens โญ 1,923
Evaluation and Tracking for LLM Experiments
๐ www.trulens.org
openai/gpt-2-output-dataset โญ 1,921
Dataset of GPT-2 outputs for research in detection, biases, and more
minimaxir/aitextgen โญ 1,833
A robust Python tool for text-based AI training and generation using GPT-2.
๐ docs.aitextgen.io
ist-daslab/gptq โญ 1,829
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
๐ arxiv.org/abs/2210.17323
epfllm/meditron โญ 1,797
Meditron is a suite of open-source medical Large Language Models (LLMs).
๐ huggingface.co/epfl-llm
microsoft/Megatron-DeepSpeed โญ 1,789
Ongoing research training transformer language models at scale, including: BERT & GPT-2
openai/gpt-discord-bot โญ 1,747
Example Discord bot written in Python that uses the completions API to have conversations with the text-davinci-003
model, and the moderations API to filter the messages.
akariasai/self-rag โญ 1,676
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
๐ selfrag.github.io
facebookresearch/chameleon โญ 1,668
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
๐ arxiv.org/abs/2405.09818
ray-project/llm-applications โญ 1,645
A comprehensive guide to building RAG-based LLM applications for production.
casper-hansen/AutoAWQ โญ 1,542
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
๐ casper-hansen.github.io/autoawq
jina-ai/thinkgpt โญ 1,513
Agent techniques to augment your LLM and push it beyong its limits
openai/simple-evals โญ 1,449
Lightweight library for evaluating language models
cstankonrad/long_llama โญ 1,444
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
azure-samples/graphrag-accelerator โญ 1,430
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
๐ github.com/microsoft/graphrag
farizrahman4u/loopgpt โญ 1,424
Re-implementation of Auto-GPT as a python package, written with modularity and extensibility in mind.
run-llama/llama-lab โญ 1,374
Llama Lab is a repo dedicated to building cutting-edge projects using LlamaIndex
agentops-ai/tokencost โญ 1,334
Easy token price estimates for 400+ LLMs. TokenOps.
๐ agentops.ai
explosion/spacy-transformers โญ 1,331
๐ธ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
๐ spacy.io/usage/embeddings-transformers
chatarena/chatarena โญ 1,310
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
bigscience-workshop/Megatron-DeepSpeed โญ 1,296
Ongoing research training transformer language models at scale, including: BERT & GPT-2
meetkai/functionary โญ 1,279
Chat language model that can use tools and interpret the results
ray-project/ray-llm โญ 1,213
RayLLM - LLMs on Ray
๐ aviary.anyscale.com
evolvinglmms-lab/lmms-eval โญ 1,199
Accelerating the development of large multimodal models (LMMs) with lmms-eval
๐ lmms-lab.github.io
srush/MiniChain โญ 1,198
A tiny library for coding with large language models.
๐ srush-minichain.hf.space
linksoul-ai/AutoAgents โญ 1,126
[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.
๐ huggingface.co/spaces/linksoul/autoagents
ibm/Dromedary โญ 1,104
Dromedary: towards helpful, ethical and reliable LLMs.
agenta-ai/agenta โญ 1,090
The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.
๐ www.agenta.ai
keirp/automatic_prompt_engineer โญ 1,078
Large Language Models Are Human-Level Prompt Engineers
hao-ai-lab/LookaheadDecoding โญ 1,065
Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
lupantech/chameleon-llm โญ 1,051
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
๐ chameleon-llm.github.io
explosion/spacy-llm โญ 1,041
๐ฆ Integrating LLMs into structured NLP pipelines
๐ spacy.io/usage/large-language-models
rlancemartin/auto-evaluator โญ 1,040
Evaluation tool for LLM QA chains
๐ autoevaluator.langchain.com
nomic-ai/pygpt4all โญ 1,026
Official supported Python bindings for llama.cpp + gpt4all
๐ nomic-ai.github.io/pygpt4all
mlfoundations/dclm โญ 1,024
DataComp for Language Models
huggingface/nanotron โญ 1,021
Minimalistic large language model 3D-parallelism training
ctlllll/LLM-ToolMaker โญ 1,013
Large Language Models as Tool Makers
microsoft/Llama-2-Onnx โญ 1,003
A Microsoft optimized version of the Llama 2 model, available from Meta
ajndkr/lanarky โญ 965
The web framework for building LLM microservices
๐ lanarky.ajndkr.com
likejazz/llama3.np โญ 943
llama3.np is a pure NumPy implementation for Llama 3 model.
pinecone-io/canopy โญ 934
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
๐ www.pinecone.io
cerebras/modelzoo โญ 872
Examples of common deep learning models that can be trained on Cerebras hardware
muennighoff/sgpt โญ 830
SGPT: GPT Sentence Embeddings for Semantic Search
๐ arxiv.org/abs/2202.08904
topoteretes/cognee โญ 786
Deterministic LLMs Outputs for AI Applications and AI Agents
๐ www.cognee.ai
oliveirabruno01/babyagi-asi โญ 773
BabyAGI: an Autonomous and Self-Improving agent, or BASI
datadreamer-dev/DataDreamer โญ 760
DataDreamer is a powerful open-source Python library for prompting, synthetic data generation, and training workflows. It is designed to be simple, extremely efficient, and research-grade.
๐ datadreamer.dev
opengenerativeai/GenossGPT โญ 748
One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) ๐๐ Replace OpenAI GPT with any LLMs in your app with one line.
๐ genoss.ai
langchain-ai/langsmith-cookbook โญ 726
LangSmith is a platform for building production-grade LLM applications.
๐ langsmith-cookbook.vercel.app
prometheus-eval/prometheus-eval โญ 719
Evaluate your LLM's response with Prometheus and GPT4 ๐ฏ
salesforce/xgen โญ 713
Salesforce open-source LLMs with 8k sequence length.
junruxiong/IncarnaMind โญ 707
Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs
๐ www.incarnamind.com
opengvlab/OmniQuant โญ 643
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
wandb/weave โญ 630
Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.
๐ wandb.me/weave
squeezeailab/SqueezeLLM โญ 619
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
๐ arxiv.org/abs/2306.07629
lupantech/ScienceQA โญ 580
Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".
zhudotexe/kani โญ 543
kani (ใซใ) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)
๐ kani.readthedocs.io
hazyresearch/ama_prompting โญ 535
Ask Me Anything language model prompting
vahe1994/SpQR โญ 517
Quantization algorithm and the model evaluation code for SpQR method for LLM compression
huggingface/lighteval โญ 516
LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.
continuum-llms/chatgpt-memory โญ 511
Allows to scale the ChatGPT API to multiple simultaneous sessions with infinite contextual and adaptive memory powered by GPT and Redis datastore.
tsinghuadatabasegroup/DB-GPT โญ 507
LLM As Database Administrator
๐ dbgpt.dbmind.cn
hazyresearch/H3 โญ 506
Language Modeling with the H3 State Space Model
modal-labs/llm-finetuning โญ 505
Guide for fine-tuning Llama/Mistral/CodeLlama models and more
nousresearch/Hermes-Function-Calling โญ 504
Code for the Hermes Pro Large Language Model to perform function calling based on the provided schema. It allows users to query the model and retrieve information related to stock prices, company fundamentals, financial statements
judahpaul16/gpt-home โญ 383
ChatGPT at home! Basically a better Google Nest Hub or Amazon Alexa home assistant. Built on the Raspberry Pi using the OpenAI API.
๐ hub.docker.com/r/judahpaul/gpt-home
stanford-oval/suql โญ 178
SUQL: Conversational Search over Structured and Unstructured Data with LLMs
๐ arxiv.org/abs/2311.09818
Mathematical, numerical and scientific libraries.
numpy/numpy โญ 27,305
The fundamental package for scientific computing with Python.
๐ numpy.org
taichi-dev/taichi โญ 25,179
Productive, portable, and performant GPU programming in Python: Taichi Lang is an open-source, imperative, parallel programming language for high-performance numerical computation.
๐ taichi-lang.org
scipy/scipy โญ 12,806
SciPy library main repository
๐ scipy.org
sympy/sympy โญ 12,680
A computer algebra system written in pure Python
๐ sympy.org
google/or-tools โญ 10,882
Google Optimization Tools (a.k.a., OR-Tools) is an open-source, fast and portable software suite for solving combinatorial optimization problems.
๐ developers.google.com/optimization
z3prover/z3 โญ 10,052
Z3 is a theorem prover from Microsoft Research with a Python language binding.
google-deepmind/alphageometry โญ 3,970
Solving Olympiad Geometry without Human Demonstrations
mikedh/trimesh โญ 2,895
Python library for loading and using triangular meshes.
๐ trimesh.org
mckinsey/causalnex โญ 2,202
A Python library that helps data scientists to infer causation rather than observing correlation.
๐ causalnex.readthedocs.io
pyomo/pyomo โญ 1,939
An object-oriented algebraic modeling language in Python for structured optimization problems.
๐ www.pyomo.org
facebookresearch/theseus โญ 1,690
A library for differentiable nonlinear optimization
google-research/torchsde โญ 1,539
Differentiable SDE solvers with GPU support and efficient sensitivity analysis.
dynamicslab/pysindy โญ 1,383
A package for the sparse identification of nonlinear dynamical systems from data
๐ pysindy.readthedocs.io/en/latest
geomstats/geomstats โญ 1,187
Computations and statistics on manifolds with geometric structures.
๐ geomstats.ai
cma-es/pycma โญ 1,064
pycma is a Python implementation of CMA-ES and a few related numerical optimization tools.
sj001/AI-Feynman โญ 616
Implementation of AI Feynman: a Physics-Inspired Method for Symbolic Regression
willianfuks/tfcausalimpact โญ 600
Python Causal Impact Implementation Based on Google's R Package. Built using TensorFlow Probability.
lean-dojo/LeanDojo โญ 530
Tool for data extraction and interacting with Lean programmatically.
๐ leandojo.org
brandondube/prysm โญ 249
Prysm is an open-source library for physical and first-order modeling of optical systems and analysis of related data: numerical and physical optics, integrated modeling, phase retrieval, segmented systems, polynomials and fitting, sequential raytracing.
๐ prysm.readthedocs.io/en/stable
lean-dojo/ReProver โญ 201
Retrieval-Augmented Theorem Provers for Lean
๐ leandojo.org
albahnsen/pycircular โญ 92
pycircular is a Python module for circular data analysis
gbillotey/Fractalshades โญ 27
Arbitrary-precision fractal explorer - Python package
General and classical machine learning libraries. See below for other sections covering specialised ML areas.
scikit-learn/scikit-learn โญ 59,148
scikit-learn: machine learning in Python
๐ scikit-learn.org
openai/openai-cookbook โญ 58,036
Examples and guides for using the OpenAI API
๐ cookbook.openai.com
tencentarc/GFPGAN โญ 35,263
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
suno-ai/bark โญ 34,153
๐ Text-Prompted Generative Audio Model
google-research/google-research โญ 33,588
This repository contains code released by Google Research
๐ research.google
facebookresearch/faiss โญ 29,909
A library for efficient similarity search and clustering of dense vectors.
๐ faiss.ai
google/jax โญ 29,440
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
๐ jax.readthedocs.io
open-mmlab/mmdetection โญ 28,801
OpenMMLab Detection Toolbox and Benchmark
๐ mmdetection.readthedocs.io
ageron/handson-ml2 โญ 27,516
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
lutzroeder/netron โญ 27,115
Visualizer for neural network, deep learning and machine learning models
๐ netron.app
google/mediapipe โญ 26,467
Cross-platform, customizable ML solutions for live and streaming media.
๐ mediapipe.dev
dmlc/xgboost โญ 25,951
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
๐ xgboost.readthedocs.io/en/stable
harisiqbal88/PlotNeuralNet โญ 21,692
Latex code for making neural networks diagrams
jina-ai/jina โญ 20,737
โ๏ธ Build multimodal AI applications with cloud-native stack
๐ docs.jina.ai
roboflow/supervision โญ 18,298
We write your reusable computer vision tools. ๐
๐ supervision.roboflow.com
onnx/onnx โญ 17,422
Open standard for machine learning interoperability
๐ onnx.ai
microsoft/LightGBM โญ 16,449
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
๐ lightgbm.readthedocs.io/en/latest
ml-explore/mlx โญ 16,118
MLX is an array framework for machine learning on Apple silicon, brought to you by Apple machine learning research.
๐ ml-explore.github.io/mlx
tensorflow/tensor2tensor โญ 15,230
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
ddbourgin/numpy-ml โญ 15,129
Machine learning, in numpy
๐ numpy-ml.readthedocs.io
aleju/imgaug โญ 14,301
Image augmentation for machine learning experiments.
๐ imgaug.readthedocs.io
microsoft/nni โญ 13,929
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
๐ nni.readthedocs.io
microsoft/onnxruntime โญ 13,821
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
๐ onnxruntime.ai
jindongwang/transferlearning โญ 13,165
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-่ฟ็งปๅญฆไน
๐ transferlearning.xyz
deepmind/deepmind-research โญ 13,006
This repository contains implementations and illustrative code to accompany DeepMind publications
spotify/annoy โญ 13,005
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
neonbjb/tortoise-tts โญ 12,648
A multi-voice TTS system trained with an emphasis on quality
deepmind/alphafold โญ 12,192
Implementation of the inference pipeline of AlphaFold v2
ggerganov/ggml โญ 10,525
Tensor library for machine learning
facebookresearch/AnimatedDrawings โญ 10,404
Code to accompany "A Method for Animating Children's Drawings of the Human Figure"
optuna/optuna โญ 10,337
A hyperparameter optimization framework
๐ optuna.org
twitter/the-algorithm-ml โญ 9,990
Source code for Twitter's Recommendation Algorithm
๐ blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm
statsmodels/statsmodels โญ 9,884
Statsmodels: statistical modeling and econometrics in Python
๐ www.statsmodels.org/devel
epistasislab/tpot โญ 9,626
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
๐ epistasislab.github.io/tpot
megvii-basedetection/YOLOX โญ 9,232
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
cleanlab/cleanlab โญ 9,229
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
๐ cleanlab.ai
wandb/wandb โญ 8,745
๐ฅ A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
๐ wandb.ai
pycaret/pycaret โญ 8,733
An open-source, low-code machine learning library in Python
๐ www.pycaret.org
pymc-devs/pymc โญ 8,551
Bayesian Modeling and Probabilistic Programming in Python
๐ docs.pymc.io
uberi/speech_recognition โญ 8,238
Speech recognition module for Python, supporting several engines and APIs, online and offline.
๐ pypi.python.org/pypi/speechrecognition
facebookresearch/xformers โญ 8,177
Hackable and optimized Transformers building blocks, supporting a composable construction.
๐ facebookresearch.github.io/xformers
catboost/catboost โญ 7,952
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
๐ catboost.ai
open-mmlab/mmsegmentation โญ 7,849
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
๐ mmsegmentation.readthedocs.io/en/main
awslabs/autogluon โญ 7,517
Fast and Accurate ML in 3 Lines of Code
๐ auto.gluon.ai
automl/auto-sklearn โญ 7,512
Automated Machine Learning with scikit-learn
๐ automl.github.io/auto-sklearn
huggingface/accelerate โญ 7,486
๐ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
๐ huggingface.co/docs/accelerate
lmcinnes/umap โญ 7,282
Uniform Manifold Approximation and Projection
hyperopt/hyperopt โญ 7,175
Distributed Asynchronous Hyperparameter Optimization in Python
๐ hyperopt.github.io/hyperopt
featurelabs/featuretools โญ 7,152
An open source python library for automated feature engineering
๐ www.featuretools.com
py-why/dowhy โญ 6,945
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
๐ www.pywhy.org/dowhy
hips/autograd โญ 6,888
Efficiently computes derivatives of numpy code.
open-mmlab/mmagic โญ 6,795
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic ๐ช: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
๐ mmagic.readthedocs.io/en/latest
scikit-learn-contrib/imbalanced-learn โญ 6,772
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
๐ imbalanced-learn.org
probml/pyprobml โญ 6,402
Python code for "Probabilistic Machine learning" book by Kevin Murphy
nicolashug/Surprise โญ 6,301
A Python scikit for building and analyzing recommender systems
๐ surpriselib.com
cleverhans-lab/cleverhans โญ 6,137
An adversarial example library for constructing attacks, building defenses, and benchmarking both
kevinmusgrave/pytorch-metric-learning โญ 5,908
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
๐ kevinmusgrave.github.io/pytorch-metric-learning
open-mmlab/mmcv โญ 5,783
OpenMMLab Computer Vision Foundation
๐ mmcv.readthedocs.io/en/latest
ml-explore/mlx-examples โญ 5,662
Examples in the MLX framework
project-monai/MONAI โญ 5,592
AI Toolkit for Healthcare Imaging
๐ monai.io
thudm/CogVideo โญ 5,294
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
mdbloice/Augmentor โญ 5,049
Image augmentation library in Python for machine learning.
๐ augmentor.readthedocs.io/en/stable
online-ml/river โญ 4,927
๐ Online machine learning in Python
๐ riverml.xyz
uber/causalml โญ 4,926
Uplift modeling and causal inference with machine learning algorithms
rasbt/mlxtend โญ 4,843
A library of extension and helper modules for Python's data analysis and machine learning libraries.
๐ rasbt.github.io/mlxtend
google-gemini/cookbook โญ 4,483
A collection of guides and examples for the Gemini API, including quickstart tutorials for writing prompts.
๐ ai.google.dev/gemini-api/docs
google-deepmind/graphcast โญ 4,462
GraphCast: Learning skillful medium-range global weather forecasting
marqo-ai/marqo โญ 4,393
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
๐ www.marqo.ai
lucidrains/deep-daze โญ 4,376
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun
skvark/opencv-python โญ 4,372
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
๐ pypi.org/project/opencv-python
sanchit-gandhi/whisper-jax โญ 4,300
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
apple/coremltools โญ 4,277
Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
๐ coremltools.readme.io
districtdatalabs/yellowbrick โญ 4,253
Visual analysis and diagnostic tools to facilitate machine learning model selection.
๐ www.scikit-yb.org
nmslib/hnswlib โญ 4,203
Header-only C++/python library for fast approximate nearest neighbors
๐ github.com/nmslib/hnswlib
nv-tlabs/GET3D โญ 4,169
Generative Model of High Quality 3D Textured Shapes Learned from Images
cmusphinx/pocketsphinx โญ 3,863
A small speech recognizer
microsoft/FLAML โญ 3,802
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
๐ microsoft.github.io/flaml
ourownstory/neural_prophet โญ 3,770
NeuralProphet: A simple forecasting package
๐ neuralprophet.com
py-why/EconML โญ 3,700
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to brin...
๐ www.microsoft.com/en-us/research/project/alice
huggingface/autotrain-advanced โญ 3,695
AutoTrain Advanced: faster and easier training and deployments of state-of-the-art machine learning models
๐ huggingface.co/autotrain
huggingface/notebooks โญ 3,503
Notebooks using the Hugging Face libraries ๐ค
zjunlp/DeepKE โญ 3,306
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
๐ deepke.zjukg.cn
rucaibox/RecBole โญ 3,297
A unified, comprehensive and efficient recommendation library
๐ recbole.io
yoheinakajima/instagraph โญ 3,262
Converts text input or URL into knowledge graph and displays
facebookresearch/vissl โญ 3,244
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
๐ vissl.ai
pytorch/glow โญ 3,193
Compiler for Neural Network hardware accelerators
lucidrains/musiclm-pytorch โญ 3,111
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
hrnet/HRNet-Semantic-Segmentation โญ 3,107
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
mljar/mljar-supervised โญ 2,986
Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
๐ mljar.com
lightly-ai/lightly โญ 2,877
A python library for self-supervised learning on images.
๐ docs.lightly.ai/self-supervised-learning
shankarpandala/lazypredict โญ 2,819
Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning
scikit-learn-contrib/hdbscan โญ 2,741
A high performance implementation of HDBSCAN clustering.
๐ hdbscan.readthedocs.io/en/latest
scikit-optimize/scikit-optimize โญ 2,737
Sequential model-based optimization with a scipy.optimize
interface
๐ scikit-optimize.github.io
huggingface/safetensors โญ 2,679
Implements a new simple format for storing tensors safely (as opposed to pickle) and that is still fast (zero-copy).
๐ huggingface.co/docs/safetensors
google-research/t5x โญ 2,602
T5X is a modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models (starting with language) at many scales.
apple/ml-ane-transformers โญ 2,530
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)
freedmand/semantra โญ 2,456
Semantra is a multipurpose tool for semantically searching documents. Query by meaning rather than just by matching text.
scikit-learn-contrib/category_encoders โญ 2,392
A library of sklearn compatible categorical variable encoders
๐ contrib.scikit-learn.org/category_encoders
huggingface/optimum โญ 2,377
๐ Accelerate training and inference of ๐ค Transformers and ๐ค Diffusers with easy to use hardware optimization tools
๐ huggingface.co/docs/optimum/main
rom1504/clip-retrieval โญ 2,305
Easily compute clip embeddings and build a clip retrieval system with them
๐ rom1504.github.io/clip-retrieval
aws/sagemaker-python-sdk โญ 2,082
A library for training and deploying machine learning models on Amazon SageMaker
๐ sagemaker.readthedocs.io
eric-mitchell/direct-preference-optimization โญ 1,940
Reference implementation for DPO (Direct Preference Optimization)
huggingface/evaluate โญ 1,925
๐ค Evaluate: A library for easily evaluating machine learning models and datasets.
๐ huggingface.co/docs/evaluate
huggingface/huggingface_hub โญ 1,903
The official Python client for the Huggingface Hub.
๐ huggingface.co/docs/huggingface_hub
rentruewang/koila โญ 1,816
Prevent PyTorch's CUDA error: out of memory
in just 1 line of code.
๐ rentruewang.com/koila
contextlab/hypertools โญ 1,812
A Python toolbox for gaining geometric insights into high-dimensional data
๐ hypertools.readthedocs.io/en/latest
linkedin/greykite โญ 1,804
A flexible, intuitive and fast forecasting library
bmabey/pyLDAvis โญ 1,801
Python library for interactive topic model visualization. Port of the R LDAvis package.
scikit-learn-contrib/lightning โญ 1,719
Large-scale linear classification, regression and ranking in Python
๐ contrib.scikit-learn.org/lightning
tensorflow/addons โญ 1,688
Useful extra functionality for TensorFlow 2.x maintained by SIG-addons
microsoft/i-Code โญ 1,658
The ambition of the i-Code project is to build integrative and composable multimodal AI. The "i" stands for integrative multimodal learning.
castorini/pyserini โญ 1,587
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
๐ pyserini.io
googlecloudplatform/vertex-ai-samples โญ 1,579
Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative AI workflows using Google Cloud Vertex AI.
๐ cloud.google.com/vertex-ai
visual-layer/fastdup โญ 1,547
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
laekov/fastmoe โญ 1,498
A fast MoE impl for PyTorch
๐ fastmoe.ai
microsoft/Olive โญ 1,476
Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.
๐ microsoft.github.io/olive
kubeflow/katib โญ 1,472
Automated Machine Learning on Kubernetes
๐ www.kubeflow.org/docs/components/katib
jina-ai/finetuner โญ 1,455
:dart: Task-oriented embedding tuning for BERT, CLIP, etc.
๐ finetuner.jina.ai
csinva/imodels โญ 1,351
Interpretable ML package ๐ for concise, transparent, and accurate predictive modeling (sklearn-compatible).
๐ csinva.io/imodels
microsoft/Semi-supervised-learning โญ 1,287
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
๐ usb.readthedocs.io
borealisai/advertorch โญ 1,285
A Toolbox for Adversarial Robustness Research
patchy631/machine-learning โญ 1,267
Machine Learning Tutorials Repository
spotify/voyager โญ 1,249
๐ฐ๏ธ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
๐ spotify.github.io/voyager
awslabs/dgl-ke โญ 1,249
High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
๐ dglke.dgl.ai/doc
qdrant/fastembed โญ 1,240
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
๐ qdrant.github.io/fastembed
koaning/scikit-lego โญ 1,228
Extra blocks for scikit-learn pipelines.
๐ koaning.github.io/scikit-lego
google/vizier โญ 1,201
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
๐ oss-vizier.readthedocs.io
automl/TabPFN โญ 1,153
Official implementation of the TabPFN paper (https://arxiv.org/abs/2207.01848) and the tabpfn package.
๐ priorlabs.ai
pytorch/FBGEMM โญ 1,146
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
lightning-ai/lightning-thunder โญ 1,104
Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
google-research/deeplab2 โญ 992
DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks.
davidmrau/mixture-of-experts โญ 918
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538
lmcinnes/pynndescent โญ 875
A Python nearest neighbor descent for approximate nearest neighbors
hazyresearch/safari โญ 858
Convolutions for Sequence Modeling
oml-team/open-metric-learning โญ 843
OML is a PyTorch-based framework to train and validate the models producing high-quality embeddings.
๐ open-metric-learning.readthedocs.io/en/latest/index.html
opentensor/bittensor โญ 842
Internet-scale Neural Networks
๐ www.bittensor.com
nvidia/cuda-python โญ 838
CUDA Python Low-level Bindings
๐ nvidia.github.io/cuda-python
criteo/autofaiss โญ 783
Automatically create Faiss knn indices with the most optimal similarity search parameters.
๐ criteo.github.io/autofaiss
huggingface/optimum-quanto โญ 713
A pytorch quantization backend for optimum
replicate/replicate-python โญ 705
Python client for Replicate
๐ replicate.com
awslabs/python-deequ โญ 685
Python API for Deequ, a library built on Spark for defining "unit tests for data", which measure data quality in large datasets
facebookresearch/balance โญ 678
The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to some target population of interest.
๐ import-balance.org
nicolas-hbt/pygraft โญ 659
Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
๐ pygraft.readthedocs.io/en/latest
qdrant/quaterion โญ 630
Blazing fast framework for fine-tuning similarity learning models
๐ quaterion.qdrant.tech
hpcaitech/EnergonAI โญ 629
Large-scale model inference.
pymc-labs/pymc-marketing โญ 622
Bayesian marketing toolbox in PyMC. Media Mix (MMM), customer lifetime value (CLV), buy-till-you-die (BTYD) models and more.
๐ www.pymc-marketing.io
googleapis/python-aiplatform โญ 598
A Python SDK for Vertex AI, a fully managed, end-to-end platform for data science and machine learning.
huggingface/exporters โญ 595
Export Hugging Face models to Core ML and TensorFlow Lite
microsoft/Focal-Transformer โญ 543
[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"
cvxgrp/pymde โญ 531
Minimum-distortion embedding with PyTorch
๐ pymde.org
nevronai/MetisFL โญ 530
The first open Federated Learning framework implemented in C++ and Python.
๐ metisfl.org
mrdbourke/m1-machine-learning-test โญ 516
Code for testing various M1 Chip benchmarks with TensorFlow.
linkedin/FastTreeSHAP โญ 504
Fast SHAP value computation for interpreting tree-based models
intellabs/bayesian-torch โญ 503
A library for Bayesian neural network layers and uncertainty estimation in Deep Learning extending the core of PyTorch
lightning-ai/litdata โญ 281
Transform datasets at scale. Optimize datasets for fast AI model training.
dylanhogg/gptauthor โญ 54
GPTAuthor is an AI tool for writing long form, multi-chapter stories given a story prompt.
Machine learning libraries that cross over with deep learning in some way.
tensorflow/tensorflow โญ 184,632
An Open Source Machine Learning Framework for Everyone
๐ tensorflow.org
pytorch/pytorch โญ 81,182
Tensors and Dynamic neural networks in Python with strong GPU acceleration
๐ pytorch.org
openai/whisper โญ 65,802
Robust Speech Recognition via Large-Scale Weak Supervision
keras-team/keras โญ 61,447
Deep Learning for humans
๐ keras.io
deepfakes/faceswap โญ 50,041
Deepfakes Software For All
๐ www.faceswap.dev
iperov/DeepFaceLab โญ 46,547
DeepFaceLab is the leading software for creating deepfakes.
facebookresearch/segment-anything โญ 46,137
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
microsoft/DeepSpeed โญ 34,254
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
๐ www.deepspeed.ai
rwightman/pytorch-image-models โญ 31,059
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
๐ huggingface.co/docs/timm
facebookresearch/detectron2 โญ 29,682
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
๐ detectron2.readthedocs.io/en/latest
lightning-ai/pytorch-lightning โญ 27,709
The deep learning framework to pretrain, finetune and deploy AI models. PyTorch Lightning is just organized PyTorch - Lightning disentangles PyTorch code to decouple the science from the engineering.
๐ lightning.ai
xinntao/Real-ESRGAN โญ 27,295
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
facebookresearch/Detectron โญ 26,227
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
google-research/tuning_playbook โญ 26,112
A playbook for systematically maximizing the performance of deep learning models.
matterport/Mask_RCNN โญ 24,464
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
openai/CLIP โญ 24,117
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
paddlepaddle/Paddle โญ 21,963
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice ๏ผใ้ฃๆกจใๆ ธๅฟๆกๆถ๏ผๆทฑๅบฆๅญฆไน &ๆบๅจๅญฆไน ้ซๆง่ฝๅๆบใๅๅธๅผ่ฎญ็ปๅ่ทจๅนณๅฐ้จ็ฝฒ๏ผ
๐ www.paddlepaddle.org
pyg-team/pytorch_geometric โญ 20,760
Graph Neural Network Library for PyTorch
๐ pyg.org
apache/mxnet โญ 20,740
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
๐ mxnet.apache.org
lucidrains/vit-pytorch โญ 19,239
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
sanster/IOPaint โญ 18,447
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
๐ www.iopaint.com
rasbt/deeplearning-models โญ 16,511
A collection of various deep learning architectures, models, and tips
danielgatis/rembg โญ 15,796
Rembg is a tool to remove images background
albumentations-team/albumentations โญ 13,872
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
๐ albumentations.ai
microsoft/Swin-Transformer โญ 13,443
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
๐ arxiv.org/abs/2103.14030
dmlc/dgl โญ 13,283
Python package built to ease deep learning on graph, on top of existing DL frameworks.
๐ dgl.ai
facebookresearch/detr โญ 13,229
End-to-End Object Detection with Transformers
nvidia/DeepLearningExamples โญ 13,101
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
kornia/kornia โญ 9,692
Geometric Computer Vision Library for Spatial AI
๐ kornia.readthedocs.io
mlfoundations/open_clip โญ 9,491
An open source implementation of OpenAI's CLIP (Contrastive Language-Image Pre-training)
keras-team/autokeras โญ 9,103
AutoML library for deep learning
๐ autokeras.com
modelscope/facechain โญ 8,799
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
facebookresearch/pytorch3d โญ 8,570
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
๐ pytorch3d.org
pyro-ppl/pyro โญ 8,465
Deep universal probabilistic programming with Python and PyTorch
๐ pyro.ai
arogozhnikov/einops โญ 8,263
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
๐ einops.rocks
nvidia/apex โญ 8,238
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
facebookresearch/ImageBind โญ 8,146
ImageBind One Embedding Space to Bind Them All
google/trax โญ 8,024
Trax โ Deep Learning with Clear Code and Speed
lucidrains/imagen-pytorch โญ 7,937
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
xpixelgroup/BasicSR โญ 6,584
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
๐ basicsr.readthedocs.io/en/latest
google/flax โญ 5,871
Flax is a neural network library for JAX that is designed for flexibility.
๐ flax.readthedocs.io
skorch-dev/skorch โญ 5,725
A scikit-learn compatible neural network library that wraps PyTorch
facebookresearch/mmf โญ 5,464
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
๐ mmf.sh
mosaicml/composer โญ 5,096
Supercharge Your Model Training
๐ docs.mosaicml.com
facebookincubator/AITemplate โญ 4,507
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
pytorch/ignite โญ 4,505
High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
๐ pytorch-ignite.ai
deci-ai/super-gradients โญ 4,488
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
๐ www.supergradients.com
nvidiagameworks/kaolin โญ 4,374
A PyTorch Library for Accelerating 3D Deep Learning Research
williamyang1991/VToonify โญ 3,520
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
facebookresearch/PyTorch-BigGraph โญ 3,362
Generate embeddings from large-scale graph-structured data.
๐ torchbiggraph.readthedocs.io
cvg/LightGlue โญ 3,224
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
google-research/scenic โญ 3,189
Scenic: A Jax Library for Computer Vision Research and Beyond
alpa-projects/alpa โญ 3,034
Training and serving large-scale neural networks with auto parallelization.
๐ alpa.ai
pytorch/botorch โญ 3,028
Bayesian optimization in PyTorch
๐ botorch.org
deepmind/dm-haiku โญ 2,850
JAX-based neural network library
๐ dm-haiku.readthedocs.io
explosion/thinc โญ 2,811
๐ฎ A refreshing functional take on deep learning, compatible with your favorite libraries
๐ thinc.ai
nerdyrodent/VQGAN-CLIP โญ 2,596
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
danielegrattarola/spektral โญ 2,357
Graph Neural Networks with Keras and Tensorflow 2.
๐ graphneural.network
google-research/electra โญ 2,317
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
neuralmagic/sparseml โญ 2,026
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
fepegar/torchio โญ 2,026
Medical imaging toolkit for deep learning
๐ www.torchio.org
pytorch/torchrec โญ 1,835
Pytorch domain library for recommendation systems
tensorflow/mesh โญ 1,573
Mesh TensorFlow: Model Parallelism Made Easier
vt-vl-lab/FGVC โญ 1,549
[ECCV 2020] Flow-edge Guided Video Completion
tensorly/tensorly โญ 1,528
TensorLy: Tensor Learning in Python.
๐ tensorly.org
calculatedcontent/WeightWatcher โญ 1,424
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks
hysts/pytorch_image_classification โญ 1,338
PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet
jeshraghian/snntorch โญ 1,214
Deep and online learning with spiking neural networks in Python
๐ snntorch.readthedocs.io/en/latest
xl0/lovely-tensors โญ 1,080
Tensors, for human consumption
๐ xl0.github.io/lovely-tensors
tensorflow/similarity โญ 1,007
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
keras-team/keras-cv โญ 987
Industry-strength Computer Vision workflows with Keras
deepmind/android_env โญ 983
RL research on Android devices.
deepmind/chex โญ 752
Chex is a library of utilities for helping to write reliable JAX code
๐ chex.readthedocs.io
kakaobrain/rq-vae-transformer โญ 741
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
mlfoundations/datacomp โญ 626
DataComp: In search of the next generation of multimodal datasets
๐ datacomp.ai
whitead/dmol-book โญ 602
Deep learning for molecules and materials book
๐ dmol.pub
Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training.
slundberg/shap โญ 22,280
A game theoretic approach to explain the output of any machine learning model.
๐ shap.readthedocs.io
marcotcr/lime โญ 11,451
Lime: Explaining the predictions of any machine learning classifier
interpretml/interpret โญ 6,162
Fit interpretable models. Explain blackbox machine learning.
๐ interpret.ml/docs
pytorch/captum โญ 4,748
Model interpretability and understanding for PyTorch
๐ captum.ai
tensorflow/lucid โญ 4,640
A collection of infrastructure and tools for research in neural network interpretability.
pair-code/lit โญ 3,447
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
๐ pair-code.github.io/lit
arize-ai/phoenix โญ 3,242
AI Observability & Evaluation
๐ docs.arize.com/phoenix
teamhg-memex/eli5 โญ 2,750
A library for debugging/inspecting machine learning classifiers and explaining their predictions
๐ eli5.readthedocs.io
maif/shapash โญ 2,701
๐
Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
๐ maif.github.io/shapash
seldonio/alibi โญ 2,356
Algorithms for explaining machine learning models
๐ docs.seldon.io/projects/alibi/en/stable
oegedijk/explainerdashboard โญ 2,276
Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
๐ explainerdashboard.readthedocs.io
eleutherai/pythia โญ 2,173
Interpretability analysis and scaling laws to understand how knowledge develops and evolves during training in autoregressive transformers
jalammar/ecco โญ 1,948
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
๐ ecco.readthedocs.io
google-deepmind/penzai โญ 1,601
A JAX library for writing models as legible, functional pytree data structures, along with tools for visualizing, modifying, and analyzing them. Penzai focuses on making it easy to do stuff with models after they have been trained
๐ penzai.readthedocs.io
trusted-ai/AIX360 โญ 1,579
Interpretability and explainability of data and machine learning models
๐ aix360.res.ibm.com
cdpierse/transformers-interpret โญ 1,242
Model explainability that works seamlessly with ๐ค transformers. Explain your transformers model in just 2 lines of code.
ethicalml/xai โญ 1,088
XAI is a Machine Learning library that is designed with AI explainability in its core. XAI contains various tools that enable for analysis and evaluation of data and models
๐ ethical.institute/principles.html#commitment-3
selfexplainml/PiML-Toolbox โญ 916
PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics
๐ selfexplainml.github.io/piml-toolbox
salesforce/OmniXAI โญ 853
OmniXAI: A Library for eXplainable AI
andyzoujm/representation-engineering โญ 668
Representation Engineering: A Top-Down Approach to AI Transparency
๐ www.ai-transparency.org
MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models.
apache/airflow โญ 35,799
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
๐ airflow.apache.org
ray-project/ray โญ 32,591
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
๐ ray.io
mlflow/mlflow โญ 18,118
Open source platform for the machine learning lifecycle
๐ mlflow.org
spotify/luigi โญ 17,604
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
prefecthq/prefect โญ 15,590
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
๐ prefect.io
horovod/horovod โญ 14,095
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
๐ horovod.ai
iterative/dvc โญ 13,496
๐ฆ ML Experiments and Data Management with Git
๐ dvc.org
ludwig-ai/ludwig โญ 11,035
Low-code framework for building custom LLMs, neural networks, and other AI models
๐ ludwig.ai
dagster-io/dagster โญ 10,942
An orchestration platform for the development, production, and observation of data assets.
๐ dagster.io
great-expectations/great_expectations โญ 9,739
Always know what to expect from your data.
๐ docs.greatexpectations.io
kedro-org/kedro โญ 9,582
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
๐ kedro.org
bentoml/OpenLLM โญ 9,548
Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
๐ bentoml.com
dbt-labs/dbt-core โญ 9,438
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
๐ getdbt.com
huggingface/text-generation-inference โญ 8,558
A Rust, Python and gRPC server for text generation inference. Used in production at HuggingFace to power Hugging Chat, the Inference API and Inference Endpoint.
๐ hf.co/docs/text-generation-inference
activeloopai/deeplake โญ 7,968
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
๐ activeloop.ai
netflix/metaflow โญ 7,923
๐ Build and manage real-life ML, AI, and data science projects with ease!
๐ metaflow.org
mage-ai/mage-ai โญ 7,570
๐ง Build, run, and manage data pipelines for integrating and transforming data.
๐ www.mage.ai
kestra-io/kestra โญ 7,321
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
๐ kestra.io
bentoml/BentoML โญ 6,886
The easiest way to serve AI apps and models - Build reliable Inference APIs, LLM apps, Multi-model chains, RAG service, and much more!
๐ bentoml.com
allegroai/clearml โญ 5,504
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
๐ clear.ml/docs
feast-dev/feast โญ 5,423
The Open Source Feature Store for Machine Learning
๐ feast.dev
flyteorg/flyte โญ 5,322
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
๐ flyte.org
langfuse/langfuse โญ 5,126
๐ชข Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. ๐YC W23
๐ langfuse.com/docs
aimhubio/aim โญ 5,069
Aim ๐ซ โ An easy-to-use & supercharged open-source experiment tracker.
๐ aimstack.io
evidentlyai/evidently โญ 4,980
Evidently is โโan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
๐ www.evidentlyai.com/evidently-oss
adap/flower โญ 4,699
Flower: A Friendly Federated Learning Framework
๐ flower.ai
orchest/orchest โญ 4,037
Build data pipelines, the easy way ๐ ๏ธ
๐ orchest.readthedocs.io/en/stable
zenml-io/zenml โญ 3,875
ZenML ๐: The bridge between ML and Ops. https://zenml.io.
๐ zenml.io
internlm/lmdeploy โญ 3,805
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
๐ lmdeploy.readthedocs.io/en/latest
internlm/xtuner โญ 3,545
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
๐ xtuner.readthedocs.io/zh-cn/latest
polyaxon/polyaxon โญ 3,533
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
๐ polyaxon.com
kubeflow/pipelines โญ 3,530
Machine Learning Pipelines for Kubeflow
๐ www.kubeflow.org/docs/components/pipelines
ploomber/ploomber โญ 3,458
The fastest โก๏ธ way to build data pipelines. Develop iteratively, deploy anywhere. โ๏ธ
๐ docs.ploomber.io
towhee-io/towhee โญ 3,112
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
๐ towhee.io
determined-ai/determined โญ 2,958
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
๐ determined.ai
leptonai/leptonai โญ 2,608
A Pythonic framework to simplify AI service building
๐ lepton.ai
meltano/meltano โญ 1,736
Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
๐ meltano.com
dagworks-inc/hamilton โญ 1,655
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
๐ hamilton.dagworks.io/en/latest
azure/PyRIT โญ 1,626
The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and ML engineers to red team foundation models and their applications.
hi-primus/optimus โญ 1,466
:truck: Agile Data Preparation Workflows madeย easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
๐ hi-optimus.com
kubeflow/examples โญ 1,389
A repository to host extended examples and tutorials
dstackai/dstack โญ 1,265
A lightweight alternative to Kubernetes for AI, simplifying container orchestration on any cloud or on-premises and accelerating AI development, training, and deployment.
๐ dstack.ai/docs
dagworks-inc/burr โญ 1,010
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
๐ burr.dagworks.io
nccr-itmo/FEDOT โญ 627
Automated modeling and machine learning framework FEDOT
๐ fedot.readthedocs.io
Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF
openai/gym โญ 34,357
A toolkit for developing and comparing reinforcement learning algorithms.
๐ www.gymlibrary.dev
unity-technologies/ml-agents โญ 16,750
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
๐ unity.com/products/machine-learning-agents
openai/baselines โญ 15,564
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
google/dopamine โญ 10,429
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
๐ github.com/google/dopamine
thu-ml/tianshou โญ 7,683
An elegant PyTorch deep reinforcement learning library.
๐ tianshou.org
lucidrains/PaLM-rlhf-pytorch โญ 7,662
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
tensorlayer/TensorLayer โญ 7,312
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
๐ tensorlayerx.com
farama-foundation/Gymnasium โญ 6,506
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
๐ gymnasium.farama.org
keras-rl/keras-rl โญ 5,507
Deep Reinforcement Learning for Keras.
๐ keras-rl.readthedocs.io
deepmind/dm_control โญ 3,680
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
ai4finance-foundation/ElegantRL โญ 3,590
Massively Parallel Deep Reinforcement Learning. ๐ฅ
๐ ai4finance.org
facebookresearch/ReAgent โญ 3,547
A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
๐ reagent.ai
deepmind/acme โญ 3,449
A library of reinforcement learning components and agents
eureka-research/Eureka โญ 2,756
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
๐ eureka-research.github.io
pettingzoo-team/PettingZoo โญ 2,499
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
๐ pettingzoo.farama.org
kzl/decision-transformer โญ 2,294
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
pytorch/rl โญ 2,123
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
๐ pytorch.org/rl
anthropics/hh-rlhf โญ 1,533
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
๐ arxiv.org/abs/2204.05862
humancompatibleai/imitation โญ 1,235
Clean PyTorch implementations of imitation and reward learning algorithms
๐ imitation.readthedocs.io
arise-initiative/robosuite โญ 1,198
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
๐ robosuite.ai
denys88/rl_games โญ 826
RL Games: High performance RL library
Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover.
huggingface/transformers โญ 130,399
๐ค Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
๐ huggingface.co/transformers
pytorch/fairseq โญ 29,976
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
explosion/spaCy โญ 29,462
๐ซ Industrial-strength Natural Language Processing (NLP) in Python
๐ spacy.io
myshell-ai/OpenVoice โญ 27,880
Instant voice cloning by MyShell.
๐ research.myshell.ai/open-voice
microsoft/unilm โญ 19,343
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
๐ aka.ms/generalai
huggingface/datasets โญ 18,856
๐ค The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
๐ huggingface.co/docs/datasets
rare-technologies/gensim โญ 15,502
Topic Modelling for Humans
๐ radimrehurek.com/gensim
vikparuchuri/marker โญ 15,331
Marker converts PDF, EPUB, and MOBI to markdown. It's 10x faster than nougat, more accurate on most documents, and has low hallucination risk.
๐ www.datalab.to
ukplab/sentence-transformers โญ 14,624
Multilingual Sentence & Image Embeddings with BERT
๐ www.sbert.net
gunthercox/ChatterBot โญ 13,985
ChatterBot is a machine learning, conversational dialog engine for creating chat bots
๐ chatterbot.readthedocs.io
flairnlp/flair โญ 13,775
A very simple framework for state-of-the-art Natural Language Processing (NLP)
๐ flairnlp.github.io/flair
nltk/nltk โญ 13,331
NLTK Source
๐ www.nltk.org
jina-ai/clip-as-service โญ 12,326
๐ Scalable embedding, reasoning, ranking for images and sentences with CLIP
๐ clip-as-service.jina.ai
allenai/allennlp โญ 11,727
An open-source NLP research library, built on PyTorch.
๐ www.allennlp.org
openai/tiktoken โญ 11,450
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
nvidia/NeMo โญ 11,201
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
๐ docs.nvidia.com/nemo-framework/user-guide/latest/overview.html
facebookresearch/seamless_communication โญ 10,647
Foundational Models for State-of-the-Art Speech and Text Translation
facebookresearch/ParlAI โญ 10,457
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
๐ parl.ai
m-bain/whisperX โญ 10,430
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
google/sentencepiece โญ 9,940
Unsupervised text tokenizer for Neural Network-based text generation.
vikparuchuri/surya โญ 9,448
OCR, layout analysis, reading order, line detection in 90+ languages
๐ www.datalab.to
doccano/doccano โญ 9,310
Open source annotation tool for machine learning practitioners.
sloria/TextBlob โญ 9,058
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
๐ textblob.readthedocs.io
togethercomputer/OpenChatKit โญ 9,011
OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots
clips/pattern โญ 8,718
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
๐ github.com/clips/pattern/wiki
facebookresearch/nougat โญ 8,595
Implementation of Nougat Neural Optical Understanding for Academic Documents
๐ facebookresearch.github.io/nougat
neuml/txtai โญ 8,447
๐ก All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
๐ neuml.github.io/txtai
speechbrain/speechbrain โญ 8,374
A PyTorch-based Speech Toolkit
๐ speechbrain.github.io
espnet/espnet โญ 8,195
End-to-End Speech Processing Toolkit
๐ espnet.github.io/espnet
deeppavlov/DeepPavlov โญ 6,634
An open source library for deep learning end-to-end dialog systems and chatbots.
๐ deeppavlov.ai
facebookresearch/metaseq โญ 6,446
A codebase for working with Open Pre-trained Transformers, originally forked from fairseq.
kingoflolz/mesh-transformer-jax โญ 6,255
Model parallel transformers in JAX and Haiku
maartengr/BERTopic โญ 5,902
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
๐ maartengr.github.io/bertopic
prefecthq/marvin โญ 5,075
โจ Build AI interfaces that spark joy
๐ askmarvin.ai
aiwaves-cn/agents โญ 5,064
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
minimaxir/textgenrnn โญ 4,944
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
salesforce/CodeGen โญ 4,861
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
layout-parser/layout-parser โญ 4,709
A Unified Toolkit for Deep Learning Based Document Image Analysis
๐ layout-parser.github.io
facebookresearch/DrQA โญ 4,470
Reading Wikipedia to Answer Open-Domain Questions
makcedward/nlpaug โญ 4,378
Data augmentation for NLP
๐ makcedward.github.io
thilinarajapakse/simpletransformers โญ 4,049
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
๐ simpletransformers.ai
argilla-io/argilla โญ 3,719
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
๐ docs.argilla.io
maartengr/KeyBERT โญ 3,379
Minimal keyword extraction with BERT
๐ maartengr.github.io/keybert
life4/textdistance โญ 3,344
๐ Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
jsvine/markovify โญ 3,290
A simple, extensible Markov chain generator.
promptslab/Promptify โญ 3,166
Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
๐ discord.gg/m88xfymbk6
bytedance/lightseq โญ 3,149
LightSeq: A High Performance Library for Sequence Processing and Generation
errbotio/errbot โญ 3,103
Errbot is a chatbot, a daemon that connects to your favorite chat service and bring your tools and some fun into the conversation.
๐ errbot.io
huawei-noah/Pretrained-Language-Model โญ 2,997
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
neuralmagic/deepsparse โญ 2,952
Sparsity-aware deep learning inference runtime for CPUs
๐ neuralmagic.com/deepsparse
ddangelov/Top2Vec โญ 2,906
Top2Vec learns jointly embedded topic, document and word vectors.
jbesomi/texthero โญ 2,880
Text preprocessing, representation and visualization from zero to hero.
๐ texthero.org
huggingface/neuralcoref โญ 2,842
โจFast Coreference Resolution in spaCy with Neural Networks
๐ huggingface.co/coref
salesforce/CodeT5 โญ 2,680
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
๐ arxiv.org/abs/2305.07922
bigscience-workshop/promptsource โญ 2,618
Toolkit for creating, sharing and using natural language prompts.
huggingface/setfit โญ 2,107
SetFit is an efficient and prompt-free framework for few-shot fine-tuning of Sentence Transformers.
๐ hf.co/docs/setfit
jamesturk/jellyfish โญ 2,027
๐ชผ a python library for doing approximate and phonetic matching of strings.
๐ jamesturk.github.io/jellyfish
alibaba/EasyNLP โญ 2,013
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
thudm/P-tuning-v2 โญ 1,953
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
featureform/featureform โญ 1,778
The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
๐ www.featureform.com
marella/ctransformers โญ 1,768
Python bindings for the Transformer models implemented in C/C++ using GGML library.
deepset-ai/FARM โญ 1,735
๐ก Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
๐ farm.deepset.ai
franck-dernoncourt/NeuroNER โญ 1,689
Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
๐ neuroner.com
plasticityai/magnitude โญ 1,620
A fast, efficient universal vector embedding utility package.
arxiv-vanity/arxiv-vanity โญ 1,599
Renders papers from arXiv as responsive web pages so you don't have to squint at a PDF.
๐ www.arxiv-vanity.com
google-research/language โญ 1,589
Shared repository for open-sourced projects from the Google AI Language team.
๐ ai.google/research/teams/language
explosion/spacy-models โญ 1,577
๐ซ Models for the spaCy Natural Language Processing (NLP) library
๐ spacy.io
chrismattmann/tika-python โญ 1,467
Tika-Python is a Python binding to the Apache Tikaโข REST services allowing Tika to be called natively in the Python community.
dmmiller612/bert-extractive-summarizer โญ 1,374
Easy to use extractive text summarization with BERT
gunthercox/chatterbot-corpus โญ 1,357
A multilingual dialog corpus
๐ chatterbot-corpus.readthedocs.io
jonasgeiping/cramming โญ 1,276
Cramming the training of a (BERT-type) language model into limited compute.
intellabs/fastRAG โญ 1,187
Efficient Retrieval Augmentation and Generation Framework
nomic-ai/nomic โญ 1,187
Interact, analyze and structure massive text, image, embedding, audio and video datasets
๐ atlas.nomic.ai
urchade/GLiNER โญ 1,115
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
๐ arxiv.org/abs/2311.08526
pemistahl/lingua-py โญ 1,065
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
abertsch72/unlimiformer โญ 1,047
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
openai/grade-school-math โญ 966
GSM8K, a dataset of 8.5K high quality linguistically diverse grade school math word problems
norskregnesentral/skweak โญ 915
skweak: A software toolkit for weak supervision applied to NLP tasks
explosion/spacy-streamlit โญ 782
๐ spaCy building blocks and visualizers for Streamlit apps
๐ share.streamlit.io/ines/spacy-streamlit-demo/master/app.py
paddlepaddle/RocketQA โญ 761
๐ RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.
keras-team/keras-nlp โญ 748
Modular Natural Language Processing workflows with Keras
explosion/spacy-stanza โญ 722
๐ฅ Use the latest Stanza (StanfordNLP) research models directly in spaCy
Python packaging, dependency management and bundling.
pyenv/pyenv โญ 38,177
pyenv lets you easily switch between multiple versions of Python.
python-poetry/poetry โญ 30,755
Python packaging and dependency management made easy
๐ python-poetry.org
pypa/pipenv โญ 24,762
A virtualenv management tool that supports a multitude of systems and nicely bridges the gaps between pip, python and virtualenv.
๐ pipenv.pypa.io
astral-sh/uv โญ 15,817
An extremely fast Python package installer and resolver, written in Rust. Designed as a drop-in replacement for pip and pip-compile.
๐ astral.sh
mitsuhiko/rye โญ 13,078
a Hassle-Free Python Experience
๐ rye.astral.sh
pyinstaller/pyinstaller โญ 11,601
Freeze (package) Python programs into stand-alone executables
๐ www.pyinstaller.org
pypa/pipx โญ 9,800
Install and Run Python Applications in Isolated Environments
๐ pipx.pypa.io
pdm-project/pdm โญ 7,678
A modern Python package and dependency manager supporting the latest PEP standards
๐ pdm-project.org
jazzband/pip-tools โญ 7,653
A set of tools to keep your pinned Python dependencies fresh (pip-compile + pip-sync)
๐ pip-tools.rtfd.io
mamba-org/mamba โญ 6,633
The Fast Cross-Platform Package Manager: mamba is a reimplementation of the conda package manager in C++
๐ mamba.readthedocs.io
conda/conda โญ 6,298
A system-level, binary package and environment manager running on all major operating systems and platforms.
๐ docs.conda.io/projects/conda
conda-forge/miniforge โญ 5,945
A conda-forge distribution.
๐ conda-forge.org/miniforge
pypa/hatch โญ 5,782
Modern, extensible Python project management
๐ hatch.pypa.io/latest
indygreg/PyOxidizer โญ 5,348
A modern Python application packaging and distribution tool
pypa/virtualenv โญ 4,775
A tool to create isolated Python environments. Since Python 3.3, a subset of it has been integrated into the standard lib venv module.
๐ virtualenv.pypa.io
spack/spack โญ 4,153
A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
๐ spack.io
prefix-dev/pixi โญ 2,629
pixi is a cross-platform, multi-language package manager and workflow tool built on the foundation of the conda ecosystem.
๐ pixi.sh
beeware/briefcase โญ 2,546
Tools to support converting a Python project into a standalone native application.
๐ briefcase.readthedocs.io
pantsbuild/pex โญ 2,501
A tool for generating .pex (Python EXecutable) files, lock files and venvs.
๐ docs.pex-tool.org
pypa/flit โญ 2,144
Simplified packaging of Python modules
๐ flit.pypa.io
linkedin/shiv โญ 1,716
shiv is a command line utility for building fully self contained Python zipapps as outlined in PEP 441, but with all their dependencies included.
marcelotduarte/cx_Freeze โญ 1,299
cx_Freeze creates standalone executables from Python scripts, with the same performance, is cross-platform and should work on any platform that Python itself works on.
๐ marcelotduarte.github.io/cx_freeze
ofek/pyapp โญ 1,130
Runtime installer for Python applications
๐ ofek.dev/pyapp
pypa/gh-action-pypi-publish โญ 881
The blessed :octocat: GitHub Action, for publishing your ๐ฆ distribution files to PyPI: https://github.com/marketplace/actions/pypi-publish
๐ packaging.python.org/guides/publishing-package-distribution-releases-using-github-actions-ci-cd-workflows
py2exe/py2exe โญ 813
Create standalone Windows programs from Python code
๐ www.py2exe.org
prefix-dev/rip โญ 640
RIP is a library that allows the resolving and installing of Python PyPI packages from Rust into a virtual environment. It's based on our experience with building Rattler and aims to provide the same experience but for PyPI instead of Conda.
๐ prefix.dev
snok/install-poetry โญ 565
Github action for installing and configuring Poetry
python-poetry/install.python-poetry.org โญ 192
The official Poetry installation script
๐ install.python-poetry.org
Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations.
pandas-dev/pandas โญ 42,948
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
๐ pandas.pydata.org
pola-rs/polars โญ 28,538
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
๐ docs.pola.rs
duckdb/duckdb โญ 21,590
DuckDB is an analytical in-process SQL database management system
๐ www.duckdb.org
gventuri/pandas-ai โญ 12,307
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
๐ pandas-ai.com
ydataai/ydata-profiling โญ 12,304
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
๐ docs.profiling.ydata.ai
kanaries/pygwalker โญ 10,897
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
๐ kanaries.net/pygwalker
rapidsai/cudf โญ 8,133
cuDF is a GPU DataFrame library for loading joining, aggregating, filtering, and otherwise manipulating data
๐ docs.rapids.ai/api/cudf/stable
aws/aws-sdk-pandas โญ 3,862
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
๐ aws-sdk-pandas.readthedocs.io
nalepae/pandarallel โญ 3,611
A simple and efficient tool to parallelize Pandas operations on all availableย CPUs
๐ nalepae.github.io/pandarallel
unionai-oss/pandera โญ 3,184
A light-weight, flexible, and expressive statistical data testing library
๐ www.union.ai/pandera
blaze/blaze โญ 3,181
NumPy and Pandas interface to Big Data
๐ blaze.pydata.org
adamerose/PandasGUI โญ 3,161
A GUI for Pandas DataFrames
pydata/pandas-datareader โญ 2,893
Extract data from a wide range of Internet sources into a pandas DataFrame.
๐ pydata.github.io/pandas-datareader/stable/index.html
scikit-learn-contrib/sklearn-pandas โญ 2,798
Pandas integration with sklearn
jmcarpenter2/swifter โญ 2,500
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
delta-io/delta-rs โญ 2,049
A native Rust library for Delta Lake, with bindings into Python
๐ delta-io.github.io/delta-rs
eventual-inc/Daft โญ 1,963
Distributed DataFrame for Python designed for the cloud, powered by Rust
๐ getdaft.io
fugue-project/fugue โญ 1,940
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
๐ fugue-tutorials.readthedocs.io
pyjanitor-devs/pyjanitor โญ 1,335
Clean APIs for data cleaning. Python implementation of R package Janitor
๐ pyjanitor-devs.github.io/pyjanitor
machow/siuba โญ 1,138
Python library for using dplyr like syntax with pandas and SQL
๐ siuba.org
renumics/spotlight โญ 1,084
Interactively explore unstructured datasets from your dataframe.
๐ renumics.com
holoviz/hvplot โญ 1,046
A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews
๐ hvplot.holoviz.org
tkrabel/bamboolib โญ 935
bamboolib - a GUI for pandas DataFrames
๐ bamboolib.com
mwouts/itables โญ 745
This packages changes how Pandas and Polars DataFrames are rendered in Jupyter Notebooks. With itables you can display your tables as interactive DataTables that you can sort, paginate, scroll or filter.
๐ mwouts.github.io/itables
Performance, parallelisation and low level libraries.
celery/celery โญ 24,216
Distributed Task Queue (development branch)
๐ docs.celeryq.dev
google/flatbuffers โญ 22,687
FlatBuffers: Memory Efficient Serialization Library
๐ flatbuffers.dev
pybind/pybind11 โญ 15,296
Seamless operability between C++11 and Python
๐ pybind11.readthedocs.io
exaloop/codon โญ 14,021
A high-performance, zero-overhead, extensible Python compiler using LLVM
๐ docs.exaloop.io/codon
dask/dask โญ 12,310
Parallel computing with task scheduling
๐ dask.org
numba/numba โญ 9,701
NumPy aware dynamic Python compiler using LLVM
๐ numba.pydata.org
modin-project/modin โญ 9,652
Modin: Scale your Pandas workflows by changing a single line of code
๐ modin.readthedocs.io
nebuly-ai/optimate โญ 8,366
A collection of libraries to optimise AI model performances
๐ www.nebuly.com
vaexio/vaex โญ 8,243
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐
๐ vaex.io
mher/flower โญ 6,336
Real-time monitor and web admin for Celery distributed task queue
๐ flower.readthedocs.io
python-trio/trio โญ 6,027
Trio โ a friendly Python library for async concurrency and I/O
๐ trio.readthedocs.io
ultrajson/ultrajson โญ 4,294
Ultra fast JSON decoder and encoder written in C with Python bindings
๐ pypi.org/project/ujson
facebookincubator/cinder โญ 3,446
Cinder is Meta's internal performance-oriented production version of CPython.
๐ trycinder.com
tlkh/asitop โญ 3,251
Perf monitoring CLI tool for Apple Silicon
๐ tlkh.github.io/asitop
ipython/ipyparallel โญ 2,570
IPython Parallel: Interactive Parallel Computing in Python
๐ ipyparallel.readthedocs.io
airtai/faststream โญ 2,267
FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
๐ faststream.airt.ai/latest
intel/intel-extension-for-transformers โญ 2,085
โก Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsโก
h5py/h5py โญ 2,062
HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
๐ www.h5py.org
agronholm/anyio โญ 1,708
High level asynchronous concurrency and networking framework that works on top of either trio or asyncio
faster-cpython/ideas โญ 1,668
Discussion and work tracker for Faster CPython project.
dask/distributed โญ 1,561
A distributed task scheduler for Dask
๐ distributed.dask.org
tiangolo/asyncer โญ 1,531
Asyncer, async and await, focused on developer experience.
๐ asyncer.tiangolo.com
intel/intel-extension-for-pytorch โญ 1,510
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
nschloe/perfplot โญ 1,330
:chart_with_upwards_trend: Performance analysis for Python snippets
intel/scikit-learn-intelex โญ 1,199
Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
๐ intel.github.io/scikit-learn-intelex
markshannon/faster-cpython โญ 938
How to make CPython faster.
zerointensity/pointers.py โญ 913
Bringing the hell of pointers to Python.
๐ pointers.zintensity.dev
brandtbucher/specialist โญ 621
Visualize CPython's specializing, adaptive interpreter. ๐ฅ
Memory and CPU/GPU profiling tools and libraries.
bloomberg/memray โญ 12,981
Memray is a memory profiler for Python
๐ bloomberg.github.io/memray
benfred/py-spy โญ 12,302
Sampling profiler for Python programs
plasma-umass/scalene โญ 11,477
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
joerick/pyinstrument โญ 6,374
๐ดย Call stack profiler for Python. Shows you why your code is slow!
๐ pyinstrument.readthedocs.io
gaogaotiantian/viztracer โญ 4,750
VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.
๐ viztracer.readthedocs.io
pythonprofilers/memory_profiler โญ 4,294
Monitor Memory usage of Python code
๐ pypi.python.org/pypi/memory_profiler
reloadware/reloadium โญ 2,765
Hot Reloading and Profiling for Python
๐ reloadium.io
pyutils/line_profiler โญ 2,612
Line-by-line profiling for Python
jiffyclub/snakeviz โญ 2,302
An in-browser Python profile viewer
๐ jiffyclub.github.io/snakeviz
p403n1x87/austin โญ 1,418
Python frame stack sampler for CPython
๐ pypi.org/project/austin-dist
pythonspeed/filprofiler โญ 825
A Python memory profiler for data processing and scientific computing applications
๐ pythonspeed.com/products/filmemoryprofiler
Security related libraries: vulnerability discovery, SQL injection, environment auditing.
swisskyrepo/PayloadsAllTheThings โญ 59,026
A list of useful payloads and bypass for Web Application Security and Pentest/CTF
๐ swisskyrepo.github.io/payloadsallthethings
sqlmapproject/sqlmap โญ 31,594
Automatic SQL injection and database takeover tool
๐ sqlmap.org
certbot/certbot โญ 31,205
Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.
aquasecurity/trivy โญ 22,488
Find vulnerabilities, misconfigurations, secrets, SBOM in containers, Kubernetes, code repositories, clouds and more
๐ aquasecurity.github.io/trivy
bridgecrewio/checkov โญ 6,806
Checkov is a static code analysis tool for infrastructure as code (IaC) and also a software composition analysis (SCA) tool for images and open source packages.
๐ www.checkov.io
nccgroup/ScoutSuite โญ 6,396
Multi-Cloud Security Auditing Tool
pycqa/bandit โญ 6,213
Bandit is a tool designed to find common security issues in Python code.
๐ bandit.readthedocs.io
stamparm/maltrail โญ 6,200
Malicious traffic detection system
rhinosecuritylabs/pacu โญ 4,242
The AWS exploitation framework, designed for testing the security of Amazon Web Services environments.
๐ rhinosecuritylabs.com/aws/pacu-open-source-aws-exploitation-framework
dashingsoft/pyarmor โญ 3,163
A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.
๐ pyarmor.dashingsoft.com
luijait/DarkGPT โญ 1,925
DarkGPT is an OSINT assistant based on GPT-4-200K (recommended use) designed to perform queries on leaked databases, thus providing an artificial intelligence assistant that can be useful in your traditional OSINT processes.
pyupio/safety โญ 1,675
Safety checks Python dependencies for known security vulnerabilities and suggests the proper remediations for vulnerabilities detected.
๐ safetycli.com/product/safety-cli
trailofbits/pip-audit โญ 952
Audits Python environments, requirements files and dependency trees for known security vulnerabilities, and can automatically fix them
๐ pypi.org/project/pip-audit
fadi002/de4py โญ 832
toolkit for python reverse engineering
๐ de4py.000.pe
Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover.
atsushisakai/PythonRobotics โญ 22,489
Python sample codes for robotics algorithms.
๐ atsushisakai.github.io/pythonrobotics
bulletphysics/bullet3 โญ 12,313
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
๐ bulletphysics.org
isl-org/Open3D โญ 10,976
Open3D: A Modern Library for 3D Data Processing
๐ www.open3d.org
qiskit/qiskit โญ 4,945
Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
๐ www.ibm.com/quantum/qiskit
astropy/astropy โญ 4,335
Astronomy and astrophysics core library
๐ www.astropy.org
quantumlib/Cirq โญ 4,206
A python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum (NISQ) circuits.
nvidia/warp โญ 3,972
A Python framework for high performance GPU simulation and graphics
๐ nvidia.github.io/warp
openai/mujoco-py โญ 2,810
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
rdkit/rdkit โญ 2,556
The official sources for the RDKit library
taichi-dev/difftaichi โญ 2,438
10 differentiable physical simulators built with Taichi differentiable programming (DiffTaichi, ICLR 2020)
projectmesa/mesa โญ 2,338
Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.
๐ mesa.readthedocs.io
google/brax โญ 2,180
Massively parallel rigidbody physics simulation on accelerator hardware.
quantecon/QuantEcon.py โญ 1,908
A community based Python library for quantitative economics
๐ quantecon.org/quantecon-py
facebookresearch/habitat-lab โญ 1,859
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
๐ aihabitat.org
nvidia-omniverse/IsaacGymEnvs โญ 1,836
Example RL environments for the NVIDIA Isaac Gym high performance environments
microsoft/PromptCraft-Robotics โญ 1,812
Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
๐ aka.ms/chatgpt-robotics
nvidia-omniverse/IsaacLab โญ 1,726
Unified framework for robot learning built on NVIDIA Isaac Sim
๐ isaac-sim.github.io/isaaclab
deepmodeling/deepmd-kit โญ 1,426
A deep learning package for many-body potential energy representation and molecular dynamics
๐ docs.deepmodeling.com/projects/deepmd
sail-sg/envpool โญ 1,057
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
๐ envpool.readthedocs.io
a-r-j/graphein โญ 1,012
Protein Graph Library
๐ graphein.ai
bowang-lab/scGPT โญ 958
scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AI
๐ scgpt.readthedocs.io/en/latest
viblo/pymunk โญ 909
Pymunk is a easy-to-use pythonic 2d physics library that can be used whenever you need 2d rigid body physics from Python
๐ www.pymunk.org
google-deepmind/materials_discovery โญ 853
Graph Networks for Materials Science (GNoME) is a project centered around scaling machine learning methods to tackle materials science.
facebookresearch/fairo โญ 842
A modular embodied agent architecture and platform for building embodied agents
google/evojax โญ 818
EvoJAX is a scalable, general purpose, hardware-accelerated neuroevolution toolkit built on the JAX library
nvidia-omniverse/OmniIsaacGymEnvs โญ 797
Reinforcement Learning Environments for Omniverse Isaac Gym
eureka-research/DrEureka โญ 767
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
๐ eureka-research.github.io/dr-eureka
Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials.
thealgorithms/Python โญ 183,083
All Algorithms implemented in Python
๐ the-algorithms.com
microsoft/generative-ai-for-beginners โญ 58,095
18 Lessons, Get Started Building with Generative AI ๐ https://microsoft.github.io/generative-ai-for-beginners/
๐ microsoft.github.io/generative-ai-for-beginners
jakevdp/PythonDataScienceHandbook โญ 42,426
Python Data Science Handbook: full text in Jupyter Notebooks
๐ jakevdp.github.io/pythondatasciencehandbook
mlabonne/llm-course โญ 35,660
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
๐ mlabonne.github.io/blog
realpython/python-guide โญ 28,069
Python best practices guidebook, written for humans.
๐ docs.python-guide.org
rasbt/LLMs-from-scratch โญ 24,602
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
๐ www.manning.com/books/build-a-large-language-model-from-scratch
christoschristofidis/awesome-deep-learning โญ 23,425
A curated list of awesome Deep Learning tutorials, projects and communities.
d2l-ai/d2l-en โญ 22,812
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
๐ d2l.ai
wesm/pydata-book โญ 21,837
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
microsoft/recommenders โญ 18,596
Best Practices on Recommendation Systems
๐ recommenders-team.github.io/recommenders/intro.html
fchollet/deep-learning-with-python-notebooks โญ 18,018
Jupyter notebooks for the code samples of the book "Deep Learning with Python"
hannibal046/Awesome-LLM โญ 16,650
Awesome-LLM: a curated list of Large Language Model
graykode/nlp-tutorial โญ 13,977
Natural Language Processing Tutorial for Deep Learning Researchers
๐ www.reddit.com/r/machinelearning/comments/amfinl/project_nlptutoral_repository_who_is_studying
shangtongzhang/reinforcement-learning-an-introduction โญ 13,393
Python Implementation of Reinforcement Learning: An Introduction
naklecha/llama3-from-scratch โญ 11,767
llama3 implementation one matrix multiplication at a time
karpathy/nn-zero-to-hero โญ 11,281
Neural Networks: Zero to Hero
eugeneyan/open-llms โญ 10,759
๐ A list of open LLMs available for commercial use.
openai/spinningup โญ 9,886
An educational resource to help anyone learn deep reinforcement learning.
๐ spinningup.openai.com
rucaibox/LLMSurvey โญ 9,770
The official GitHub page for the survey paper "A Survey of Large Language Models".
๐ arxiv.org/abs/2303.18223
karpathy/micrograd โญ 9,760
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
mrdbourke/pytorch-deep-learning โญ 9,750
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
๐ learnpytorch.io
mooler0410/LLMsPracticalGuide โญ 9,139
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
๐ arxiv.org/abs/2304.13712v2
nielsrogge/Transformers-Tutorials โญ 8,776
This repository contains demos I made with the Transformers library by HuggingFace.
zhanymkanov/fastapi-best-practices โญ 8,162
FastAPI Best Practices and Conventions we used at our startup
firmai/industry-machine-learning โญ 7,171
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
๐ www.linkedin.com/company/firmai
gkamradt/langchain-tutorials โญ 6,577
Overview and tutorial of the LangChain Library
udlbook/udlbook โญ 5,851
Understanding Deep Learning - Simon J.D. Prince
srush/GPU-Puzzles โญ 5,498
Teaching beginner GPU programming in a completely interactive fashion
neetcode-gh/leetcode โญ 5,414
Leetcode solutions for NeetCode.io
udacity/deep-learning-v2-pytorch โญ 5,250
Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101
mrdbourke/tensorflow-deep-learning โญ 5,090
All course materials for the Zero to Mastery Deep Learning with TensorFlow course.
๐ dbourke.link/ztmtfcourse
roboflow/notebooks โญ 4,968
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
๐ roboflow.com/models
timofurrer/awesome-asyncio โญ 4,520
A curated list of awesome Python asyncio frameworks, libraries, software and resources
zotroneneis/machine_learning_basics โญ 4,257
Plain python implementations of basic machine learning algorithms
alirezadir/Machine-Learning-Interviews โญ 4,082
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
huggingface/deep-rl-class โญ 3,766
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
promptslab/Awesome-Prompt-Engineering โญ 3,619
This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
๐ discord.gg/m88xfymbk6
huggingface/diffusion-models-class โญ 3,465
Materials for the Hugging Face Diffusion Models Course
cosmicpython/book โญ 3,332
A Book about Pythonic Application Architecture Patterns for Managing Complexity. Cosmos is the Opposite of Chaos you see. O'R. wouldn't actually let us call it "Cosmic Python" tho.
๐ www.cosmicpython.com
rasbt/machine-learning-book โญ 3,204
Code Repository for Machine Learning with PyTorch and Scikit-Learn
๐ sebastianraschka.com/books/#machine-learning-with-pytorch-and-scikit-learn
fluentpython/example-code-2e โญ 3,120
Example code for Fluent Python, 2nd edition (O'Reilly 2022)
๐ amzn.to/3j48u2j
amanchadha/coursera-deep-learning-specialization โญ 2,943
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv...
mrdbourke/zero-to-mastery-ml โญ 2,795
All course materials for the Zero to Mastery Machine Learning and Data Science course.
๐ dbourke.link/ztmmlcourse
krzjoa/awesome-python-data-science โญ 2,466
Probably the best curated list of data science software in Python.
๐ krzjoa.github.io/awesome-python-data-science
cgpotts/cs224u โญ 2,087
Code for CS224u: Natural Language Understanding
cerlymarco/MEDIUM_NoteBook โญ 2,060
Repository containing notebooks of my posts on Medium
gerdm/prml โญ 2,035
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop
trananhkma/fucking-awesome-python โญ 1,982
awesome-python with :octocat: โญ and ๐ด
huggingface/cookbook โญ 1,553
Community-driven practical examples of building AI applications and solving various tasks with AI using open-source tools and models.
๐ huggingface.co/learn/cookbook
chandlerbang/awesome-self-supervised-gnn โญ 1,527
Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).
atcold/NYU-DLSP21 โญ 1,518
NYU Deep Learning Spring 2021
๐ atcold.github.io/nyu-dlsp21
patrickloeber/MLfromscratch โญ 1,235
Machine Learning algorithm implementations from scratch.
davidadsp/Generative_Deep_Learning_2nd_Edition โญ 1,000
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
๐ www.oreilly.com/library/view/generative-deep-learning/9781098134174
jackhidary/quantumcomputingbook โญ 782
Companion site for the textbook Quantum Computing: An Applied Approach
dylanhogg/awesome-python โญ 285
๐ Hand-picked awesome Python libraries and frameworks, organised by category
๐ www.awesomepython.org
Template tools and libraries: cookiecutter repos, generators, quick-starts.
tiangolo/full-stack-fastapi-template โญ 25,264
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
cookiecutter/cookiecutter โญ 22,137
A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
๐ pypi.org/project/cookiecutter
drivendata/cookiecutter-data-science โญ 8,084
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
๐ cookiecutter-data-science.drivendata.org
buuntu/fastapi-react โญ 2,158
๐ Cookiecutter Template for FastAPI + React Projects. Using PostgreSQL, SQLAlchemy, and Docker
pyscaffold/pyscaffold โญ 2,062
๐ Python project template generator with batteries included
๐ pyscaffold.org
cjolowicz/cookiecutter-hypermodern-python โญ 1,774
Cookiecutter template for a Python package based on the Hypermodern Python article series.
๐ cookiecutter-hypermodern-python.readthedocs.io
tezromach/python-package-template โญ 1,087
๐ Your next Python package needs a bleeding-edge project structure.
martinheinz/python-project-blueprint โญ 951
Blueprint/Boilerplate For Python Projects
fmind/mlops-python-package โญ 595
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
๐ fmind.github.io/mlops-python-package
Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars.
willmcgugan/rich โญ 48,563
Rich is a Python library for rich text and beautiful formatting in the terminal.
๐ rich.readthedocs.io/en/latest
tqdm/tqdm โญ 28,119
:zap: A Fast, Extensible Progress Bar for Python and CLI
๐ tqdm.github.io
google/python-fire โญ 26,732
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
willmcgugan/textual โญ 24,547
The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
๐ textual.textualize.io
pallets/click โญ 15,394
Python composable command line interface toolkit
๐ click.palletsprojects.com
tiangolo/typer โญ 15,124
Typer, build great CLIs. Easy to code. Based on Python type hints.
๐ typer.tiangolo.com
saulpw/visidata โญ 7,729
A terminal spreadsheet multitool for discovering and arranging data
๐ visidata.org
tconbeer/harlequin โญ 3,441
The SQL IDE for Your Terminal.
๐ harlequin.sh
manrajgrover/halo โญ 2,873
๐ซ Beautiful spinners for terminal, IPython and Jupyter
urwid/urwid โญ 2,775
Console user interface library for Python (official repo)
๐ urwid.org
textualize/trogon โญ 2,419
Easily turn your Click CLI into a powerful terminal application
darrenburns/elia โญ 1,681
A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.
tmbo/questionary โญ 1,499
Python library to build pretty command line user prompts โจEasy to use multi-select lists, confirmations, free text prompts ...
jazzband/prettytable โญ 1,327
Display tabular data in a visually appealing ASCII table format
๐ pypi.org/project/prettytable
1j01/textual-paint โญ 930
:art: MS Paint in your terminal.
๐ pypi.org/project/textual-paint
Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins.
mitmproxy/mitmproxy โญ 35,534
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
๐ mitmproxy.org
locustio/locust โญ 24,372
Write scalable load tests in plain Python ๐๐จ
pytest-dev/pytest โญ 11,756
The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
๐ pytest.org
microsoft/playwright-python โญ 11,327
Python version of the Playwright testing and automation library.
๐ playwright.dev/python
robotframework/robotframework โญ 9,513
Generic automation framework for acceptance testing and RPA
๐ robotframework.org
getmoto/moto โญ 7,536
A library that allows you to easily mock out tests based on AWS infrastructure.
๐ docs.getmoto.org/en/latest
hypothesisworks/hypothesis โญ 7,447
Hypothesis is a powerful, flexible, and easy to use library for property-based testing.
๐ hypothesis.works
newsapps/beeswithmachineguns โญ 6,430
A utility for arming (creating) many bees (micro EC2 instances) to attack (load test) targets (web applications).
๐ apps.chicagotribune.com
seleniumbase/SeleniumBase โญ 4,606
๐ Python's all-in-one framework for web crawling, scraping, testing, and reporting. Supports pytest. UC Mode provides stealth. Includes many tools.
๐ seleniumbase.io
getsentry/responses โญ 4,105
A utility for mocking out the Python Requests library.
spulec/freezegun โญ 4,105
Let your Python tests travel through time
codium-ai/cover-agent โญ 4,101
CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! ๐ป๐ค๐งช๐
๐ www.codium.ai
tox-dev/tox โญ 3,601
Command line driven CI frontend and development task automation tool.
๐ tox.wiki
behave/behave โญ 3,117
BDD, Python style.
๐ behave.readthedocs.io/en/latest
nedbat/coveragepy โญ 2,935
The code coverage tool for Python
๐ coverage.readthedocs.io
cobrateam/splinter โญ 2,708
splinter - python test framework for web applications
๐ splinter.readthedocs.org/en/stable/index.html
kevin1024/vcrpy โญ 2,665
Automatically mock your HTTP interactions to simplify and speed up testing
confident-ai/deepeval โญ 2,659
The LLM Evaluation Framework
๐ docs.confident-ai.com
pytest-dev/pytest-testinfra โญ 2,357
With Testinfra you can write unit tests in Python to test actual state of your servers configured by management tools like Salt, Ansible, Puppet, Chef and so on.
๐ testinfra.readthedocs.io
pytest-dev/pytest-mock โญ 1,805
Thin-wrapper around the mock package for easier use with pytest
๐ pytest-mock.readthedocs.io/en/latest
pytest-dev/pytest-cov โญ 1,711
Coverage plugin for pytest.
pytest-dev/pytest-xdist โญ 1,422
pytest plugin for distributed testing and loop-on-failures testing modes.
๐ pytest-xdist.readthedocs.io
pytest-dev/pytest-asyncio โญ 1,379
Asyncio support for pytest
๐ pytest-asyncio.readthedocs.io
taverntesting/tavern โญ 1,013
A command-line tool and Python library and Pytest plugin for automated testing of RESTful APIs, with a simple, concise and flexible YAML-based syntax
๐ taverntesting.github.io
Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics.
facebook/prophet โญ 18,155
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
๐ facebook.github.io/prophet
blue-yonder/tsfresh โญ 8,264
Automatic extraction of relevant features from time series:
๐ tsfresh.readthedocs.io
unit8co/darts โญ 7,798
A python library for user-friendly forecasting and anomaly detection on time series.
๐ unit8co.github.io/darts
sktime/sktime โญ 7,668
A unified framework for machine learning with time series
๐ www.sktime.net
facebookresearch/Kats โญ 4,858
Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends.
awslabs/gluonts โญ 4,495
Probabilistic time series modeling in Python
๐ ts.gluon.ai
nixtla/statsforecast โญ 3,798
Lightning โก๏ธ fast forecasting with statistical and econometric models.
๐ nixtlaverse.nixtla.io/statsforecast
tdameritrade/stumpy โญ 3,553
STUMPY is a powerful and scalable Python library for modern time series analysis
๐ stumpy.readthedocs.io/en/latest
google-research/timesfm โญ 3,371
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
๐ research.google/blog/a-decoder-only-foundation-model-for-time-series-forecasting
salesforce/Merlion โญ 3,322
Merlion: A Machine Learning Framework for Time Series Intelligence
amazon-science/chronos-forecasting โญ 2,205
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting
๐ arxiv.org/abs/2403.07815
rjt1990/pyflux โญ 2,103
Open source time series library for Python
aistream-peelout/flow-forecast โญ 1,984
Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).
๐ flow-forecast.atlassian.net/wiki/spaces/ff/overview
uber/orbit โญ 1,843
A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.
๐ orbit-ml.readthedocs.io/en/stable
alkaline-ml/pmdarima โญ 1,560
A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
๐ www.alkaline-ml.com/pmdarima
time-series-foundation-models/lag-llama โญ 1,149
Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting
winedarksea/AutoTS โญ 1,083
Automated Time Series Forecasting
autoviml/Auto_TS โญ 713
Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Created by Ram Seshadri. Collaborators welcome.
google/temporian โญ 658
Temporian is an open-source Python library for preprocessing โก and feature engineering ๐ temporal data ๐ for machine learning applications ๐ค
๐ temporian.readthedocs.io
Typing libraries: static and run-time type checking, annotations.
python/mypy โญ 18,045
Optional static typing for Python
๐ www.mypy-lang.org
microsoft/pyright โญ 12,819
Static Type Checker for Python
facebook/pyre-check โญ 6,779
Performant type-checking for python.
๐ pyre-check.org
python-attrs/attrs โญ 5,183
Python Classes Without Boilerplate
๐ www.attrs.org
instagram/MonkeyType โญ 4,700
A Python library that generates static type annotations by collecting runtime types
google/pytype โญ 4,692
A static type analyzer for Python code
๐ google.github.io/pytype
python/typeshed โญ 4,210
Collection of library stubs for Python, with static types
mtshiba/pylyzer โญ 2,242
A fast static code analyzer & language server for Python
๐ mtshiba.github.io/pylyzer
microsoft/pylance-release โญ 1,691
Fast, feature-rich language support for Python. Documentation and issues for Pylance.
agronholm/typeguard โญ 1,484
Run-time type checker for Python
patrick-kidger/torchtyping โญ 1,369
Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.
robertcraigie/pyright-python โญ 156
Python command line wrapper for pyright, a static type checker
๐ pypi.org/project/pyright
General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools.
yt-dlp/yt-dlp โญ 79,494
A feature-rich command-line audio/video downloader
๐ discord.gg/h5mncfw63r
home-assistant/core โญ 70,322
๐ก Open source home automation that puts local control and privacy first.
๐ www.home-assistant.io
python/cpython โญ 61,703
The Python programming language
๐ www.python.org
localstack/localstack โญ 53,304
๐ป A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
๐ localstack.cloud
faif/python-patterns โญ 40,010
A collection of design patterns/idioms in Python
mingrammer/diagrams โญ 36,470
:art: Diagram as Code for prototyping cloud system architectures
๐ diagrams.mingrammer.com
ggerganov/whisper.cpp โญ 33,626
Port of OpenAI's Whisper model in C/C++
keon/algorithms โญ 23,853
Minimal examples of data structures and algorithms in Python
norvig/pytudes โญ 22,556
Python programs, usually short, of considerable difficulty, to perfect particular skills.
modularml/mojo โญ 22,543
The Mojo Programming Language
๐ docs.modular.com/mojo/manual
openai/openai-python โญ 21,559
The official Python library for the OpenAI API
๐ pypi.org/project/openai
chriskiehl/Gooey โญ 20,525
Turn (almost) any Python command line program into a full GUI application with one line
facebookresearch/audiocraft โญ 20,393
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
pydantic/pydantic โญ 20,041
Data validation using Python type hints
๐ docs.pydantic.dev
squidfunk/mkdocs-material โญ 19,562
Documentation that simply works
๐ squidfunk.github.io/mkdocs-material
delgan/loguru โญ 19,140
Python logging made (stupidly) simple
micropython/micropython โญ 18,897
MicroPython - a lean and efficient Python implementation for microcontrollers and constrained systems
๐ micropython.org
mkdocs/mkdocs โญ 18,864
Project documentation with Markdown.
๐ www.mkdocs.org
rustpython/RustPython โญ 18,494
A Python Interpreter written in Rust
๐ rustpython.github.io
kivy/kivy โญ 17,401
Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
๐ kivy.org
higherorderco/Bend โญ 17,017
A massively parallel, high-level programming language
๐ higherorderco.com
blakeblackshear/frigate โญ 16,816
NVR with realtime local object detection for IP cameras
๐ frigate.video
ipython/ipython โญ 16,215
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
๐ ipython.readthedocs.org
alievk/avatarify-python โญ 16,200
Avatars for Zoom, Skype and other video-conferencing apps.
paul-gauthier/aider โญ 15,759
Aider is a command line tool that lets you pair program with LLMs, to edit code stored in your local git repository
๐ aider.chat
zulko/moviepy โญ 12,233
Video editing with Python
๐ zulko.github.io/moviepy
openai/triton โญ 12,204
Development repository for the Triton language and compiler
๐ triton-lang.org
python-pillow/Pillow โญ 11,992
The Python Imaging Library adds image processing capabilities to Python (Pillow is the friendly PIL fork)
๐ python-pillow.org
pytube/pytube โญ 11,853
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
๐ pytube.io
pyodide/pyodide โญ 11,824
Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
๐ pyodide.org/en/stable
pyo3/pyo3 โญ 11,719
Rust bindings for the Python interpreter
๐ pyo3.rs
dbader/schedule โญ 11,672
Python job scheduling for humans.
๐ schedule.readthedocs.io
nuitka/Nuitka โญ 11,568
Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4-3.12. You feed it your Python app, it does a lot of clever things, and spits out an executable or extension module.
๐ nuitka.net
caronc/apprise โญ 11,154
Apprise - Push Notifications that work with just about every platform!
๐ hub.docker.com/r/caronc/apprise
ninja-build/ninja โญ 10,903
Ninja is a small build system with a focus on speed.
๐ ninja-build.org
secdev/scapy โญ 10,441
Scapy: the Python-based interactive packet manipulation program & library.
๐ scapy.net
magicstack/uvloop โญ 10,203
Ultra fast asyncio event loop.
pallets/jinja โญ 10,176
A very fast and expressive template engine.
๐ jinja.palletsprojects.com
asweigart/pyautogui โญ 10,022
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.
aristocratos/bpytop โญ 9,994
Linux/OSX/FreeBSD resource monitor
aws/serverless-application-model โญ 9,306
The AWS Serverless Application Model (AWS SAM) transform is a AWS CloudFormation macro that transforms SAM templates into CloudFormation templates.
๐ aws.amazon.com/serverless/sam
cython/cython โญ 9,231
The most widely used Python to C compiler
๐ cython.org
paramiko/paramiko โญ 8,984
The leading native Python SSHv2 protocol library.
๐ paramiko.org
boto/boto3 โญ 8,890
AWS SDK for Python
๐ aws.amazon.com/sdk-for-python
arrow-py/arrow โญ 8,651
๐น Better dates & times for Python
๐ arrow.readthedocs.io
facebookresearch/hydra โญ 8,508
Hydra is a framework for elegantly configuring complex applications
๐ hydra.cc
xonsh/xonsh โญ 8,240
:shell: Python-powered shell. Full-featured and cross-platform.
๐ xon.sh
eternnoir/pyTelegramBotAPI โญ 7,953
Python Telegram bot api.
py-pdf/pypdf โญ 7,913
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
๐ pypdf.readthedocs.io/en/latest
kellyjonbrazil/jc โญ 7,753
CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts.
googleapis/google-api-python-client โญ 7,610
๐ The official Python client library for Google's discovery based APIs.
๐ googleapis.github.io/google-api-python-client/docs
theskumar/python-dotenv โญ 7,390
Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.
๐ saurabh-kumar.com/python-dotenv
jasonppy/VoiceCraft โญ 7,310
Zero-Shot Speech Editing and Text-to-Speech in the Wild
googlecloudplatform/python-docs-samples โญ 7,124
Code samples used on cloud.google.com
google/latexify_py โญ 7,122
A library to generate LaTeX expression from Python code.
marshmallow-code/marshmallow โญ 6,959
A lightweight library for converting complex objects to and from simple Python datatypes.
๐ marshmallow.readthedocs.io
hugapi/hug โญ 6,857
Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.
pygithub/PyGithub โญ 6,855
Typed interactions with the GitHub API v3
๐ pygithub.readthedocs.io
pyca/cryptography โญ 6,471
cryptography is a package designed to expose cryptographic primitives and recipes to Python developers.
๐ cryptography.io
openai/point-e โญ 6,440
Point cloud diffusion for 3D model synthesis
gorakhargosh/watchdog โญ 6,435
Python library and shell utilities to monitor filesystem events.
๐ packages.python.org/watchdog
icloud-photos-downloader/icloud_photos_downloader โญ 6,422
A command-line tool to download photos from iCloud
jd/tenacity โญ 6,412
Retrying library for Python
๐ tenacity.readthedocs.io
sphinx-doc/sphinx โญ 6,311
The Sphinx documentation generator
๐ www.sphinx-doc.org
sdispater/pendulum โญ 6,145
Python datetimes made easy
๐ pendulum.eustace.io
agronholm/apscheduler โญ 6,041
Task scheduling library for Python
bndr/pipreqs โญ 6,035
pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.
scikit-image/scikit-image โญ 5,995
Image processing in Python
๐ scikit-image.org
ijl/orjson โญ 5,954
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy
wireservice/csvkit โญ 5,926
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
๐ csvkit.readthedocs.io
timdettmers/bitsandbytes โญ 5,869
Accessible large language models via k-bit quantization for PyTorch.
๐ huggingface.co/docs/bitsandbytes/main/en/index
pdfminer/pdfminer.six โญ 5,723
Community maintained fork of pdfminer - we fathom PDF
๐ pdfminersix.readthedocs.io
pytransitions/transitions โญ 5,562
A lightweight, object-oriented finite state machine implementation in Python with many extensions
rsalmei/alive-progress โญ 5,311
A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!
buildbot/buildbot โญ 5,210
Python-based continuous integration testing framework; your pull requests are more than welcome!
๐ www.buildbot.net
prompt-toolkit/ptpython โญ 5,152
A better Python REPL
spotify/pedalboard โญ 5,056
๐ ๐ A Python library for audio.
๐ spotify.github.io/pedalboard
pycqa/pycodestyle โญ 5,006
Simple Python style checker in one Python file
๐ pycodestyle.pycqa.org
pywinauto/pywinauto โญ 4,826
Windows GUI Automation with Python (based on text properties)
๐ pywinauto.github.io
jorgebastida/awslogs โญ 4,817
AWS CloudWatch logs for Humansโข
tebelorg/RPA-Python โญ 4,773
Python package for doing RPA
pytoolz/toolz โญ 4,619
A functional standard library for Python.
๐ toolz.readthedocs.org
hhatto/autopep8 โญ 4,553
A tool that automatically formats Python code to conform to the PEP 8 style guide.
๐ pypi.org/project/autopep8
pyinvoke/invoke โญ 4,340
Pythonic task management & command execution.
๐ pyinvoke.org
bogdanp/dramatiq โญ 4,191
A fast and reliable background task processing library for Python 3.
๐ dramatiq.io
evhub/coconut โญ 4,022
Coconut (coconut-lang.org) is a variant of Python that adds on top of Python syntax new features for simple, elegant, Pythonic functional programming.
๐ coconut-lang.org
adafruit/circuitpython โญ 3,984
CircuitPython - a Python implementation for teaching coding with microcontrollers
๐ circuitpython.org
ashleve/lightning-hydra-template โญ 3,955
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. โก๐ฅโก
miguelgrinberg/python-socketio โญ 3,897
Python Socket.IO server and client
joblib/joblib โญ 3,788
Computing with Python functions.
๐ joblib.readthedocs.org
pyinfra-dev/pyinfra โญ 3,769
pyinfra turns Python code into shell commands and runs them on your servers. Execute ad-hoc commands and write declarative operations. Target SSH servers, local machine and Docker containers. Fast and scales from one server to thousands.
๐ pyinfra.com
ets-labs/python-dependency-injector โญ 3,751
Dependency injection framework for Python
๐ python-dependency-injector.ets-labs.org
rspeer/python-ftfy โญ 3,743
Fixes mojibake and other glitches in Unicode text, after the fact.
๐ ftfy.readthedocs.org
blealtan/efficient-kan โญ 3,720
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
python-markdown/markdown โญ 3,692
A Python implementation of John Gruberโs Markdown with Extension support.
๐ python-markdown.github.io
more-itertools/more-itertools โญ 3,642
More routines for operating on iterables, beyond itertools
๐ more-itertools.rtfd.io
zeromq/pyzmq โญ 3,638
PyZMQ: Python bindings for zeromq
๐ zguide.zeromq.org/py:all
pyo3/maturin โญ 3,589
Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages
๐ maturin.rs
pydata/xarray โญ 3,530
N-D labeled arrays and datasets in Python
๐ xarray.dev
pypi/warehouse โญ 3,529
The Python Package Index
๐ pypi.org
tartley/colorama โญ 3,491
Simple cross-platform colored terminal text in Python
osohq/oso โญ 3,462
Oso is a batteries-included framework for building authorization in your application.
๐ docs.osohq.com
jorisschellekens/borb โญ 3,346
borb is a library for reading, creating and manipulating PDF files in python.
๐ borbpdf.com
spotify/basic-pitch โญ 3,199
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
๐ basicpitch.io
pyserial/pyserial โญ 3,171
Python serial port access library
camelot-dev/camelot โญ 2,865
A Python library to extract tabular data from PDFs
๐ camelot-py.readthedocs.io
legrandin/pycryptodome โญ 2,767
A self-contained cryptographic library for Python
๐ www.pycryptodome.org
tox-dev/pipdeptree โญ 2,747
A command line utility to display dependency tree of the installed Python packages
๐ pypi.python.org/pypi/pipdeptree
lxml/lxml โญ 2,635
The lxml XML toolkit for Python
๐ lxml.de
liiight/notifiers โญ 2,633
The easy way to send notifications
๐ notifiers.readthedocs.io
whylabs/whylogs โญ 2,606
An open-source data logging library for machine learning models and data pipelines. ๐ Provides visibility into data quality & model performance over time. ๐ก๏ธ Supports privacy-preserving data collection, ensuring safety & robustness. ๐
๐ whylogs.readthedocs.io
pexpect/pexpect โญ 2,573
A Python module for controlling interactive programs in a pseudo-terminal
๐ pexpect.readthedocs.io
litl/backoff โญ 2,556
Python library providing function decorators for configurable backoff and retry
scrapinghub/dateparser โญ 2,511
python parser for human readable dates
pyston/pyston โญ 2,503
A faster and highly-compatible implementation of the Python programming language.
๐ www.pyston.org
yaml/pyyaml โญ 2,497
Canonical source repository for PyYAML
dosisod/refurb โญ 2,472
A tool for refurbishing and modernizing Python codebases
cdgriffith/Box โญ 2,424
Python dictionaries with advanced dot notation access
๐ github.com/cdgriffith/box/wiki
pypa/setuptools โญ 2,420
Official project repository for the Setuptools build system
๐ pypi.org/project/setuptools
nschloe/tikzplotlib โญ 2,365
๐ Save matplotlib figures as TikZ/PGFplots for smooth integration into LaTeX.
hgrecco/pint โญ 2,342
Operate and manipulate physical quantities in Python
๐ pint.readthedocs.org
dateutil/dateutil โญ 2,296
Useful extensions to the standard Python datetime features
grantjenks/python-diskcache โญ 2,287
Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.
๐ www.grantjenks.com/docs/diskcache
pndurette/gTTS โญ 2,223
Python library and CLI tool to interface with Google Translate's text-to-speech API
๐ gtts.readthedocs.org
kiminewt/pyshark โญ 2,188
Python wrapper for tshark, allowing python packet parsing using wireshark dissectors
jcrist/msgspec โญ 2,167
A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
๐ jcristharif.com/msgspec
pyparsing/pyparsing โญ 2,164
Python library for creating PEG parsers
ianmiell/shutit โญ 2,150
Automation framework for programmers
๐ ianmiell.github.io/shutit
libaudioflux/audioFlux โญ 2,115
A library for audio and music analysis, feature extraction.
๐ audioflux.top
astanin/python-tabulate โญ 2,074
Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.
๐ pypi.org/project/tabulate
nateshmbhat/pyttsx3 โญ 2,034
Offline Text To Speech synthesis for python
google/gin-config โญ 2,024
Gin provides a lightweight configuration framework for Python
grahamdumpleton/wrapt โญ 2,021
A Python module for decorators, wrappers and monkey patching.
pyfilesystem/pyfilesystem2 โญ 1,973
Python's Filesystem abstraction layer
๐ www.pyfilesystem.org
landscapeio/prospector โญ 1,927
Inspects Python source files and provides information about type and location of classes, methods etc
rhettbull/osxphotos โญ 1,913
Python app to work with pictures and associated metadata from Apple Photos on macOS. Also includes a package to provide programmatic access to the Photos library, pictures, and metadata.
omry/omegaconf โญ 1,907
Flexible Python configuration system. The last one you will ever need.
julienpalard/Pipe โญ 1,907
A Python library to use infix notation in Python
python-rope/rope โญ 1,901
a python refactoring library
mitmproxy/pdoc โญ 1,898
API Documentation for Python Projects
๐ pdoc.dev
numba/llvmlite โญ 1,879
A lightweight LLVM python binding for writing JIT compilers
๐ llvmlite.pydata.org
chaostoolkit/chaostoolkit โญ 1,868
Chaos Engineering Toolkit & Orchestration for Developers
๐ chaostoolkit.org
carpedm20/emoji โญ 1,864
emoji terminal output for Python
pydoit/doit โญ 1,825
task management & automation tool
๐ pydoit.org
joowani/binarytree โญ 1,810
Python Library for Studying Binary Trees
๐ binarytree.readthedocs.io
pydantic/logfire โญ 1,803
Uncomplicated Observability for Python and beyond! ๐ชต๐ฅ
๐ docs.pydantic.dev/logfire
pygments/pygments โญ 1,771
Pygments is a generic syntax highlighter written in Python
๐ pygments.org
kalliope-project/kalliope โญ 1,714
Kalliope is a framework that will help you to create your own personal assistant.
๐ kalliope-project.github.io
konradhalas/dacite โญ 1,708
Simple creation of data classes from dictionaries.
samuelcolvin/watchfiles โญ 1,701
Simple, modern and fast file watching and code reload in python.
๐ watchfiles.helpmanual.io
p0dalirius/Coercer โญ 1,699
A python script to automatically coerce a Windows server to authenticate on an arbitrary machine through 12 methods.
๐ podalirius.net
home-assistant/supervisor โญ 1,696
๐ก Home Assistant Supervisor
๐ home-assistant.io/hassio
open-telemetry/opentelemetry-python โญ 1,692
OpenTelemetry Python API and SDK
๐ opentelemetry.io
rubik/radon โญ 1,692
Various code metrics for Python code
๐ radon.readthedocs.org
hbldh/bleak โญ 1,674
A cross platform Bluetooth Low Energy Client for Python using asyncio
mkdocstrings/mkdocstrings โญ 1,648
๐ Automatic documentation from sources, for MkDocs.
๐ mkdocstrings.github.io
facebookincubator/Bowler โญ 1,532
Safe code refactoring for modern Python.
๐ pybowler.io
quodlibet/mutagen โญ 1,502
Python module for handling audio metadata
๐ mutagen.readthedocs.io
nficano/python-lambda โญ 1,493
A toolkit for developing and deploying serverless Python code in AWS Lambda.
instagram/LibCST โญ 1,490
A concrete syntax tree parser and serializer library for Python that preserves many aspects of Python's abstract syntax tree
๐ libcst.readthedocs.io
lcompilers/lpython โญ 1,489
Python compiler
๐ lpython.org
fabiocaccamo/python-benedict โญ 1,474
๐ dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities.
aws-samples/aws-glue-samples โญ 1,414
AWS Glue code samples
pycqa/pyflakes โญ 1,355
A simple program which checks Python source files for errors
๐ pypi.org/project/pyflakes
lidatong/dataclasses-json โญ 1,349
Easily serialize Data Classes to and from JSON
ossf/criticality_score โญ 1,307
Gives criticality score for an open source project
brandon-rhodes/python-patterns โญ 1,300
Source code behind the python-patterns.guide site by Brandon Rhodes
aio-libs/yarl โญ 1,275
Yet another URL library
๐ yarl.aio-libs.org
anthropics/anthropic-sdk-python โญ 1,238
SDK providing access to Anthropic's safety-first language model APIs
oracle/graalpython โญ 1,187
A Python 3 implementation built on GraalVM
pdoc3/pdoc โญ 1,119
๐ :arrow_right: ๐ Auto-generate API documentation for Python projects
๐ pdoc3.github.io/pdoc
pyo3/rust-numpy โญ 1,080
PyO3-based Rust bindings of the NumPy C-API
c4urself/bump2version โญ 1,052
Version-bump your software with a single command
๐ pypi.python.org/pypi/bump2version
metachris/logzero โญ 1,028
Robust and effective logging for Python 2 and 3.
๐ logzero.readthedocs.io
pyfpdf/fpdf2 โญ 1,025
Simple PDF generation for Python
๐ py-pdf.github.io/fpdf2
fsspec/filesystem_spec โญ 980
A specification that python filesystems should adhere to.
milvus-io/pymilvus โญ 959
Python SDK for Milvus.
fastai/fastcore โญ 930
Python supercharged for the fastai library
๐ fastcore.fast.ai
lastmile-ai/aiconfig โญ 915
AIConfig saves prompts, models and model parameters as source control friendly configs. This allows you to iterate on prompts and model parameters separately from your application code.
๐ aiconfig.lastmileai.dev
pypy/pypy โญ 881
PyPy is a very fast and compliant implementation of the Python language.
๐ pypy.org
alex-sherman/unsync โญ 876
Unsynchronize asyncio
barracuda-fsh/pyobd โญ 794
An OBD-II compliant car diagnostic tool
samuelcolvin/dirty-equals โญ 788
Doing dirty (but extremely useful) things with equals.
๐ dirty-equals.helpmanual.io
ariebovenberg/whenever โญ 774
โฐ Modern datetime library for Python, available in Rust or pure Python
๐ whenever.rtfd.io
pypa/build โญ 710
A simple, correct Python build frontend
๐ build.pypa.io
qdrant/qdrant-client โญ 706
Python client for Qdrant vector search engine
๐ qdrant.tech
open-telemetry/opentelemetry-python-contrib โญ 677
OpenTelemetry instrumentation for Python modules
๐ opentelemetry.io
modal-labs/modal-examples โญ 668
Examples of programs built using Modal
๐ modal.com/docs
instagram/Fixit โญ 663
Advanced Python linting framework with auto-fixes and hierarchical configuration that makes it easy to write custom in-repo lint rules.
๐ fixit.rtfd.io/en/latest
gefyrahq/gefyra โญ 662
Blazingly-fast ๐, rock-solid, local application development :arrow_right: with Kubernetes.
๐ gefyra.dev
methexis-inc/terminal-copilot โญ 569
A smart terminal assistant that helps you find the right command.
platformdirs/platformdirs โญ 549
A small Python module for determining appropriate platform-specific dirs, e.g. a "user data dir".
๐ platformdirs.readthedocs.io
argoproj-labs/hera โญ 538
Hera is an Argo Python SDK. Hera aims to make construction and submission of various Argo Project resources easy and accessible to everyone! Hera abstracts away low-level setup details while still maintaining a consistent vocabulary with Argo. โญ๏ธ Remember to star!
๐ hera.rtfd.io
fastai/ghapi โญ 529
A delightful and complete interface to GitHub's amazing API
๐ ghapi.fast.ai
steamship-core/steamship-langchain โญ 510
steamship-langchain
Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL.
apache/superset โญ 61,072
Apache Superset is a Data Visualization and Data Exploration Platform
๐ superset.apache.org
streamlit/streamlit โญ 33,921
Streamlit โ A faster way to build and share data apps.
๐ streamlit.io
gradio-app/gradio โญ 31,492
Build and share delightful machine learning apps, all in Python. ๐ Star to support our work!
๐ www.gradio.app
plotly/dash โญ 21,031
Data Apps & Dashboards for Python. No JavaScript Required.
๐ plotly.com/dash
matplotlib/matplotlib โญ 19,794
matplotlib: plotting with Python
๐ matplotlib.org/stable
bokeh/bokeh โญ 19,138
Interactive Data Visualization in the browser, from Python
๐ bokeh.org
plotly/plotly.py โญ 15,850
The interactive graphing library for Python โจ This project now includes Plotly Express!
๐ plotly.com/python
mwaskom/seaborn โญ 12,288
Statistical data visualization in Python
๐ seaborn.pydata.org
visgl/deck.gl โญ 11,984
WebGL2 powered visualization framework
๐ deck.gl
marceloprates/prettymaps โญ 11,068
A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.
altair-viz/altair โญ 9,132
Declarative statistical visualization library for Python
๐ altair-viz.github.io
nvidia/TensorRT-LLM โญ 7,872
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT...
๐ nvidia.github.io/tensorrt-llm
lux-org/lux โญ 5,104
Automatically visualize your pandas dataframe via a single print! ๐ ๐ก
renpy/renpy โญ 4,818
The Ren'Py Visual Novel Engine
๐ www.renpy.org
man-group/dtale โญ 4,654
Visualizer for pandas data structures
๐ alphatechadmin.pythonanywhere.com
holoviz/panel โญ 4,539
Panel: The powerful data exploration & web app framework for Python
๐ panel.holoviz.org
has2k1/plotnine โญ 3,931
A Grammar of Graphics for Python
๐ plotnine.org
residentmario/missingno โญ 3,875
missingno provides a small toolset of flexible and easy-to-use missing data visualizations and utilities that allows you to get a quick visual summary of the completeness (or lack thereof) of your dataset.
pyqtgraph/pyqtgraph โญ 3,814
Fast data visualization and GUI tools for scientific / engineering applications
๐ www.pyqtgraph.org
vispy/vispy โญ 3,276
Main repository for Vispy
๐ vispy.org
ml-tooling/opyrator โญ 3,054
๐ช Turns your machine learning code into microservices with web API, interactive GUI, and more.
๐ opyrator-playground.mltooling.org
netflix/flamescope โญ 2,995
FlameScope is a visualization tool for exploring different time ranges as Flame Graphs.
facebookresearch/hiplot โญ 2,731
HiPlot makes understanding high dimensional data easy
๐ facebookresearch.github.io/hiplot
holoviz/holoviews โญ 2,676
With Holoviews, your data visualizes itself.
๐ holoviews.org
kozea/pygal โญ 2,629
pygal is a dynamic SVG charting library written in python.
๐ www.pygal.org
mckinsey/vizro โญ 2,586
Vizro is a toolkit for creating modular data visualization applications.
๐ vizro.readthedocs.io/en/stable
pyvista/pyvista โญ 2,533
3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
๐ docs.pyvista.org
marcomusy/vedo โญ 1,997
A python module for scientific analysis of 3D data based on VTK and Numpy
๐ vedo.embl.es
datapane/datapane โญ 1,363
Build and share data reports in 100% Python
๐ datapane.com
facultyai/dash-bootstrap-components โญ 1,094
Bootstrap components for Plotly Dash
๐ dash-bootstrap-components.opensource.faculty.ai
nomic-ai/deepscatter โญ 1,007
Zoomable, animated scatterplots in the browser that scales over a billion points
holoviz/holoviz โญ 827
High-level tools to simplify visualization in Python.
๐ holoviz.org
hazyresearch/meerkat โญ 820
Creative interactive views of any dataset.
Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management.
django/django โญ 78,410
The Web framework for perfectionists with deadlines.
๐ www.djangoproject.com
tiangolo/fastapi โญ 74,488
FastAPI framework, high performance, easy to learn, fast to code, ready for production
๐ fastapi.tiangolo.com
pallets/flask โญ 67,280
The Python micro framework for building web applications.
๐ flask.palletsprojects.com
sherlock-project/sherlock โญ 53,273
Hunt down social media accounts by username across social networks
๐ sherlockproject.xyz
psf/requests โญ 51,854
A simple, yet elegant, HTTP library.
๐ requests.readthedocs.io/en/latest
tornadoweb/tornado โญ 21,641
Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.
๐ www.tornadoweb.org
reflex-dev/reflex โญ 18,547
๐ธ๏ธ Web apps in pure Python ๐
๐ reflex.dev
huge-success/sanic โญ 17,945
Accelerate your web app development | Build fast. Run fast.
๐ sanic.dev
wagtail/wagtail โญ 17,713
A Django content management system focused on flexibility and user experience
๐ wagtail.org
pyscript/pyscript โญ 17,581
A framework that allows users to create rich Python applications in the browser using HTML's interface and the power of Pyodide, WASM, and modern web technologies.
๐ pyscript.net
aio-libs/aiohttp โญ 14,854
Asynchronous HTTP client/server framework for asyncio and Python
๐ docs.aiohttp.org
vincigit00/Scrapegraph-ai โญ 13,769
ScrapeGraphAI is a web scraping python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents
๐ scrapegraphai.com
encode/httpx โญ 12,835
A next generation HTTP client for Python. ๐ฆ
๐ www.python-httpx.org
getpelican/pelican โญ 12,450
Static site generator that supports Markdown and reST syntax. Powered by Python.
๐ getpelican.com
aws/chalice โญ 10,564
Python Serverless Microframework for AWS
flet-dev/flet โญ 10,534
Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.
๐ flet.dev
encode/starlette โญ 9,895
The little ASGI framework that shines. ๐
๐ www.starlette.io
benoitc/gunicorn โญ 9,673
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
๐ www.gunicorn.org
falconry/falcon โญ 9,469
The no-magic web data plane API and microservices framework for Python developers, with a focus on reliability, correctness, and performance at scale.
๐ falcon.readthedocs.io/en/stable
zauberzeug/nicegui โญ 8,509
Create web-based user interfaces with Python. The nice way.
๐ nicegui.io
bottlepy/bottle โญ 8,356
bottle.py is a fast and simple micro-framework for python web-applications.
๐ bottlepy.org
encode/uvicorn โญ 8,231
An ASGI web server, for Python. ๐ฆ
๐ www.uvicorn.org
graphql-python/graphene โญ 8,038
GraphQL framework for Python
๐ graphene-python.org
reactive-python/reactpy โญ 7,791
ReactPy is a library for building user interfaces in Python without Javascript
๐ reactpy.dev
vitalik/django-ninja โญ 6,781
๐จ Fast, Async-ready, Openapi, type hints based framework for building APIs
๐ django-ninja.dev
pyeve/eve โญ 6,678
REST API framework designed for human beings
๐ python-eve.org
pallets/werkzeug โญ 6,609
The comprehensive WSGI web application library.
๐ werkzeug.palletsprojects.com
webpy/webpy โญ 5,884
web.py is a web framework for python that is as simple as it is powerful.
๐ webpy.org
starlite-api/litestar โญ 5,247
Production-ready, Light, Flexible and Extensible ASGI API framework | Effortlessly Build Performant APIs
๐ litestar.dev
stephenmcd/mezzanine โญ 4,749
CMS framework for Django
๐ mezzanine.jupo.org
nameko/nameko โญ 4,688
A microservices framework for Python that lets service developers concentrate on application logic and encourages testability.
๐ www.nameko.io
pywebio/PyWebIO โญ 4,459
Write interactive web app in script way.
๐ pywebio.readthedocs.io
fastapi-users/fastapi-users โญ 4,418
Ready-to-use and customizable users management for FastAPI
๐ fastapi-users.github.io/fastapi-users
pylons/pyramid โญ 3,929
Pyramid - A Python web framework
๐ trypyramid.com
h2oai/wave โญ 3,913
H2O Wave is a software stack for building beautiful, low-latency, realtime, browser-based applications and dashboards entirely in Python/R without using HTML, Javascript, or CSS.
๐ wave.h2o.ai
strawberry-graphql/strawberry โญ 3,882
A GraphQL library for Python that leverages type annotations ๐
๐ strawberry.rocks
websocket-client/websocket-client โญ 3,540
WebSocket client for Python
๐ github.com/websocket-client/websocket-client
unbit/uwsgi โญ 3,441
uWSGI application server container
๐ projects.unbit.it/uwsgi
pallets/quart โญ 2,834
An async Python micro framework for building web applications.
๐ quart.palletsprojects.com
fastapi-admin/fastapi-admin โญ 2,758
A fast admin dashboard based on FastAPI and TortoiseORM with tabler ui, inspired by Django admin
๐ fastapi-admin-docs.long2ice.io
flipkart-incubator/Astra โญ 2,475
Automated Security Testing For REST API's
masoniteframework/masonite โญ 2,174
The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ
๐ docs.masoniteproject.com
dot-agent/nextpy โญ 2,171
๐คSelf-Modifying Framework from the Future ๐ฎ World's First AMS
๐ dotagent.ai
python-restx/flask-restx โญ 2,123
Fork of Flask-RESTPlus: Fully featured framework for fast, easy and documented API development with Flask
๐ flask-restx.readthedocs.io/en/latest
s3rius/FastAPI-template โญ 1,859
Feature rich robust FastAPI template.
dmontagu/fastapi-utils โญ 1,852
Reusable utilities for FastAPI: a number of utilities to help reduce boilerplate and reuse common functionality across projects
neoteroi/BlackSheep โญ 1,822
Fast ASGI web framework for Python
๐ www.neoteroi.dev/blacksheep
cherrypy/cherrypy โญ 1,818
CherryPy is a pythonic, object-oriented HTTP framework. https://cherrypy.dev
๐ docs.cherrypy.dev
indico/indico โญ 1,709
Indico - A feature-rich event management system, made @ CERN, the place where the Web was born.
๐ getindico.io
jordaneremieff/mangum โญ 1,662
AWS Lambda support for ASGI applications
๐ mangum.io
wtforms/wtforms โญ 1,497
A flexible forms validation and rendering library for Python.
๐ wtforms.readthedocs.io
awtkns/fastapi-crudrouter โญ 1,352
A dynamic FastAPI router that automatically creates CRUD routes for your models
๐ fastapi-crudrouter.awtkns.com
long2ice/fastapi-cache โญ 1,230
fastapi-cache is a tool to cache fastapi response and function result, with backends support redis and memcached.
๐ github.com/long2ice/fastapi-cache
magicstack/httptools โญ 1,187
Fast HTTP parser
rstudio/py-shiny โญ 1,169
Shiny for Python
๐ shiny.posit.co/py
whitphx/stlite โญ 1,114
A port of Streamlit to WebAssembly, powered by Pyodide.
๐ edit.share.stlite.net
koxudaxi/fastapi-code-generator โญ 987
This code generator creates FastAPI app from an openapi file.
aeternalis-ingenium/FastAPI-Backend-Template โญ 600
A backend project template with FastAPI, PostgreSQL with asynchronous SQLAlchemy 2.0, Alembic for asynchronous database migration, and Docker.
Interactive version: www.awesomepython.org, Hugging Face Dataset: awesome-python
Please raise a new issue to suggest a Python repo that you would like to see added.
1,534 hand-picked awesome Python libraries and frameworks, updated 12 Aug 2024