A RetroSearch Logo

Home - News ( United States | United Kingdom | Italy | Germany ) - Football scores

Search Query:

Showing content from https://github.com/ml-tooling/best-of-ml-python below:

lukasmasuch/best-of-ml-python: πŸ† A ranked list of awesome machine learning Python libraries. Updated weekly.

Best-of Machine Learning with Python

πŸ†  A ranked list of awesome machine learning Python libraries. Updated weekly.

This curated list contains 920 awesome open-source projects with a total of 5M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!

πŸ§™β€β™‚οΈ  Discover other best-of lists or create your own.
πŸ“«  Subscribe to our newsletter for updates and trending projects.


Machine Learning Frameworks

General-purpose machine learning and deep learning frameworks.

Tensorflow (πŸ₯‡54 Β· ⭐ 190K) - An Open Source Machine Learning Framework for Everyone. Apache-2 PyTorch (πŸ₯‡53 Β· ⭐ 92K) - Tensors and Dynamic neural networks in Python with strong GPU.. BSD-3 scikit-learn (πŸ₯‡51 Β· ⭐ 63K) - scikit-learn: machine learning in Python. BSD-3 Keras (πŸ₯‡48 Β· ⭐ 63K Β· πŸ“ˆ) - Deep Learning for humans. Apache-2 XGBoost (πŸ₯‡43 Β· ⭐ 27K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2 PySpark (πŸ₯‡42 Β· ⭐ 42K Β· πŸ“ˆ) - Apache Spark Python API. Apache-2 jax (πŸ₯‡42 Β· ⭐ 33K) - Composable transformations of Python+NumPy programs: differentiate,.. Apache-2 StatsModels (πŸ₯‡42 Β· ⭐ 11K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3 pytorch-lightning (πŸ₯ˆ41 Β· ⭐ 30K) - Pretrain, finetune ANY AI model of ANY size on.. Apache-2 PaddlePaddle (πŸ₯ˆ41 Β· ⭐ 23K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2 Catboost (πŸ₯ˆ40 Β· ⭐ 8.5K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2 LightGBM (πŸ₯ˆ39 Β· ⭐ 17K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT Fastai (πŸ₯ˆ35 Β· ⭐ 27K) - The fastai deep learning library. Apache-2 Flax (πŸ₯ˆ34 Β· ⭐ 6.7K) - Flax is a neural network library for JAX that is designed for.. Apache-2 Ignite (πŸ₯ˆ34 Β· ⭐ 4.7K) - High-level library to help with training and evaluating neural.. BSD-3 PyFlink (πŸ₯ˆ33 Β· ⭐ 25K) - Apache Flink Python API. Apache-2 Jina (πŸ₯ˆ33 Β· ⭐ 22K) - Build multimodal AI applications with cloud-native stack. Apache-2 ivy (πŸ₯ˆ33 Β· ⭐ 14K) - Convert Machine Learning Code Between Frameworks. Apache-2 mlpack (πŸ₯ˆ32 Β· ⭐ 5.4K) - mlpack: a fast, header-only C++ machine learning library. BSD-3 Thinc (πŸ₯ˆ32 Β· ⭐ 2.9K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT einops (πŸ₯ˆ31 Β· ⭐ 9.1K Β· πŸ“‰) - Flexible and powerful tensor operations for readable and reliable code.. MIT Vowpal Wabbit (πŸ₯ˆ31 Β· ⭐ 8.6K Β· πŸ’€) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3 skorch (πŸ₯ˆ31 Β· ⭐ 6.1K) - A scikit-learn compatible neural network library that wraps.. BSD-3 Sonnet (πŸ₯‰30 Β· ⭐ 9.9K) - TensorFlow-based neural network library. Apache-2 Ludwig (πŸ₯‰29 Β· ⭐ 12K Β· πŸ’€) - Low-code framework for building custom LLMs, neural networks,.. Apache-2 Haiku (πŸ₯‰29 Β· ⭐ 3.1K) - JAX-based neural network library. Apache-2 tensorflow-upstream (πŸ₯‰27 Β· ⭐ 690) - TensorFlow ROCm port. Apache-2 Geomstats (πŸ₯‰26 Β· ⭐ 1.4K) - Computations and statistics on manifolds with geometric structures. MIT Determined (πŸ₯‰25 Β· ⭐ 3.2K) - Determined is an open-source machine learning platform.. Apache-2 NuPIC (πŸ₯‰24 Β· ⭐ 6.3K Β· πŸ’€) - Numenta Platform for Intelligent Computing is an implementation of.. MIT pyRiemann (πŸ₯‰24 Β· ⭐ 690) - Machine learning for multivariate data through the Riemannian.. BSD-3 Neural Network Libraries (πŸ₯‰23 Β· ⭐ 2.8K) - Neural Network Libraries. Apache-2 Towhee (πŸ₯‰21 Β· ⭐ 3.4K Β· πŸ’€) - Towhee is a framework that is dedicated to making neural data.. Apache-2 fklearn (πŸ₯‰20 Β· ⭐ 1.5K) - fklearn: Functional Machine Learning. Apache-2 NeoML (πŸ₯‰18 Β· ⭐ 790) - Machine learning framework for both deep learning and traditional.. Apache-2 chefboost (πŸ₯‰17 Β· ⭐ 480) - A Lightweight Decision Tree Framework supporting regular algorithms:.. MIT Runhouse (πŸ₯‰16 Β· ⭐ 1K) - Distribute and run AI workloads magically in Python, like PyTorch for.. Apache-2 ThunderGBM (πŸ₯‰16 Β· ⭐ 700) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2 Show 25 hidden projects...

General-purpose and task-specific data visualization libraries.

Matplotlib (πŸ₯‡47 Β· ⭐ 22K) - matplotlib: plotting with Python. ❗Unlicensed Plotly (πŸ₯‡45 Β· ⭐ 18K Β· πŸ“ˆ) - The interactive graphing library for Python. MIT dash (πŸ₯‡43 Β· ⭐ 24K) - Data Apps & Dashboards for Python. No JavaScript Required. MIT Bokeh (πŸ₯‡43 Β· ⭐ 20K Β· πŸ“ˆ) - Interactive Data Visualization in the browser, from Python. BSD-3 Seaborn (πŸ₯‡39 Β· ⭐ 13K) - Statistical data visualization in Python. BSD-3 Altair (πŸ₯‡38 Β· ⭐ 9.9K) - Declarative visualization library for Python. BSD-3 HoloViews (πŸ₯ˆ37 Β· ⭐ 2.8K) - With Holoviews, your data visualizes itself. BSD-3 PyVista (πŸ₯ˆ36 Β· ⭐ 3.2K) - 3D plotting and mesh analysis through a streamlined interface for.. MIT PyQtGraph (πŸ₯ˆ35 Β· ⭐ 4.1K) - Fast data visualization and GUI tools for scientific / engineering.. MIT cartopy (πŸ₯ˆ35 Β· ⭐ 1.5K) - Cartopy - a cartographic python library with matplotlib support. BSD-3 pandas-profiling (πŸ₯ˆ34 Β· ⭐ 13K) - 1 Line of code data quality profiling & exploratory.. MIT FiftyOne (πŸ₯ˆ34 Β· ⭐ 9.8K) - Visualize, create, and debug image and video datasets.. Apache-2 Perspective (πŸ₯ˆ34 Β· ⭐ 9.4K) - A data visualization and analytics component, especially.. Apache-2 plotnine (πŸ₯ˆ34 Β· ⭐ 4.3K) - A Grammar of Graphics for Python. MIT VisPy (πŸ₯ˆ34 Β· ⭐ 3.5K) - High-performance interactive 2D/3D data visualization library. BSD-3 datashader (πŸ₯ˆ33 Β· ⭐ 3.4K) - Quickly and accurately render even the largest data. BSD-3 Graphviz (πŸ₯ˆ32 Β· ⭐ 1.7K) - Simple Python interface for Graphviz. MIT pyecharts (πŸ₯ˆ30 Β· ⭐ 15K Β· πŸ’€) - Python Echarts Plotting Library. MIT wordcloud (πŸ₯ˆ30 Β· ⭐ 10K) - A little word cloud generator in Python. MIT UMAP (πŸ₯ˆ30 Β· ⭐ 7.9K Β· πŸ“‰) - Uniform Manifold Approximation and Projection. BSD-3 mpld3 (πŸ₯ˆ30 Β· ⭐ 2.4K) - An interactive data visualization tool which brings matplotlib graphics to.. BSD-3 lets-plot (πŸ₯ˆ30 Β· ⭐ 1.7K) - Multiplatform plotting library based on the Grammar of Graphics. MIT hvPlot (πŸ₯ˆ30 Β· ⭐ 1.2K) - A high-level plotting API for pandas, dask, xarray, and networkx built.. BSD-3 D-Tale (πŸ₯‰29 Β· ⭐ 5K) - Visualizer for pandas data structures. ❗️LGPL-2.1 bqplot (πŸ₯‰28 Β· ⭐ 3.7K Β· πŸ’€) - Plotting library for IPython/Jupyter notebooks. Apache-2 openTSNE (πŸ₯‰26 Β· ⭐ 1.6K Β· πŸ“ˆ) - Extensible, parallel implementations of t-SNE. BSD-3 HyperTools (πŸ₯‰25 Β· ⭐ 1.9K) - A Python toolbox for gaining geometric insights into high-dimensional.. MIT Chartify (πŸ₯‰23 Β· ⭐ 3.6K Β· πŸ’€) - Python library that makes it easy for data scientists to create.. Apache-2 Plotly-Resampler (πŸ₯‰23 Β· ⭐ 1.1K) - Visualize large time series data with plotly.py. MIT data-validation (πŸ₯‰21 Β· ⭐ 770) - Library for exploring and validating machine learning.. Apache-2 vega (πŸ₯‰21 Β· ⭐ 380 Β· πŸ’€) - IPython/Jupyter notebook module for Vega and Vega-Lite. BSD-3 vegafusion (πŸ₯‰20 Β· ⭐ 380) - Serverside scaling for Vega and Altair visualizations. BSD-3 animatplot (πŸ₯‰18 Β· ⭐ 420 Β· πŸ’€) - A python package for animating plots build on matplotlib. MIT Popmon (πŸ₯‰16 Β· ⭐ 500 Β· πŸ’€) - Monitor the stability of a Pandas or Spark dataframe. MIT ivis (πŸ₯‰16 Β· ⭐ 340 Β· πŸ’€) - Dimensionality reduction in very large datasets using Siamese.. Apache-2 Show 20 hidden projects...

Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.

transformers (πŸ₯‡50 Β· ⭐ 150K) - Transformers: the model-definition framework for.. Apache-2 spaCy (πŸ₯‡42 Β· ⭐ 32K) - Industrial-strength Natural Language Processing (NLP) in Python. MIT nltk (πŸ₯‡42 Β· ⭐ 14K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2 sentence-transformers (πŸ₯‡41 Β· ⭐ 17K) - State-of-the-Art Text Embeddings. Apache-2 litellm (πŸ₯‡39 Β· ⭐ 27K) - Python SDK, Proxy Server (LLM Gateway) to call 100+.. MIT o t h e r s gensim (πŸ₯‡37 Β· ⭐ 16K) - Topic Modelling for Humans. ❗️LGPL-2.1 sentencepiece (πŸ₯‡37 Β· ⭐ 11K Β· πŸ“ˆ) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2 NeMo (πŸ₯‡36 Β· ⭐ 15K) - A scalable generative AI framework built for researchers and.. Apache-2 Tokenizers (πŸ₯‡36 Β· ⭐ 10K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2 qdrant (πŸ₯‡35 Β· ⭐ 25K) - Qdrant - High-performance, massive-scale Vector Database and Vector.. Apache-2 flair (πŸ₯‡35 Β· ⭐ 14K) - A very simple framework for state-of-the-art Natural Language Processing.. MIT fairseq (πŸ₯ˆ34 Β· ⭐ 32K) - Facebook AI Research Sequence-to-Sequence Toolkit written in Python. MIT haystack (πŸ₯ˆ34 Β· ⭐ 22K) - AI orchestration framework to build customizable, production-ready.. Apache-2 TextBlob (πŸ₯ˆ33 Β· ⭐ 9.4K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT ChatterBot (πŸ₯ˆ32 Β· ⭐ 14K) - ChatterBot is a machine learning, conversational dialog engine for.. BSD-3 rubrix (πŸ₯ˆ31 Β· ⭐ 4.6K) - Argilla is a collaboration tool for AI engineers and domain experts.. Apache-2 snowballstemmer (πŸ₯ˆ31 Β· ⭐ 800) - Snowball compiler and stemming algorithms. BSD-3 Rasa (πŸ₯ˆ30 Β· ⭐ 21K Β· πŸ’€) - Open source machine learning framework to automate text- and.. Apache-2 spark-nlp (πŸ₯ˆ29 Β· ⭐ 4K) - State of the Art Natural Language Processing. Apache-2 stanza (πŸ₯ˆ28 Β· ⭐ 7.6K Β· πŸ’€) - Stanford NLP Python library for tokenization, sentence.. Apache-2 jellyfish (πŸ₯ˆ28 Β· ⭐ 2.2K) - a python library for doing approximate and phonetic matching of strings. MIT TensorFlow Text (πŸ₯ˆ28 Β· ⭐ 1.3K) - Making text a first-class citizen in TensorFlow. Apache-2 Opik (πŸ₯ˆ27 Β· ⭐ 13K) - Debug, evaluate, and monitor your LLM applications, RAG systems, and.. Apache-2 DeepPavlov (πŸ₯ˆ27 Β· ⭐ 6.9K Β· πŸ’€) - An open source library for deep learning end-to-end.. Apache-2 Dedupe (πŸ₯ˆ27 Β· ⭐ 4.4K) - A python library for accurate and scalable fuzzy matching, record.. MIT ftfy (πŸ₯ˆ27 Β· ⭐ 3.9K Β· πŸ’€) - Fixes mojibake and other glitches in Unicode text, after the fact. Apache-2 torchtext (πŸ₯ˆ27 Β· ⭐ 3.6K) - Models, data loaders and abstractions for language processing,.. BSD-3 TextDistance (πŸ₯ˆ26 Β· ⭐ 3.5K) - Compute distance between sequences. 30+ algorithms, pure python.. MIT spacy-transformers (πŸ₯ˆ25 Β· ⭐ 1.4K) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT spacy scattertext (πŸ₯ˆ24 Β· ⭐ 2.3K) - Beautiful visualizations of how language differs among document.. Apache-2 SciSpacy (πŸ₯ˆ24 Β· ⭐ 1.9K) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2 sense2vec (πŸ₯ˆ24 Β· ⭐ 1.7K) - Contextually-keyed word vectors. MIT DeepKE (πŸ₯‰23 Β· ⭐ 4.1K) - [EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and.. MIT detoxify (πŸ₯‰23 Β· ⭐ 1.1K) - Trained models & code to predict toxic comments on all 3 Jigsaw.. Apache-2 CLTK (πŸ₯‰23 Β· ⭐ 860) - The Classical Language Toolkit. MIT english-words (πŸ₯‰22 Β· ⭐ 11K Β· πŸ’€) - A text file containing 479k English words for all your.. Unlicense small-text (πŸ₯‰20 Β· ⭐ 620) - Active Learning for Text Classification in Python. MIT T5 (πŸ₯‰19 Β· ⭐ 6.4K) - Code for the paper Exploring the Limits of Transfer Learning with a.. Apache-2 UForm (πŸ₯‰19 Β· ⭐ 1.2K) - Pocket-Sized Multimodal AI for content understanding and.. Apache-2 finetune (πŸ₯‰19 Β· ⭐ 710) - Scikit-learn style model finetuning for NLP. MPL-2.0 happy-transformer (πŸ₯‰19 Β· ⭐ 540) - Happy Transformer makes it easy to fine-tune and.. Apache-2 huggingface fast-bert (πŸ₯‰18 Β· ⭐ 1.9K Β· πŸ’€) - Super easy library for BERT based NLP models. Apache-2 Sockeye (πŸ₯‰18 Β· ⭐ 1.2K Β· πŸ’€) - Sequence-to-sequence framework with a focus on Neural.. Apache-2 VizSeq (πŸ₯‰15 Β· ⭐ 440) - An Analysis Toolkit for Natural Language Generation (Translation,.. MIT textaugment (πŸ₯‰14 Β· ⭐ 420) - TextAugment: Text Augmentation Library. MIT Show 58 hidden projects...

Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.

Pillow (πŸ₯‡46 Β· ⭐ 13K) - Python Imaging Library (Fork). ❗️PIL PyTorch Image Models (πŸ₯‡40 Β· ⭐ 35K) - The largest collection of PyTorch image encoders /.. Apache-2 torchvision (πŸ₯‡39 Β· ⭐ 17K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3 MoviePy (πŸ₯‡39 Β· ⭐ 14K) - Video editing with Python. MIT InsightFace (πŸ₯‡37 Β· ⭐ 26K) - State-of-the-art 2D and 3D Face Analysis Project. MIT Kornia (πŸ₯ˆ35 Β· ⭐ 11K) - Geometric Computer Vision Library for Spatial AI. Apache-2 detectron2 (πŸ₯ˆ34 Β· ⭐ 33K) - Detectron2 is a platform for object detection, segmentation.. Apache-2 deepface (πŸ₯ˆ34 Β· ⭐ 20K) - A Lightweight Face Recognition and Facial Attribute Analysis (Age,.. MIT Albumentations (πŸ₯ˆ34 Β· ⭐ 15K) - Fast and flexible image augmentation library. Paper about.. MIT imageio (πŸ₯ˆ33 Β· ⭐ 1.6K) - Python library for reading and writing image data. BSD-2 opencv-python (πŸ₯ˆ30 Β· ⭐ 4.9K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT Wand (πŸ₯ˆ30 Β· ⭐ 1.5K) - The ctypes-based simple ImageMagick binding for Python. MIT PaddleSeg (πŸ₯ˆ29 Β· ⭐ 9.1K Β· πŸ’€) - Easy-to-use image segmentation library with awesome pre-.. Apache-2 doctr (πŸ₯ˆ29 Β· ⭐ 5.1K) - docTR (Document Text Recognition) - a seamless, high-.. Apache-2 sahi (πŸ₯ˆ29 Β· ⭐ 4.8K) - Framework agnostic sliced/tiled inference + interactive ui + error analysis.. MIT ImageHash (πŸ₯ˆ29 Β· ⭐ 3.7K) - A Python Perceptual Image Hashing Module. BSD-2 lightly (πŸ₯ˆ27 Β· ⭐ 3.5K) - A python library for self-supervised learning on images. MIT CellProfiler (πŸ₯ˆ27 Β· ⭐ 1K) - An open-source application for biological image analysis. BSD-3 mahotas (πŸ₯ˆ27 Β· ⭐ 870) - Computer Vision in Python. MIT vit-pytorch (πŸ₯‰26 Β· ⭐ 24K) - Implementation of Vision Transformer, a simple way to achieve.. MIT PaddleDetection (πŸ₯‰26 Β· ⭐ 14K) - Object Detection toolkit based on PaddlePaddle. It.. Apache-2 pyvips (πŸ₯‰25 Β· ⭐ 720) - python binding for libvips using cffi. MIT facenet-pytorch (πŸ₯‰24 Β· ⭐ 5K Β· πŸ’€) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT mtcnn (πŸ₯‰24 Β· ⭐ 2.4K Β· πŸ’€) - MTCNN face detection implementation for TensorFlow, as a PIP.. MIT Face Alignment (πŸ₯‰23 Β· ⭐ 7.4K Β· πŸ’€) - 2D and 3D Face alignment library build using pytorch. BSD-3 Norfair (πŸ₯‰23 Β· ⭐ 2.5K) - Lightweight Python library for adding real-time multi-object tracking.. BSD-3 MMF (πŸ₯‰22 Β· ⭐ 5.6K) - A modular framework for vision & language multimodal research from.. BSD-3 Image Deduplicator (πŸ₯‰22 Β· ⭐ 5.5K) - Finding duplicate images made easy!. Apache-2 pytorchvideo (πŸ₯‰21 Β· ⭐ 3.5K Β· πŸ’€) - A deep learning library for video understanding.. Apache-2 tensorflow-graphics (πŸ₯‰21 Β· ⭐ 2.8K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2 PySlowFast (πŸ₯‰20 Β· ⭐ 7.1K Β· πŸ’€) - PySlowFast: video understanding codebase from FAIR for.. Apache-2 segmentation_models (πŸ₯‰20 Β· ⭐ 4.9K Β· πŸ’€) - Segmentation models with pretrained backbones. Keras.. MIT kubric (πŸ₯‰19 Β· ⭐ 2.5K) - A data generation pipeline for creating semi-realistic synthetic.. Apache-2 icevision (πŸ₯‰19 Β· ⭐ 860 Β· πŸ’€) - An Agnostic Computer Vision Framework - Pluggable to any.. Apache-2 Image Super-Resolution (πŸ₯‰18 Β· ⭐ 4.8K Β· πŸ’€) - Super-scale your images and run experiments with.. Apache-2 scenic (πŸ₯‰17 Β· ⭐ 3.6K) - Scenic: A Jax Library for Computer Vision Research and Beyond. Apache-2 Caer (πŸ₯‰17 Β· ⭐ 800 Β· πŸ“ˆ) - A lightweight Computer Vision library. Scale your models, not boilerplate. MIT Show 27 hidden projects...

Libraries for graph processing, clustering, embedding, and machine learning tasks.

networkx (πŸ₯‡42 Β· ⭐ 16K) - Network Analysis in Python. BSD-3 PyTorch Geometric (πŸ₯‡37 Β· ⭐ 23K) - Graph Neural Network Library for PyTorch. MIT dgl (πŸ₯‡31 Β· ⭐ 14K) - Python package built to ease deep learning on graph, on top of existing DL.. Apache-2 ogb (πŸ₯ˆ26 Β· ⭐ 2K) - Benchmark datasets, data loaders, and evaluators for graph machine learning. MIT pytorch_geometric_temporal (πŸ₯ˆ25 Β· ⭐ 2.9K) - PyTorch Geometric Temporal: Spatiotemporal Signal.. MIT pygraphistry (πŸ₯ˆ25 Β· ⭐ 2.3K) - PyGraphistry is a Python library to quickly load, shape,.. BSD-3 PyKEEN (πŸ₯ˆ25 Β· ⭐ 1.9K) - A Python library for learning and evaluating knowledge graph embeddings. MIT torch-cluster (πŸ₯ˆ24 Β· ⭐ 890) - PyTorch Extension Library of Optimized Graph Cluster.. MIT Node2Vec (πŸ₯ˆ22 Β· ⭐ 1.3K Β· πŸ’€) - Implementation of the node2vec algorithm. MIT Show 27 hidden projects...

Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.

spleeter (πŸ₯‡33 Β· ⭐ 27K) - Deezer source separation library including pretrained models. MIT speechbrain (πŸ₯‡33 Β· ⭐ 10K) - A PyTorch-based Speech Toolkit. Apache-2 espnet (πŸ₯‡33 Β· ⭐ 9.4K) - End-to-End Speech Processing Toolkit. Apache-2 SpeechRecognition (πŸ₯ˆ31 Β· ⭐ 8.8K) - Speech recognition module for Python, supporting several.. BSD-3 librosa (πŸ₯ˆ31 Β· ⭐ 7.8K) - Python library for audio and music analysis. ISC torchaudio (πŸ₯ˆ30 Β· ⭐ 2.7K) - Data manipulation and transformation for audio signal.. BSD-2 Coqui TTS (πŸ₯ˆ30 Β· ⭐ 1.7K) - - a deep learning toolkit for Text-to-Speech, battle-.. MPL-2.0 Magenta (πŸ₯ˆ28 Β· ⭐ 20K) - Magenta: Music and Art Generation with Machine Intelligence. Apache-2 audiomentations (πŸ₯ˆ27 Β· ⭐ 2.1K) - A Python library for audio data augmentation. Useful for.. MIT Porcupine (πŸ₯‰25 Β· ⭐ 4.3K) - On-device wake word detection powered by deep learning. Apache-2 pyAudioAnalysis (πŸ₯‰24 Β· ⭐ 6.1K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2 Madmom (πŸ₯‰24 Β· ⭐ 1.5K Β· πŸ’€) - Python audio and music signal processing library. BSD-3 tinytag (πŸ₯‰24 Β· ⭐ 770) - Python library for reading audio file metadata. MIT DDSP (πŸ₯‰23 Β· ⭐ 3.1K Β· πŸ’€) - DDSP: Differentiable Digital Signal Processing. Apache-2 python-soundfile (πŸ₯‰22 Β· ⭐ 790) - SoundFile is an audio library based on libsndfile, CFFI, and.. BSD-3 DeepSpeech (πŸ₯‰20 Β· ⭐ 27K) - DeepSpeech is an open source embedded (offline, on-device).. MPL-2.0 nnAudio (πŸ₯‰18 Β· ⭐ 1.1K) - Audio processing by using pytorch 1D convolution network. MIT Julius (πŸ₯‰15 Β· ⭐ 450) - Fast PyTorch based DSP for audio and 1D signals. MIT Show 11 hidden projects...

Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.

pydeck (πŸ₯‡41 Β· ⭐ 13K) - WebGL2 powered visualization framework. MIT folium (πŸ₯‡40 Β· ⭐ 7.2K Β· πŸ“ˆ) - Python Data. Leaflet.js Maps. MIT GeoPandas (πŸ₯‡37 Β· ⭐ 4.9K) - Python tools for geographic data. BSD-3 Shapely (πŸ₯‡37 Β· ⭐ 4.2K) - Manipulation and analysis of geometric objects. BSD-3 Rasterio (πŸ₯ˆ36 Β· ⭐ 2.4K) - Rasterio reads and writes geospatial raster datasets. BSD-3 pyproj (πŸ₯ˆ35 Β· ⭐ 1.1K Β· πŸ“ˆ) - Python interface to PROJ (cartographic projections and coordinate.. MIT ipyleaflet (πŸ₯ˆ34 Β· ⭐ 1.5K) - A Jupyter - Leaflet.js bridge. MIT ArcGIS API (πŸ₯ˆ32 Β· ⭐ 2K) - Documentation and samples for ArcGIS API for Python. Apache-2 Fiona (πŸ₯ˆ32 Β· ⭐ 1.2K) - Fiona reads and writes geographic data files. BSD-3 PySAL (πŸ₯‰29 Β· ⭐ 1.4K Β· πŸ“‰) - PySAL: Python Spatial Analysis Library Meta-Package. BSD-3 geojson (πŸ₯‰28 Β· ⭐ 970 Β· πŸ’€) - Python bindings and utilities for GeoJSON. BSD-3 EarthPy (πŸ₯‰28 Β· ⭐ 520) - A package built to support working with spatial data using open source.. BSD-3 GeoViews (πŸ₯‰24 Β· ⭐ 620) - Simple, concise geographical visualization in Python. BSD-3 pymap3d (πŸ₯‰24 Β· ⭐ 420) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef.. BSD-2 Mapbox GL (πŸ₯‰18 Β· ⭐ 680) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook. MIT Show 7 hidden projects...

Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.

yfinance (πŸ₯‡39 Β· ⭐ 19K) - Download market data from Yahoo! Finances API. Apache-2 bt (πŸ₯‡29 Β· ⭐ 2.6K) - bt - flexible backtesting for Python. MIT Qlib (πŸ₯ˆ27 Β· ⭐ 28K) - Qlib is an AI-oriented Quant investment platform that aims to use AI tech.. MIT ffn (πŸ₯ˆ25 Β· ⭐ 2.3K) - ffn - a financial function library for Python. MIT Alpha Vantage (πŸ₯‰24 Β· ⭐ 4.6K) - A python wrapper for Alpha Vantage API for financial data. MIT stockstats (πŸ₯‰21 Β· ⭐ 1.4K) - Supply a wrapper ``StockDataFrame`` based on the.. BSD-3 tf-quant-finance (πŸ₯‰19 Β· ⭐ 4.9K) - High-performance TensorFlow library for quantitative.. Apache-2 finmarketpy (πŸ₯‰19 Β· ⭐ 3.6K) - Python library for backtesting trading strategies & analyzing.. Apache-2 Show 17 hidden projects...

Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.

sktime (πŸ₯‡39 Β· ⭐ 9.2K) - A unified framework for machine learning with time series. BSD-3 Prophet (πŸ₯‡33 Β· ⭐ 20K Β· πŸ“‰) - Tool for producing high quality forecasts for time series data that.. MIT tslearn (πŸ₯‡32 Β· ⭐ 3K) - The machine learning toolkit for time series analysis in Python. BSD-2 pytorch-forecasting (πŸ₯ˆ31 Β· ⭐ 4.4K) - Time series forecasting with PyTorch. MIT Darts (πŸ₯ˆ30 Β· ⭐ 8.8K) - A python library for user-friendly forecasting and anomaly detection.. Apache-2 StatsForecast (πŸ₯ˆ30 Β· ⭐ 4.5K) - Lightning fast forecasting with statistical and econometric.. Apache-2 STUMPY (πŸ₯ˆ30 Β· ⭐ 4K) - STUMPY is a powerful and scalable Python library for modern time series.. BSD-3 tsfresh (πŸ₯ˆ29 Β· ⭐ 8.9K) - Automatic extraction of relevant features from time series:. MIT NeuralForecast (πŸ₯ˆ29 Β· ⭐ 3.7K) - Scalable and user friendly neural forecasting algorithms. Apache-2 pmdarima (πŸ₯ˆ29 Β· ⭐ 1.7K Β· πŸ’€) - A statistical library designed to fill the void in Pythons time.. MIT skforecast (πŸ₯ˆ28 Β· ⭐ 1.4K) - Time series forecasting with machine learning models. BSD-3 Streamz (πŸ₯ˆ26 Β· ⭐ 1.3K Β· πŸ’€) - Real-time stream processing for python. BSD-3 GluonTS (πŸ₯‰25 Β· ⭐ 5K) - Probabilistic time series modeling in Python. Apache-2 pyts (πŸ₯‰24 Β· ⭐ 1.8K) - A Python package for time series classification. BSD-3 NeuralProphet (πŸ₯‰21 Β· ⭐ 4.2K Β· πŸ’€) - NeuralProphet: A simple forecasting package. MIT TSFEL (πŸ₯‰19 Β· ⭐ 1K Β· πŸ’€) - An intuitive library to extract features from time series. BSD-3 tsflex (πŸ₯‰18 Β· ⭐ 430 Β· πŸ’€) - Flexible time series feature extraction & processing. MIT greykite (πŸ₯‰17 Β· ⭐ 1.8K) - A flexible, intuitive and fast forecasting library. BSD-2 pydlm (πŸ₯‰16 Β· ⭐ 480 Β· πŸ’€) - A python library for Bayesian time series modeling. BSD-3 Show 10 hidden projects...

Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.

MNE (πŸ₯‡36 Β· ⭐ 3K) - MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python. BSD-3 Nilearn (πŸ₯‡36 Β· ⭐ 1.3K) - Machine learning for NeuroImaging in Python. BSD-3 MONAI (πŸ₯ˆ35 Β· ⭐ 6.7K) - AI Toolkit for Healthcare Imaging. Apache-2 NIPYPE (πŸ₯ˆ33 Β· ⭐ 790) - Workflows and interfaces for neuroimaging packages. Apache-2 Lifelines (πŸ₯ˆ29 Β· ⭐ 2.5K Β· πŸ’€) - Survival analysis in Python. MIT Hail (πŸ₯ˆ29 Β· ⭐ 1K) - Cloud-native genomic dataframes and batch computing. MIT NiBabel (πŸ₯ˆ29 Β· ⭐ 720 Β· πŸ“‰) - Python package to access a cacophony of neuro-imaging file formats. MIT DeepVariant (πŸ₯‰25 Β· ⭐ 3.5K) - DeepVariant is an analysis pipeline that uses a deep neural.. BSD-3 Brainiak (πŸ₯‰15 Β· ⭐ 360 Β· πŸ’€) - Brain Imaging Analysis Kit. Apache-2 Show 10 hidden projects...

Libraries for processing tabular and structured data.

skrub (πŸ₯‡26 Β· ⭐ 1.4K) - Machine learning with dataframes. BSD-3 pytorch_tabular (πŸ₯ˆ19 Β· ⭐ 1.6K) - A standard framework for modelling Deep Learning Models.. MIT miceforest (πŸ₯ˆ17 Β· ⭐ 390 Β· πŸ’€) - Multiple Imputation with LightGBM in Python. MIT upgini (πŸ₯ˆ17 Β· ⭐ 340) - Data search & enrichment library for Machine Learning Easily find and add.. BSD-3 Show 2 hidden projects...
Optical Character Recognition

Libraries for optical character recognition (OCR) and text extraction from images or videos.

PaddleOCR (πŸ₯‡42 Β· ⭐ 53K) - Awesome multilingual OCR and Document Parsing toolkits based.. Apache-2 OCRmyPDF (πŸ₯‡35 Β· ⭐ 31K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them.. MPL-2.0 EasyOCR (πŸ₯ˆ33 Β· ⭐ 28K Β· πŸ’€) - Ready-to-use OCR with 80+ supported languages and all popular.. Apache-2 Tesseract (πŸ₯ˆ29 Β· ⭐ 6.2K) - Python-tesseract is an optical character recognition (OCR) tool.. Apache-2 tesserocr (πŸ₯ˆ28 Β· ⭐ 2.1K) - A Python wrapper for the tesseract-ocr API. MIT MMOCR (πŸ₯‰24 Β· ⭐ 4.6K Β· πŸ’€) - OpenMMLab Text Detection, Recognition and Understanding Toolbox. Apache-2 Show 6 hidden projects...
Data Containers & Structures

General-purpose data containers & structures as well as utilities & extensions for pandas.

πŸ”— best-of-python - Data Containers ( ⭐ 4.1K) - Collection of data-container, dataframe, and pandas-..


Data Loading & Extraction

Libraries for loading, collecting, and extracting data from a variety of data sources and formats.

πŸ”— best-of-python - Data Extraction ( ⭐ 4.1K) - Collection of data-loading and -extraction libraries.

Libraries for web scraping, crawling, downloading, and mining as well as libraries.

πŸ”— best-of-web-python - Web Scraping ( ⭐ 2.6K) - Collection of web-scraping and crawling libraries.


Data Pipelines & Streaming

Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.

πŸ”— best-of-python - Data Pipelines ( ⭐ 4.1K) - Libraries for data batch- and stream-processing,..

Show 1 hidden projects...
Distributed Machine Learning

Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.

Ray (πŸ₯‡45 Β· ⭐ 38K) - Ray is an AI compute engine. Ray consists of a core distributed runtime.. Apache-2 dask (πŸ₯‡42 Β· ⭐ 13K) - Parallel computing with task scheduling. BSD-3 DeepSpeed (πŸ₯‡37 Β· ⭐ 40K) - DeepSpeed is a deep learning optimization library that makes.. Apache-2 dask.distributed (πŸ₯‡37 Β· ⭐ 1.6K) - A distributed task scheduler for Dask. BSD-3 metrics (πŸ₯ˆ36 Β· ⭐ 2.3K) - Machine learning metrics for distributed, scalable PyTorch.. Apache-2 ColossalAI (πŸ₯ˆ32 Β· ⭐ 41K) - Making large AI models cheaper, faster and more accessible. Apache-2 horovod (πŸ₯ˆ31 Β· ⭐ 15K) - Distributed training framework for TensorFlow, Keras, PyTorch, and.. Apache-2 mpi4py (πŸ₯ˆ31 Β· ⭐ 870) - Python bindings for MPI. BSD-3 H2O-3 (πŸ₯ˆ29 Β· ⭐ 7.3K) - H2O is an Open Source, Distributed, Fast & Scalable Machine Learning.. Apache-2 FairScale (πŸ₯ˆ29 Β· ⭐ 3.4K) - PyTorch extensions for high performance and large scale training. BSD-3 BigDL (πŸ₯ˆ28 Β· ⭐ 8.2K) - Accelerate local LLM inference and finetuning (LLaMA, Mistral,.. Apache-2 dask-ml (πŸ₯ˆ27 Β· ⭐ 940) - Scalable Machine Learning with Dask. BSD-3 Submit it (πŸ₯ˆ25 Β· ⭐ 1.5K) - Python 3.8+ toolbox for submitting jobs to Slurm. MIT SynapseML (πŸ₯‰24 Β· ⭐ 5.2K) - Simple and Distributed Machine Learning. MIT petastorm (πŸ₯‰24 Β· ⭐ 1.9K) - Petastorm library enables single machine or distributed training.. Apache-2 MMLSpark (πŸ₯‰23 Β· ⭐ 5.2K) - Simple and Distributed Machine Learning. MIT Apache Singa (πŸ₯‰23 Β· ⭐ 3.5K) - a distributed deep learning platform. Apache-2 Hivemind (πŸ₯‰22 Β· ⭐ 2.2K) - Decentralized deep learning in PyTorch. Built to train models on.. MIT analytics-zoo (πŸ₯‰20 Β· ⭐ 2.6K Β· πŸ’€) - Distributed Tensorflow, Keras and PyTorch on Apache.. Apache-2 Show 17 hidden projects...
Hyperparameter Optimization & AutoML

Libraries for hyperparameter optimization, automl and neural architecture search.

Optuna (πŸ₯‡41 Β· ⭐ 12K) - A hyperparameter optimization framework. MIT AutoGluon (πŸ₯‡34 Β· ⭐ 9.3K) - Fast and Accurate ML in 3 Lines of Code. Apache-2 Ax (πŸ₯‡33 Β· ⭐ 2.6K) - Adaptive Experimentation Platform. MIT BoTorch (πŸ₯‡32 Β· ⭐ 3.3K) - Bayesian optimization in PyTorch. MIT Hyperopt (πŸ₯‡31 Β· ⭐ 7.5K Β· πŸ’€) - Distributed Asynchronous Hyperparameter Optimization in Python. BSD-3 AutoKeras (πŸ₯ˆ30 Β· ⭐ 9.3K Β· πŸ’€) - AutoML library for deep learning. Apache-2 featuretools (πŸ₯ˆ30 Β· ⭐ 7.5K Β· πŸ’€) - An open source python library for automated feature.. BSD-3 Bayesian Optimization (πŸ₯ˆ29 Β· ⭐ 8.4K) - A Python implementation of global optimization with.. MIT nevergrad (πŸ₯ˆ28 Β· ⭐ 4.1K) - A Python toolbox for performing gradient-free optimization. MIT mljar-supervised (πŸ₯ˆ27 Β· ⭐ 3.2K) - Python package for AutoML on Tabular Data with Feature.. MIT lazypredict (πŸ₯ˆ24 Β· ⭐ 3.2K) - Lazy Predict help build a lot of basic models without much code.. MIT FEDOT (πŸ₯‰21 Β· ⭐ 680) - Automated modeling and machine learning framework FEDOT. BSD-3 Hyperactive (πŸ₯‰21 Β· ⭐ 530) - An optimization and data collection toolbox for convenient and fast.. MIT AlphaPy (πŸ₯‰19 Β· ⭐ 1.6K Β· πŸ’€) - Python AutoML for Trading Systems and Sports Betting. Apache-2 Auto ViML (πŸ₯‰17 Β· ⭐ 540 Β· πŸ’€) - Automatically Build Multiple ML Models with a Single Line of.. Apache-2 opytimizer (πŸ₯‰16 Β· ⭐ 620 Β· πŸ’€) - Opytimizer is a Python library consisting of meta-heuristic.. Apache-2 featurewiz (πŸ₯‰15 Β· ⭐ 660) - Use advanced feature engineering strategies and select best.. Apache-2 Show 35 hidden projects...

Libraries for building and evaluating reinforcement learning & agent-based systems.

FinRL (πŸ₯‡28 Β· ⭐ 12K) - FinRL: Financial Reinforcement Learning. MIT ViZDoom (πŸ₯‡28 Β· ⭐ 1.9K) - Reinforcement Learning environments based on the 1993 game Doom. MIT Acme (πŸ₯ˆ27 Β· ⭐ 3.8K) - A library of reinforcement learning components and agents. Apache-2 TF-Agents (πŸ₯ˆ23 Β· ⭐ 2.9K) - TF-Agents: A reliable, scalable and easy to use TensorFlow.. Apache-2 Dopamine (πŸ₯‰22 Β· ⭐ 11K Β· πŸ’€) - Dopamine is a research framework for fast prototyping of.. Apache-2 PARL (πŸ₯‰22 Β· ⭐ 3.4K Β· πŸ’€) - A high-performance distributed training framework for.. Apache-2 RLax (πŸ₯‰21 Β· ⭐ 1.3K) - A library of reinforcement learning building blocks in JAX. Apache-2 ReAgent (πŸ₯‰20 Β· ⭐ 3.7K) - A platform for Reasoning systems (Reinforcement Learning,.. BSD-3 PFRL (πŸ₯‰20 Β· ⭐ 1.2K Β· πŸ’€) - PFRL: a PyTorch-based deep reinforcement learning library. MIT rliable (πŸ₯‰14 Β· ⭐ 840 Β· πŸ’€) - [NeurIPS21 Outstanding Paper] Library for reliable evaluation on.. Apache-2 Show 13 hidden projects...

Libraries for building and evaluating recommendation systems.

Recommenders (πŸ₯‡30 Β· ⭐ 21K) - Best Practices on Recommendation Systems. MIT torchrec (πŸ₯‡30 Β· ⭐ 2.3K) - Pytorch domain library for recommendation systems. BSD-3 Cornac (πŸ₯ˆ28 Β· ⭐ 970) - A Comparative Framework for Multimodal Recommender Systems. Apache-2 RecBole (πŸ₯‰22 Β· ⭐ 3.9K) - A unified, comprehensive and efficient recommendation library. MIT TF Recommenders (πŸ₯‰19 Β· ⭐ 2K) - TensorFlow Recommenders is a library for building.. Apache-2 Show 12 hidden projects...

Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.

PySyft (πŸ₯‡29 Β· ⭐ 9.8K) - Perform data science on data that remains in someone elses server. Apache-2 Opacus (πŸ₯‡29 Β· ⭐ 1.8K) - Training PyTorch models with differential privacy. Apache-2 FATE (πŸ₯ˆ23 Β· ⭐ 5.9K Β· πŸ’€) - An Industrial Grade Federated Learning Framework. Apache-2 TFEncrypted (πŸ₯‰22 Β· ⭐ 1.2K Β· πŸ’€) - A Framework for Encrypted Machine Learning in.. Apache-2 TensorFlow Privacy (πŸ₯‰19 Β· ⭐ 2K) - Library for training machine learning models with.. Apache-2 CrypTen (πŸ₯‰19 Β· ⭐ 1.6K Β· πŸ’€) - A framework for Privacy Preserving Machine Learning. MIT Show 1 hidden projects...
Workflow & Experiment Tracking

Libraries to organize, track, and visualize machine learning experiments.

mlflow (πŸ₯‡42 Β· ⭐ 22K) - The open source developer platform to build AI/LLM applications and.. Apache-2 wandb client (πŸ₯‡41 Β· ⭐ 10K) - The AI developer platform. Use Weights & Biases to train and fine-.. MIT DVC (πŸ₯‡40 Β· ⭐ 15K) - Data Versioning and ML Experiments. Apache-2 Tensorboard (πŸ₯‡40 Β· ⭐ 7K) - TensorFlows Visualization Toolkit. Apache-2 SageMaker SDK (πŸ₯ˆ37 Β· ⭐ 2.2K) - A library for training and deploying machine learning.. Apache-2 Metaflow (πŸ₯ˆ34 Β· ⭐ 9.4K) - Build, Manage and Deploy AI/ML Systems. Apache-2 snakemake (πŸ₯ˆ33 Β· ⭐ 2.6K) - This is the development home of the workflow management system.. MIT PyCaret (πŸ₯ˆ32 Β· ⭐ 9.5K) - An open-source, low-code machine learning library in Python. MIT tensorboardX (πŸ₯ˆ31 Β· ⭐ 8K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT kaggle (πŸ₯ˆ31 Β· ⭐ 6.8K) - Official Kaggle API. Apache-2 ClearML (πŸ₯ˆ30 Β· ⭐ 6.2K) - ClearML - Auto-Magical CI/CD to streamline your AI workload... Apache-2 aim (πŸ₯ˆ29 Β· ⭐ 5.8K) - Aim An easy-to-use & supercharged open-source experiment tracker. Apache-2 sacred (πŸ₯ˆ27 Β· ⭐ 4.3K Β· πŸ’€) - Sacred is a tool to help you configure, organize, log and reproduce.. MIT Neptune.ai (πŸ₯ˆ27 Β· ⭐ 620) - The experiment tracker for foundation model training. Apache-2 AzureML SDK (πŸ₯ˆ25 Β· ⭐ 4.2K) - Python notebooks with ML and deep learning examples with Azure.. MIT VisualDL (πŸ₯‰24 Β· ⭐ 4.8K Β· πŸ’€) - Deep Learning Visualization Toolkit. Apache-2 TNT (πŸ₯‰24 Β· ⭐ 1.7K) - A lightweight library for PyTorch training tools and utilities. BSD-3 ml-metadata (πŸ₯‰23 Β· ⭐ 650) - For recording and retrieving metadata associated with ML.. Apache-2 Labml (πŸ₯‰21 Β· ⭐ 2.2K) - Monitor deep learning model training and hardware usage from your mobile.. MIT livelossplot (πŸ₯‰21 Β· ⭐ 1.3K Β· πŸ’€) - Live training loss plot in Jupyter Notebook for Keras,.. MIT gokart (πŸ₯‰21 Β· ⭐ 330) - Gokart solves reproducibility, task dependencies, constraints of good code,.. MIT Guild AI (πŸ₯‰19 Β· ⭐ 890) - Experiment tracking, ML developer tools. Apache-2 quinn (πŸ₯‰17 Β· ⭐ 680 Β· πŸ’€) - pyspark methods to enhance developer productivity. Apache-2 datmo (πŸ₯‰17 Β· ⭐ 340) - Open source production model management tool for data scientists. MIT keepsake (πŸ₯‰16 Β· ⭐ 1.7K Β· πŸ’€) - Version control for machine learning. Apache-2 CometML (πŸ₯‰13) - Supercharging Machine Learning. MIT Show 14 hidden projects...
Model Serialization & Deployment

Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.

onnx (πŸ₯‡40 Β· ⭐ 19K) - Open standard for machine learning interoperability. Apache-2 triton (πŸ₯‡38 Β· ⭐ 17K) - Development repository for the Triton language and compiler. MIT huggingface_hub (πŸ₯ˆ35 Β· ⭐ 2.8K) - The official Python client for the Huggingface Hub. Apache-2 Core ML Tools (πŸ₯ˆ33 Β· ⭐ 4.9K) - Core ML tools contain supporting tools for Core ML model.. BSD-3 TorchServe (πŸ₯ˆ32 Β· ⭐ 4.4K) - Serve, optimize and scale PyTorch models in production. Apache-2 BentoML (πŸ₯ˆ30 Β· ⭐ 8K) - The easiest way to serve AI apps and models - Build Model Inference.. Apache-2 hls4ml (πŸ₯ˆ29 Β· ⭐ 1.6K) - Machine learning on FPGAs using HLS. Apache-2 mmdnn (πŸ₯ˆ24 Β· ⭐ 5.8K) - MMdnn is a set of tools to help users inter-operate among different deep.. MIT Hummingbird (πŸ₯‰23 Β· ⭐ 3.5K) - Hummingbird compiles trained ML models into tensor computation for.. MIT tfdeploy (πŸ₯‰15 Β· ⭐ 350 Β· πŸ’€) - Deploy tensorflow graphs for fast evaluation and export to.. BSD-3 Show 10 hidden projects...

Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.

shap (πŸ₯‡40 Β· ⭐ 24K) - A game theoretic approach to explain the output of any machine learning model. MIT Netron (πŸ₯‡36 Β· ⭐ 31K) - Visualizer for neural network, deep learning and machine learning.. MIT arviz (πŸ₯‡34 Β· ⭐ 1.7K) - Exploratory analysis of Bayesian models with Python. Apache-2 Captum (πŸ₯‡33 Β· ⭐ 5.4K) - Model interpretability and understanding for PyTorch. BSD-3 explainerdashboard (πŸ₯‡30 Β· ⭐ 2.4K) - Quickly build Explainable AI dashboards that show the inner.. MIT InterpretML (πŸ₯ˆ29 Β· ⭐ 6.6K) - Fit interpretable models. Explain blackbox machine learning. MIT evaluate (πŸ₯ˆ28 Β· ⭐ 2.3K) - Evaluate: A library for easily evaluating machine learning models.. Apache-2 DoWhy (πŸ₯ˆ27 Β· ⭐ 7.7K) - DoWhy is a Python library for causal inference that supports explicit.. MIT dtreeviz (πŸ₯ˆ26 Β· ⭐ 3.1K) - A python library for decision tree visualization and model interpretation. MIT shapash (πŸ₯ˆ26 Β· ⭐ 2.9K) - Shapash: User-friendly Explainability and Interpretability to.. Apache-2 fairlearn (πŸ₯ˆ26 Β· ⭐ 2.1K) - A Python package to assess and improve fairness of machine.. MIT LIT (πŸ₯ˆ24 Β· ⭐ 3.6K Β· πŸ’€) - The Learning Interpretability Tool: Interactively analyze ML models.. Apache-2 Fairness 360 (πŸ₯ˆ24 Β· ⭐ 2.6K Β· πŸ’€) - A comprehensive set of fairness metrics for datasets and.. Apache-2 Model Analysis (πŸ₯ˆ23 Β· ⭐ 1.3K) - Model analysis tools for TensorFlow. Apache-2 Explainability 360 (πŸ₯‰21 Β· ⭐ 1.7K) - Interpretability and explainability of data and.. Apache-2 responsible-ai-widgets (πŸ₯‰21 Β· ⭐ 1.6K) - Responsible AI Toolbox is a suite of tools providing.. MIT imodels (πŸ₯‰21 Β· ⭐ 1.5K) - Interpretable ML package for concise, transparent, and accurate.. MIT DiCE (πŸ₯‰20 Β· ⭐ 1.4K) - Generate Diverse Counterfactual Explanations for any machine.. MIT keract (πŸ₯‰20 Β· ⭐ 1.1K) - Layers Outputs and Gradients in Keras. Made easy. MIT aequitas (πŸ₯‰20 Β· ⭐ 730) - Bias Auditing & Fair ML Toolkit. MIT ecco (πŸ₯‰19 Β· ⭐ 2K Β· πŸ’€) - Explain, analyze, and visualize NLP language models. Ecco creates.. BSD-3 fairness-indicators (πŸ₯‰18 Β· ⭐ 350) - Tensorflows Fairness Evaluation and Visualization.. Apache-2 LOFO (πŸ₯‰16 Β· ⭐ 840) - Leave One Feature Out Importance. MIT random-forest-importances (πŸ₯‰16 Β· ⭐ 620) - Code to compute permutation and drop-column.. MIT ExplainX.ai (πŸ₯‰12 Β· ⭐ 440 Β· πŸ’€) - Explainable AI framework for data scientists. Explain & debug any.. MIT Show 30 hidden projects...
Vector Similarity Search (ANN)

Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search.

πŸ”— ANN Benchmarks ( ⭐ 5.4K) - Benchmarks of approximate nearest neighbor libraries in Python.

Milvus (πŸ₯‡41 Β· ⭐ 37K Β· πŸ“‰) - Milvus is a high-performance, cloud-native vector database built.. Apache-2 Faiss (πŸ₯‡39 Β· ⭐ 37K) - A library for efficient similarity search and clustering of dense vectors. MIT USearch (πŸ₯ˆ32 Β· ⭐ 3K) - Fast Open-Source Search & Clustering engine for Vectors & Arbitrary.. Apache-2 NMSLIB (πŸ₯‰28 Β· ⭐ 3.5K Β· πŸ’€) - Non-Metric Space Library (NMSLIB): An efficient similarity search.. Apache-2 PyNNDescent (πŸ₯‰25 Β· ⭐ 940 Β· πŸ’€) - A Python nearest neighbor descent for approximate nearest.. BSD-2 NGT (πŸ₯‰21 Β· ⭐ 1.3K) - Nearest Neighbor Search with Neighborhood Graph and Tree for High-.. Apache-2 Show 6 hidden projects...
Probabilistics & Statistics

Libraries providing capabilities for probabilistic programming/reasoning, bayesian inference, gaussian processes, or statistics.

PyMC3 (πŸ₯‡40 Β· ⭐ 9.2K) - Bayesian Modeling and Probabilistic Programming in Python. Apache-2 tensorflow-probability (πŸ₯‡31 Β· ⭐ 4.4K Β· πŸ“‰) - Probabilistic reasoning and statistical analysis in.. Apache-2 pgmpy (πŸ₯‡30 Β· ⭐ 3K) - Python Library for Causal and Probabilistic Modeling using Bayesian Networks. MIT patsy (πŸ₯‡30 Β· ⭐ 970) - Describing statistical models in Python using symbolic formulas. BSD-2 Pyro (πŸ₯ˆ29 Β· ⭐ 8.8K) - Deep universal probabilistic programming with Python and PyTorch. Apache-2 GPyTorch (πŸ₯ˆ29 Β· ⭐ 3.8K) - A highly efficient implementation of Gaussian Processes in PyTorch. MIT emcee (πŸ₯ˆ28 Β· ⭐ 1.5K) - The Python ensemble sampling toolkit for affine-invariant MCMC. MIT hmmlearn (πŸ₯ˆ27 Β· ⭐ 3.2K Β· πŸ’€) - Hidden Markov Models in Python, with scikit-learn like API. BSD-3 GPflow (πŸ₯ˆ27 Β· ⭐ 1.9K) - Gaussian processes in TensorFlow. Apache-2 SALib (πŸ₯‰26 Β· ⭐ 940) - Sensitivity Analysis Library in Python. Contains Sobol, Morris, FAST, and.. MIT pomegranate (πŸ₯‰25 Β· ⭐ 3.5K) - Fast, flexible and easy to use probabilistic modelling in Python. MIT bambi (πŸ₯‰25 Β· ⭐ 1.2K) - BAyesian Model-Building Interface (Bambi) in Python. MIT scikit-posthocs (πŸ₯‰24 Β· ⭐ 370) - Multiple Pairwise Comparisons (Post Hoc) Tests in Python. MIT Baal (πŸ₯‰21 Β· ⭐ 900) - Bayesian active learning library for research and industrial usecases. Apache-2 TorchUncertainty (πŸ₯‰20 Β· ⭐ 420) - Open-source framework for uncertainty and deep.. Apache-2 pandas-ta (πŸ₯‰19 Β· ⭐ 5.5K) - Technical Analysis Indicators - Pandas TA is an easy to use.. MIT Orbit (πŸ₯‰19 Β· ⭐ 2K) - A Python package for Bayesian forecasting with object-oriented design.. Apache-2 pyhsmm (πŸ₯‰19 Β· ⭐ 560 Β· πŸ’€) - Bayesian inference in HSMMs and HMMs. MIT Show 6 hidden projects...

Libraries for testing the robustness of machine learning models against attacks with adversarial/malicious examples.

ART (πŸ₯‡34 Β· ⭐ 5.5K) - Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning.. MIT TextAttack (πŸ₯ˆ26 Β· ⭐ 3.2K) - TextAttack is a Python framework for adversarial attacks, data.. MIT Show 7 hidden projects...
GPU & Accelerator Utilities

Libraries that require and make use of CUDA/GPU or other accelerator hardware capabilities to optimize machine learning tasks.

optimum (πŸ₯‡33 Β· ⭐ 3K Β· πŸ“‰) - Accelerate inference and training of Transformers, Diffusers, TIMM.. Apache-2 cuDF (πŸ₯‡32 Β· ⭐ 9.1K) - cuDF - GPU DataFrame Library. Apache-2 Apex (πŸ₯‡32 Β· ⭐ 8.8K) - A PyTorch Extension: Tools for easy mixed precision and distributed.. BSD-3 PyCUDA (πŸ₯ˆ31 Β· ⭐ 2K) - CUDA integration for Python, plus shiny features. MIT cuML (πŸ₯ˆ29 Β· ⭐ 4.9K) - cuML - RAPIDS Machine Learning Library. Apache-2 cuGraph (πŸ₯ˆ28 Β· ⭐ 2K) - cuGraph - RAPIDS Graph Analytics Library. Apache-2 CuPy (πŸ₯ˆ27 Β· ⭐ 10K) - NumPy & SciPy for GPU. MIT ArrayFire (πŸ₯ˆ26 Β· ⭐ 4.8K) - ArrayFire: a general purpose GPU library. BSD-3 gpustat (πŸ₯ˆ26 Β· ⭐ 4.2K) - A simple command-line utility for querying and monitoring GPU status. MIT DALI (πŸ₯‰25 Β· ⭐ 5.5K) - A GPU-accelerated library containing highly optimized building blocks.. Apache-2 Vulkan Kompute (πŸ₯‰21 Β· ⭐ 2.3K) - General purpose GPU compute framework built on Vulkan to.. Apache-2 Show 9 hidden projects...

Libraries that extend TensorFlow with additional capabilities.

TensorFlow Datasets (πŸ₯‡35 Β· ⭐ 4.5K) - TFDS is a collection of datasets ready to use with.. Apache-2 tensorflow-hub (πŸ₯‡29 Β· ⭐ 3.5K Β· πŸ’€) - A library for transfer learning by reusing parts of.. Apache-2 TFX (πŸ₯ˆ27 Β· ⭐ 2.2K) - TFX is an end-to-end platform for deploying production ML pipelines. Apache-2 TensorFlow Transform (πŸ₯‰23 Β· ⭐ 990) - Input pipeline framework. Apache-2 TensorFlow I/O (πŸ₯‰23 Β· ⭐ 730) - Dataset, streaming, and file system extensions.. Apache-2 TF Model Optimization (πŸ₯‰22 Β· ⭐ 1.5K) - A toolkit to optimize ML models for deployment for.. Apache-2 Neural Structured Learning (πŸ₯‰21 Β· ⭐ 1K Β· πŸ’€) - Training neural models with structured signals. Apache-2 TensorFlow Cloud (πŸ₯‰18 Β· ⭐ 380 Β· πŸ’€) - The TensorFlow Cloud repository provides APIs that.. Apache-2 TF Compression (πŸ₯‰17 Β· ⭐ 900) - Data compression in TensorFlow. Apache-2 Show 7 hidden projects...

Libraries that extend Jax with additional capabilities.

equinox (πŸ₯‡27 Β· ⭐ 2.5K) - Elegant easy-to-use neural networks + scientific computing in.. Apache-2 Show 2 hidden projects...

Libraries that extend scikit-learn with additional capabilities.

scikit-learn-intelex (πŸ₯‡34 Β· ⭐ 1.3K) - Extension for Scikit-learn is a seamless way to speed.. Apache-2 imbalanced-learn (πŸ₯ˆ30 Β· ⭐ 7K) - A Python Package to Tackle the Curse of Imbalanced.. MIT MLxtend (πŸ₯ˆ30 Β· ⭐ 5.1K) - A library of extension and helper modules for Pythons data.. BSD-3 category_encoders (πŸ₯ˆ30 Β· ⭐ 2.5K) - A library of sklearn compatible categorical variable.. BSD-3 scikit-lego (πŸ₯ˆ26 Β· ⭐ 1.4K) - Extra blocks for scikit-learn pipelines. MIT iterative-stratification (πŸ₯‰17 Β· ⭐ 880 Β· πŸ’€) - scikit-learn cross validators for iterative.. BSD-3 dabl (πŸ₯‰15 Β· ⭐ 730 Β· πŸ’€) - Data Analysis Baseline Library. BSD-3 Show 12 hidden projects...

Libraries that extend Pytorch with additional capabilities.

accelerate (πŸ₯‡40 Β· ⭐ 9K) - A simple way to launch, train, and use PyTorch models on.. Apache-2 tinygrad (πŸ₯‡33 Β· ⭐ 30K) - You like pytorch? You like micrograd? You love tinygrad!. MIT PML (πŸ₯‡27 Β· ⭐ 6.2K Β· πŸ’€) - The easiest way to use deep metric learning in your application... MIT torchdiffeq (πŸ₯‡27 Β· ⭐ 6.1K) - Differentiable ODE solvers with full GPU support and.. MIT torch-scatter (πŸ₯ˆ26 Β· ⭐ 1.7K) - PyTorch Extension Library of Optimized Scatter Operations. MIT torchsde (πŸ₯ˆ25 Β· ⭐ 1.7K Β· πŸ’€) - Differentiable SDE solvers with GPU support and efficient.. Apache-2 PyTorch Sparse (πŸ₯ˆ25 Β· ⭐ 1.1K) - PyTorch Extension Library of Optimized Autograd Sparse.. MIT Pytorch Toolbelt (πŸ₯‰19 Β· ⭐ 1.6K) - PyTorch extensions for fast R&D prototyping and Kaggle.. MIT pytorchviz (πŸ₯‰14 Β· ⭐ 3.4K Β· πŸ’€) - A small package to create visualizations of PyTorch execution.. MIT madgrad (πŸ₯‰14 Β· ⭐ 800 Β· πŸ’€) - MADGRAD Optimization Method. MIT Show 22 hidden projects...

Libraries for connecting to, operating, and querying databases.

πŸ”— best-of-python - DB Clients ( ⭐ 4.1K) - Collection of database clients for python.

scipy (πŸ₯‡48 Β· ⭐ 14K) - Ecosystem of open-source software for mathematics, science, and engineering. BSD-3 SymPy (πŸ₯‡47 Β· ⭐ 14K) - A computer algebra system written in pure Python. BSD-3 Streamlit (πŸ₯‡40 Β· ⭐ 41K Β· πŸ“ˆ) - Streamlit A faster way to build and share data apps. Apache-2 Gradio (πŸ₯‡38 Β· ⭐ 39K) - Wrap UIs around any model, share with anyone. Apache-2 PennyLane (πŸ₯‡35 Β· ⭐ 2.8K) - PennyLane is a cross-platform Python library for quantum.. Apache-2 PyOD (πŸ₯‡34 Β· ⭐ 9.4K) - A Python Library for Outlier and Anomaly Detection, Integrating Classical.. BSD-2 Autograd (πŸ₯‡34 Β· ⭐ 7.3K) - Efficiently computes derivatives of NumPy code. MIT DeepChem (πŸ₯ˆ33 Β· ⭐ 6.1K) - Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry,.. MIT carla (πŸ₯ˆ32 Β· ⭐ 13K) - Open-source simulator for autonomous driving research. MIT Pythran (πŸ₯ˆ32 Β· ⭐ 2.1K) - Ahead of Time compiler for numeric kernels. BSD-3 Datasette (πŸ₯ˆ31 Β· ⭐ 10K) - An open source multi-tool for exploring and publishing data. Apache-2 datalad (πŸ₯ˆ31 Β· ⭐ 590) - Keep code, data, containers under control with git and git-annex. MIT pyjanitor (πŸ₯ˆ30 Β· ⭐ 1.4K) - Clean APIs for data cleaning. Python implementation of R package.. MIT agate (πŸ₯ˆ30 Β· ⭐ 1.2K) - A Python data analysis library that is optimized for humans instead of.. MIT anomalib (πŸ₯ˆ29 Β· ⭐ 4.8K) - An anomaly detection library comprising state-of-the-art algorithms.. Apache-2 hdbscan (πŸ₯ˆ29 Β· ⭐ 3K Β· πŸ“‰) - A high performance implementation of HDBSCAN clustering. BSD-3 River (πŸ₯ˆ28 Β· ⭐ 5.5K) - Online machine learning in Python. BSD-3 tensorly (πŸ₯ˆ28 Β· ⭐ 1.6K) - TensorLy: Tensor Learning in Python. BSD-2 dstack (πŸ₯ˆ27 Β· ⭐ 1.9K) - dstack is an open-source control plane for running development,.. MPL-2.0 Prince (πŸ₯ˆ26 Β· ⭐ 1.4K) - Multivariate exploratory data analysis in Python PCA, CA, MCA, MFA,.. MIT causalml (πŸ₯ˆ25 Β· ⭐ 5.5K) - Uplift modeling and causal inference with machine learning.. Apache-2 Trax (πŸ₯‰24 Β· ⭐ 8.3K) - Trax Deep Learning with Clear Code and Speed. Apache-2 adapter-transformers (πŸ₯‰24 Β· ⭐ 2.8K) - A Unified Library for Parameter-Efficient and Modular.. Apache-2 huggingface gplearn (πŸ₯‰24 Β· ⭐ 1.7K) - Genetic Programming in Python, with a scikit-learn inspired API. BSD-3 TabPy (πŸ₯‰24 Β· ⭐ 1.6K Β· πŸ’€) - Execute Python code on the fly and display results in Tableau.. MIT metric-learn (πŸ₯‰24 Β· ⭐ 1.4K Β· πŸ’€) - Metric learning algorithms in Python. MIT AugLy (πŸ₯‰23 Β· ⭐ 5K) - A data augmentations library for audio, image, text, and video. MIT pycm (πŸ₯‰23 Β· ⭐ 1.5K) - Multi-class confusion matrix library in Python. MIT avalanche (πŸ₯‰22 Β· ⭐ 1.9K) - Avalanche: an End-to-End Library for Continual Learning based on.. MIT MONAILabel (πŸ₯‰22 Β· ⭐ 750) - MONAI Label is an intelligent open source image labeling and.. Apache-2 findspark (πŸ₯‰22 Β· ⭐ 520) - Find pyspark to make it importable. BSD-3 Feature Engine (πŸ₯‰21 Β· ⭐ 2K Β· πŸ’€) - Feature engineering package with sklearn like functionality. BSD-3 BioPandas (πŸ₯‰21 Β· ⭐ 730 Β· πŸ’€) - Working with molecular structures in pandas DataFrames. BSD-3 pykale (πŸ₯‰21 Β· ⭐ 470) - Knowledge-Aware machine LEarning (KALE): accessible machine learning.. MIT PaddleHub (πŸ₯‰19 Β· ⭐ 13K Β· πŸ’€) - 400+ AI Models: Rich, high-quality AI models, including.. Apache-2 apricot (πŸ₯‰19 Β· ⭐ 510) - apricot implements submodular optimization for the purpose of selecting.. MIT SUOD (πŸ₯‰18 Β· ⭐ 390) - (MLSys 21) An Acceleration System for Large-scare Unsupervised Heterogeneous.. BSD-2 pymdp (πŸ₯‰14 Β· ⭐ 550) - A Python implementation of active inference for Markov Decision Processes. MIT NeuralCompression (πŸ₯‰13 Β· ⭐ 570 Β· πŸ’€) - A collection of tools for neural compression enthusiasts. MIT Show 27 hidden projects...

Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:

If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository. If you like to create your own best-of list, we recommend to follow this guide.

For more information on how to add or update projects, please read the contribution guidelines. By participating in this project, you agree to abide by its Code of Conduct.


RetroSearch is an open source project built by @garambo | Open a GitHub Issue

Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo

HTML: 3.2 | Encoding: UTF-8 | Version: 0.7.4