Stars
A curated list of awesome Machine Learning frameworks, libraries and software.
Streamlit — A faster way to build and share data apps.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…
Code for the paper "Language Models are Unsupervised Multitask Learners"
Open source platform for the machine learning lifecycle
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Python package built to ease deep learning on graph, on top of existing DL frameworks.
Advanced Python Mastery (course by @dabeaz)
Statsmodels: statistical modeling and econometrics in Python
Practical Python Programming (course by @dabeaz)
Open source annotation tool for machine learning practitioners.
Natural Language Processing Best Practices & Examples
A curated list of data science blogs
A system for quickly generating training data with weak supervision
The Open Source Feature Store for Machine Learning
Probabilistic time series modeling in Python
🛁 Clean Code concepts adapted for Python
Visual analysis and diagnostic tools to facilitate machine learning model selection.
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
Code samples from the "Python Cookbook, 3rd Edition", published by O'Reilly & Associates, May, 2013.
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
The Python code to reproduce the illustrations from The Hundred-Page Machine Learning Book.
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch…