Stars
Voilà turns Jupyter notebooks into standalone web applications
Scalable Python DS & ML, in an API compatible & lightning fast way.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Full Notes of Andrew Ng's Coursera Machine Learning.
This repository contains my personal notes and summaries on DeepLearning.ai specialization courses. I've enjoyed every little bit of the course hope you enjoy my notes too.
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Modin: Scale your Pandas workflows by changing a single line of code
Google Research
PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf
An open-source, low-code machine learning library in Python
Makes Interactive Chart Widget, Cleans raw data, Runs baseline models, Interactive hyperparameter tuning & tracking
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
A very simple framework for state-of-the-art Natural Language Processing (NLP)
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A drop-in replacement for Scikit-Learn’s GridSearchCV / RandomizedSearchCV -- but with cutting edge hyperparameter tuning techniques.
A library for debugging/inspecting machine learning classifiers and explaining their predictions
Feature selector is a tool for dimensionality reduction of machine learning datasets
scikit-learn: machine learning in Python
Pandas integration with sklearn
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Automated Machine Learning with scikit-learn
Python implementations of the Boruta all-relevant feature selection method.
Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more
Python package for publishing Jupyter Notebooks as Medium blogposts
Automatic extraction of relevant features from time series:
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀