- Chile
Stars
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
All Algorithms implemented in Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Turn (almost) any Python command line program into a full GUI application with one line
Best Practices on Recommendation Systems
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
nannyml: post-deployment data science in python
The Python code to reproduce the illustrations from The Hundred-Page Machine Learning Book.
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks
A curated list of gradient boosting research papers with implementations.
GRU4Rec is the original Theano implementation of the algorithm in "Session-based Recommendations with Recurrent Neural Networks" paper, published at ICLR 2016 and its follow-up "Recurrent Neural Ne…
Exploring word2vec embeddings as a graph of nearest neighbors
Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc
Repository for Project Insight: NLP as a Service
Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications, build pipelines, manage secrets (Cloud-only)
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low…
manipulate pandas dataframes from the comfort of your browser
Build and deploy a serverless data pipeline on AWS with no effort.
Willump Is a Low-Latency Useful Machine learning Platform.
Charla de web scraping sobre datos públicos de Chile
Template for python project with continuous integration in Azure