Skip to content
View iair's full-sized avatar
  • Chile

Block or report iair

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
29 stars written in Python
Clear filter

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 290,710 48,337 Updated Dec 2, 2024

All Algorithms implemented in Python

Python 197,828 46,329 Updated Mar 3, 2025

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 38,992 14,754 Updated Mar 3, 2025

Turn (almost) any Python command line program into a full GUI application with one line

Python 20,970 1,025 Updated Feb 21, 2024

Best Practices on Recommendation Systems

Python 19,847 3,165 Updated Feb 12, 2025

The Data Engineering Cookbook

Python 14,081 2,569 Updated Dec 11, 2024

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…

Python 10,194 929 Updated Mar 3, 2025

Python SQL Parser and Transpiler

Python 7,223 787 Updated Mar 3, 2025

The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

Python 3,550 239 Updated Sep 18, 2024

nannyml: post-deployment data science in python

Python 2,035 152 Updated Jan 14, 2025

The Python code to reproduce the illustrations from The Hundred-Page Machine Learning Book.

Python 1,889 569 Updated Jun 27, 2024

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,722 199 Updated Feb 25, 2025

DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks

Python 1,264 135 Updated Mar 2, 2023

Predictive Power Score (PPS) in Python

Python 1,124 170 Updated Jan 7, 2025

A curated list of gradient boosting research papers with implementations.

Python 1,013 157 Updated Mar 16, 2024

GRU4Rec is the original Theano implementation of the algorithm in "Session-based Recommendations with Recurrent Neural Networks" paper, published at ICLR 2016 and its follow-up "Recurrent Neural Ne…

Python 769 223 Updated Aug 24, 2023

Exploring word2vec embeddings as a graph of nearest neighbors

Python 707 93 Updated Dec 3, 2020

Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc

Python 385 22 Updated Jun 30, 2024

Repository for Project Insight: NLP as a Service

Python 303 46 Updated Feb 14, 2023

Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications, build pipelines, manage secrets (Cloud-only)

Python 285 24 Updated Feb 14, 2025
Python 268 32 Updated Nov 26, 2024

A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low…

Python 242 37 Updated May 12, 2024

manipulate pandas dataframes from the comfort of your browser

Python 171 24 Updated Aug 30, 2021

Build and deploy a serverless data pipeline on AWS with no effort.

Python 111 19 Updated Feb 8, 2023

Willump Is a Low-Latency Useful Machine learning Platform.

Python 44 8 Updated Mar 24, 2023
Python 41 6 Updated Jan 29, 2022
Python 19 2 Updated Oct 10, 2020

Charla de web scraping sobre datos públicos de Chile

Python 8 3 Updated Dec 8, 2022

Template for python project with continuous integration in Azure

Python 4 2 Updated Feb 18, 2023