- London, United Kingdon
Stars
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
StreamAlert is a serverless, realtime data analysis framework which empowers you to ingest, analyze, and alert on data from any environment, using datasources and alerting logic you define.
Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Simple and powerful factories for mock data generation
Turbine: the bare metals that gets you Airflow
Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform
A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (including HDFS, Hive, Presto, MySQL, etc).
Augment Beancount importers with machine learning functionality.
SQL upsert using pandas DataFrames for PostgreSQL, SQlite and MySQL with extra features