Skip to content
View jarobe42's full-sized avatar
  • London, United Kingdon

Block or report jarobe42

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
14 stars written in Python
Clear filter

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…

Python 10,129 914 Updated Jan 23, 2025

A next-generation curated knowledge sharing platform for data scientists and other technical professions.

Python 5,497 690 Updated Sep 4, 2024

StreamAlert is a serverless, realtime data analysis framework which empowers you to ingest, analyze, and alert on data from any environment, using datasources and alerting logic you define.

Python 2,861 334 Updated Oct 23, 2023

Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment

Python 2,787 1,253 Updated Nov 15, 2024

A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

Python 2,081 102 Updated Dec 15, 2023

Fava - web interface for Beancount

Python 2,045 295 Updated Jan 25, 2025

Simple and powerful factories for mock data generation

Python 1,108 88 Updated Jan 24, 2025

Manage AWS MFA Security Credentials

Python 1,034 169 Updated Aug 8, 2024

Turbine: the bare metals that gets you Airflow

Python 377 69 Updated Oct 10, 2021

Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform

Python 261 58 Updated Jul 19, 2023

A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (including HDFS, Hive, Presto, MySQL, etc).

Python 256 46 Updated Jun 1, 2024

Augment Beancount importers with machine learning functionality.

Python 256 31 Updated Jan 6, 2025

SQL upsert using pandas DataFrames for PostgreSQL, SQlite and MySQL with extra features

Python 228 15 Updated Dec 6, 2023