Skip to content
View sauravsp21's full-sized avatar

Highlights

  • Pro

Block or report sauravsp21

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 1 17 Updated Oct 14, 2016

Create a data pipeline on AWS to execute batch processing in a Spark cluster provisioned by Amazon EMR. ETL using managed airflow: extracts data from S3, transform data using spark, load transforme…

Python 9 4 Updated Jul 12, 2021

Apache Spark notebook to perform ETL processes on OpenMRS data

Python 5 2 Updated Apr 10, 2018

🚚 ETL for Spark and Airflow

Python 24 6 Updated Mar 19, 2018

Spark data pipeline that processes movie ratings data.

Python 27 12 Updated Jan 6, 2025

This is a simple ETL using Airflow. First, we fetch data from API (extract). Then, we drop unused columns, convert to CSV, and validate (transform). Finally, we load the transformed data to databas…

Python 23 9 Updated Oct 12, 2019

Airflow DAGs for the Stellar ETL project

Python 35 19 Updated Jan 6, 2025

Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3

Python 90 58 Updated Nov 22, 2021

Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.

Python 304 53 Updated Jan 12, 2022

Real-time ETL pipeline for financial data (kafka, pyspark) .

Python 8 1 Updated Dec 31, 2022

Blockchain ETL Architecture

47 15 Updated Oct 10, 2022

Airflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. How to get any Ethereum smart contract into BigQuery https://towardsdatascience.com/how-to-get-any-ethereum-smart-cont…

Python 408 194 Updated Dec 27, 2024

A docker image and kubernetes config files to run Airflow on Kubernetes

Python 655 204 Updated Jul 19, 2019

PySpark functions and utilities with examples. Assists ETL process of data modeling

Jupyter Notebook 99 76 Updated Dec 3, 2020
PowerShell 27 15 Updated Mar 7, 2022

Power Pop Health is a collection of content intended to simplify the process of ingesting and prepping Healthcare Open Data using Azure data tools and Power BI. Moving forward the overarching theme…

18 4 Updated May 23, 2022

Udacity Data Engineering Nano Degree (DEND)

Jupyter Notebook 186 168 Updated Jan 20, 2020

Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in…

Python 2,983 863 Updated Dec 19, 2024

This project covers the implementation of dockerizing a python flask based credit risk assessment calculator web application integrated with two different deep learning and transfer learning based …

HTML 3 1 Updated May 19, 2022
Jupyter Notebook 11 8 Updated Sep 30, 2022

A pipeline to CI/CD of a machine learning model on Google Cloud Run

Python 31 7 Updated May 1, 2023

This repo provides step to step guide to build CI/CD Pipeline on Azure ML

Jupyter Notebook 6 23 Updated Mar 1, 2021

This is an AWS MLE and MLOps Bank Customers Churn Prediction Project.

Jupyter Notebook 3 1 Updated May 1, 2023

ZenML 🙏: The bridge between ML and Ops. https://zenml.io.

Python 4,300 463 Updated Jan 7, 2025

A collection of full time roles in SWE, Quant, and PM for new grads.

11,893 1,070 Updated Jan 7, 2025

Model to segment 3D MRI images using a 3D UNET based FCN architecture and convert it to a surface mesh. Please see the link below for the full paper.

Python 1 Updated Jul 23, 2021

Computer vision deep learning with medical images, MSc DS final project

Python 1 1 Updated Sep 12, 2021