Lists (2)
Sort Name ascending (A-Z)
Starred repositories
A quick guide (especially) for trending instruction finetuning datasets
A list of developer portfolios for your inspiration
This is a public repository to go over all the LLM-driven data engineering concepts.
📨 The ultimate social media scheduling tool, with a bunch of AI 🤖
Open-source AI video dubbing studio that costs $0.1/min(~20x cheaper than alternatives like Elevenlabs, Rask or Speechify)
Notes on books I read, talks I watch, articles I study, and papers I love
step by step guide for aws mini labs. Currently maintained on : https://github.com/Cloud-Yeti/aws-labs Youtube playlist for labs:
A complete computer science study plan to become a software engineer.
Extract Keywords from sentence or Replace keywords in sentences.
A guide to available tools and platforms for developing on Ethereum.
Data Engineering Project to Extract and Process Solana Reddit Data
Preparation links and resources for system design questions
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of hetero…
Simple UI for LLM Model Finetuning
My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggregates Twitter and US stock market data for user sentiment anal…
Troubleshooting Apache Spark [Video] Published by Packt
Practical Data Engineering: A Hands-On Real-Estate Project Guide
Screener for Stocks with Relative Strength New Highs & New Highs before Price, sorted by RS Score
Enterprise-grade, production-hardened, serverless data lake on AWS
This provides the contents for AWS Data Lake Handson in both Japanese and English.
A deployable reference implementation intended to address pain points around conceptualizing data lake architectures that automatically configures the core AWS services necessary to easily tag, sea…
Data Engineering with Spark and Delta Lake