Skip to content
View risarora's full-sized avatar

Block or report risarora

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 37 26 Updated Dec 8, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,710 174 Updated Nov 28, 2023

A list of developer portfolios for your inspiration

7,794 1,715 Updated Dec 17, 2024

This is a public repository to go over all the LLM-driven data engineering concepts.

Python 912 148 Updated Oct 26, 2024
Python 3 Updated Sep 19, 2023

📨 The ultimate social media scheduling tool, with a bunch of AI 🤖

TypeScript 14,302 2,454 Updated Dec 18, 2024

Open-source AI video dubbing studio that costs $0.1/min(~20x cheaper than alternatives like Elevenlabs, Rask or Speechify)

TypeScript 161 13 Updated Nov 5, 2024

Notes on books I read, talks I watch, articles I study, and papers I love

SCSS 5,549 1,200 Updated Jan 2, 2024

step by step guide for aws mini labs. Currently maintained on : https://github.com/Cloud-Yeti/aws-labs Youtube playlist for labs:

HCL 204 842 Updated Aug 21, 2019

Code to download and process SF housing sales data

R 32 12 Updated Sep 29, 2009

A complete computer science study plan to become a software engineer.

307,935 77,158 Updated Dec 5, 2024

LLM training in simple, raw C/CUDA

Cuda 24,738 2,803 Updated Oct 2, 2024

Extract Keywords from sentence or Replace keywords in sentences.

Python 5,601 601 Updated Jul 3, 2024

A guide to available tools and platforms for developing on Ethereum.

5,371 1,339 Updated Dec 12, 2024

Data Engineering Project to Extract and Process Solana Reddit Data

Python 23 2 Updated Feb 3, 2024

Preparation links and resources for system design questions

8,873 2,476 Updated May 10, 2024

WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of hetero…

Scala 30 11 Updated Jul 4, 2024

Simple UI for LLM Model Finetuning

Jupyter Notebook 2,046 132 Updated Dec 21, 2023

My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on ​lambda architecture​, that aggregates Twitter and US stock market data for user sentiment anal…

Scala 492 128 Updated Aug 24, 2022

Troubleshooting Apache Spark [Video] Published by Packt

Scala 6 12 Updated Jan 30, 2023

Practical Data Engineering: A Hands-On Real-Estate Project Guide

Jupyter Notebook 565 92 Updated Sep 3, 2024

Screener for Stocks with Relative Strength New Highs & New Highs before Price, sorted by RS Score

18 8 Updated Mar 19, 2021

Data Engineering Take Home

7 13 Updated Jun 6, 2020

Enterprise-grade, production-hardened, serverless data lake on AWS

Python 424 140 Updated Dec 6, 2024

explaining sql levels based on one meme

98 9 Updated Aug 27, 2023

This provides the contents for AWS Data Lake Handson in both Japanese and English.

115 59 Updated May 26, 2023

A deployable reference implementation intended to address pain points around conceptualizing data lake architectures that automatically configures the core AWS services necessary to easily tag, sea…

JavaScript 401 160 Updated Jun 3, 2024

Data Engineering with Spark and Delta Lake

TSQL 92 76 Updated Jan 18, 2023
Next