Skip to content
View MayankAgarwal's full-sized avatar

Organizations

@IBM

Block or report MayankAgarwal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reference implementation for DPO (Direct Preference Optimization)

Python 2,267 187 Updated Aug 11, 2024
Python 2,165 244 Updated Dec 20, 2024

A bibliography and survey of the papers surrounding o1

TeX 937 37 Updated Nov 16, 2024

Efficient Triton Kernels for LLM Training

Python 3,892 230 Updated Dec 20, 2024

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,699 2,270 Updated Dec 19, 2024

Matplotlib style sheets to nicely format figures for scientific papers, thesis and presentations while keeping them fully editable in Adobe Illustrator.

Python 854 36 Updated Mar 25, 2024

Matplotlib styles for scientific plotting

Python 7,258 714 Updated Oct 12, 2024

A collaborative catalog of NLP resources for Indic languages

565 80 Updated Dec 14, 2024

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 659 52 Updated Sep 4, 2024

A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering

688 91 Updated Jul 9, 2024

🌐 Wikipedia for Web APIs. Directory of REST API definitions in OpenAPI 2.0/3.x format

3,907 582 Updated Jul 28, 2024

LLM inference in C/C++

C++ 69,519 10,026 Updated Dec 21, 2024

Minimalistic large language model 3D-parallelism training

Python 1,343 133 Updated Dec 19, 2024

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

1,848 115 Updated Dec 20, 2024

Machine Learning Engineering Open Book

Python 12,034 732 Updated Dec 20, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 35,629 4,419 Updated Nov 18, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,088 881 Updated Sep 1, 2024

Structured Text Generation

Python 10,106 526 Updated Dec 20, 2024

Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry, Materials Science and Biology

Python 5,605 1,702 Updated Dec 18, 2024

PyTorch package to train and audit ML models for Individual Fairness

Python 63 7 Updated Sep 14, 2023

Publicly available structured COVID-19 data from India, extracted automatically from daily health bulletins published by state governments.

Python 21 8 Updated Apr 14, 2022

Latex code for making neural networks diagrams

TeX 22,409 2,889 Updated Aug 21, 2023

C++ Implementation of PyTorch Tutorials for Everyone

C++ 1,978 263 Updated May 6, 2024

A library for federated learning (a distributed machine learning process) in an enterprise environment.

Python 500 137 Updated Aug 1, 2023

resources about federated learning and privacy in machine learning

526 95 Updated Jun 26, 2024

Save Jupyter Notebooks as PDF

Jupyter Notebook 371 73 Updated May 27, 2024

Command Line Artificial Intelligence or CLAI is an open-sourced project from IBM Research aimed to bring the power of AI to the command line interface.

Python 479 75 Updated Mar 17, 2023

Various tutorials given for welcoming new students at MILA.

Jupyter Notebook 986 210 Updated Jun 27, 2018