Skip to content
View romitjain's full-sized avatar

Organizations

@cmeraki

Block or report romitjain

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GPU programming related news and material links

1,275 75 Updated Sep 23, 2024

[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)

1,197 139 Updated Dec 16, 2024

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 5,052 884 Updated Mar 5, 2024

Experiments with Model Training, Deployment & Monitoring

Python 38 4 Updated Nov 29, 2024

A fast multimodal LLM for real-time voice

Python 1,627 111 Updated Dec 12, 2024

first base model for full-duplex conversational audio

Python 1,656 106 Updated Nov 12, 2024

A list of Free Software network services and web applications which can be hosted on your own servers

208,001 9,892 Updated Dec 18, 2024

A bibliography and survey of the papers surrounding o1

TeX 935 37 Updated Nov 16, 2024

Making Long-Context LLM Inference 10x Faster and 10x Cheaper

Python 294 35 Updated Dec 17, 2024

Build real-time multimodal AI applications 🤖🎙️📹

Python 4,240 474 Updated Dec 19, 2024
Python 6,999 549 Updated Dec 19, 2024

A list of awesome resources for tmux

7,893 304 Updated Nov 19, 2024

Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.

Python 689 65 Updated Aug 22, 2024

Efficient Triton Kernels for LLM Training

Python 3,881 230 Updated Dec 19, 2024

Audio tokenization, in the fastest way possible!

Python 46 3 Updated Aug 26, 2024
Python 60 18 Updated Nov 7, 2024

A minimal implementation of vllm.

Cuda 30 Updated Jul 27, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,572 160 Updated Dec 19, 2024

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,425 305 Updated Oct 19, 2024

Applied AI experiments and examples for PyTorch

Python 188 16 Updated Dec 17, 2024

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,774 176 Updated Nov 18, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,874 443 Updated Dec 18, 2024

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,190 67 Updated Nov 27, 2024

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4,055 223 Updated Dec 12, 2024

A network filesystem client to connect to SSH servers

C 6,094 501 Updated Nov 29, 2024

Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on infer…

185 7 Updated Dec 14, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,206 4,441 Updated Aug 16, 2024

Instant voice cloning by MIT and MyShell.

Python 30,138 2,977 Updated Dec 12, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,404 57 Updated Aug 15, 2024

A simple tutorial of Variational AutoEncoders with Pytorch

Jupyter Notebook 341 77 Updated Feb 15, 2024
Next