Skip to content
View soran-ghaderi's full-sized avatar

Highlights

  • Pro

Organizations

@appheap @bi-graph @tensorops

Block or report soran-ghaderi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train transformer language models with reinforcement learning.

Python 10,894 1,436 Updated Jan 31, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 6,091 1,051 Updated Jan 28, 2025

PyTorch library for solving imaging inverse problems using deep learning

Python 363 78 Updated Jan 28, 2025
Python 20 3 Updated Jan 28, 2025

Boltzmann Generators and Normalizing Flows in PyTorch

Jupyter Notebook 155 39 Updated Jan 30, 2024

PyTorch implementation of normalizing flow models

Python 775 114 Updated Aug 25, 2024

POT : Python Optimal Transport

Python 2,477 510 Updated Jan 27, 2025

A simple implimentation of Bayesian Flow Networks (BFN)

Jupyter Notebook 240 15 Updated Jan 4, 2024

TorchCFM: a Conditional Flow Matching library

Python 1,433 121 Updated Jan 24, 2025

Reference implementation for Deep Unsupervised Learning using Nonequilibrium Thermodynamics

Python 456 70 Updated Feb 28, 2017
Python 50 5 Updated Jun 14, 2024

Bridging deep learning and logical reasoning using a differentiable satisfiability solver.

Python 409 52 Updated Nov 22, 2022

A PyTorch native library for large model training

Python 3,220 258 Updated Jan 31, 2025

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 4,835 271 Updated Jan 26, 2025

[ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

Python 105 6 Updated Dec 4, 2024

awesome synthetic (text) datasets

Jupyter Notebook 256 11 Updated Oct 29, 2024

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,111 69 Updated Jul 14, 2024

Tools for merging pretrained large language models.

Python 5,170 485 Updated Jan 25, 2025

[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Python 227 8 Updated Jan 27, 2025

A framework for few-shot evaluation of language models.

Python 7,606 2,047 Updated Jan 30, 2025

Efficient Triton Kernels for LLM Training

Python 4,261 249 Updated Jan 30, 2025

A throughput-oriented high-performance serving framework for LLMs

Cuda 714 29 Updated Sep 21, 2024

[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.

Python 505 28 Updated Mar 7, 2024

AIOS: AI Agent Operating System

Python 3,746 460 Updated Jan 28, 2025

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Python 1,412 89 Updated Sep 7, 2023

Official inference repo for FLUX.1 models

Python 19,828 1,390 Updated Jan 9, 2025

Accelerated First Order Parallel Associative Scan

Python 170 8 Updated Aug 20, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,840 1,382 Updated Dec 25, 2024

The Memory layer for AI Agents

Python 24,238 2,249 Updated Jan 31, 2025
Next