jbegaint

Jean Bégaint jbegaint

ML for video

50 followers · 127 following

Achievements

x3 x2

Achievements

x3 x2

Stars

kyutai-labs / hibiki

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 789 60 Updated Feb 9, 2025

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

437 6 Updated Feb 15, 2025

Saiyan-World / goku

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,363 234 Updated Feb 19, 2025

NVlabs / flip

A tool for visualizing and communicating the errors in rendered images.

C++ 541 43 Updated Jan 9, 2025

ziqihuangg / Awesome-Evaluation-of-Visual-Generation

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

245 13 Updated Jan 25, 2025

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,549 1,076 Updated Feb 20, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,499 1,353 Updated Feb 1, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 74,947 10,828 Updated Feb 21, 2025

ggml-org / llama.vim

Vim plugin for LLM-assisted code/text completion

Vim Script 1,193 27 Updated Feb 20, 2025

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,270 149 Updated Sep 3, 2024

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,548 479 Updated Feb 12, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 789 57 Updated Feb 17, 2025

pytorch / torchcodec

PyTorch video decoding

Python 247 22 Updated Feb 21, 2025

gluonfield / enchanted

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

Swift 4,891 304 Updated Jan 27, 2025

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 128,417 10,460 Updated Feb 22, 2025

gergap / vim-ollama

Vim plugin for integrating Ollama based LLM (large language models)

Vim Script 123 16 Updated Feb 18, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,000 96 Updated Jan 2, 2025

KellerJordan / modded-nanogpt

NanoGPT (124M) in 3 minutes

Python 2,303 246 Updated Feb 21, 2025

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 8,678 702 Updated Feb 20, 2025

LadybirdBrowser / ladybird

Truly independent web browser

C++ 28,147 1,220 Updated Feb 21, 2025

apple / ml-mobileclip

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 832 59 Updated Nov 22, 2024

apple / ml-aim

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,185 57 Updated Nov 22, 2024

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,437 241 Updated Feb 20, 2025

Standard-Intelligence / hertz-dev

first base model for full-duplex conversational audio

Python 1,707 112 Updated Jan 5, 2025

filipstrand / mflux

A MLX port of FLUX based on the Huggingface Diffusers implementation.

Python 1,216 75 Updated Feb 19, 2025

argmaxinc / WhisperKitAndroid

On-device Speech Recognition for Android

C++ 59 3 Updated Feb 20, 2025

xinntao / Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 29,716 3,717 Updated Aug 6, 2024

HasnainRaz / Fast-SRGAN

A Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps

Python 676 118 Updated Jan 23, 2025

XPixelGroup / BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …

Python 7,176 1,242 Updated Jul 21, 2024

sihyun-yu / REPA

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)

Python 832 40 Updated Jan 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jean Bégaint jbegaint

Achievements

Achievements

Block or report jbegaint

Stars

kyutai-labs / hibiki

ML-GSAI / LLaDA

Saiyan-World / goku

NVlabs / flip

ziqihuangg / Awesome-Evaluation-of-Visual-Generation

QwenLM / Qwen2.5

Jiayi-Pan / TinyZero

ggml-org / llama.cpp

ggml-org / llama.vim

THUDM / CogVLM2

NVIDIA / Cosmos

huggingface / picotron

pytorch / torchcodec

gluonfield / enchanted

ollama / ollama

gergap / vim-ollama

facebookresearch / flow_matching

KellerJordan / modded-nanogpt

Tencent / HunyuanVideo

LadybirdBrowser / ladybird

apple / ml-mobileclip

apple / ml-aim

facebookresearch / lingua

Standard-Intelligence / hertz-dev

filipstrand / mflux

argmaxinc / WhisperKitAndroid

xinntao / Real-ESRGAN

HasnainRaz / Fast-SRGAN

XPixelGroup / BasicSR

sihyun-yu / REPA