Skip to content
View jbegaint's full-sized avatar

Block or report jbegaint

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 789 60 Updated Feb 9, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

437 6 Updated Feb 15, 2025

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,363 234 Updated Feb 19, 2025

A tool for visualizing and communicating the errors in rendered images.

C++ 541 43 Updated Jan 9, 2025

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

245 13 Updated Jan 25, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,549 1,076 Updated Feb 20, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,499 1,353 Updated Feb 1, 2025

LLM inference in C/C++

C++ 74,947 10,828 Updated Feb 21, 2025

Vim plugin for LLM-assisted code/text completion

Vim Script 1,193 27 Updated Feb 20, 2025

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,270 149 Updated Sep 3, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,548 479 Updated Feb 12, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 789 57 Updated Feb 17, 2025

PyTorch video decoding

Python 247 22 Updated Feb 21, 2025

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

Swift 4,891 304 Updated Jan 27, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 128,417 10,460 Updated Feb 22, 2025

Vim plugin for integrating Ollama based LLM (large language models)

Vim Script 123 16 Updated Feb 18, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,000 96 Updated Jan 2, 2025

NanoGPT (124M) in 3 minutes

Python 2,303 246 Updated Feb 21, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 8,678 702 Updated Feb 20, 2025

Truly independent web browser

C++ 28,147 1,220 Updated Feb 21, 2025

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 832 59 Updated Nov 22, 2024

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,185 57 Updated Nov 22, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,437 241 Updated Feb 20, 2025

first base model for full-duplex conversational audio

Python 1,707 112 Updated Jan 5, 2025

A MLX port of FLUX based on the Huggingface Diffusers implementation.

Python 1,216 75 Updated Feb 19, 2025

On-device Speech Recognition for Android

C++ 59 3 Updated Feb 20, 2025

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 29,716 3,717 Updated Aug 6, 2024

A Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps

Python 676 118 Updated Jan 23, 2025

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …

Python 7,176 1,242 Updated Jul 21, 2024

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)

Python 832 40 Updated Jan 28, 2025
Next