A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,116 2,674 Updated Feb 15, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 11,373 2,549 Updated Feb 15, 2025

databricks / megablocks

Python 1,270 181 Updated Nov 20, 2024

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,278 1,066 Updated Feb 14, 2025

thuml / TimeXer

Official implementation for "TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables" (NeurIPS 2024)

Python 167 22 Updated Nov 27, 2024

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 1,819 264 Updated Feb 13, 2025

hhaAndroid / awesome-mm-chat

多模态 MM +Chat 合集

Python 240 19 Updated Feb 12, 2025

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,096 204 Updated Feb 14, 2025

ChaofanTao / Autoregressive-Models-in-Vision-Survey

The paper collections for the autoregressive models in vision.

396 14 Updated Feb 15, 2025

ddz16 / TSFpaper

This repository contains a reading list of papers on Time Series Forecasting/Prediction (TSF) and Spatio-Temporal Forecasting/Prediction (STF). These papers are mainly categorized according to the …

2,314 204 Updated Feb 14, 2025

Time-MoE / Time-MoE

[ICLR 2025 Spotlight] Official implementation of "Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts"

Python 433 34 Updated Feb 13, 2025

Yangyi-Chen / SOLO

[TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"

Jupyter Notebook 127 3 Updated Nov 14, 2024

ameliawong1996 / From_News_to_Forecast

This repository is for the paper entitled: From News to Forecast: Integrating Event Analysis in LLM-based Time Series Forecasting with Reflection (NeurIPS 2024)

Python 76 16 Updated Oct 30, 2024

rom1504 / img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,889 347 Updated Aug 7, 2024

GAIR-NLP / O1-Journey

O1 Replication Journey

1,945 62 Updated Jan 14, 2025

HKUST-LongGroup / CoMM

Official repository for CoMM Dataset

Python 27 Updated Dec 31, 2024

showlab / Show-o

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,199 51 Updated Feb 10, 2025

ngruver / llmtime

Jupyter Notebook 736 166 Updated Aug 26, 2024

CrossmodalGroup / DynamicVectorQuantization

Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"

Python 167 6 Updated Jul 23, 2023