Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …

Python 1,980 260 Updated Dec 14, 2024

poloclub / diffusiondb

A large-scale text-to-image prompt gallery dataset based on Stable Diffusion

Python 1,221 68 Updated Jul 11, 2024

CLAY-3D / OpenCLAY

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

849 11 Updated Jun 21, 2024

j-min / DSG

Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)

Jupyter Notebook 78 5 Updated Dec 9, 2024

Yushi-Hu / tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Python 140 10 Updated Apr 29, 2024

donahowe / AutoStudio

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Jupyter Notebook 417 31 Updated Oct 31, 2024

layer6ai-labs / fusemix

Data-Efficient Multimodal Fusion on a Single GPU

Python 48 7 Updated May 7, 2024

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,108 88 Updated Aug 6, 2024

Kwai-Kolors / Kolors

Kolors Team

Python 4,000 289 Updated Nov 13, 2024

AiuniAI / Unique3D

[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Python 3,134 252 Updated Sep 18, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,796 117 Updated Oct 30, 2024

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,165 147 Updated Sep 3, 2024

buaacyw / MeshAnything

From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"

Python 2,068 91 Updated Aug 5, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,635 5,488 Updated Dec 14, 2024

McGill-NLP / llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,359 101 Updated Oct 8, 2024

apple / ml-veclip

The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"

Jupyter Notebook 235 13 Updated Aug 24, 2024

facebookresearch / DCI

Densely Captioned Images (DCI) dataset repository.

Python 162 5 Updated Jul 1, 2024

google / imageinwords

Data release for the ImageInWords (IIW) paper.

JavaScript 202 9 Updated Nov 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chunyu Wang CHUNYUWANG

Achievements

Achievements

Block or report CHUNYUWANG

Stars

Tencent / HunyuanVideo

urchade / GLiNER

ifzhang / FairMOT

Zeqiang-Lai / Mini-DALLE3

prs-eth / Marigold

tencent-ailab / Frequency_Aug_VAE_MoESR

openai / consistencydecoder

ContextualAI / gritlm

tgxs002 / HPSv2

google-research / syn-rep-learn

TencentQQGYLab / ELLA

kohya-ss / sd-scripts

stanford-crfm / helm