Skip to content
@FoundationVision

FoundationVision

Hi there 👋

This is FoundationVision official website repo

Popular repositories Loading

  1. VAR VAR Public

    [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

    Python 6.1k 410

  2. LlamaGen LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    Python 1.4k 57

  3. GLEE GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    Python 1.1k 86

  4. Groma Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    Python 577 61

  5. Infinity Infinity Public

    Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    305

  6. OmniTokenizer OmniTokenizer Public

    [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

    Python 271 7

Repositories

Showing 10 of 11 repositories
  • Infinity Public

    Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

    FoundationVision/Infinity’s past year of commit activity
    305 MIT 0 3 0 Updated Dec 11, 2024
  • FoundationVision/infinity.project’s past year of commit activity
    HTML 0 0 0 0 Updated Dec 11, 2024
  • VAR Public

    [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

    FoundationVision/VAR’s past year of commit activity
    Python 6,080 MIT 410 37 1 Updated Dec 6, 2024
  • GLEE Public

    [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

    FoundationVision/GLEE’s past year of commit activity
    Python 1,102 MIT 86 39 2 Updated Oct 21, 2024
  • LlamaGen Public

    Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

    FoundationVision/LlamaGen’s past year of commit activity
    Python 1,392 MIT 57 50 0 Updated Aug 15, 2024
  • OmniTokenizer Public

    [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.

    FoundationVision/OmniTokenizer’s past year of commit activity
    Python 271 MIT 7 8 0 Updated Jul 10, 2024
  • vaex Public

    🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook

    FoundationVision/vaex’s past year of commit activity
    Python 54 MIT 3 1 0 Updated Jun 23, 2024
  • Groma Public

    [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

    FoundationVision/Groma’s past year of commit activity
    Python 577 Apache-2.0 61 8 1 Updated Jun 7, 2024
  • GenerateU Public

    [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

    FoundationVision/GenerateU’s past year of commit activity
    Python 148 6 13 0 Updated Mar 25, 2024
  • UniRef Public

    [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

    FoundationVision/UniRef’s past year of commit activity
    Python 235 MIT 15 4 0 Updated Jan 10, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python HTML