-
MMSearch Public
The First Multimodal Seach Engine Pipeline and Benchmark for LMMs
-
CoMat Public
[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
-
VLMEvalKit Public
Forked from open-compass/VLMEvalKitOpen-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
Python Apache License 2.0 UpdatedSep 26, 2024 -
-
lmms-eval Public
Forked from EvolvingLMMs-Lab/lmms-evalAccelerating the development of large multimodal models (LMMs) with lmms-eval
Python Other UpdatedSep 24, 2024