yeppp27

yeppp27

0 followers · 3 following

Stars

34 results for source starred repositories written in Python

Clear filter

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,789 525 Updated Dec 25, 2024

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,737 218 Updated Jan 11, 2025

feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Python 630 64 Updated Aug 22, 2024

FranxYao / Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 448 29 Updated Mar 19, 2024

Q-Future / Q-Align

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 334 23 Updated Aug 12, 2024

waterhorse1 / LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 251 26 Updated May 26, 2024

kongds / E5-V

E5-V: Universal Embeddings with Multimodal Large Language Models

Python 215 8 Updated Dec 23, 2024

LingyvKong / OneChart

[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"

Python 213 17 Updated Dec 4, 2024

Everlyn-Labs / ANTRP

Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs

Python 212 42 Updated Oct 11, 2024

Q-Future / Q-Instruct

②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.

Python 210 10 Updated Aug 12, 2024

X-PLUG / Multi-LLM-Agent

Python 206 24 Updated Apr 23, 2024

haonan3 / AnchorContext

AnchorAttention: Improved attention for LLMs long-context training

Python 202 6 Updated Jan 15, 2025

PhoenixZ810 / MG-LLaVA

Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).

Python 152 4 Updated Sep 27, 2024

mrwu-mac / ControlMLLM

[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'

Python 135 2 Updated Jan 13, 2025

avinabsaha / ReIQA

Official implementation for CVPR2023 Paper "Re-IQA : Unsupervised Learning for Image Quality Assessment in the Wild"

Python 101 7 Updated Apr 26, 2024

XPixelGroup / DepictQA

DepictQA: Depicted Image Quality Assessment with Vision Language Models

Python 101 3 Updated Nov 17, 2024

SUSTechBruce / LOOK-M

[EMNLP 2024 Findings🔥] Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference"

Python 88 6 Updated Nov 9, 2024

longvideobench / LongVideoBench

[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.

Python 82 2 Updated Jul 27, 2024

scofield7419 / Video-of-Thought

Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"

Python 79 4 Updated Dec 3, 2024

PerkLab / SlicerSandbox

Collection of utilities that are not polished implementations but can be useful to users

Python 77 24 Updated Oct 23, 2024

Arvid-pku / Godel_Agent

Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement

Python 69 22 Updated Oct 30, 2024

QQ-MM / Video-CCAM

A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.

Python 68 3 Updated Oct 14, 2024

TianheWu / MLLMs-for-IQA

[ECCV 2024] Official Pytorch Implementation of A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment

Python 64 1 Updated Jul 20, 2024

Quinn777 / AtomThink

Python 51 Updated Dec 13, 2024

WooooDyy / MathCritique

Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".

Python 49 Updated Nov 29, 2024

thu-wyz / inference_scaling

Python 46 3 Updated Nov 19, 2024

MingyuJ666 / Disentangling-Memory-and-Reasoning

[preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.

Python 39 2 Updated Dec 27, 2024

leezythu / FocusLLM

FocusLLM: Scaling LLM’s Context by Parallel Decoding

Python 33 2 Updated Dec 8, 2024

YeolJ00 / Personalized-Aesthetics

Official PyTorch implementation of "Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization" (ECCV 2024)

Python 19 Updated Oct 23, 2024

ivattyue / SC-Tune

Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"

Python 16 1 Updated Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly