yeppp27

yeppp27

0 followers · 3 following

Stars

Everlyn-Labs / ANTRP

Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs

Python 212 42 Updated Oct 11, 2024

waterhorse1 / LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Python 251 26 Updated May 26, 2024

WooooDyy / MathCritique

Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".

Python 49 Updated Nov 29, 2024

thu-wyz / inference_scaling

Python 46 3 Updated Nov 19, 2024

cyzus / thoughtsculpt

Python 13 1 Updated Dec 13, 2024

Arvid-pku / Godel_Agent

Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement

Python 69 22 Updated Oct 30, 2024

MingyuJ666 / Disentangling-Memory-and-Reasoning

[preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.

Python 39 2 Updated Dec 27, 2024

Quinn777 / AtomThink

Python 51 Updated Dec 13, 2024

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,736 218 Updated Jan 11, 2025

FranxYao / Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 448 29 Updated Mar 19, 2024

haonan3 / AnchorContext

AnchorAttention: Improved attention for LLMs long-context training

Python 202 6 Updated Dec 7, 2024

feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Python 629 64 Updated Aug 22, 2024

VQAssessment / DOVER

[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.

Jupyter Notebook 317 32 Updated Aug 12, 2024

QQ-MM / Video-CCAM

A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.

Python 68 3 Updated Oct 14, 2024

yueqis / Multilingual_Visual_Reasoning

Python 3 Updated May 27, 2024

mrwu-mac / ControlMLLM

[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'

Python 134 2 Updated Jan 13, 2025

leezythu / FocusLLM

FocusLLM: Scaling LLM’s Context by Parallel Decoding

Python 33 2 Updated Dec 8, 2024

SUSTechBruce / LOOK-M

[EMNLP 2024 Findings🔥] Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference"

Python 88 6 Updated Nov 9, 2024

Q-Future / Q-Ground

Official codes for "Q-Ground: Image Quality Grounding with Large Multi-modality Models", ACM MM2024 (Oral)

33 Updated Oct 25, 2024

longvideobench / LongVideoBench

[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.

Python 82 2 Updated Jul 27, 2024

XPixelGroup / DepictQA

DepictQA: Depicted Image Quality Assessment with Vision Language Models

Python 101 3 Updated Nov 17, 2024

kongds / E5-V

E5-V: Universal Embeddings with Multimodal Large Language Models

Python 215 8 Updated Dec 23, 2024

scofield7419 / Video-of-Thought

Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"

Python 79 4 Updated Dec 3, 2024

YeolJ00 / Personalized-Aesthetics

Official PyTorch implementation of "Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization" (ECCV 2024)

Python 19 Updated Oct 23, 2024

PhoenixZ810 / MG-LLaVA

Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).

Python 152 4 Updated Sep 27, 2024

JiuTian-VL / JiuTian-LION

[CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge

Jupyter Notebook 128 6 Updated Jul 18, 2024

LingyvKong / OneChart

[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"

Python 212 17 Updated Dec 4, 2024

Q-Future / Co-Instruct

④[ECCV 2024 Oral, Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a benchmark.

73 4 Updated Sep 29, 2024

X-PLUG / Multi-LLM-Agent

Python 205 24 Updated Apr 23, 2024

teowu / lmms-eval

Forked from EvolvingLMMs-Lab/lmms-eval

Q-Bench, Q-Bench+ and LongVideoBench for LMMs-Eval

Python 3 1 Updated Nov 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly