TruemanV5

TruemanV5

1 follower · 3 following

Highlights

Lists (1)

Sort

LLM

A starter for my PhD in LLM

16 repositories

Stars

Ruiyang-061X / Awesome-MLLM-Uncertainty

✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).

24 Updated Dec 25, 2024

OpenGVLab / Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,122 253 Updated Nov 26, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 14,987 1,209 Updated Dec 12, 2024

facebookresearch / LaViLa

Code release for "Learning Video Representations from Large Language Models"

Python 498 46 Updated Oct 1, 2023

Azure-Samples / graphrag-accelerator

One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure

Python 2,010 332 Updated Dec 19, 2024

doc-doc / NExT-QA

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)

Python 135 13 Updated Jul 25, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,726 87 Updated Dec 12, 2024

Ziyang412 / VideoTree

Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"

Python 88 3 Updated Aug 6, 2024

YueFan1014 / VideoAgent

This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)

Python 151 7 Updated Dec 5, 2024

wxh1996 / VideoAgent

Python 60 9 Updated Dec 16, 2024

showlab / Show-o

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,109 46 Updated Dec 26, 2024

bytedance / tarsier

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 167 10 Updated Dec 25, 2024

xiaobai1217 / Awesome-Video-Datasets

Video datasets

1,259 96 Updated Mar 8, 2023

cilinyan / VISA

[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model

Python 143 4 Updated Aug 5, 2024

hasancaslan / BeautifulPointCloud

Transform your point cloud data into beautifully rendered 3D images.

Python 12 Updated Aug 21, 2023

NExT-ChatV / NExT-Chat

The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".

Python 227 9 Updated Feb 5, 2024

VCL3D / Pano3D

Code and models for "Pano3D: A Holistic Benchmark and a Solid Baseline for 360 Depth Estimation", OmniCV Workshop @ CVPR21.

Python 82 7 Updated Nov 12, 2022

ryanbgriffiths / ICRA2024PaperList

ICRA2024 Paper List

471 30 Updated Sep 17, 2024

yl3800 / LASO

Python 20 1 Updated Aug 8, 2024

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,070 868 Updated Jul 6, 2024

chidiwilliams / buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Python 12,906 957 Updated Dec 15, 2024

ActiveVisionLab / Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

1,346 82 Updated Dec 16, 2024

hzxie / CityDreamer

The official implementation of "CityDreamer: Compositional Generative Model of Unbounded 3D Cities". (Xie et al., CVPR 2024)

Python 622 42 Updated Aug 31, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,804 4,530 Updated Dec 25, 2024

aisingapore / sealion

South-East Asia Large Language Models

Shell 282 21 Updated Dec 18, 2024

MultimodalGeo / GeoText-1652

An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching

Python 56 1 Updated Oct 17, 2024

Ruiyang-061X / LiSe

Official repo for our ECCV'24 paper: Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.

Jupyter Notebook 31 2 Updated Sep 3, 2024

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,658 831 Updated Aug 20, 2024

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,484 91 Updated Dec 11, 2024

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,239 461 Updated Nov 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly