Skip to content
View douyh's full-sized avatar

Block or report douyh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of awesome LLM for Autonomous Driving resources (continually updated)

1,067 53 Updated Sep 25, 2024

[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving

245 12 Updated Mar 14, 2024

A lightweight framework for building LLM-based agents

Python 1,923 200 Updated Dec 3, 2024

LlamaIndex is a data framework for your LLM applications

Python 37,365 5,359 Updated Dec 16, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,521 460 Updated Dec 15, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 31,948 3,673 Updated Dec 15, 2024

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 11,811 819 Updated Dec 16, 2024

This is a demo of multimodal RAG solution

Python 17 3 Updated May 31, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,864 900 Updated Oct 22, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,847 438 Updated Dec 16, 2024

LLM inference in C/C++

C++ 69,293 9,975 Updated Dec 16, 2024

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,345 345 Updated Dec 13, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31,959 4,858 Updated Dec 16, 2024

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Python 2,779 320 Updated Dec 4, 2024

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Python 717 53 Updated Feb 1, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 169,171 44,560 Updated Dec 16, 2024
Jupyter Notebook 761 71 Updated Aug 7, 2024

YOLO-World + EfficientViT SAM

Python 80 9 Updated Feb 18, 2024
Python 362 14 Updated Jul 29, 2024

Ultralytics YOLO11 🚀

Python 34,061 6,545 Updated Dec 16, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,571 461 Updated Nov 21, 2024

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,076 69 Updated Apr 15, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,712 2,279 Updated Aug 12, 2024

Collection of AWESOME vision-language models for vision tasks

2,619 225 Updated Dec 3, 2024

[ECCV 2024] 3DGazeNet: Generalizing Gaze Estimation with Weak-Supervision from Synthetic Views

Python 62 6 Updated Aug 28, 2024

Make human motion capture easier.

Python 3,756 463 Updated May 6, 2024

本人的科研经验

6,076 362 Updated Dec 8, 2024

Collecting papers about new view synthesis

696 53 Updated Aug 26, 2024

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 13,072 1,045 Updated Dec 5, 2024

A collaboration friendly studio for NeRFs

Python 9,654 1,325 Updated Dec 5, 2024
Next