Skip to content
View Jiawei-Guo's full-sized avatar
  • China
  • 23:00 (UTC +08:00)

Block or report Jiawei-Guo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Python 16 Updated Dec 9, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,035 712 Updated Aug 12, 2024

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 36,509 4,497 Updated Dec 21, 2024

official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"

Jupyter Notebook 177 18 Updated Nov 26, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,802 118 Updated Oct 30, 2024

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,592 1,015 Updated Dec 21, 2024

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

6,994 415 Updated Jul 28, 2024

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

JavaScript 4,310 418 Updated Sep 9, 2024

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Python 2,277 167 Updated Nov 13, 2024

ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large conversational datasets along with scripts exemplifying the u…

Jupyter Notebook 561 131 Updated Dec 2, 2024

Source code for end-to-end dialogue model from the MultiWOZ paper (Budzianowski et al. 2018, EMNLP)

Python 872 199 Updated Dec 15, 2024

🐝 GPTSwarm: LLM agents as (Optimizable) Graphs

Python 707 44 Updated Oct 14, 2024

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Python 52,686 6,845 Updated Nov 17, 2024

🤖 Awesome list of AGI Agents. Agents 精选资源合集.

326 24 Updated Oct 31, 2023

A Python library to extract tabular data from PDFs

Python 3,068 476 Updated Aug 19, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,224 461 Updated Nov 6, 2024

PDF解析(文字,章节,表格,图片,参考),基于大模型(ChatGLM2-6B, RWKV)+langchain+streamlit的PDF问答,摘要,信息抽取

Python 170 29 Updated Oct 17, 2023

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,782 2,292 Updated Aug 12, 2024

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 16,129 1,942 Updated Nov 7, 2024
Python 224 31 Updated May 27, 2024

[EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records

Python 66 10 Updated Sep 22, 2024

An incremental parsing system for programming tools

Rust 19,039 1,527 Updated Dec 20, 2024

Evolutionary algorithm toolbox and framework with high performance for Python

Python 2,033 727 Updated Sep 20, 2024

The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".

Python 33 1 Updated Feb 10, 2024

A hard gym for programming

Python 142 15 Updated Jul 7, 2024

A Database of Real Faults and an Experimental Infrastructure to Enable Controlled Experiments in Software Engineering Research

Perl 755 306 Updated Nov 27, 2024
Next