Skip to content
View zehuichen123's full-sized avatar
🐶
where is my job?
🐶
where is my job?

Organizations

@apachecn

Block or report zehuichen123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OO for LLMs

Python 551 39 Updated Dec 15, 2024

[NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents

Python 48 5 Updated Nov 6, 2024

Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"

Python 179 7 Updated Dec 15, 2024

Amodal Depth Anything: Amodal Depth Estimation in the Wild

Python 14 2 Updated Dec 1, 2024

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,595 61 Updated Dec 12, 2024

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 785 126 Updated Dec 5, 2024

Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)

Python 42 2 Updated Oct 16, 2024

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 1,464 161 Updated Dec 9, 2024

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 256 16 Updated Dec 12, 2024
Python 19 1 Updated May 2, 2024

The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting

Python 23 Updated Mar 29, 2024

田柯宇 (Tian Keyu)恶意攻击集群事件的证据揭露

662 42 Updated Oct 20, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 16,903 1,714 Updated Oct 15, 2024

O1 Replication Journey: A Strategic Progress Report – Part I

1,675 50 Updated Nov 30, 2024

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 665 87 Updated Nov 13, 2024

The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents"

Python 22 Updated Mar 14, 2024

official repo for the paper "Learning From Mistakes Makes LLM Better Reasoner"

Python 55 2 Updated Dec 20, 2023

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 512 50 Updated Nov 20, 2024

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 279 21 Updated Nov 26, 2024

The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 401 29 Updated Dec 11, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,637 308 Updated Dec 15, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 60,097 6,394 Updated Dec 15, 2024

Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]

Python 100 11 Updated Nov 26, 2024

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,794 1,979 Updated Sep 26, 2024

🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper.

Python 126 8 Updated Dec 3, 2024

SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.

5,585 524 Updated Dec 1, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,995 1,022 Updated Dec 10, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,595 224 Updated Dec 4, 2024

Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api

Python 916 131 Updated Dec 10, 2024

Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"

Python 438 52 Updated Mar 4, 2024
Next