Skip to content
View zehuichen123's full-sized avatar
🐶
where is my job?
🐶
where is my job?

Organizations

@apachecn

Block or report zehuichen123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OO for LLMs

Python 553 39 Updated Dec 15, 2024

[NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents

Python 48 5 Updated Nov 6, 2024

Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"

Python 182 7 Updated Dec 15, 2024

Amodal Depth Anything: Amodal Depth Estimation in the Wild

Python 14 2 Updated Dec 1, 2024

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,603 61 Updated Dec 12, 2024

Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"

Python 788 126 Updated Dec 5, 2024

Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)

Python 42 2 Updated Oct 16, 2024

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 1,466 161 Updated Dec 9, 2024

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 257 16 Updated Dec 12, 2024
Python 19 1 Updated May 2, 2024

The official pytorch implementation of Exploring the Interactive Guidance for Unified and Effective Image Matting

Python 23 Updated Mar 29, 2024

田柯宇 (Tian Keyu)恶意攻击集群事件的证据揭露

662 42 Updated Oct 20, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 16,931 1,715 Updated Oct 15, 2024

O1 Replication Journey: A Strategic Progress Report – Part I

1,681 50 Updated Nov 30, 2024

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 666 88 Updated Nov 13, 2024

The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents"

Python 22 Updated Mar 14, 2024

official repo for the paper "Learning From Mistakes Makes LLM Better Reasoner"

Python 57 2 Updated Dec 20, 2023

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 512 50 Updated Nov 20, 2024

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 279 21 Updated Nov 26, 2024

The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 401 29 Updated Dec 16, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,656 310 Updated Dec 16, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 60,172 6,403 Updated Dec 15, 2024

Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]

Python 101 11 Updated Nov 26, 2024

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,801 1,980 Updated Sep 26, 2024

🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper.

Python 125 8 Updated Dec 3, 2024

SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.

5,586 524 Updated Dec 1, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,997 1,022 Updated Dec 10, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,616 224 Updated Dec 4, 2024

Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-api

Python 917 131 Updated Dec 10, 2024

Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"

Python 441 52 Updated Mar 4, 2024
Next