Skip to content
View heroding77's full-sized avatar
🏫
Working from AI Lab
🏫
Working from AI Lab

Highlights

  • Pro

Block or report heroding77

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
JavaScript 31 Updated Mar 23, 2024

A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.

TypeScript 2,360 163 Updated Feb 4, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 3,357 442 Updated Jan 26, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 25,229 1,912 Updated Jan 27, 2025

This repository is used to collect papers and code in the field of AI.

48 5 Updated Feb 5, 2025

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

1,807 147 Updated Jan 3, 2025

Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 90 4 Updated Jan 24, 2025

Building a comprehensive and handy list of papers for GUI agents

Python 199 11 Updated Jan 17, 2025
Python 5 Updated Nov 19, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 23,583 1,998 Updated Feb 5, 2025

[NAACL 2025 Main Conference] PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization

Python 8 Updated Oct 17, 2024

Convert PDF to markdown + JSON quickly with high accuracy

Python 20,287 1,214 Updated Feb 5, 2025

OS-ATLAS: A Foundation Action Model For Generalist GUI Agents

263 12 Updated Feb 1, 2025

[NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training

Python 11 Updated Oct 25, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 6,610 474 Updated Feb 1, 2025
Python 66 6 Updated Dec 6, 2024

AndroidWorld is an environment and benchmark for autonomous agents

Python 200 21 Updated Jan 22, 2025

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,375 162 Updated Apr 20, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,930 529 Updated Dec 25, 2024

卡码网题解全集

389 128 Updated Jan 10, 2025

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 8,838 1,284 Updated Jan 28, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,979 575 Updated Oct 22, 2024

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 5,855 596 Updated Jan 8, 2025

demo page of sea

JavaScript 2 Updated Dec 30, 2024

SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for papers, thereby assisting researchers in improving the qual…

Python 58 7 Updated Nov 25, 2024

😎 Awesome lists about all kinds of interesting topics

345,380 28,450 Updated Feb 5, 2025

[Preprint] A Neural-Symbolic Self-Training Framework

C 102 3 Updated Jul 23, 2024
Next