Task oriented AI agent framework for digital workers and vertical AI agents
-
Updated
Jan 28, 2025 - Python
Task oriented AI agent framework for digital workers and vertical AI agents
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use. Selenium IDE import/export.
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.
An open-sourced end-to-end VLM-based GUI Agent
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
Desktop app powered by Claude’s computer use capability to control your computer
A framework to enable autonomous android and computer use using any LLM (local or remote)
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use".
A general AI agent framework that can be adapted to various tasks and environments.
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
✨ Use natural language to control your browser, powered by LLM and playwright
Mark web pages for use with vision-language models
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
try Computer Use on your Mac with a few clicks
Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.
Add a description, image, and links to the computer-use topic page so that developers can more easily learn about it.
To associate your repository with the computer-use topic, visit your repo's landing page and select "manage topics."