Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use. Selenium IDE import/export.
-
Updated
Dec 3, 2024 - JavaScript
Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use. Selenium IDE import/export.
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
Repository for ShowUI: One Vision-Language-Action Model for GUI Visual Agent
Desktop app powered by Claude’s computer use capability to control your computer
✨ Use natural language to control your browser, powered by LLM and playwright
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
try Computer Use on your Mac with a few clicks
Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.
🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.
Claude Computer Use API with Ubuntu that enables Claude to interact with and automate desktop environments. It allows seamless command execution through VNC or noVNC, enhancing productivity with secure, containerized workflows with Github Codespaces.
Give a Multi-Modal LLM full access of your linux computer
🤖 LLM-powered computer control through local and Docker environments. Features VNC integration, automated interactions, and a chat interface for natural language system control.
Anthropic's Computer use implementation in Nodejs
Anthropic's computer use controlling a Macbook
A streamlined setup script for Anthropic's Computer-Use Demo environment. This script automates the entire setup process, handling all dependencies and configuration automatically.
Anthropic's Computer Use tools within VSCode
Effortless Deployment and Integration for SOTA Screenshot Parsing and Action Models
Add a description, image, and links to the computer-use topic page so that developers can more easily learn about it.
To associate your repository with the computer-use topic, visit your repo's landing page and select "manage topics."