The Composio SWEBench-Agent-v2 is a cutting-edge open-source software engineering assistant that achieves OSS state-of-the-art accuracy on the SWE Bench. This agent is built using Composio's SWEKit framework, which allows for the creation of various specialized software engineering agents. SWEKit, integrated with the Langgraph agentic platform, provides a powerful and flexible foundation for developing task-specific software engineering assistants.
Our SWEBench-Agent-v2 utilizes Composio's tools to tackle SWE Bench challenges:
- File Editing and Reading Tools: Enables seamless manipulation and analysis of code files.
- Git Tools: Facilitates version control operations and code management.
- Code Indexing Tools: Allows for semantic code queries, enhancing code understanding and navigation.
- LSP Tools: Provides function and class signatures, improving code comprehension and modification capabilities.
Our system employs a multi-agent architecture to efficiently handle complex software engineering tasks:
- SWE ENGINEER: Acts as the manager agent, orchestrating the overall process.
- CODE ANALYSIS: Specialized agent for in-depth code analysis.
- EDITOR: Dedicated agent for code modification and execution.
To get started with the Composio SWEBench-Agent-v2, follow these steps:
-
Install the required packages:
pip install -U swekit composio-core
-
Clone the agent code repository:
git clone https://github.com/ComposioHQ/composio.git cd composio/python/swe/agent
-
Run the benchmark with a specific test instance:
python benchmark.py --test-instance-ids "django__django-14434"
Replace "django__django-14434" with the required instance ID for your test.
For more detailed usage examples and advanced configurations, please refer to our documentation or contact our support team.
Our Composio SWEBench-Agent-v2 has demonstrated state-of-the-art OSS performance on the SWE Bench, showcasing its capabilities in handling diverse software engineering tasks.
The agent achieved an impressive 48.6% accuracy on the SWE Bench Verified dataset, successfully solving 243 out of 500 issues. This performance underscores the agent's proficiency in tackling a wide range of software engineering challenges and positions it as a leading solution in the field of automated software engineering assistance.
These results highlight the effectiveness of our multi-agent architecture and the power of Composio's SWEKit toolset in creating highly capable software engineering agents.
Note:
- Is a pass@1 submission (does not attempt the same task instance more than once)
- Does not use SWE-bench test knowledge (
PASS_TO_PASS
,FAIL_TO_PASS
) - Does not use the
hints
field in SWE-bench - Does not have web-browsing OR has taken steps to prevent lookup of SWE-bench solutions via web-browsing
While this README focuses on the SWEBench-Agent-v2, it's important to note that the SWEKit framework enables the creation of various specialized agents. Some potential applications include:
- Refactoring-focused agents
- Language-specific agents (e.g., Golang, Python)
- Frontend development agents
- And more, tailored to specific software engineering needs
We welcome contributions from the community! Please see our CONTRIBUTING.md for guidelines on how to get involved.
- Composio for providing the SWEKit framework and essential tools.
- Langgraph for the agentic platform.
- SWE Bench for providing the benchmark and dataset.
For more information about SWEKit and our various agents, please contact us at
[email protected]
or visit our project page.