- Emergent Behavior
- Posts
- 2024-04-01: Coding Agents Tackle Github Issues
2024-04-01: Coding Agents Tackle Github Issues
Fast Approaching Clff for Computer Science
🔷 Subscribe to get breakdowns of the most important developments in AI in your inbox every morning.
Here’s today at a glance:
🗃️ Coding Agents Tackle Github Issues
Who: Fudan University et al.
What: An AI agent that can automatically resolve Github issues that require code changes across an entire repository
How:
MAGIS consists of four types of AI agents that collaborate:
Manager: Plans the resolution and assembles a team of developer agents
Repository Custodian: Locates code files relevant to the issue
Developers: Modify the code based on the plan
Quality Assurance (QA): Reviews and validates the code changes
The agents work together to analyze issues, find relevant files, create a plan, implement code changes, and verify the modifications. The inspiration comes from Github Flow, a commonly used workflow paradigm.
MAGIS
The research group:
Evaluated MAGIS on the SWE-bench dataset of real GitHub issues and compared it to LLMs like GPT-3.5, GPT-4, and Claude-2
MAGIS successfully resolved 13.94% of issues, an 8-fold improvement over applying GPT-4 directly, demonstrating the effectiveness of the multi-agent approach
Ablation studies showed the QA agent and ability to utilize information from comments also boosted performance
MAGIS marks a significant step towards using AI to automatically resolve complex software issues that span multiple files and require understanding the relationships between code components.
If we are at 14% now... we will probably fully solve coding within 24 months at this rate. Ridiculous.
🌠 Enjoying this edition of Emergent Behavior? Send this web link with a friend to help spread the word of technological progress and positive AI to the world!
Or send them the below subscription link:
🗞️ Things Happen
Cryptic post from the head of Tesla’s self-driving team
It is the beginning of the end!
— Ashok Elluswamy (@aelluswamy)
5:41 AM • Apr 1, 2024
🖼️ AI Artwork Of The Day
Rooms in the style of planets of the solar system - u/requiredpaper from r/midjourney
That’s it for today! Become a subscriber for daily breakdowns of what’s happening in the AI world:
Reply