2024-04-01: Coding Agents Tackle Github Issues

Fast Approaching Clff for Computer Science

🔷 Subscribe to get breakdowns of the most important developments in AI in your inbox every morning.

Here’s today at a glance:

🗃️ Coding Agents Tackle Github Issues

Who: Fudan University et al.

What: An AI agent that can automatically resolve Github issues that require code changes across an entire repository

How:

MAGIS consists of four types of AI agents that collaborate:

  • Manager: Plans the resolution and assembles a team of developer agents

  • Repository Custodian: Locates code files relevant to the issue

  • Developers: Modify the code based on the plan

  • Quality Assurance (QA): Reviews and validates the code changes

The agents work together to analyze issues, find relevant files, create a plan, implement code changes, and verify the modifications. The inspiration comes from Github Flow, a commonly used workflow paradigm.

MAGIS

The research group:

  • Evaluated MAGIS on the SWE-bench dataset of real GitHub issues and compared it to LLMs like GPT-3.5, GPT-4, and Claude-2

  • MAGIS successfully resolved 13.94% of issues, an 8-fold improvement over applying GPT-4 directly, demonstrating the effectiveness of the multi-agent approach

  • Ablation studies showed the QA agent and ability to utilize information from comments also boosted performance

MAGIS marks a significant step towards using AI to automatically resolve complex software issues that span multiple files and require understanding the relationships between code components.

If we are at 14% now... we will probably fully solve coding within 24 months at this rate. Ridiculous.

🌠 Enjoying this edition of Emergent Behavior? Send this web link with a friend to help spread the word of technological progress and positive AI to the world!

Or send them the below subscription link:

🗞️ Things Happen

  • Cryptic post from the head of Tesla’s self-driving team

🖼️ AI Artwork Of The Day

Rooms in the style of planets of the solar system - u/requiredpaper from r/midjourney

That’s it for today! Become a subscriber for daily breakdowns of what’s happening in the AI world:

Reply

or to participate.