Emergent Behavior
Posts
Of AI Agents and OpenAI SuperAgents

Of AI Agents and OpenAI SuperAgents

Building something greater than ourselves

Prakash Ate-A-Pi
February 14, 2024

🔷 Subscribe to get breakdowns of the most important developments in AI in your inbox every morning.

Here’s today at a glance:

The AI Operating System
OpenAI You Only Live Once Method
Things happen
AI artwork of the day

💻 The AI Operating System

Gist: An AI Agent that uses your computer, including the browser, Excel, and PowerPoint, to do tasks.

Paper Title: OS-Copilot: Towards Generalist Computer Agents With Self-Improvement

Who: Shanghai AI Lab plus others

What did they do:

Built an agent using a mix of Python code and GPT-4 language model prompts called Friday; that
controls a Linux or Mac OS computer,
including browser, Excel, and PowerPoint, to perform tasks; and
self-improves

How did they do it?

Created a set of sequential prompts and code, grouped into agents such as:
- Planner - decompose user requests into smaller tasks
- Configurator - middleware to take each task and configure it with data from memory or how-tos from tool repositor before passing to Executor
- Declarative memory - user profile and history of previous actions
- Tool repository - tools available
- Working memory - where the next steps for tasks and previous history are kept
- Executor - generates executable command
- Critic - assessing whether a task has been completed successfully or whether iteration is needed
GPT-4 was the underlying AI model

Generating python code to set dark mode on an app

What did they find?

Friday (their agent framework) outperformed GPT-4 with Plugins on a benchmark for general agents
It could perform tasks in both Excel and PowerPoint

Comparison of FRIDAY agent on the GAIA agent benchmark

What are the implications?

LLM OS. Bear with me I'm still cooking.
Specs:
- LLM: OpenAI GPT-4 Turbo 256 core (batch size) processor @ 20Hz (tok/s)
- RAM: 128Ktok
- Filesystem: Ada002
— Andrej Karpathy (@karpathy)
12:48 AM • Nov 11, 2023

This is actually a working demonstration of Andrej Karpathy’s proposal for an AI Operating System
Ideas have been circulating for a while now
These systems will get better

Share this story

🌠 Enjoying this edition of Emergent Behavior? Send this web link with a friend to help spread the word of technological progress and positive AI to the world!

Or send them the below subscription link:

🤙 OpenAI You Only Live Once Method

I found this method explanation of something OpenAI allows their researchers to do amazing:

In effect, the method is:

Hire 10x-100x AI researchers
Allow them to speculatively jump to a solution
Then provide them the GPUs to test it
Rather than making them justify each step before allowing GPU scaling

It is an interesting time to be in a field where there are so many low-hanging fruit that so many things work spectacularly well. The lifting of the veil of ignorance occluding human sight through the ages.

Share this story

🗞️ Things Happen

Lots of AI projects and how to differentiate them: Jeremiah Owyang has some good advice:

Ukraine is using thousands of networked microphones to detect Russian drones. This seems to be just grabbing background noise from across the country, using AI to classify “drone present“ and mapping it. Simple, and effective, with the main constraints being bandwidth and distribution of mics. Just another story of a seemingly useless data pipeline being utilized for something it was never intended for.
Google Gemini reviews are in!

Gemini Ultra subscription is FIRE
Same price as ChatGPT except you get
Similar model but better data recency
Integrated in Gmail, big time savings
Google calander scheduling
2TB Google Photos/Drive, was already paying for 500GB so cost of that can be subtracted
A fantastic VPN
— Dylan Patel (@dylan522p)
1:16 AM • Feb 10, 2024

🖼️ AI Artwork Of The Day

Harry Potter, Except It’s in Soviet Russia and Every is Having a Terrible Time - u/WhyTryAI r/MidJourney

That’s it for today! Become a subscriber for daily breakdowns of what’s happening in the AI world:

Reply

or to participate.