Making History

on remaking what we know of the past

đź”· Subscribe to get breakdowns of the most important developments in AI in your inbox every morning.

Here’s today at a glance:

đź’ˇ From an Idea to Rewriting Ancient History in Under a Year

(This post was guest-written by @sporadicalia)

This week, former Github CEO, Nat Friedman announced “The scrolls have been read”, as the Vesuvius Challenge - an ML & computer vision contest he’s sponsored, solved a massive archaeological mystery.

So what really happened?

"What was really achieved? And why is it such a massive deal?" - you may be wondering.

It starts in 79 AD. Mount Vesuvius erupted, burying the nearby cities - most notably, Pompeii.

The eruption

But another town, Herculaneum was also buried; and with it, a treasure trove of ancient papyri.

This now ancient library of papyrus scrolls was completely buried and lost to time -- that is, until 1709 when Herculaneum was accidentally rediscovered.

And in 1750, a tunnel reached the ruins of an enormous seaside mansion now known as the Villa of the Papyri.

An artist’s rendering of the villa where the scrolls were found. Source: Rocío Espín

The Villa was likely built by one Lucius Calpurnius Piso - the father-in-law of Julius Caesar. And it was found to contain "lumps of what appeared to be charcoal. It was gradually realized that these were papyrus scrolls, carbonized by the heat of the eruption."

Lucius Calpurnius Piso

You see, the scrolls are carbonized by the heat of the volcanic debris and ash, but in a stroke of historical luck, their contents are also preserved.

Whereas virtually every ancient text exposed to the air decays and disappears, the library of the Villa of the Papyri remains.

The current condition of the scrolls

There's just one problem - they're nearly impossible to unroll and read. Early attempts resulted in a few unrolled scrolls but also unfortunately destroyed many of them.

A prior unrolling machine which destroyed many scrolls

So how do you read them? Let me introduce you to Dr. Brent Seales.

Dr. Brent Seales

In 2015, using X-ray tomography and computer vision, a team led by Seales at the University of Kentucky pioneered the technique of "virtual unwrapping" to read one of the Dead Sea scrolls from Israel.

The Dead Sea Scrolls

Inspired by the research, Nat Friedman reached out. Progress in unwrapping the Herculaneum Papyri was slow--Seales had revealed bits and pieces, but they were mostly unreadable.

But Friedman had an idea. In March of 2023, the Vesuvius Challenge was launched, offering $1 million in prizes to anyone who could deliver a breakthrough.

Just 10 months later, the Vesuvius Challenge has yielded results beyond what anyone would have expected so soon.

So let's quickly take a few tweets to explain what exactly is going on in "unraveling" the Herculaneum Papyri and why it had proved so difficult - until now.

the raw data - like rings of a tree

It starts with a scan: Each scroll is placed within a particle accelerator and is peppered with high-energy X-rays, resulting in high-resolution CT scans that are carved into tiny slices and ultimately, combined into a 3D volume reconstruction.

The scan

Once scanned, the next step is identifying and capturing segments from the tightly-rolled layers, as well as flattening the 3D volume in and around the segment. These methods allow researchers to potentially capture ink that lays on top of or has seeped into the papyrus.

3D reconstruction

Finally, it's time to read - but to do so, machine learning models have to be used to detect the ink. Trained on a small collection of ground truth data from intact fragments, sufficiently advanced algorithms can detect not just letters, but potentially even entire passages!

Reading the scroll

Enter the brilliant minds who took on the challenge--and eventually earned the Grand Prize of $700,000 by meeting the criteria of "4 passages of 140 characters each, with at least 85% of characters recoverable."

Congrats to Luke Farritor, Youssef Nader, and Julian Schilliger!

Luke Farritor is a 21-year-old college student and SpaceX intern from Nebraska. He received the first-place First Letters Prize in late 2023, as the "first person in history to read an entire word from the inside of a Herculaneum scroll."

The First Letters Prize - (ΠΟΡΦΥΡΑϹ, “purple”)

Farritor's work was a massive win for the challenge, convincing everyone that the Grand Prize was "definitely achievable."

Then in October, Youssef Nader, an Egyptian PhD student in Berlin, was able to read a few columns of text, winning the second-place First Letters Prize.

His results were "particularly clear and readable, which made him the natural lead for the team that formed."

With the help of Julian Schilliger, a student at ETH Zurich who figured out ways to segment the text on the scrolls, winning three Segmentation Tooling subprizes and enabling the 3D-mapping of the papyrus areas - a super team was formed.

That brings us to this week. 275 years after the discovery of the Herculaneum Papyri, we can finally begin to read the scrolls!

The submission is frankly incredible - more than a dozen columns of text with entire paragraphs visible.

the completed submission

Scholars are already busy translating the ancient text - which comprises just 5% of the scroll it was pulled from - but it's the larger-than-life possibilities for what comes next that have historians and technologists alike incredibly hopeful.

Multiple scanned scrolls remain to be segmented and read. Meanwhile, hundreds, maybe thousands, of preserved scrolls remain buried in the Villa of the Papyri.

It's a treasure trove of ancient documents, the likes of which simply don't exist anywhere else. So what comes next?

The Vesuvius Challenge and Nat Friedman have already laid out "Stage Two" of the Master Plan:

  • Use a particle accelerator to scan the rest of the scrolls

  • $40k per scroll x 800 scrolls = $30 million cost

Scanning and segmentation at scale are still a major challenge. Even with proven techniques, the task of scaling up to scan, segment, and read the "800 extant scrolls" remains.

To do so, however, requires more money (as all endeavors do), of course!

And so, if you're a wealthy donor who wants to support and make possible some of the greatest historical and technical research out there today -- well now's your chance!

It's also important to recognize everyone who made the Vesuvius Challenge possible in the first place - from Nat Friedman himself, but also fellow donors like Alex Gerko, Daniel Gross, and Joseph Jacks.

Such groundbreaking research would not have been possible without them.

The sponsors

🌠 Enjoying this edition of Emergent Behavior? Send this web link with a friend to help spread the word of technological progress and positive AI to the world!

Or send them the below subscription link:

🗞️ Things Happen

  • Boston Dynamics, the robot company that became well known for 30 years of demo videos of robots but little revenue, has finally revealed a robot that does useful heavy work. It’s springtime in robotics, but who will get to mass production first?

  • Google's Lyria model, which they said "set the standard for the responsible development and deployment of music generation tools", was trained on copyrighted music without permission, according to Billboard. This is not surprising given how much leakage there is from copyrighted to non-copyrighted data.. with the AI model able to recreate original tunes from covers. We need a big Supreme Court case to finally put to rest these issues.

  • A fake ID site, OnlyFake offers fake California drivers’ licenses that are good enough to pass KYC onboarding at most crypto exchanges. Supposedly “neural nets“ are used. This sure beats handing 20 bucks to your high school buddy so that you can buy a forty yourself instead of paying 5 bucks per can to the homeless guy on the corner. But seriously, where there is a will, there is a way, neural nets or not.

🖼️ AI Artwork Of The Day

Sydney by Vincent Van Gogh, from the series Famous Cities by Famous Painters - u/MisterFurniture on r/MidJourney

That’s it for today! Become a subscriber for daily breakdowns of what’s happening in the AI world:

Reply

or to participate.