- Emergent Behavior
- Posts
- Sora Update
Sora Update
It was a little cherry-picked after all
đź”· Subscribe to get breakdowns of the most important developments in AI in your inbox every morning.
Here’s today at a glance:
🛸 Sora Update
OpenAI CTO Mira Murati gives the most detailed update on Sora in an interview with Joanna Stern. Joanna gave several prompts for them to generate. This was the response to “Female video producer in New York City holding a high-end video camera. Suddenly a robot yanks the camera out of her hand.”
The female video producer comes out ok, but then, morphs into a robot:
This is the first time I’ve seen a Sora video with a serious non-intentional morphing problem. So..
Joanna’s prompt request forced the team to generate exactly what she’d asked for (they couldn’t take the risk of changing the prompt or making it more specific because you never want a whistleblower issue when you fundraise next)
This confirms that they cherry-picked the videos posted on Sora to date. I always found the dribble of videos suspicious. I mean, they’ve posted literally less than a hundred videos. If they had a video generation machine that anyone in the company can use, there should be thousands.
Other Notes
Hand and finger issues - similar to early days of images, hands are a work in progress, with morphing, wrong number of fingers, etc. Motion makes this more complex than images. This is being worked on.
Sound for video, not actively being worked on, but they'll eventually get there
Training data was either licensed or publicly available data (undefined and unconfirmed where from) - Mira was uncomfortable responding to this, as there now seems to be uncertainty whether publicly available videos can be used for training
generation time "a few minutes," she confirms, heard from other sources 10, maximum 15 minutes
Sora is not yet optimized, so it is super expensive
Hopes to make it available at the "same pricing" as Dall-E "this year."
Red teaming, refinement, and optimization are underway
Not that concerned with creator job losses as these tools will extend all creators’ ability to create
Nudity may be allowed! In talks with creators about use cases and guardrails
🌠Enjoying this edition of Emergent Behavior? Send this web link with a friend to help spread the word of technological progress and positive AI to the world!
Or send them the below subscription link:
🗞️ Things Happen
Anthropic’s Claude, doesn’t show a time to first token issue slowdown for long context, which is pretty amazing. Looks like they implemented an optimization somewhere
The following plot is time-to-1st-token generation for @AnthropicAI's Claude models vs input context length (5 runs).
It's kinda nuts that the quadratic cost of attention doesn't seem to kick in at all (i.e., it's not a first order term)
Some observations:
- Sonnet becomes as… twitter.com/i/web/status/1…— Dimitris Papailiopoulos (@DimitrisPapail)
9:06 PM • Mar 12, 2024
🖼️ AI Artwork Of The Day
Which TED talk are you listening to? u/broncobama_ from r/midjourney
That’s it for today! Become a subscriber for daily breakdowns of what’s happening in the AI world:
Reply