What Is Sora For? | Commentary

OpenAI’s new video generator is high on cool factor but low on utility, at least for now.

Sora

On the “featured” tab of its new Sora video generator, OpenAI highlights a bunch of standout AI video clips. There’s a panda riding the subway, an alien smoking a cigarette, a paper boat navigating a stormy sea, and a golden statue winking at you.

The videos are stunning outputs of a marvelous new technology, but who knows what they’re for. Eye catching and creative but too low quality to insert into a commercial production, Sora clips exist in a state of liminality. Yes, the product understands at least some physics, and AI video may be the tech breakthrough of the year. But like many generative AI products so far, it’s not entirely clear what we’re supposed to do with it.

AI video will certainly improve, but Sora and its counterparts seem high on cool factor and low on utility, at least for now. The problem: There is no natural user. With ChatGPT, coders and students saw immediate value, and AI text generation has since expanded to more use cases. Image generators like Dall-E haven’t broken through in the same way though, struggling to find natural applications of their richer media format. Sora, similarly, isn’t good enough to generate clips for feature films, or even commercials, and is a bit too intense to be useful to regular people. So its use case remains fuzzy.

In a blog post announcing Sora’s general release this week, OpenAI said it hopes it “will enable people everywhere to explore new forms of creativity, tell their stories, and push the boundaries of what’s possible with video storytelling.” But as someone who just learned how to edit video, I can attest that doing anything with video is hard. Even with Sora’s unbelievable power in everyone’s hands, it’s hard to imagine reworking the internet’s 90-9-1 rule, where 90% of people consume, 9% distribute, and 1% create.

Looking through Sora’s “recent” tab shows some interest but bafflement over what to do with the service. One user put a dog in a driver seat, another put a cat in a sailor hat, another showed a horse strolling through a graveyard at night. There are lots of animals. And lots of women, some prompted with a creepy amount of detail. The videos seem to let users escape to other worlds, or “push boundaries,” as OpenAI suggests. But once you prompt a few times, the compelling reason to come back — and pay — becomes harder to find. How many puppies driving cars do you need to see?

Sora will, of course, find some valuable applications. It will allow movie directors to plot out scenes before shooting them. It will enable fashion brands to see models wear their work on a runway before creating it. And it will help brand managers cook up funky posts for Instagram. Yes, these AI videos will likely fill our social media feeds like Shrimp Jesus has filled Facebook.

But Sora is also debuting at a time when determining what’s real is harder than ever, and the service and its peers will add to the confusion. This past week, I found the United Healthcare shooting story harder to follow than almost any other previous major story. There was a fake Substack and loads of fake information online. But the fake videos of the shooter were most puzzling. Multiple users generated fake AI videos of the shooter from surveillance video. And while some explicitly said it was AI, others shared videos they insisted were not. It all contributes to a sense of reality apathy, where telling true from false is so hard you just give up.

To OpenAI’s credit, Sora’s safeguards are pretty good. The service wouldn’t let me create videos from images of people, and it blocked my prompts after I tried to get it to generate videos of Trump dancing and the UHC shooter getting arrested at McDonalds. OpenAI declined to make a member of the Sora team available for interview.

Perhaps by focusing so much on the video generation aspect I’m missing the point. Sora can create cool videos, but the basic premise of the product is to enhance artificial intelligence’s understanding of the real world beyond what’s depicted in text. “Sora serves as a foundation for AI that understands and simulates reality,” OpenAI wrote in its announcement, “an important step towards developing models that can interact with the physical world.”

That interaction with the real world could mean applying Sora’s intelligence in robotics, or perhaps helping models that understand the planet they’re communicating about. If that turns out the case, it may be exactly what Sora is for.

This article is from Big Technology, a newsletter by Alex Kantrowitz.

Comments