Sora

Diffusion Models Generative AI OpenAI Video Generation

32 min read

Updated Jun 20, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 20, 2026

Fact-checked

In review queue

Sources

50 citations

Revision

v10 · 6,457 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Sora was OpenAI's text-to-video generative AI model: a system that generated short, high-fidelity video clips from text prompts, still images, and existing video. Developed by OpenAI, it was first previewed as a research demonstration on February 15, 2024, launched publicly as Sora Turbo on December 9, 2024, and succeeded by Sora 2 on September 30, 2025, which added native audio, user-likeness "cameos," and a TikTok-style social app. In its February 2024 technical report, OpenAI described Sora as "capable of generating a minute of high fidelity video" and argued that "scaling video generation models is a promising path towards building general purpose simulators of the physical world" ^[4]. An experimental Sora 2 Pro variant launched in October 2025 ^[1]^[2]^[3]. The model used a diffusion transformer architecture (the DiT design) that operated on "spacetime patches" of video latent codes, enabling generation across varying resolutions, durations, and aspect ratios ^[4]. After roughly sixteen months of public consumer availability, OpenAI announced on March 24, 2026, that it was discontinuing the Sora app; the web and mobile experiences shut down on April 26, 2026, with the developer API scheduled to end on September 24, 2026 ^[5]^[6].

Sora was led by OpenAI's World Simulation team, headed by Aditya Ramesh as VP of Research, with co-leads Tim Brooks and Bill Peebles. Brooks departed for Google DeepMind in October 2024, and Peebles left OpenAI in April 2026 shortly after the shutdown announcement ^[7]^[8]. The model's hyperreal output prompted reactions ranging from Tyler Perry pausing an $800 million Atlanta studio expansion to a wave of public concern about deepfakes, copyright, and the future of professional video work.

What was the background behind Sora?

OpenAI's path to text-to-video built directly on the lineage of its image generation work. The DALL-E family (DALL-E in January 2021, DALL-E 2 in April 2022, and DALL-E 3 integrated into ChatGPT in October 2023) had established the company's expertise in text-conditioned image diffusion. Aditya Ramesh, who later led the Sora team, had previously led the DALL-E 3 effort and integrated it into ChatGPT ^[9].

The architectural foundation for Sora was published in late 2022 by William ("Bill") Peebles and Saining Xie in the paper "Scalable Diffusion Models with Transformers," presented at the IEEE/CVF International Conference on Computer Vision (ICCV) in 2023 ^[10]. The paper introduced the Diffusion Transformer (DiT) architecture, which replaced the U-Net backbone common in earlier image diffusion models with a pure transformer operating on latent image patches. Peebles joined OpenAI in 2023 and brought that recipe into the video domain alongside Tim Brooks. By the time Sora was unveiled, the DiT-on-patches recipe had become the dominant pattern for state-of-the-art generative video.

How did Sora work?

Sora was a diffusion model built on a transformer backbone, a design known as a diffusion transformer or DiT. The core pipeline consisted of three stages: a video compressor (encoder), the transformer-based denoiser, and a video decompressor (decoder) ^[4]. The architecture extended Peebles and Xie's ICCV 2023 paper, originally used for image generation, into the spatiotemporal domain ^[10]. In OpenAI's own description, the team "train text-conditional diffusion models jointly on videos and images of variable durations, resolutions and aspect ratios," using "a transformer architecture that operates on spacetime patches of video and image latent codes" ^[4].

How does the video compression work?

Sora used a spatiotemporal autoencoder trained from scratch to compress raw video into a lower-dimensional latent space. This compression reduced both spatial resolution and temporal length, so a one-minute video became a much shorter sequence of latent frames. The compression step is what enabled Sora to handle long-duration video generation without an unmanageable number of tokens ^[4].

What are spacetime patches?

The compressed video was then decomposed into "spacetime patches," three-dimensional chunks spanning portions of both the spatial frame and the temporal sequence. These patches served as the equivalent of tokens in a large language model: the transformer processed them as a sequence. Because patches could be extracted from videos of any resolution, duration, or aspect ratio, the same architecture handled a wide variety of input and output formats without requiring fixed dimensions ^[4]^[11].

OpenAI's technical report drew an explicit analogy: just as text tokens represent word fragments that can be assembled into any sentence, spacetime patches represent "visual phrases" that can be assembled into any video ^[4].

How does the diffusion process generate video?

Video generation began with a latent representation filled with random noise. Over many denoising steps, the transformer predicted and removed noise to reveal the final video. The model was trained to predict the original "clean" patches from their noisy versions, conditioned on a text prompt processed by a text encoder (similar to those used in DALL-E 3). The result was decoded back into pixel space by the video decompressor ^[11]. The denoising procedure built on Jonathan Ho, Ajay Jain, and Pieter Abbeel's work on "Denoising Diffusion Probabilistic Models" (NeurIPS 2020), cited explicitly in OpenAI's technical report ^[4].

How did recaptioning improve prompt following?

Like DALL-E 3, Sora used an internal video captioning model to generate detailed text descriptions for each training clip. These long, dense captions, much richer than alt text or human-written descriptions found on the open web, helped the model learn fine-grained correspondences between language and visuals. At inference time, GPT-4 expanded short user prompts into the longer captioning style the model had been trained against ^[4].

How did Sora's quality scale with compute?

OpenAI's technical report demonstrated that Sora's output quality improved smoothly with additional training compute, mirroring scaling laws observed in large language models. At small compute, output was blurry and physically incoherent; at large compute, the model produced minute-long clips with consistent characters and convincing camera motion. OpenAI argued that this trajectory suggested video models could become "general-purpose simulators of the physical world" given continued scaling ^[4].

What changed in Sora 2's architecture?

Sora 2 retained the diffusion-transformer foundation but added native audio generation, longer maximum durations, and significantly improved physics modeling. The Sora 2 system card describes a unified audio-video model that conditions both modalities on the same latent representation, allowing dialogue lip sync and ambient sound effects to emerge from a single forward pass ^[12]. OpenAI did not publish parameter counts for either Sora or Sora 2.

What was Sora trained on?

According to OpenAI's system card, Sora was trained on a combination of three data sources ^[13]:

Data Source	Description
Publicly available data	Industry-standard machine learning datasets and web crawls
Proprietary partnership data	Licensed content from partners such as Shutterstock and Pond5
Human feedback data	Input from AI trainers, red teamers, and employees

Before training, all datasets went through a filtering process that removed explicit, violent, or otherwise sensitive material, extending the methods developed for DALL-E 2 and DALL-E 3 ^[13]. OpenAI did not disclose specific video sources or the total dataset size, leaving open questions about whether YouTube content was used; CTO Mira Murati's evasive answers in a March 2024 Wall Street Journal interview fueled speculation that some scraped video material may have come from major video platforms ^[14].

Sora 1 and Sora Turbo (February to December 2024)

What was the February 2024 research preview?

On February 15, 2024, OpenAI published a technical report titled "Video generation models as world simulators" alongside several sample clips: an SUV driving down a mountain road, an animated "short fluffy monster" standing next to a candle, two people walking through snowy Tokyo, and synthetic historical footage of the California gold rush ^[4]. Sora was not available to the general public at this stage. OpenAI granted access to a small group of red teamers and visual artists in over 60 countries to test the system, identify safety weaknesses, and provide creative feedback ^[13].

The February 2024 preview was widely compared to a "GPT-1 moment" for video, marking the first time that behaviors like object permanence seemed to emerge naturally from scaling up pre-training compute for video generation ^[15]. OpenAI's largest released model could generate up to one minute of high-fidelity video at 1080p. Sora's announcement coincided with Google's Gemini 1.5 reveal earlier the same day, leading some commentators to describe February 15 as one of the most consequential single days in generative AI history.

In the weeks following the announcement, OpenAI CTO Mira Murati gave a Wall Street Journal interview in which she struggled to answer basic questions about Sora's training data. Asked by reporter Joanna Stern whether the model had been trained on YouTube, Instagram, or Facebook video, Murati said she was "not sure," before falling back on the phrase "publicly available and licensed data" ^[14]. The interview drew widespread criticism and intensified scrutiny of how OpenAI sourced training video, an issue that resurfaced throughout Sora's life.

When did Sora launch publicly?

OpenAI released Sora to the public on December 9, 2024, during its "12 Days of OpenAI" (informally called "12 Days of Shipmas") event, a 12-day live-stream series that ran from December 5 onward, unveiling a new product or feature each weekday ^[16]. The version launched publicly was called Sora Turbo, a significantly faster and more capable iteration of the model shown in February.

Sora Turbo brought several improvements over the research preview ^[1]:

Faster generation speed (four 10-second videos generated simultaneously in roughly 72 seconds during a live demonstration, compared with the original model's rate of 10 seconds to generate 1 second of video).
Support for text-to-video, image-to-video, and video-to-video generation.
A storyboard tool for specifying inputs frame by frame.
An Explore feed for discovering other users' public generations.
Loop and blend features for infinite playback and scene merging.
Visible watermarks by default and C2PA metadata to identify AI-generated content.

At launch, Sora was available to ChatGPT Plus and Pro subscribers in most regions where ChatGPT operated. The United Kingdom, Switzerland, and the European Economic Area were excluded from the initial rollout because of regulatory uncertainty around the EU AI Act ^[17].

Sora Turbo was hosted on a dedicated web property at sora.com rather than inside the ChatGPT interface. The Plus tier capped output at 720p and 5 seconds per clip, while the Pro tier supported 1080p and clips up to 20 seconds. Both tiers could combine clips using the storyboard tool to produce longer composed sequences.

Sora 2 (September 2025)

What was new in Sora 2?

OpenAI announced Sora 2 on September 30, 2025, alongside a dedicated iOS app and plans for an Android version (which arrived on November 4, 2025) ^[2]^[18]^[19]. In its launch post, OpenAI framed the release in historical terms: "With Sora 2, we are jumping straight to what we think may be the GPT-3.5 moment for video" ^[2]. CEO Sam Altman called it a "ChatGPT for creativity" moment in his launch post on X. Sora 2 represented a large step forward in several areas:

Physics accuracy: The model handled realistic physical interactions more reliably. For example, a basketball that missed the hoop now rebounded off the backboard rather than clipping through it. OpenAI highlighted that Sora 2 could render actions difficult or impossible for prior models, including Olympic gymnastics routines, backflips on a paddleboard with accurate buoyancy dynamics, and triple axels ^[2].
Native audio generation: Sora 2 automatically generated synchronized dialogue, sound effects, and ambient audio to match the visuals, producing videos of 10 to 25 seconds with full audio ^[2].
Cameos feature: After a short one-time video-and-audio recording for identity verification and likeness capture, users could insert themselves or friends into any Sora-generated environment with high fidelity. Each cameo carried granular permission settings (private, friends-only, mutual followers, or anyone), and the owner could revoke access at any time ^[2]^[20].
Improved controllability: The model could follow detailed multi-shot instructions while maintaining consistent world state across cuts ^[2].
Style versatility: It performed well across realistic, cinematic, and anime visual styles ^[2].

The Sora app also functioned as a TikTok-style social platform, with a vertical feed of 10-second clips, like and comment buttons, remix tools, and user profiles. It rolled out invite-only at launch, with each existing user receiving codes to share with friends ^[21]. Within five days the app had crossed one million downloads, faster than ChatGPT had reached the same milestone, driven by nearly 100,000 daily installs, and reached the top of Apple's US App Store free-app chart ^[50]. By the end of October it had been downloaded approximately 2.7 million times on iOS ^[21].

What was Sora 2 Pro?

On October 14, 2025, OpenAI rolled out Sora 2 Pro, an experimental higher-quality variant available to ChatGPT Pro subscribers. Sora 2 Pro generated 1080p output at up to 25 seconds in length, with sharper detail, cleaner text rendering, and more reliable audio synchronization than the standard Sora 2 ^[22]. ChatGPT Pro subscribers could use Sora 2 Pro on sora.com and, soon after, in the Sora app itself.

When did the Sora app reach Android?

The Sora iOS app launched on September 30, 2025, in the United States and Canada ^[2]. Following its viral first week, OpenAI shipped an Android version on November 4, 2025, in the United States, Canada, Japan, South Korea, Taiwan, Thailand, and Vietnam ^[18]^[19]. OpenAI later disclosed that the Android client had been built in 28 days with heavy use of its own Codex coding assistant ^[23].

The app emphasized social features: a TikTok-style vertical feed, video remixing, public profiles, and the Cameos likeness system. Bill Peebles announced on X that the team was working on rolling out Sora in additional regions, including Europe, where regulatory uncertainty around the EU AI Act had again slowed launch ^[18].

Storyboard, remix, and other UI features

Throughout 2024 and 2025, OpenAI iterated on the Sora product surface with a steady stream of features ^[24]:

Storyboard tool: Users specified inputs frame by frame, dragging text prompts and images onto a timeline. The tool allowed composed clips longer than the underlying model's per-generation duration.
Remix: Users could fork an existing public video and modify the prompt, characters, or style while preserving overall composition.
Video extensions: Users could seamlessly continue any video, with the model preserving characters, settings, and visual style across the extension.
Photo-to-video with people: Eligible users could upload images containing people to create videos after attesting they had consent from the individuals featured. Images including children or young-looking individuals were subject to stricter moderation and automatically stylized to clearly signal AI-generated content.
New visual styles: Preset styles included "Golden," "Handheld," "Retro," and "Festive."
Cameos: Verified likeness insertion (Sora 2 onward).
Expanded geographic availability: Sora became available in additional Latin American countries, including Argentina, Chile, Colombia, Costa Rica, the Dominican Republic, Mexico, Panama, Paraguay, Peru, and Uruguay.

How much did Sora cost?

Sora was bundled with OpenAI's ChatGPT subscription tiers rather than sold as a standalone product. The pricing structure evolved over time; through early 2026 the tiers were as follows ^[25]^[26]:

Feature	ChatGPT Plus ($20/month)	ChatGPT Pro ($200/month)
Monthly credits	1,000	10,000
Priority videos (approx.)	~50	~500
Maximum resolution	720p	1080p
Maximum video length	5 s (Sora Turbo) / 15 s (Sora 2)	20 s (Sora Turbo) / 25 s (Sora 2 Pro)
Watermarks	Yes	Removable
Relaxed mode (unlimited)	No	Yes
Sora 2 Pro access	No	Yes

Pro subscribers also had access to an unlimited "relaxed" generation mode, where videos were queued at lower priority and processed during off-peak hours at no credit cost ^[26]. Free-tier users lost access to Sora's generation features on January 10, 2026 ^[25]. With the April 26, 2026 shutdown, all consumer access ended.

What did the Sora API cost?

OpenAI offered API access to Sora 2 with pricing based on model tier and output resolution ^[27]:

Model	Resolution	Price per second
Sora 2	720p	$0.10
Sora 2 Pro	720p	$0.30
Sora 2 Pro	1024 vertical or 1792×1024	$0.50

A 10-second standard video at 720p cost roughly $1.00, while a 10-second Pro HD clip ran approximately $5.00. Developers needed at minimum a $10 API credit top-up (Tier 2) to unlock Sora model access. Rate limits scaled with tier: Plus subscribers got 5 requests per minute, Pro users got 50 requests per minute, and Enterprise accounts could negotiate 200 or more requests per minute with dedicated support ^[27]. The API supported reusable character references, video extensions, generations up to 20 seconds, 1080p output for the sora-2-pro model, and a POST /v1/videos/edits endpoint for editing existing videos ^[24].

With the April 26, 2026 app shutdown, only the API remained available. OpenAI announced that the API itself would be discontinued on September 24, 2026, after which Sora would no longer be accessible through any OpenAI surface ^[5]^[6].

How did Sora compare to Veo, Runway, and Kling?

Sora operated in a rapidly expanding market for AI video generation. The major competitors as of mid-2026 included:

Model	Developer	Key features
Veo 3 / 3.1	Google (DeepMind)	Native 4K output, character consistency, vertical video, native audio; Ingredients to Video, Frames to Video, Insert/Remove Object with automatic lighting; available through Gemini Advanced
Movie Gen	Meta	30B-parameter text-to-video model, 16-second videos at 1080p (16fps), synchronized audio up to 45 seconds via a 13B audio model; announced October 2024 ^[28]
Runway Gen-4 / Gen-4.5	Runway	High visual quality, cinematic focus, realistic physics, widely used in professional post-production; partnerships with Lionsgate; no native audio as of mid-2026; $95/month Unlimited tier
Pika 2.1 Turbo	Pika Labs	Fast generation (30 to 90 seconds), creative effects, social-oriented; $28/month Pro tier
Kling 3.0	Kuaishou	Native 4K, multi-shot sequences with subject consistency, simultaneous audio-visual generation
Seedance 2.0	ByteDance	Unified audio-video architecture for natural reverb and proximity effects
Hailuo 02 / 2.3	MiniMax	1080p at 24 to 30 FPS, ranked second globally on Artificial Analysis benchmark as of early 2026
Hunyuan Video	Tencent	Open-weights release in late 2024, 13B parameters, text-to-video and image-to-video
Wan 2.1 / 2.6	Alibaba	Open-source Chinese model, released early 2025, integrated into the ModelScope ecosystem

On the public Artificial Analysis text-to-video leaderboard, Sora 2 Pro consistently ranked behind ByteDance's Seedance 2.0, Runway's Gen-4.5, and Kling 3.0 by mid-2026, reflecting both quality stagnation in the wake of Sora's stalled development and the rapid release cadence of competitors. Google's Veo 3.1 was widely cited as the strongest all-rounder, particularly for prompt adherence, native audio, and 4K output. Sora 2 retained advantages in human emotion rendering and physics simulation in the period before its discontinuation.

The earlier first-generation Runway Gen-3 (July 2024) and Veo (announced May 2024 at Google I/O, evolving through Veo 2 in December 2024 and Veo 3 in May 2025) had been Sora's direct early competitors. Notably, Veo 3 beat Sora to market with high-quality synchronized audio by several months.

What were Sora's limitations?

Despite its capabilities, Sora had known shortcomings that OpenAI publicly acknowledged in its technical report and system card.

Physics errors

The original Sora frequently failed to simulate complex physical dynamics correctly. A cookie might show no bite mark after a character takes a bite; a glass might not shatter when dropped; smoke could move in physically impossible patterns ^[4]. While Sora 2 improved physics accuracy, errors still occurred in scenes involving multiple interacting forces. Users found that combining several physical actions in a single prompt (such as pouring water while stirring a spoon) increased the likelihood of artifacts ^[29].

Spatial reasoning

Sora sometimes confused spatial directions, mixing up left and right or failing to follow precise positional descriptions in the prompt. This limitation affected tasks requiring exact object placement or character orientation within a scene ^[4].

Complex motion and human anatomy

Early versions of Sora produced particularly poor results for gymnastics, generating "strange shape-shifting humans that vault through the air and sometimes land on three legs or an extra head" ^[30]. While Sora 2 specifically highlighted improved gymnastics rendering as a benchmark, complex human motion remained an area where errors could appear, especially in close-up sequences with rapid limb movement.

Text rendering

Like most diffusion models, Sora struggled to render coherent on-screen text such as signs, captions, or product labels. Sora 2 Pro improved this somewhat by raising spatial resolution, but readable embedded text remained inconsistent.

Long video coherence

Maintaining narrative and visual consistency across longer durations was challenging. While Sora 2 improved multi-shot controllability, subtle inconsistencies in character appearance, clothing, or background details could still emerge over extended sequences. Most professional users limited individual clips to 5 to 10 seconds and stitched several together using the storyboard tool.

Causal reasoning

OpenAI conceded that Sora often broke down on cause-and-effect chains. Effects sometimes occurred before their causes, and characters could respond to events that had not yet been depicted, suggesting the model treated time as a dimension to fill rather than as a strict ordering ^[4].

What controversies did Sora face?

Artist protests

In November 2024, shortly before the public launch, a group of artists who had been granted early access to Sora leaked access to the model on Hugging Face. They published a manifesto accusing OpenAI of "art washing," claiming that the company used them as "PR puppets" to lend artistic credibility to a product they believed threatened their livelihoods, all without compensation ^[31]. OpenAI revoked the leaked access within roughly three hours and stated that "hundreds of artists" had shaped Sora's development through voluntary participation.

Copyright concerns

Sora's training data drew sustained legal scrutiny. A coalition of Japanese entertainment companies, including Studio Ghibli, Bandai Namco, and Square Enix, accused OpenAI of using copyrighted animation and design styles without permission. Japan's Content Overseas Distribution Association argued that OpenAI's "opt-out" system for rights holders improperly reversed the burden of consent, urging the company to stop using Japanese works until a legal framework was in place ^[32]. The Motion Picture Association in the United States issued a similar complaint on October 6, 2025, criticizing the opt-out approach for treating copyrighted material as default-permitted ^[5].

On the user-generated content side, the Sora app initially faced problems with users creating videos featuring copyrighted characters like SpongeBob and Pikachu. OpenAI shifted from an opt-out to an opt-in model for intellectual property and increased content restrictions, though this change contributed to declining user engagement ^[33].

The Mira Murati WSJ interview from March 2024, in which the OpenAI CTO appeared unable or unwilling to confirm whether YouTube videos had been used in training, became a frequently cited piece of evidence in legal filings and journalism about the model's data provenance ^[14].

Film industry reaction

The Sora announcement prompted a strong response from parts of the entertainment industry. Filmmaker Tyler Perry announced he would pause a planned $800 million expansion of his Atlanta studio, citing concerns about the potential impact of AI video generation on traditional filmmaking. Perry said pilots that traditionally cost $15 million to $35 million could in the future be produced "at a fraction of the cost," warning that "a lot of jobs are going to be lost" ^[34].

Major talent agencies also took protective action: Creative Artists Agency and United Talent Agency opted their clients out of Sora 2. United Talent Agency described the app as "exploitation, not innovation," while Creative Artists Agency warned that it "exposes our clients and their intellectual property to significant risk" ^[35].

Deepfake risks

The Sora 2 launch in September 2025 immediately triggered deepfake concerns. Unauthorized AI-generated clips using actor Bryan Cranston's voice and likeness appeared on the platform, including a viral clip of Cranston taking a selfie with Michael Jackson. Under pressure from Cranston and SAG-AFTRA, OpenAI updated its policy on October 20, 2025, to require opt-in consent before any person's likeness could be used and to give rights-holders "more granular control" over generations involving their clients ^[32]. Families of Robin Williams, George Carlin, Martin Luther King Jr., Kobe Bryant, and Paul Walker also complained to OpenAI about the misuse of their loved ones' likenesses on the platform ^[35]^[36].

Public Citizen, a US consumer advocacy group, called on OpenAI to suspend Sora 2 in November 2025, warning that its realistic video output could be weaponized for political deepfakes or non-consensual imagery ^[37]. A separate controversy emerged around a "dead celebrity loophole": since posthumous likeness rights vary widely across jurisdictions, families of deceased public figures had limited legal recourse. OpenAI blocked videos of Martin Luther King Jr. on the platform after users created what the company called "disrespectful depictions" ^[32].

Broader concerns about disinformation emerged rapidly after the Sora 2 launch. The Sora app saw AI-generated videos depicting ballot fraud, immigration arrests, protests, and fabricated crime scenes appear on its social feed within days of release ^[35]. UC Berkeley's School of Information warned that society was "unprepared for the next wave of increasingly realistic, personalized deepfakes" ^[35]. The most-viewed Sora 2 clip in the first week of launch was reportedly a parody video depicting CEO Sam Altman shoplifting GPU cards from a Target store, which Altman himself later acknowledged on X ^[38].

Popular culture reaction

The South Park episode "Sora Not Sorry," the third episode of season 28, aired on November 12, 2025, and satirized AI deepfakes and copyright issues by showing schoolchildren weaponizing Sora 2 against each other. Online creators colloquially nicknamed the Sora app's video stream "SlopTok," reflecting concerns that the platform was promoting low-effort, novelty-driven content rather than substantive creative work ^[38].

How did Sora handle C2PA and watermarking?

Generated videos contained C2PA (Coalition for Content Provenance and Authenticity) metadata identifying them as AI-produced ^[13]. The Sora Turbo launch (December 2024) added visible watermarks by default, removable only for ChatGPT Pro subscribers ^[1]. Sora 2 changed to a visible, moving digital watermark to make removal more difficult.

Despite these safeguards, within a week of Sora 2's release, third-party programs appeared that could strip the moving watermark, undermining this safety measure ^[39]. The proliferation of watermark removers contributed to the spread of Sora-generated clips on adversarial social platforms and was a recurring point of criticism in coverage of OpenAI's safety practices.

OpenAI also built an internal search tool that used technical attributes of generated videos to help verify whether a piece of content came from Sora, aiding internal misuse tracking ^[1]. Security researchers at Reality Defender, a firm specializing in identifying deepfakes, reported they bypassed Sora's anti-impersonation safeguards within 24 hours of the Sora 2 launch. A Washington Post journalist demonstrated that the face-sharing feature could be exploited: granting cameo access to chosen contacts allowed those contacts to create videos of the person being arrested or engaging in fabricated scenarios, all without further approval ^[35].

What was the Disney partnership?

On December 11, 2025, The Walt Disney Company announced a $1 billion equity investment in OpenAI, accompanied by a three-year licensing agreement that made Disney the first major content licensing partner on the Sora platform ^[40]^[41]. Under the deal, Sora users would gain access to more than 200 animated, masked, and creature characters from Disney, Pixar, Marvel, and Star Wars, including costumes, props, vehicles, and iconic environments. Available characters were to include Mickey Mouse, Minnie Mouse, Lilo, Stitch, Ariel, Belle, Cinderella, Black Panther, Darth Vader, and Yoda ^[41]. The agreement explicitly excluded any talent likenesses or voices.

Beyond the Sora licensing, Disney also became a major enterprise customer of OpenAI, using its APIs to build internal tools and experiences for Disney+, and deploying ChatGPT for its employees. A selection of fan-inspired Sora short-form videos became available to stream on Disney+. The character licensing on Sora and ChatGPT Images was expected to go live in early 2026 ^[40]^[41].

The partnership effectively dissolved when Sora itself was wound down. According to TechCrunch, Disney was informed of OpenAI's decision to shutter the Sora app less than an hour before the public announcement on March 24, 2026 ^[42].

Why did engagement decline in 2026?

As of March 13, 2026, Sora 1 was no longer available in the United States; the app opened in Sora 2 by default for US users ^[43]. Starting January 10, 2026, OpenAI removed free-tier access to video and image generation in Sora, restricting it to Plus and Pro subscribers only ^[25].

The standalone Sora app experienced a sharp drop in engagement after its initial launch excitement. App installs fell 32% in December 2025 and another 45% in January 2026, dropping to 1.2 million installs that month, while consumer spending fell 32% over the same period. On the US App Store, Sora fell out of the Top 100 free apps. Third-party measurement firms reported 30-day retention of approximately 1%, an industry-low figure that analysts attributed to the novelty wearing off and to OpenAI tightening intellectual-property restrictions ^[33]^[44].

In response, OpenAI signaled plans to integrate Sora's video generation capabilities directly into ChatGPT. The Information reported this plan on March 11, 2026, noting that the move aimed to reach a broader user base and push toward OpenAI's goal of 1 billion weekly active users ^[45].

When and why was Sora shut down?

On March 24, 2026, OpenAI announced via X that it would discontinue the Sora app. The web and iOS/Android experiences shut down on April 26, 2026, while the developer API was scheduled to remain available until September 24, 2026 ^[5]^[6]. OpenAI urged users to download any saved generations before the data deletion deadlines.

The company did not give a single official reason, but reporting from the Wall Street Journal, TechCrunch, CNN, and others converged on a familiar set of factors:

Compute costs: Reports placed daily inference costs between roughly $1 million and as much as $15 million during peak usage, while lifetime app revenue was estimated at only about $2.1 million ^[42]^[46].
User collapse: Active users fell from a peak of around one million to fewer than 500,000 by early 2026, with severe drop-off after the first 30 days ^[42].
Strategic shift: OpenAI was preparing for a potential IPO and reallocating compute toward enterprise and coding products, including ChatGPT Enterprise and Codex infrastructure, where rival Anthropic had been gaining share via Claude Code ^[42].
Mounting copyright pressure: Lawsuits and rights-holder complaints from Japan and the United States made the social feed increasingly costly to police ^[5]^[42].

On April 17, 2026, less than a month after the shutdown announcement and roughly nine days before the app went dark, three senior OpenAI executives announced their departures on the same day. Sora head Bill Peebles, chief product officer Kevin Weil, and enterprise CTO Srinivas Narayanan all confirmed their exits via X ^[7]^[47]. Peebles wrote that he was "proud of all the sleepless nights before and after the launch this team endured in order to deploy the technology in a responsible way and help steer societal norms." He credited Sora with igniting a "huge amount of investment in video across the industry" ^[7].

What is Sora's legacy and current status?

Sora's broader cultural impact outpaced its commercial trajectory. Within days of the February 2024 demonstration, the model became a centerpiece of the AI policy conversation in Washington, the United Kingdom, and Brussels, with regulators citing it in early discussions about provenance standards and synthetic-media disclosure.

In Hollywood, the response was sharply divided. Some independent filmmakers and effects houses adopted Sora for previsualization, mood boards, and storyboarding; others, led by Tyler Perry's high-profile pause, treated it as an existential threat. Several notable Sora-produced works circulated during its public availability:

Project	Type	Date	Details
Air Head (Shy Kids)	Short film	February 2024	One of the original early-access showcase pieces from Toronto-based collective Shy Kids; widely shown at film festivals
Worldweight (August Kamp)	Music video	March 2024	Two-minute video to a mellow electronic song; one of the first music releases tied to Sora
Abstract / The Golden Record (Paul Trillo)	Music video	May 2024	LA director Paul Trillo's first commissioned music video using Sora, stitched from 55 separate clips ^[48]
Toys R Us "The Origin of Toys R Us"	Brand film	June 2024	66-second commercial premiered at Cannes Lions Festival; depicted a young Charles Lazarus and the Geoffrey the Giraffe mascot ^[49]
Balenciaga "Escape from Neusman"	Fashion film	Fall 2024	30-second commercials for the brand's Fall 2024 collection

Despite the shutdown, Sora left durable technical and commercial legacies. The diffusion-transformer-on-spacetime-patches recipe became the dominant architectural pattern for video generation, adopted in various forms by Veo, Runway, Kling, and the Wan and Hunyuan open-source families. The cameo flow Sora pioneered became a template for likeness-controlled generation across the industry, with rivals adopting similar opt-in mechanisms following the Cranston and SAG-AFTRA episode.

As of May 2026, Sora was no longer accessible to consumers; only the API remained, with a hard sunset scheduled for September 24, 2026. OpenAI had signaled that the underlying video generation capabilities would resurface as a feature within ChatGPT rather than as a standalone product ^[45].

Where does the name Sora come from?

The name "Sora" comes from the Japanese word meaning "sky," which OpenAI chose to evoke the model's limitless creative potential ^[4].

References

OpenAI. "Sora is here." OpenAI, December 9, 2024. https://openai.com/index/sora-is-here/ ↩
OpenAI. "Sora 2 is here." OpenAI, September 30, 2025. https://openai.com/index/sora-2/ ↩
MindStudio. "What Is OpenAI Sora 2 Pro? The Premium AI Video Model from OpenAI." 2025. https://www.mindstudio.ai/blog/what-is-openai-sora-2-pro-video ↩
OpenAI. "Video generation models as world simulators." OpenAI Research, February 15, 2024. https://openai.com/index/video-generation-models-as-world-simulators/ ↩
OpenAI Help Center. "What to know about the Sora discontinuation." March 24, 2026. https://help.openai.com/en/articles/20001152-what-to-know-about-the-sora-discontinuation ↩
CNN Business. "OpenAI is shutting down its Sora video app just months after launch." March 24, 2026. https://edition.cnn.com/2026/03/24/tech/openai-sora-video-app-shutting-down ↩
TechCrunch. "Kevin Weil and Bill Peebles exit OpenAI as company continues to shed 'side quests.'" April 17, 2026. https://techcrunch.com/2026/04/17/kevin-weil-and-bill-peebles-exit-openai-as-company-continues-to-shed-side-quests/ ↩
TechCrunch. "A co-lead on Sora, OpenAI's video generator, has left for Google." October 3, 2024. https://techcrunch.com/2024/10/03/a-co-lead-on-sora-openais-video-generator-has-left-for-google/ ↩
Aditya Ramesh. Personal website. http://adityaramesh.com/ ↩
Peebles, William, and Saining Xie. "Scalable Diffusion Models with Transformers." Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023. https://arxiv.org/abs/2212.09748 ↩
Towards Data Science. "Explaining OpenAI Sora's Spacetime Patches: The Key Ingredient." 2024. https://towardsdatascience.com/explaining-openai-soras-spacetime-patches-the-key-ingredient-e14e0703ec5b/ ↩
OpenAI. "Sora 2 System Card." September 30, 2025. https://openai.com/index/sora-2-system-card/ ↩
OpenAI. "Sora System Card." OpenAI, December 2024. https://openai.com/index/sora-system-card/ ↩
The Decoder. "OpenAI CTO Mira Murati doesn't know what data Sora was trained on." March 14, 2024. https://the-decoder.com/openai-cto-mira-murati-doesnt-know-what-data-sora-was-trained-on/ ↩
Wikipedia. "Sora (text-to-video model)." https://en.wikipedia.org/wiki/Sora_(text-to-video_model) ↩
TechRadar. "12 Days of OpenAI: Everything that was announced." December 2024. https://www.techradar.com/news/live/12-days-of-open-ai-live-blog ↩
TechCrunch. "OpenAI's Sora video generator is launching for ChatGPT Pro and Plus subscribers, but not in the EU." December 9, 2024. https://techcrunch.com/2024/12/09/openais-sora-video-generator-might-not-be-available-in-the-eu-at-launch/ ↩
TechCrunch. "Sora is now available on Android in the US, Canada, and other regions." November 4, 2025. https://techcrunch.com/2025/11/04/sora-is-now-available-on-android-in-the-us-canada-and-other-regions/ ↩
CNBC. "OpenAI launches Sora for Android devices." November 4, 2025. https://www.cnbc.com/2025/11/04/openai-sora-android.html ↩
OpenAI Help Center. "Generating content with cameos." 2025. https://help.openai.com/en/articles/12435986-generating-content-with-cameos ↩
CNBC. "How to get Sora app invite codes for OpenAI's viral AI video creator." October 3, 2025. https://www.cnbc.com/2025/10/03/sora-app-invite-codes-openai-viral-ai-video.html ↩
Skywork AI. "OpenAI Sora 2 Launch: Game-Changer for AI Video in 2025." 2025. https://skywork.ai/blog/openai-sora-2-2025/ ↩
OpenAI. "How we used Codex to build Sora for Android in 28 days." 2025. https://openai.com/index/shipping-sora-for-android-with-codex/ ↩
OpenAI Help Center. "Sora - Release Notes." 2026. https://help.openai.com/en/articles/12593142-sora-release-notes ↩
Apiyi. "OpenAI Sora 2 Pricing Policy Update: Starting January 2026." 2026. https://help.apiyi.com/en/openai-sora-2-policy-change-plus-pro-only-en.html ↩
OpenAI Help Center. "Sora Billing FAQ." https://help.openai.com/en/articles/10245774-sora-billing-faq ↩
AI Free API. "Sora 2 API Pricing & Quotas: Complete 2026 Guide." 2026. https://www.aifreeapi.com/en/posts/sora-2-api-pricing-quotas ↩
Meta AI. "Movie Gen: A Cast of Media Foundation Models." arXiv, October 2024. https://arxiv.org/abs/2410.13720 ↩
Skywork AI. "Sora 2 Troubleshooting: How to Fix Its 5 Most Annoying Errors." 2025. https://skywork.ai/blog/sora-2-how-to-fix-its-5-most-annoying-errors/ ↩
Fast Company. "Why OpenAI's Sora has so much trouble depicting gymnasts." 2024. https://www.fastcompany.com/91245684/why-openai-sora-has-so-much-trouble-depicting-gymnasts ↩
The Washington Post. "OpenAI stops Sora video model access after artists leak in protest." November 26, 2024. https://www.washingtonpost.com/technology/2024/11/26/openai-sora-ai-video-model-artists-protest/ ↩
CNBC. "OpenAI cracks down on Sora 2 deepfakes after pressure from Bryan Cranston, SAG-AFTRA." October 20, 2025. https://www.cnbc.com/2025/10/20/open-ai-sora-bryan-cranston-sag-aftra.html ↩
TechCrunch. "OpenAI's Sora app is struggling after its stellar launch." January 29, 2026. https://techcrunch.com/2026/01/29/openais-sora-app-is-struggling-after-its-stellar-launch/ ↩
Variety. "Tyler Perry Halts $800M Studio Expansion, Citing Concerns Over OpenAI's Sora." February 22, 2024. https://variety.com/2024/film/news/tyler-perry-studio-expansion-openai-sora-concerns-1235920007/ ↩
NPR. "Sora gives deepfakes 'a publicist and a distribution deal.' It could change the internet." October 10, 2025. https://www.npr.org/2025/10/10/nx-s1-5567162/sora-ai-openai-deepfake ↩
NBC News. "OpenAI strengthens Sora 2 guardrails after actor Bryan Cranston raises alarm." October 20, 2025. https://www.nbcnews.com/tech/tech-news/openai-sora-2-guardrails-sag-aftra-bryan-cranston-rcna238715 ↩
Euronews. "Watchdog group Public Citizen calls on OpenAI to scrap AI video app Sora, citing deepfake risks." November 12, 2025. https://www.euronews.com/next/2025/11/12/watchdog-group-public-citizen-calls-on-openai-to-scrap-ai-video-app-sora-citing-deepfake-r ↩
PC Gamer. "Apparently the most popular clip on OpenAI's new AI video app Sora depicts Sam Altman stealing graphics cards." October 2025. https://www.pcgamer.com/software/ai/apparently-the-most-popular-clip-on-openais-new-ai-video-app-sora-depicts-sam-altman-stealing-graphics-cards/ ↩
404 Media. "Programs to remove Sora 2 watermarks flooding the web." October 7, 2025. https://www.404media.co/ ↩
OpenAI. "The Walt Disney Company and OpenAI reach landmark agreement to bring beloved characters from across Disney's brands to Sora." December 11, 2025. https://openai.com/index/disney-sora-agreement/ ↩
CNBC. "Disney making $1 billion investment in OpenAI, will allow characters on Sora AI video generator." December 11, 2025. https://www.cnbc.com/2025/12/11/disney-openai-sora-characters-video.html ↩
TechCrunch. "Why OpenAI really shut down Sora." March 29, 2026. https://techcrunch.com/2026/03/29/why-openai-really-shut-down-sora/ ↩
OpenAI Help Center. "Sora 1 Sunset FAQ." 2026. https://help.openai.com/en/articles/20001071-sora-1-sunset-faq ↩
Sherwood News. "OpenAI's Sora 2 started off scorching hot. Things have slowed down since." 2026. https://sherwood.news/tech/openais-sora-2-started-off-scorching-hot-things-have-slowed-down-since/ ↩
Creati.ai. "OpenAI Plans to Integrate Sora Video Generation Directly Into ChatGPT." March 14, 2026. https://creati.ai/ai-news/2026-03-14/openai-integrates-sora-video-generation-chatgpt-2026/ ↩
80lv. "Sora Was Reportedly Costing OpenAI $1 Million Per Day." 2026. https://80.lv/articles/sora-was-reportedly-costing-openai-usd1-million-per-day ↩
The Next Web. "OpenAI loses product chief, Sora head, and enterprise CTO in single-day triple exit." April 18, 2026. https://thenextweb.com/news/openai-departures-kevin-weil-sora-peebles-enterprise-pivot ↩
New Atlas. "OpenAI's Sora makes its first official music video." May 2024. https://newatlas.com/technology/openai-sora-first-commissioned-music-video/ ↩
CNN Business. "Toys R Us made an ad almost entirely AI. It's a sign of how creators are navigating the new tech." June 25, 2024. https://www.cnn.com/2024/06/25/tech/toys-r-us-sora-ai ↩
CNBC. "OpenAI's Sora hit 1 million downloads in less than five days." October 9, 2025. https://www.cnbc.com/2025/10/09/openais-sora-downloads.html ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

9 revisions by 1 contributors · full history

Suggest edit