# Captions (app)

> Source: https://aiwiki.ai/wiki/captions_ai
> Updated: 2026-06-04
> Categories: AI Companies, Generative AI, Video Generation
> From AI Wiki (https://aiwiki.ai), a free encyclopedia of artificial intelligence. Quote with attribution.

**Captions** (stylized **captions.ai**, legally NOCAP, Inc.) is an [artificial intelligence](/wiki/generative_ai)-powered video creation and editing company and product based in New York City. Founded in 2021 by Gaurav Misra and Dwight Churchill, the company first became known for a mobile app that automatically generated subtitles for "talking videos" and then corrected a speaker's gaze toward the camera with a feature called AI Eye Contact. It later expanded into a broader AI creative studio offering automated editing, dubbing and translation, and AI-generated avatars, and in 2025 introduced an in-house [video-generation](/wiki/video_generation) foundation model named Mirage. In September 2025 the parent company rebranded itself from Captions to Mirage to reflect a shift toward building proprietary video models, while keeping the Captions name on its consumer app. By 2024 the product had surpassed 10 million mobile downloads, and the company had raised roughly $100 million in venture funding at a reported $500 million valuation, followed by a $75 million growth round in 2026.

## Company

### Founding

Captions was founded in 2021 in New York City. Co-founder and chief executive Gaurav Misra previously led design engineering at Snap, where he worked on social products including Spotlight and Snap Map, and earlier was a software engineer at Microsoft. Co-founder Dwight Churchill, who serves as chief operating officer, was previously an early member of the team that built Marcus, Goldman Sachs' consumer finance business. The legal entity operates as NOCAP, Inc., doing business as Captions.

According to Misra, the product found its market almost by accident. The team had built a video app that was not gaining traction, put a roughly $10-per-month paywall on it, and largely moved on. Returning months later, they discovered that paying users had accumulated and were specifically using the captioning feature, which led the founders to refocus the entire company on AI-assisted video for creators. The app concentrated on the "talking video" format, in which a creator addresses the camera directly to share opinions, advice, or personal stories, a format Misra had watched rise during his time at Snap.

### Funding

Captions raised across several rounds, with early backing from Sequoia Capital and Andreessen Horowitz (a16z) and later rounds led by Kleiner Perkins and Index Ventures. Investors and aggregators describe an early seed investment followed by a Series A co-led by Sequoia and a16z in 2022. The company's Series B, announced on June 22, 2023, raised $25 million led by Kleiner Perkins with participation from Sequoia Capital, Andreessen Horowitz, and SV Angel, bringing total capital raised to about $40 million at that point.

On July 9, 2024, Captions announced a $60 million Series C led by Index Ventures, which valued the company at $500 million and brought total funding to roughly $100 million. Returning investors Kleiner Perkins, Andreessen Horowitz, and Sequoia Capital took part, alongside new investors Adobe Ventures, HubSpot Ventures, and the actor Jared Leto. The company framed the round as a commitment to "invest $100M into advancing generative video research from New York City," with the money going toward expanding its machine-learning team and technical infrastructure.

On March 24, 2026, by then operating as Mirage, the company announced $75 million in growth financing from General Catalyst's Customer Value Fund, capital structured to be repaid out of revenue rather than diluting equity in a traditional priced round.

| Round | Date | Amount | Lead investor | Notable participants | Reported valuation |
|---|---|---|---|---|---|
| Seed / early | 2021-2022 | early-stage | Sequoia Capital, Andreessen Horowitz | SV Angel | not disclosed |
| Series A | 2022 | undisclosed | Sequoia Capital and Andreessen Horowitz (co-led) | various | not disclosed |
| Series B | June 22, 2023 | $25 million | Kleiner Perkins | Sequoia, a16z, SV Angel | not disclosed (total raised ~$40M) |
| Series C | July 9, 2024 | $60 million | Index Ventures | Kleiner Perkins, a16z, Sequoia, Adobe Ventures, HubSpot Ventures, Jared Leto | $500 million (total ~$100M) |
| Growth (as Mirage) | March 24, 2026 | $75 million | General Catalyst (Customer Value Fund) | revenue-based financing | not disclosed |

Because some early-stage figures appear only in third-party funding trackers and are reported inconsistently (one source describes a roughly $15 million seed, others list a smaller seed plus a Series A of around $11 million), the precise amounts and dates of the seed and Series A are treated here as approximate; the $25 million Series B (June 2023), the $60 million Series C (July 2024), and the $75 million growth round (March 2026) are the figures confirmed across multiple independent reports.

### Acquisitions and hires

On November 13, 2024, Captions made its first acquisition, buying AlpacaML (also styled Alpaca), a generative-AI rendering tool founded in 2022 that turned sketches, thumbnails, and images into finished, styled visuals. AlpacaML's chief executive William Buchwalter joined Captions as a research engineer, and job offers were extended to all six of AlpacaML's employees. Around the same time, the company hired Drew Jaegle, formerly of [Google DeepMind](/wiki/google_deepmind), as Head of AI; at DeepMind he had worked on multimodal representation and generative models. The acquisition and hire were positioned as the build-out of an in-house AI research arm.

## Products

### Captions app

The Captions mobile app, launched for iOS in 2021 and later on Android, is an AI-assisted studio for short-form, creator-style video. Its features grew over time and have included:

- **Automatic captions / subtitles:** the original feature, transcribing and styling on-screen text for talking videos.
- **AI Eye Contact:** a post-production tool that analyzes footage and subtly realigns a speaker's gaze so they appear to look into the camera, even when the original take had them glancing at a script or away from the lens. The app also includes a built-in camera and teleprompter for recording.
- **AI Edit:** automated editing that removes filler words such as "ums" and "ahs," reduces background noise, and enhances speech.
- **AI Music:** a feature that suggests and adds fitting music to a video.
- **AI Dubbing / translation:** lip-synced dubbing into many languages, also released as a standalone app called Lipdub.
- **AI Creator and AI Twin:** tools that generate videos with AI avatars. AI Creator produces videos using a 3D avatar, and users can build their own AI Twin from selfies to act as a virtual spokesperson for user-generated-content (UGC) style ads.

The app monetizes through subscriptions; in January 2025 the company moved its mobile app to a freemium model. Earlier reporting cited a subscription tier in the range of roughly $10 per month.

### Lipdub

In October 2023, Captions released Lipdub, a free standalone iOS app for AI dubbing and translation. Lipdub translated a video of a single person speaking (up to about one minute) into 28 languages, including French, Hindi, Spanish, Italian, Portuguese, and Japanese, and adjusted the speaker's lip movements to match the target language. It also offered novelty "languages" such as Gen Z slang and pirate speech. Contemporary coverage noted occasional lag between audio and lip movement. The same lip-modification technology underpinned an in-app "AI Lipdub" feature that could change a speaker's mouth movements when the transcript was edited in post-production.

### Mirage (foundation model)

On March 12, 2025, the company introduced Mirage, which it described as the world's first video foundation model purpose-built for generating UGC-style ads and talking content, developed entirely in-house. Rather than applying dubbing or [lip-sync](/wiki/speech_recognition) to licensed footage of real actors, Mirage generates complete scenes from scratch: hyper-realistic talking people, objects, and backgrounds that do not exist. The company's argument for the approach is that much of human communication is carried by facial expressions, micro-reactions, and body language, so it generates full facial and upper-body motion rather than only matching lips to audio.

Mirage can produce video from a text prompt, a script, a video file, or an audio file alone, and lets users specify a speaker's apparent age, gender, clothing, background, and on-screen products. The company says it supports more than 29 languages and preserves accents in generated speech. Mirage is a [generative](/wiki/generative_ai) media model in the same broad family as other [diffusion-model](/wiki/diffusion_model) and [text-to-speech](/wiki/text_to_speech_ai) systems, combining synthetic video with synthetic voice; the company has emphasized building the underlying [machine learning](/wiki/machine_learning) and [deep learning](/wiki/deep_learning) stack itself rather than licensing third-party models.

### Mirage Studio

Mirage Studio, launched in June 2025, is a web platform aimed at brands and advertisers that uses the Mirage model to generate short ads without human talent, stock footage, voice cloning, or traditional lip-syncing. A user can submit an audio file and the system generates the video, including an AI background and AI avatars, or upload selfies to create an avatar of a specific likeness. The company says these AI actors can laugh, sing, rap, gesture, flinch, and convey micro-expressions. Mirage Studio is offered under a business plan reported at $399 per month for 8,000 credits, with a 50 percent discount on the first month for new users. Following the 2026 growth round, the company also described a web-based "marketing suite" for bulk creation and distribution of videos by companies.

## Rebrand to Mirage

On September 4, 2025, the company announced that it was rebranding from Captions to Mirage at the corporate level, while the consumer app would keep the Captions name. Misra framed the change as a move from being a creator-tools company toward being an AI research lab focused on multimodal foundation models for short-form video on platforms such as TikTok, Reels, and Shorts, telling [TechCrunch](/wiki/techcrunch) that "the real race for AI video hasn't begun." Under the new structure, the Captions app, the Mirage model, and Mirage Studio all sit under the Mirage brand.

Because the rebrand consolidated AI-generated talking-head video, the company addressed concerns about deepfakes and impersonation. It said it had moderation measures intended to prevent impersonation and to require consent for the use of a person's likeness, while acknowledging that product design "isn't a catch-all" and arguing that part of the answer is a broader "new kind of media literacy."

## Adoption

Captions grew quickly as a consumer product. By the time of its June 2023 Series B, the company reported more than three million users; an October 2023 account cited over 100,000 daily active users producing on the order of a million videos a month. At the July 2024 Series C, the company reported more than 10 million mobile downloads and roughly three million monthly active users.

By the 2026 growth round, the company (as Mirage) reported that more than 200 million videos had been created with its tools to date. Third-party app analytics cited at that time indicated over 3.2 million downloads and about $28.4 million in in-app revenue over the preceding 365 days. The company also said its user base was heavily international, with only about a quarter of revenue coming from the United States.

## Related

- [Generative AI](/wiki/generative_ai)
- [Video generation](/wiki/video_generation)
- [Diffusion model](/wiki/diffusion_model)
- [Text-to-speech AI](/wiki/text_to_speech_ai)
- [Runway ML](/wiki/runway_ml)
- [ElevenLabs](/wiki/elevenlabs)
- [Google DeepMind](/wiki/google_deepmind)

## References

1. "Captions rebrands as Mirage, expands beyond creator tools to AI video research." TechCrunch, September 4, 2025. https://techcrunch.com/2025/09/04/captions-rebrands-as-mirage-expands-beyond-creator-tools-to-ai-video-research/
2. "Mirage raises $75M to continue building models for its AI video-editing app Captions." TechCrunch, March 24, 2026. https://techcrunch.com/2026/03/24/mirage-raises-75m-to-continue-building-models-for-its-ai-video-editing-app-captions/
3. "Mirage (Captions) raises $60M in Series C funding to invest in generative video research." Index Ventures, July 2024. https://www.indexventures.com/perspectives/captions-raises-60m-in-series-c-funding-to-invest-in-generative-video-research/
4. "Captions Raises Series C to Invest $100M in Pioneering AI Video Research in New York City." Business Wire, July 9, 2024. https://www.businesswire.com/news/home/20240709061123/en/Captions-Raises-Series-C-to-Invest-$100M-in-Pioneering-AI-Video-Research-in-New-York-City
5. "Captions Announces $60M Series C, Hits $500M Valuation." Maginative, July 9, 2024. https://www.maginative.com/article/captions-announces-60m-series-c-hits-500m-valuation/
6. "AI Video Startup Captions Valued at USD 500M in USD 60M Series C." Slator, July 2024. https://slator.com/ai-video-startup-captions-valued-at-usd-500m-in-usd-60m-series-c/
7. "Captions Celebrates $25 Million In Series B Funding With A Launch Propelled By Stellar Reception On Apple App Store." Business Wire, June 22, 2023. https://www.businesswire.com/news/home/20230622730984/en/Captions-Celebrates-%2425-Million-In-Series-B-Funding-With-A-Launch-Propelled-By-Stellar-Reception-On-Apple-App-Store
8. "AI video creation app Captions bags $25M from top VCs." VentureBeat, June 22, 2023. https://venturebeat.com/ai/ai-video-creation-app-captions-bags-25m-from-top-vcs
9. "Captions Raises $25M in Series B Funding." FinSMEs, June 22, 2023. https://www.finsmes.com/2023/06/captions-raises-25m-in-series-b-funding.html
10. "Investing in Captions." Andreessen Horowitz (a16z), June 22, 2023. https://a16z.com/announcement/investing-in-captions/
11. "Video editing startup Captions launches a dubbing app, Lipdub, with support for 28 languages." TechCrunch, October 11, 2023. https://techcrunch.com/2023/10/11/video-editing-startup-captions-launches-a-dubbing-app-with-support-for-28-languages/
12. "Mirage: The World's First Foundation Model for UGC Video." Captions (Mirage) Blog, March 12, 2025. https://captions.ai/blog/mirage-worlds-first-foundation-model-for-ugc-video
13. "New name, new chapter: Captions is now Mirage." Mirage Blog, September 4, 2025. https://www.captions.ai/blog-post/new-name-new-chapter-captions-is-now-mirage
14. "AI video editing startup Captions acquires rendering tool AlpacaML." American Bazaar, November 29, 2024. https://americanbazaaronline.com/2024/11/29/ai-video-editing-startup-captions-acquires-alpacaml/
15. "Captions Hires DeepMind Alum to Head up AI and Acquires AlpacaML, Expanding AI Research Arm." Captions (Mirage) Blog, November 13, 2024. https://www.captions.ai/blog-post/captions-acquires-alpacaml-and-hires-deepmind-alum-to-head-up-ai-expanding-ai-research-arm
16. "Generative video studio Captions acquires AlpacaML." Axios, November 13, 2024. https://www.axios.com/pro/media-deals/2024/11/13/captions-video-editing-app-alpacaml
17. "Captions launches Mirage Studio for AI-generated UGC videos with lifelike digital avatars." AlternativeTo, June 2025. https://alternativeto.net/news/2025/6/captions-launches-mirage-studio-for-ai-generated-ugc-videos-with-lifelike-digital-avatars/
18. "Gaurav Misra & Dwight Churchill - Building Captions [Invest Like the Best, EP.405]." Colossus, 2025. https://colossus.com/episode/building-captions/
19. "Gaurav Misra On Raising $100 Million To Develop AI-Powered Tools For Video Editing And Creation." Alejandro Cremades, 2024. https://alejandrocremades.com/gaurav-misra/
20. "Captions (app)." Wikipedia. https://en.wikipedia.org/wiki/Captions_(app)
21. "AI Eye Contact Adjustment for Videos." Mirage Help Center. https://help.mirage.app/docs/visual/eye-contact

