Lyria 2

Generative AI Google DeepMind Music & Audio Generation

8 min read

Updated Jun 28, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 28, 2026

Fact-checked

In review queue

Sources

10 citations

Revision

v2 · 1,633 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

Lyria 2 is a high-fidelity, text-to-music generation model built by Google DeepMind that turns text prompts into professional-grade instrumental audio. Announced on April 24, 2025, it is the second generation of Google's Lyria music technology and powers the Music AI Sandbox, with the same release also introducing an interactive streaming variant called Lyria RealTime ^[1]. Lyria 2 became generally available to enterprises on Google Cloud's Vertex AI platform on May 21, 2025, where it produces 48 kHz stereo audio clips of about 30 seconds and embeds a SynthID watermark in every output to mark the audio as AI-generated ^[2]^[5]^[6].

Google DeepMind describes the model as one that "delivers high-fidelity music and professional-grade audio outputs that capture subtle nuances across a range of genres and intricate compositions" ^[1]. It was released alongside the video model Veo 3 and the image model Imagen 4 as part of Google's generative media portfolio for Vertex AI ^[2].

What is Lyria 2?

Lyria 2 is Google DeepMind's text-to-music generation model: a user supplies a written prompt and the model renders high-fidelity instrumental audio in response ^[1]. Google positioned it as a professional-grade text-to-music system, capable of rendering a range of genres and intricate arrangements while capturing the subtle differences between instruments and playing styles ^[1]. It is the underlying generation model in the refreshed Music AI Sandbox and is exposed on Vertex AI as the model lyria-002 ^[1]^[5]. Every clip it generates carries a SynthID watermark, in keeping with the rest of the Lyria line ^[1]^[2].

How does Lyria 2 relate to the original Lyria?

Lyria 2 is the second generation of a model family that Google DeepMind first unveiled on November 16, 2023, in partnership with YouTube ^[3]. That original Lyria was described at the time as the company's most advanced music generation model, and it launched with two applications. The first was Dream Track, an experiment on YouTube Shorts in which a small group of creators could type a topic, pick a participating artist from a carousel, and receive a 30-second soundtrack with an AI-generated voice, lyrics, and backing track in that artist's style ^[3]. The launch artists included Charlie Puth, T-Pain, Demi Lovato, John Legend, Sia, Charli XCX, Troye Sivan, Alec Benjamin, and Papoose ^[3]. The second was a set of Music AI tools developed through YouTube's Music AI Incubator, which later became the Music AI Sandbox ^[3]^[4].

The Lyria work drew on earlier Google research in music synthesis, most directly MusicLM, the text-to-music model introduced in early 2023. From the first release, Google committed to watermarking every Lyria output with SynthID, treating provenance marking as a default part of deployment rather than an optional add-on ^[3]. Lyria 2 inherited that approach and extended the model's fidelity and controllability.

What can Lyria 2 do?

Lyria 2 generates high-fidelity music and professional-grade audio outputs from text prompts, with finer control over instruments, tempo (BPM), and overall musical character than the original model offered ^[1]^[5]. Google describes the output as capturing nuance across genres and complex compositions ^[1].

On Vertex AI the model is exposed as lyria-002. Its output is instrumental-only audio delivered as 48 kHz stereo WAV clips of roughly 30 seconds ^[5]^[6]. The interface supports negative prompting, which lets a user describe elements to steer the model away from, and a single request can return more than one variation ^[5]^[6]. Generation passes through Google's safety stack, which the documentation describes as content safety filters, recitation checking, and artist intent checks meant to block harmful or inappropriate inputs and outputs ^[6].

Attribute	Lyria 2 (`lyria-002`)
Developer	Google DeepMind
Announced	April 24, 2025 ^[1]
Vertex AI general availability	May 21, 2025 ^[2]
Output	Instrumental, 48 kHz stereo WAV, about 30 seconds ^[5]^[6]
Prompt controls	Text prompt, negative prompt, multiple samples ^[5]^[6]
Watermark	SynthID ^[2]^[6]

What is the Music AI Sandbox?

The April 2025 announcement centered on a refreshed Music AI Sandbox, an experimental suite of tools aimed at working musicians, producers, and songwriters ^[1]. The Sandbox is built for tasks such as generating fresh instrumental ideas, crafting vocal arrangements, and breaking through creative blocks, and the April update folded Lyria 2 in as its underlying generation model ^[1]. Alongside the new features, Google widened access, opening the Sandbox to more creators in the United States through a waitlist sign-up ^[1]. The announcement featured early users including the artists The Range, Isabella Kensington, Adrie, and Sidecar Tommy ^[1].

The Sandbox traces back to the November 2023 Music AI tools, which had been limited to participants in YouTube's Music AI Incubator ^[3]^[4]. The 2025 relaunch broadened that footprint and paired the toolset with both Lyria 2 and the new Lyria RealTime model for interactive work ^[1].

What is Lyria RealTime and how is it different?

Lyria RealTime is a distinct model variant announced in the same April 2025 release, designed for interactive, real-time music generation rather than fixed clips ^[1]. Where Lyria 2 renders a finished segment, Lyria RealTime produces a continuous stream of music that a user can steer, shape, and warp while it plays, mixing genres and blending styles moment by moment ^[1]^[7]. Google DeepMind says the model "allows users to interactively create, perform and control music in real-time, mixing genres, blending styles and shaping audio moment by moment" ^[1], and the company has compared the experience to jamming with a musician in a band ^[8].

Technically, the model generates a continuous stream of 48 kHz stereo audio in sequential chunks, an approach Google calls block autoregression that adapts the MusicLM architecture for live use ^[7]^[8]. Each chunk is conditioned on the previous audio and on a style embedding derived from the user's prompts, and the system keeps latency to a maximum of about two seconds between a control change and its audible effect ^[7]^[8]. Beyond blending text descriptors, the API exposes direct controls over tempo, key, note density, and spectral brightness, along with the ability to mute instrument groups such as drums or bass ^[7]^[8].

Google brought Lyria RealTime to developers as an experimental API at Google I/O 2025, on May 20, 2025, making it available through the Gemini API and Google AI Studio ^[9]^[10]. It is the model that powers MusicFX DJ, an experimental music app in Google Labs ^[9]^[10]. On the Gemini API the model identifier is models/lyria-realtime-exp; it generates instrumental music only and outputs 16-bit PCM audio at 48 kHz in stereo, with tempo selectable across roughly 60 to 200 BPM ^[10]. As with Lyria 2, output audio is always watermarked for identification under Google's responsible AI principles ^[10].

How do you access Lyria 2?

Lyria 2 reached general availability on Vertex AI on May 21, 2025, where it can be tested in Google Cloud's Media Studio and called programmatically through the model API ^[2]^[5]. The Vertex release grouped Lyria 2 with Veo 3 and Imagen 4 as a set of generative media tools for enterprise customers, all built with Google DeepMind and shipped with safety as a design principle ^[2]. Lyria RealTime, by contrast, is offered to developers as an experimental capability through the Gemini API and AI Studio rather than as a Vertex enterprise product ^[9]^[10]. The two access paths reflect the split between Lyria 2 as a Vertex enterprise model and Lyria RealTime as a developer-facing experimental API.

Is Lyria 2 output watermarked?

Yes. Both Lyria 2 and Lyria RealTime embed SynthID, Google DeepMind's imperceptible audio watermarking technology, into their outputs by default ^[2]^[6]. SynthID adds a signal that is inaudible to listeners and does not degrade the audio, yet remains detectable so that AI-generated music can be identified later ^[3]. Google has described the SynthID audio watermark as robust to common transformations such as MP3 and AAC compression. On Vertex AI the watermark is applied automatically to generated clips, and Google states that all music generated by the Lyria 2 and Lyria RealTime models is watermarked using SynthID ^[1]^[6].

Why does Lyria 2 matter?

Lyria 2 marked Google's move from music generation as a closed YouTube experiment toward a commercially available service for businesses and developers. By placing the model on Vertex AI next to Veo 3 and Imagen 4, Google assembled a single generative media portfolio spanning music, video, and images, each watermarked with SynthID ^[2]. The accompanying Lyria RealTime model pushed in a different direction, treating music less as a file to download and more as a live, steerable performance, which opened uses such as interactive soundscapes for games and installations ^[1]^[8]. The pairing of a high-fidelity clip generator with a real-time streaming engine, both released together and both provenance-marked, set the template that later Lyria releases would build on. Google has continued to emphasize artist collaboration and responsible deployment, including artist intent checks and default watermarking, as it has broadened access to the technology ^[1]^[6].

References

Google DeepMind, "Music AI Sandbox, now with new features and broader access" (April 24, 2025). https://deepmind.google/blog/music-ai-sandbox-now-with-new-features-and-broader-access/ ↩
Google Cloud, "Announcing Veo 3, Imagen 4, and Lyria 2 on Vertex AI" (May 21, 2025). https://cloud.google.com/blog/products/ai-machine-learning/announcing-veo-3-imagen-4-and-lyria-2-on-vertex-ai ↩
Google DeepMind, "Transforming the future of music creation" (November 16, 2023). https://deepmind.google/blog/transforming-the-future-of-music-creation/ ↩
YouTube Blog, "An early look at our AI Music experiment." https://blog.youtube/inside-youtube/ai-and-music-experiment/ ↩
Google Cloud Documentation, "Lyria 2" (`lyria-002`) model reference. https://docs.cloud.google.com/gemini-enterprise-agent-platform/models/lyria/lyria-002 ↩
Google Cloud Documentation, "Lyria | AI Music Generator | Generative AI on Vertex AI." https://docs.cloud.google.com/vertex-ai/generative-ai/docs/music/overview ↩
Google DeepMind, "Lyria RealTime" model page. https://deepmind.google/models/lyria/lyria-realtime/ ↩
Magenta (Google), "Introducing Lyria RealTime API." https://magenta.withgoogle.com/lyria-realtime ↩
TechCrunch, "Google brings a music-generating AI model to its API with Lyria RealTime" (May 20, 2025). https://techcrunch.com/2025/05/20/google-brings-a-music-generating-ai-model-to-its-api-with-lyria-realtime/ ↩
Google AI for Developers, "Real-time music generation using Lyria RealTime | Gemini API." https://ai.google.dev/gemini-api/docs/realtime-music-generation ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

1 revision by 1 contributors · full history

Suggest edit

What links here

AI Test Kitchen Best AI Music Generators Lyria MusicLM Stable Audio

What is Lyria 2?

How does Lyria 2 relate to the original Lyria?

What can Lyria 2 do?

What is the Music AI Sandbox?

What is Lyria RealTime and how is it different?

How do you access Lyria 2?

Is Lyria 2 output watermarked?

Why does Lyria 2 matter?

References

Improve this article

Related Articles

Lyria

Suno

Udio

Stable Audio

Suno v5

ElevenLabs Music

What links here

Related Articles

Lyria

Suno

Udio

Stable Audio

Suno v5

ElevenLabs Music

What links here