SynthID

AI Safety Generative AI Google DeepMind

29 min read

Updated Jun 21, 2026

Suggest edit History Talk

RawGraph

Last edited

Jun 21, 2026

Fact-checked

In review queue

Sources

26 citations

Revision

v3 · 5,829 words

Fact-checks are independent of edits: a reviewer re-verifies the article against its sources and stamps the date. How we verify

SynthID is a family of digital watermarking technologies developed by Google DeepMind for marking and identifying content generated by generative AI systems. First announced in August 2023 for AI-generated images produced by Google's Imagen model, SynthID has since been extended to audio (November 2023), video (May 2024), and text (May 2024, with an open-source release in October 2024).^[1]^[2]^[3] The technology embeds imperceptible signals into generated content that human observers cannot detect but that a paired detector model can identify with high confidence even after common modifications such as compression, cropping, color changes, or moderate paraphrasing.^[1]^[4] By May 2026, Google reported that SynthID had been used to watermark more than 100 billion images and videos and the equivalent of 60,000 years of audio across its services, and the technology had moved from a single-vendor marker to a cross-industry standard adopted by OpenAI, NVIDIA, ElevenLabs, and Kakao.^[24]^[25]

Infobox

Attribute	Details
Developer	Google DeepMind
First release	August 29, 2023 (image watermarking, Imagen on Vertex AI)
Modalities	Image, audio, video, text
Open-source release	October 23, 2024 (SynthID-Text via Hugging Face Transformers v4.46.0)
Public detector	SynthID Detector portal, May 20, 2025 (early testers)
Reference paper (text)	Dathathri et al., "Scalable watermarking for identifying large language model outputs", Nature 634, 818-823 (October 23, 2024)
License (text reference impl.)	Apache 2.0 (code), CC BY 4.0 (other materials)
Reported scale (May 2026)	More than 100 billion images and videos; 60,000 years of audio
External adopters (2026)	OpenAI, NVIDIA, ElevenLabs, Kakao

History and Timeline

SynthID emerged in the months following the July 21, 2023 White House voluntary commitments on AI, in which Google and six other leading AI developers pledged to deploy "robust technical mechanisms" so that users could know when content is AI-generated, with watermarking explicitly named as an example.^[7] Google DeepMind has been described in subsequent reporting as the first major laboratory to publicly ship a production watermarking tool aligned with that commitment.^[4]

When was SynthID first released? Image watermarking (August 2023)

On August 29, 2023, Google DeepMind announced SynthID for AI-generated images and made it available in beta to a limited set of Vertex AI customers using Imagen on Google Cloud.^[1] The launch announcement described SynthID as embedding "a digital watermark directly into the pixels of an image, making it imperceptible to the human eye, but detectable for identification."^[1] Pushmeet Kohli, Vice President of Research at Google DeepMind, told MIT Technology Review that "the watermark can still be detected even if the image is screenshotted or edited," while characterizing the system as "experimental" at the time of release.^[4]

The architectural decision to train the watermarking and detection networks jointly distinguished SynthID from earlier approaches that relied on metadata, hashing, or hand-crafted signals. Because both networks were optimized end to end, the watermarking model could distribute the watermark across visually busy regions where small pixel adjustments are imperceptible, while the detection model could learn invariances to the transformations expected at deployment time.^[1]^[4]

Independent reactions noted both the practical advance and the limitations of any single-vendor watermark. Ben Zhao of the University of Chicago told MIT Technology Review that "there are few or no watermarks that have proven robust over time," warning that motivated adversaries would invest in attacks beyond casual cropping or compression.^[4] Sasha Luccioni of Hugging Face criticized the proprietary scope of the initial deployment, arguing that broader adoption would be required to materially reduce deepfake harms.^[4] Both criticisms anticipated later directions for SynthID's development, including its external partnerships and open-source text release.

Audio watermarking with Lyria (November 2023)

On November 16, 2023, Google DeepMind announced Lyria, an experimental music generation model, alongside the Dream Track experiment on YouTube Shorts. Every track generated or published by Lyria would carry a SynthID watermark applied to the audio.^[8] The Dream Track collaboration featured tracks made with the voices of participating artists including Alec Benjamin, Charlie Puth, Charli XCX, Demi Lovato, John Legend, Sia, T-Pain, Troye Sivan, and Papoose.^[8] Audio watermarking was the first non-visual modality for SynthID, and Google described the watermark as remaining detectable across common audio modifications such as noise additions, MP3 compression, and changes in playback speed.^[8]

The audio extension was significant for two reasons. First, it demonstrated that the SynthID brand encompassed more than a single algorithm: the audio watermark uses a different signal carrier (a spectrogram representation rather than pixel values) than the image watermark, requiring distinct training and detection pipelines. Second, the YouTube Shorts integration meant that watermarking was being applied at the point of upload to a consumer platform with billions of viewers, rather than only at the point of generation in a developer API.^[8]

Text and video extension (May 2024)

On May 14, 2024, Google DeepMind announced SynthID watermarking for AI-generated text and video at Google I/O 2024.^[2] The text watermark was deployed in the Gemini app and web experience for Gemini, and the video watermark was applied to videos produced by Veo on the VideoFX experimental product.^[2] Google said it intended to open-source the text watermarking implementation through an updated Responsible Generative AI Toolkit later in 2024.^[2]

Text watermarking presented a different technical challenge than image or audio watermarking. Where pixel and spectrogram values can be perturbed within continuous ranges below human perceptual thresholds, the output of a language model is a discrete sequence of tokens drawn from a finite vocabulary. SynthID's text watermark therefore could not be applied as a post-generation pixel adjustment; it had to be embedded by modifying the sampling process itself. Google's blog post on the May 2024 launch described the approach as "introducing additional information in the token distribution at the point of generation by modulating the likelihood of tokens being generated," foreshadowing the tournament-sampling mechanism formalized in the Nature paper.^[2]

Open-source release of SynthID-Text (October 2024)

The text component, designated SynthID-Text, was released as an open-source reference implementation on October 23, 2024, the same day a peer-reviewed paper describing the underlying algorithm was published in Nature.^[3]^[9] The paper, "Scalable watermarking for identifying large language model outputs" by Sumanth Dathathri, Abigail See, Sumedh Ghaisas, Po-Sen Huang, Rob McAdam, and colleagues, appeared in volume 634 of Nature on pages 818-823.^[9] Integration with the Hugging Face Transformers library landed in version 4.46.0 of the library on the same day, exposing the SynthIDTextWatermarkingConfig configuration object that any developer using a compatible causal language model could pass to model.generate() to produce watermarked text.^[3]

SynthID Detector portal (May 2025)

On May 20, 2025, during Google I/O 2025, Google announced the SynthID Detector, a public-facing web portal that accepts uploads of images, audio, video, and text and reports whether the uploaded content contains a SynthID watermark.^[5]^[10] Initial access was restricted to early testers, with a waitlist open to journalists, media professionals, and researchers.^[5] Google stated that the detector also highlights which specific portions of an upload are most likely to carry the watermark, identifying segments in audio and regions in images.^[5]^[10]

Adoption beyond Google

By the end of 2025, SynthID had been integrated into Google's consumer AI surfaces (Gemini app, Imagen, Veo, Lyria) and the watermark was also being applied to media produced by selected third-party systems. Google announced a partnership with NVIDIA to watermark videos generated through the NVIDIA Cosmos preview microservice on build.nvidia.com,^[5] and a partnership with GetReal Security to incorporate SynthID extraction into that vendor's content verification suite.^[5]^[11] A 2025 announcement reported that videos created with Veo 3 carry both a visible "Veo" badge and the invisible SynthID signal.^[6]

A second peer-reviewed publication, "SynthID-Image: Image watermarking at internet scale," appeared as an arXiv preprint on October 10, 2025. It documented the production system and an external variant, SynthID-O, distributed through partnerships, and confirmed that more than ten billion images and video frames had been watermarked across Google's services.^[12]

Industry-wide adoption and a Content Detection API (May 2026)

At Google I/O 2026 on May 19, 2026, Google announced that SynthID was being adopted across the wider industry, with OpenAI, NVIDIA, ElevenLabs, and Kakao bringing the watermark to their own AI systems.^[24]^[25] OpenAI said it would integrate SynthID into images generated through ChatGPT, Codex, and its API, and described watermarking and the C2PA metadata standard as complementary: "Watermarking can be more durable through transformations like screenshots, while metadata can provide more information than a watermark alone."^[26] NVIDIA expanded its earlier work to watermark video from its Cosmos world foundation models, and ElevenLabs and Kakao adopted SynthID for AI-generated audio and content on their platforms.^[24]^[26]

In the same announcement Google reported that it had watermarked more than 100 billion images and videos and the equivalent of 60,000 years of audio with SynthID in the roughly three years since the technology's introduction.^[24] Google also previewed a Content Detection API on its Gemini Enterprise Agent Platform, designed to identify AI-generated media produced by Google and other popular models; the API accepts JPEG, PNG, or WebP images, analyzes pixel-level artifacts, noise patterns, and spectral anomalies, and does not store the images it processes.^[26] Trusted-tester partners for the API included Shutterstock, Snap, Fox Sports, and Canva.^[26] Google said SynthID verification would expand from the Gemini app to Google Search and Chrome in the following weeks, letting users ask "Is this made with AI?" directly within those products, and that C2PA Content Credentials verification would roll out alongside it.^[24]^[26]

The May 2026 cross-industry alignment, in which companies that compete on models cooperated on a shared provenance signal, addressed the open-ecosystem limitation that critics and the Nature paper had highlighted at launch: a watermark applied by only one vendor can mark only that vendor's outputs.^[9]^[24] Earlier in 2026 the standalone SynthID Detector portal had also begun accepting audio uploads, extending its previous image-focused checks.^[24]

The arc from beta launch to peer-reviewed publication of the production image system, with intermediate milestones in audio, text, and video, a public detector portal, and finally multi-vendor adoption, gives SynthID one of the most fully documented deployment trajectories among major-laboratory AI provenance efforts. Each milestone moved the program incrementally from a single proprietary marker (Imagen on Vertex AI) toward a multi-modal, multi-vendor infrastructure that operates across consumer products, accepts external uploads through a verification portal, and ships an open-source reference for the text variant.^[1]^[9]^[12]^[24]

How does SynthID work? Technical Details

SynthID is not a single algorithm but a family of modality-specific watermarks united by the goal of embedding a signal that survives realistic transformations while remaining imperceptible to humans. Different modalities use different signal carriers (pixel perturbations, spectrogram perturbations, sampling perturbations) but share a common deployment philosophy: place a marker at the moment of generation, distribute the marker across the content rather than as metadata, and rely on a paired neural detector for verification.^[1]^[9]^[12]

SynthID-Image

The image watermark is produced by two deep neural networks trained together.^[1]^[4] The first network, an embedder, takes a generated image and outputs a watermarked version in which pixel values have been adjusted by amounts that are mathematically significant to the trained detector but lie below human perceptual thresholds.^[1] The second network, the detector, receives an image at inference time and outputs one of three states: watermark present, watermark suspected, or watermark not detected.^[1]^[4]

The 2025 arXiv paper "SynthID-Image: Image watermarking at internet scale" formalized the design constraints as a constrained optimization over an encoder-decoder system, balancing a detection loss and a robustness loss subject to a perceptual distance budget between the original and watermarked image.^[12] The authors documented that SynthID-Image is applied as a post-hoc step that operates independently of the generative model and can therefore be paired with any image generator, including diffusion-based models.^[12]

Benchmarks reported in the same paper measure true-positive rate at a false-positive rate of 0.1 percent across thirty common transformations grouped into six categories (spatial, color, quality degradation, noise, overlay, and combinations). For the external variant SynthID-O the aggregated true-positive rate was 99.36 percent under randomly sampled transformation strengths and 99.71 percent under worst-case transformations, exceeding the best competing baseline by 9.36 to 16.35 percentage points depending on the setting.^[12] The system supports a 136-bit payload on 512x512 images and achieves more than 99 percent bit accuracy on most transformation categories.^[12]

A human evaluation involving ten thousand external rater judgments on a thousand 1536x1536 Imagen images across twenty-five content categories, including difficult cases such as abstract art, line drawings, pixel art, and logos with uniform color regions, found that SynthID-O produced the smallest increase in perceived artifacts compared to non-watermarked baselines.^[12]

Robustness claims

Official documentation states that the SynthID-Image watermark "will usually still exist even if the image, video, or audio is re-scaled, re-colored, compressed or altered in other ways."^[13] The original 2023 launch post documented robustness against filters, color and brightness adjustments, and lossy compression of the kind used for JPEG.^[1] The MIT Technology Review coverage added screenshots, rotation, and resizing to the list of operations the watermark was designed to survive.^[4] Google has not claimed that the watermark survives all manipulations: aggressive recompression, severe stylization, content-aware warping, and targeted removal attacks can reduce or destroy the signal.^[1]^[4]

SynthID-Audio

For audio, SynthID converts the waveform into a time-frequency representation (a spectrogram) and modifies that representation to encode the watermark before reconstructing the waveform.^[8] A spectrogram is a two-dimensional visualization showing how the spectrum of frequencies in a sound evolves over time; Google's announcement described the watermark as encoded "within the spectrogram using encoding techniques aligned with psychoacoustic properties" so that perturbations fall in frequency bands where the human auditory system has limited sensitivity to small changes.^[8] Once embedded, the spectrogram is converted back into a waveform; the embedded pattern remains in the audio but is inaudible during playback.^[8]

The result is designed to survive common audio manipulations including the addition of background noise, MP3 compression, speed changes (both faster and slower playback), and pitch shifts that preserve the music's identifiable content.^[8] Audio watermarks are applied automatically to outputs from the Lyria music model and to audio generated by NotebookLM Audio Overviews and other Google audio products.^[14] The SynthID Detector portal indicates which specific time segments of an uploaded audio file are most likely to contain the watermark, allowing users to identify sections of a longer recording that were AI-generated even when the rest is from another source.^[5]

SynthID-Text

The text watermark, SynthID-Text, alters the token sampling distribution of a large language model at generation time so that the resulting text carries a statistical signature without changing perceived quality.^[9]^[3] The full algorithm and large-scale evaluation appear in Dathathri et al., Nature 634:818-823.^[9]

Tournament sampling

The central mechanism is tournament sampling. For each token to be generated, SynthID-Text draws a set of M = 2^m candidate tokens from the language model's existing output distribution and then runs a knockout-style tournament across m layers.^[9] At each layer, candidate tokens are paired and compete: a pseudorandom scoring function, the g-function, assigns a score to each candidate, and the higher scorer advances. Ties are broken at random. After m rounds a single winner remains and is emitted as the generated token.^[9]

Because the candidate tokens at every layer were drawn from the language model's own distribution, the tournament does not introduce token choices that the model would not have made anyway. The procedure can be tuned to be non-distortionary on average, meaning the marginal distribution over emitted tokens matches the unwatermarked distribution.^[9] Critically, the detector does not need to know the language model's underlying probabilities at detection time; it only needs the watermarking key and the same g-function to compute the expected statistical signature of watermarked text.^[9]

g-values

The g-function returns pseudorandom scores called g-values. Scores are computed by hashing the candidate token together with the layer index and a context seed derived from a sliding window of the previous H tokens (the paper uses H = 4) and a private watermarking key.^[9] In the primary configuration g-values are drawn from a Bernoulli(0.5) distribution, although the framework supports other distributions.^[9] Because detection requires only the same key and the same g-function, the watermark can be verified by a lightweight statistical test that does not need access to the underlying language model.^[9]^[3]

Distortionary versus non-distortionary variants

SynthID-Text supports two regimes.^[9] In the non-distortionary setting (also called distortion-free), tournament sampling is configured so that the marginal token distribution equals the unwatermarked model's distribution; combined with a K-sequence repeated-context masking rule, the watermark can be applied without measurable degradation of text quality. In the distortionary setting, more than two candidates can compete in each match and the scoring rule favors high-g tokens more aggressively, producing a stronger watermark signal at the cost of some token-level distribution shift.^[9] Google has stated that its production deployment in Gemini uses the non-distortionary configuration.^[15]

Detection

Detection treats the sequence of observed g-values along a candidate text as a draw from either a non-watermarked or a watermarked distribution. The reference implementation provides two detectors: a training-free Weighted Mean detector and a Bayesian detector that requires a small labeled training set to fit.^[15]^[16] Detection can therefore be performed without invoking the language model that produced the text, making verification orders of magnitude cheaper than generation.^[9]

Configuration parameters in Hugging Face Transformers

The Hugging Face integration exposes the watermark through a SynthIDTextWatermarkingConfig object passed to model.generate():

from transformers import (
    AutoModelForCausalLM,
    AutoTokenizer,
    SynthIDTextWatermarkingConfig,
)

tokenizer = AutoTokenizer.from_pretrained('repo/id')
model = AutoModelForCausalLM.from_pretrained('repo/id')

watermarking_config = SynthIDTextWatermarkingConfig(
    keys=[654, 400, 836, 123, 340, 443, 597, 160, 57],
    ngram_len=5,
)

output_sequences = model.generate(
    **tokenizer(["your prompts here"], return_tensors="pt"),
    watermarking_config=watermarking_config,
    do_sample=True,
)

The two principal parameters are keys, a list of unique random integers (twenty to thirty are recommended) used to compute the g-function scores, and ngram_len, the sliding-context length that balances robustness against detectability with a default of five.^[3]^[15] The keys must be stored privately; an attacker who knows the keys can both forge the watermark in arbitrary text and remove it from watermarked output.^[3]

Evaluation in the Nature paper

The Nature paper reports two principal evaluations.^[9] The first is a controlled benchmark comparing tournament sampling to Gumbel-style baselines on Gemma 7B-IT and other Gemma-family models. At a fixed false-positive rate, the true-positive rate of SynthID-Text exceeds the baselines, with the largest improvements visible in low-entropy generation settings where alternative methods have the least room to embed signal. Independent reporting summarized the comparison as a true-positive rate of approximately 85 percent against roughly 73 percent for prior state-of-the-art methods at a false-positive rate of 1 percent over 1,500 watermarked and 10,000 unwatermarked text samples.^[9]

The second evaluation is a large-scale field study covering nearly twenty million live Gemini interactions in which user thumbs-up and thumbs-down feedback was compared between watermarked and unwatermarked outputs. The thumbs-up rate differed by 0.01 percentage points and the thumbs-down rate by 0.02 percentage points; neither difference was statistically significant, supporting the conclusion that the non-distortionary configuration does not degrade end-user-visible quality.^[9] This field study is one of the largest reported real-world evaluations of any AI safety intervention to date, and it was conducted on a production consumer LLM rather than a research prototype, lending unusual weight to the claim of zero quality regression.^[9]

An auxiliary human-preference study on 3,000 questions sourced from the ELI5 (Explain Like I'm Five) dataset found no significant differences between watermarked and unwatermarked outputs in grammaticality, relevance, correctness, or helpfulness as judged by human raters.^[9] Latency overhead from tournament sampling was 0.57 percent for a 30-layer configuration on Gemma 7B-IT, compared to 0.26 percent for the Gumbel baseline, both negligible in production serving environments.^[9]

SynthID-Video

Video watermarking is described in Google announcements as an extension of the image watermarking pipeline applied across video frames.^[2]^[5] Because each frame of a video can be watermarked independently using the same SynthID-Image network, the detector benefits from many redundant samples of the watermark across the temporal axis of a clip, which makes detection more reliable than for any single frame in isolation. Veo videos carry both a visible textual label and the invisible SynthID signal beginning with Veo 3; earlier Veo outputs included the invisible mark only.^[6] The video watermark is detectable by the same Gemini app and SynthID Detector pipelines used for stills, and the SynthID-Image paper reports more than ten billion images and video frames watermarked across Google's services as of its 2025 publication.^[12]^[13]

Where is SynthID used? Implementations and Deployment

Product	Modality	SynthID role
Imagen 3, Imagen 4	Image	Automatic SynthID-Image at generation^[5]^[13]
Veo, Veo 3	Video	Invisible SynthID; Veo 3 adds visible "Veo" badge^[6]
Lyria	Audio	Automatic SynthID-Audio on all outputs^[8]
NotebookLM Audio Overviews	Audio	SynthID watermark on generated podcasts^[14]
Gemini app text	Text	SynthID-Text logits processor^[2]^[15]
Gemini app verifier	Image/audio/video/text	Upload-to-verify interface for end users^[13]
SynthID Detector portal	Image/audio/video/text	Web portal for journalists, researchers^[5]
Content Detection API (Gemini Enterprise)	Image	Enterprise API to detect AI-generated media^[26]
NVIDIA Cosmos	Video	Third-party watermarking partnership^[5]^[24]
OpenAI (ChatGPT, Codex, API)	Image	SynthID watermarking on generated images (2026)^[24]^[26]
ElevenLabs	Audio	SynthID adoption for AI-generated audio (2026)^[24]
Kakao	Image/audio	SynthID adoption on platform AI content (2026)^[24]
GetReal Security	All	Detection capability in third-party verification suite^[5]^[11]

The Gemini-app verifier (rolled out under the title "Verify Google AI-generated images, videos, and audio with SynthID") accepts a single file at a time up to 100 MB, with videos limited to 90 seconds and audio to one hour.^[13] The verifier reports detected portions for videos and audio and instructs users to crop tightly around screenshots for best accuracy.^[13] If no watermark is detected, the documentation cautions that the content may still have been produced by a non-Google AI system, since SynthID can only identify content marked by Google's pipelines.^[13]

The open-source GitHub repository google-deepmind/synthid-text provides a reference implementation containing mix-in classes that wrap Gemma and GPT-2 model architectures, a watermarking configuration system based on integer keys, training-free Weighted Mean detection, and training-based Bayesian detection. The repository is dual-licensed under Apache 2.0 (software) and Creative Commons Attribution 4.0 (other materials).^[16] The reference code is also published on PyPI as synthid-text.^[17] A community-trained Bayesian detector and a human evaluation dataset comparing watermarked and unwatermarked Gemma outputs accompany the release.^[16]

For developers, the practical effect of the open-source release is that SynthID-Text can be added to any compatible causal language model with only configuration changes, no fine-tuning of the model weights, and detection can be performed either via the lightweight Weighted Mean detector or via a Bayesian detector trained on roughly ten thousand examples split between watermarked and non-watermarked generations.^[3]^[15] Models that share a tokenizer can also share a single watermarking configuration and detector, which simplifies operating multiple models behind a common verification stack.^[15]

Significance

SynthID arrived at a moment when policymakers, platform operators, and AI laboratories were converging on the idea that AI-generated content should be identifiable through some combination of provenance metadata and durable watermarking. The July 2023 voluntary commitments to the White House named watermarking explicitly, and the subsequent America's AI Action Plan continued to emphasize provenance and labeling.^[7]^[18] Industry standards work on content provenance, notably the C2PA Content Credentials specification, has focused on cryptographically signed metadata stored in file headers. SynthID is complementary to such metadata standards: pixel-level or token-level watermarks persist after the metadata layer is stripped (for example by a screenshot of an image or copy-paste of generated text), while metadata can carry richer context than a watermark alone.^[5]^[19]

Within the AI watermarking field, SynthID-Text is notable for being the first watermarking method demonstrated at the scale of a major consumer LLM deployment and the first to be peer-reviewed in a top general-science journal alongside an open-source release.^[9]^[3] The Nature paper's combination of controlled benchmarks and a field study on twenty million live interactions established that the non-distortionary tournament-sampling watermark could be turned on by default in Gemini without user-visible quality changes.^[9]

The 2025 SynthID-Image paper extended a similar argument for images: by publishing not only detection numbers under thirty common transformations but also a human evaluation against alternative watermarking systems, the authors framed SynthID-Image as the first invisible image watermark whose visual quality, robustness, and deployment behavior have all been characterized at internet scale.^[12]

The May 2026 cross-industry adoption marked a further shift: by having OpenAI, NVIDIA, ElevenLabs, and Kakao apply the same watermark to their own outputs, SynthID moved closer to the ecosystem-wide identification that the Nature paper argued was necessary but that no single-vendor scheme could achieve on its own.^[9]^[24] Coverage described the moment as a rare instance of cross-industry alignment, with companies that compete on models cooperating on a shared provenance signal.^[25] The SynthID Detector portal, the Gemini-app verifier, the enterprise Content Detection API, and the external partnerships together mark a change in how content provenance is intended to function. Rather than relying solely on cooperative metadata that requires distribution platforms to preserve a separate signed manifest, SynthID's watermarks remain in the content as it moves across platforms; rather than expecting end users to inspect metadata, the detector portal and the in-app verifier let users upload media and receive a verdict. Whether this approach succeeds in changing how internet audiences interact with AI content will depend on how widely both the watermark and the verification interface are adopted.^[5]^[13]^[24]

What are the limitations of SynthID?

SynthID's developers and independent researchers have documented several limitations.

Text-specific limitations

The Nature paper and Google's documentation acknowledge that SynthID-Text is less effective on factual responses, because such responses leave fewer high-entropy choices in which to embed the watermark signal without affecting correctness.^[9]^[15]^[2] The watermark is also weaker when generation temperatures are low or when the model's output distribution has low entropy for other reasons.^[9]

Meaning-preserving attacks reduce detection rates. Paraphrasing, especially extensive paraphrasing where more than roughly half of words are changed, can drive detector confidence below useful thresholds.^[15]^[20] Back-translation through another language has a similar effect.^[15] The Google for Developers documentation states explicitly that SynthID-Text "is not designed to directly stop motivated adversaries from causing harm" and recommends pairing it with other defenses.^[15]

Independent academic work has probed these limitations further. A 2026 arXiv preprint by Omidi, Dong, and Wang, "On Google's SynthID-Text LLM Watermarking System: Theoretical Analysis and Empirical Validation," demonstrated a "layer inflation" attack against the Mean detector score that exploits the structure of multi-layer tournaments; the same paper showed that the Bayesian detector is more robust against the attack and proved an optimality result for the choice of Bernoulli(0.5) for the g-value distribution.^[21]

A 2025 robustness study, "Robustness Assessment and Enhancement of Text Watermarking for Google's SynthID," characterized detection rates under a range of paraphrasing, copy-paste, and synonym-substitution attacks and proposed enhancement strategies to improve recovery.^[20]

For image and video, the watermark can be degraded or destroyed by extreme manipulations: aggressive recompression, heavy stylization, content-aware warping, severe downscaling, or targeted removal attacks.^[1]^[4] The 2023 launch reporting quoted Pushmeet Kohli describing SynthID as "experimental" and "still not perfectly immune" to tampering.^[4] University of Chicago researcher Ben Zhao said in the same article that "there are few or no watermarks that have proven robust over time" and warned that motivated bad actors would invest in sophisticated removal techniques.^[4]

For audio, very aggressive transformations such as severe denoising, voice changers, or recording the output through a microphone in a noisy environment can also degrade the watermark.^[8]

Ecosystem and policy limitations

A more structural limitation is that SynthID can only mark content produced by systems that integrate it.^[9]^[13] Content from models that do not embed SynthID, including most open-weights image and text models, will simply return a "not detected" verdict from the Gemini app and SynthID Detector, which the documentation expressly cautions does not establish that the content is human-made.^[13] The Nature paper notes that effective ecosystem-level identification of AI-generated content would require coordination across providers, since open-source models deployed locally cannot be made to apply a centralized watermark.^[9] The May 2026 adoption by OpenAI, NVIDIA, ElevenLabs, and Kakao broadened the set of integrating systems but did not change the underlying constraint, since locally run open-weights models still cannot be compelled to watermark their outputs.^[24]

The proprietary scope of the watermark detector has also been a point of criticism. Sasha Luccioni of Hugging Face argued in 2023 that the watermark would only meaningfully reduce deepfake harms if its application became widespread across providers, and the Nature commentary "AI watermarking must be watertight to be effective" raised related concerns about robustness and adversarial settings.^[4]^[22] The Columbia Journalism Review noted in subsequent coverage that even widespread watermarking does not solve the harder problem of authenticating content that is human-made, and that watermarking is best understood as one component of a broader provenance stack rather than a standalone solution.^[23]

Asymmetry of false positives and false negatives

The asymmetry between false-positive and false-negative errors deserves emphasis. A false positive, in which a non-watermarked piece of content is flagged as watermarked, is potentially highly damaging because it would falsely attribute generation to Google's AI systems. SynthID detectors therefore operate at very low false-positive operating points: the SynthID-Image paper reports performance at a 0.1 percent false-positive rate as its headline metric.^[12] False negatives, in which watermarked content is mistakenly reported as not watermarked, are less damaging in absolute terms (the user receives a "not detected" verdict and is shown documentation noting the watermark may be absent for legitimate reasons), but a high false-negative rate would erode the practical utility of the detector. The Google-published evaluation evidence indicates that SynthID maintains high true-positive rates under common transformations while keeping the false-positive rate very low; the academic literature has primarily focused on demonstrating that this favorable trade-off degrades under adversarial conditions such as targeted paraphrasing or content-aware image edits.^[9]^[12]^[20]^[21]

How does SynthID compare to other provenance approaches?

Approach	Where signal lives	Survives metadata stripping	Survives screenshots/copy-paste	Standardized openly
C2PA Content Credentials	File header metadata	No	No	Yes (open spec)
Visible watermark	Pixels (visible)	Yes	Yes	No
SynthID-Image	Pixels (invisible)	Yes	Mostly (lossy compression OK)	Reference variant open via partnerships^[12]
SynthID-Audio	Spectrogram perturbations	Yes	Mostly (MP3 compression OK)	Closed
SynthID-Text	Token sampling distribution	Yes	Partial (extensive paraphrase breaks)	Open-source (Apache 2.0 reference)^[16]
Statistical AI-text classifiers	None embedded	n/a	Variable	Mixed

Google and outside commentators have generally framed SynthID and C2PA Content Credentials as complementary layers in a content-provenance stack rather than competitors, with the watermark providing a durable signal that survives transformations and the signed metadata providing richer context when it is present.^[19]^[5] OpenAI echoed this framing in 2026, describing watermarking as "more durable through transformations like screenshots" while metadata "can provide more information than a watermark alone."^[26]

Tournament sampling joins a small family of LLM watermarking techniques. The Kirchenbauer-style "green list" watermark partitions the vocabulary into green and red lists at each step and biases sampling toward the green list; the Aaronson-Christ exponential / Gumbel-trick watermark uses a structured pseudorandom permutation of token scores; SynthID's tournament approach is novel in that it preserves the marginal distribution under the non-distortionary configuration while still producing a strong detectable signal.^[9]^[21] The Nature paper provides controlled comparisons against Gumbel-style baselines.^[9]

For images, SynthID-Image sits alongside post-hoc watermarking systems including TrustMark, StegaStamp, and Meta's VideoSeal; the 2025 SynthID-Image paper reports superior visual quality and robustness against thirty common transformations in head-to-head benchmarks.^[12]

References

Hawkins, S. and Doshi, T., "Identifying AI-generated images with SynthID", Google DeepMind, 2023-08-29. https://deepmind.google/blog/identifying-ai-generated-images-with-synthid/. Accessed 2026-05-20. ↩
Doshi, T. and Hawkins, S., "Watermarking AI-generated text and video with SynthID", Google DeepMind, 2024-05-14. https://deepmind.google/blog/watermarking-ai-generated-text-and-video-with-synthid/. Accessed 2026-05-20. ↩
Gante, J. et al., "Introducing SynthID Text", Hugging Face Blog, 2024-10-23. https://huggingface.co/blog/synthid-text. Accessed 2026-05-20. ↩
Heikkilae, M., "Google DeepMind has launched a watermarking tool for AI-generated images", MIT Technology Review, 2023-08-29. https://www.technologyreview.com/2023/08/29/1078620/google-deepmind-has-launched-a-watermarking-tool-for-ai-generated-images/. Accessed 2026-05-20. ↩
Kavukcuoglu, K., "SynthID Detector, a new portal to help identify AI-generated content", Google, 2025-05-20. https://blog.google/innovation-and-ai/products/google-synthid-ai-content-detector/. Accessed 2026-05-20. ↩
Google, "SynthID", Google DeepMind models page. https://deepmind.google/models/synthid/. Accessed 2026-05-20. ↩
The White House, "Voluntary Commitments from Leading Artificial Intelligence Companies", Harvard Law Review, 2023-07-21. https://harvardlawreview.org/print/vol-137/voluntary-commitments-from-leading-artificial-intelligence-companies-on-july-21-2023/. Accessed 2026-05-20. ↩
Google DeepMind, "Transforming the future of music creation", Google DeepMind, 2023-11-16. https://deepmind.google/discover/blog/transforming-the-future-of-music-creation/. Accessed 2026-05-20. ↩
Dathathri, S., See, A., Ghaisas, S., Huang, P.-S., McAdam, R., et al., "Scalable watermarking for identifying large language model outputs", Nature 634, 818-823, 2024-10-23. https://www.nature.com/articles/s41586-024-08025-4. Accessed 2026-05-20. ↩
Dathathri, S., See, A., Ghaisas, S., Huang, P.-S., McAdam, R., et al., "Scalable watermarking for identifying large language model outputs", PubMed Central PMC11499265, 2024-10-23. https://pmc.ncbi.nlm.nih.gov/articles/PMC11499265/. Accessed 2026-05-20. ↩
GetReal Security, "SynthID Detector, a new portal to help identify AI-generated content", GetReal Security, 2025-05-20. https://www.getrealsecurity.com/resources/synthid-detector----a-new-portal-to-help-identify-ai-generated-content. Accessed 2026-05-20. ↩
Gowal, S., Bunel, R., Stimberg, F., Stutz, D., Ortiz-Jimenez, G., et al., "SynthID-Image: Image watermarking at internet scale", arXiv:2510.09263, 2025-10-10. https://arxiv.org/abs/2510.09263. Accessed 2026-05-20. ↩
Google, "Verify Google AI-generated images, videos, and audio with SynthID", Gemini Apps Help, 2025. https://support.google.com/gemini/answer/16722517. Accessed 2026-05-20. ↩
Google DeepMind, "SynthID overview", deepmind.google/models/synthid. https://deepmind.google/models/synthid/. Accessed 2026-05-20. ↩
Google, "SynthID: Tools for watermarking and detecting LLM-generated Text", Responsible Generative AI Toolkit, Google AI for Developers. https://ai.google.dev/responsible/docs/safeguards/synthid. Accessed 2026-05-20. ↩
Google DeepMind, "synthid-text reference implementation", GitHub. https://github.com/google-deepmind/synthid-text. Accessed 2026-05-20. ↩
Google DeepMind, "synthid-text package", PyPI. https://pypi.org/project/synthid-text/. Accessed 2026-05-20. ↩
The White House, "America's AI Action Plan", whitehouse.gov, 2025. https://www.whitehouse.gov/wp-content/uploads/2025/07/Americas-AI-Action-Plan.pdf. Accessed 2026-05-20. ↩
Coalition for Content Provenance and Authenticity, "C2PA versus watermarking", c2pa.ai, 2026. https://c2pa.ai/vs-watermarking. Accessed 2026-05-20. ↩
Authors, "Robustness Assessment and Enhancement of Text Watermarking for Google's SynthID", arXiv:2508.20228, 2025. https://arxiv.org/abs/2508.20228. Accessed 2026-05-20. ↩
Omidi, R., Dong, Y., and Wang, B., "On Google's SynthID-Text LLM Watermarking System: Theoretical Analysis and Empirical Validation", arXiv:2603.03410, 2026. https://arxiv.org/abs/2603.03410. Accessed 2026-05-20. ↩
Nature Editorial, "AI watermarking must be watertight to be effective", Nature, 2024-10. https://www.nature.com/articles/d41586-024-03418-x. Accessed 2026-05-20. ↩
Columbia Journalism Review, "Just add watermarks", Columbia Journalism Review, 2024. https://www.cjr.org/the_media_today/just-add-watermarks.php. Accessed 2026-05-20. ↩
Google, "Making it easier to understand how content was created and edited", The Keyword (blog.google), 2026-05-19. https://blog.google/innovation-and-ai/products/identifying-ai-generated-media-online/. Accessed 2026-06-21. ↩
Cryptobriefing, "Google expands SynthID watermarking to OpenAI, Kakao, and ElevenLabs for AI content transparency", 2026-05-19. https://cryptobriefing.com/google-synthid-watermarking-openai-partnerships/. Accessed 2026-06-21. ↩
InfoQ, "Google Expands SynthID Adoption for AI Watermarking, Previews Content Detection API", 2026-05. https://www.infoq.com/news/2026/05/google-synthid-content-detection/. Accessed 2026-06-21. ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation. Every suggestion is reviewed for sourcing before it goes live.

2 revisions by 1 contributor · full history

Suggest edit

SynthID

Infobox

History and Timeline

When was SynthID first released? Image watermarking (August 2023)

Audio watermarking with Lyria (November 2023)

Text and video extension (May 2024)

Open-source release of SynthID-Text (October 2024)

SynthID Detector portal (May 2025)

Adoption beyond Google

Industry-wide adoption and a Content Detection API (May 2026)

How does SynthID work? Technical Details

SynthID-Image

Robustness claims

SynthID-Audio

SynthID-Text

Tournament sampling

g-values

Distortionary versus non-distortionary variants

Detection

Configuration parameters in Hugging Face Transformers

Evaluation in the Nature paper

SynthID-Video

Where is SynthID used? Implementations and Deployment

Significance

What are the limitations of SynthID?

Text-specific limitations

Ecosystem and policy limitations

Asymmetry of false positives and false negatives

How does SynthID compare to other provenance approaches?

See also

References

Improve this article

What links here (24 of 25)

What links here (24 of 25)

Infobox

History and Timeline

When was SynthID first released? Image watermarking (August 2023)

Audio watermarking with Lyria (November 2023)

Text and video extension (May 2024)

Open-source release of SynthID-Text (October 2024)

SynthID Detector portal (May 2025)

Adoption beyond Google

Industry-wide adoption and a Content Detection API (May 2026)

How does SynthID work? Technical Details

SynthID-Image

Robustness claims

SynthID-Audio

SynthID-Text

Tournament sampling

g-values

Distortionary versus non-distortionary variants

Detection

Configuration parameters in Hugging Face Transformers

Evaluation in the Nature paper

SynthID-Video

Where is SynthID used? Implementations and Deployment

Significance

What are the limitations of SynthID?

Text-specific limitations

Cross-modal limitations

Ecosystem and policy limitations

Asymmetry of false positives and false negatives

How does SynthID compare to other provenance approaches?

Related Work

See also

References

Improve this article

Related Articles

Frontier Safety Framework (Google DeepMind)

Gemini (language model)

Imagen (text-to-image model)

Veo

BigGAN

Genie 3

What links here (24 of 25)

Related Articles

Frontier Safety Framework (Google DeepMind)

Gemini (language model)

Imagen (text-to-image model)

Veo

BigGAN

Genie 3

What links here (24 of 25)