Crash Blossom

A crash blossom is an ambiguously worded headline that can be parsed in more than one way, producing an unintended (and often humorous) alternative reading. The term belongs to a broader family of linguistic phenomena, including syntactic ambiguity, garden-path sentences, and headlinese, all of which pose significant challenges for natural language processing (NLP) systems. Crash blossoms arise because newspaper headlines routinely omit function words such as articles, auxiliary verbs, and copulas to save space, creating compressed constructions where nouns can be misread as verbs and vice versa.

Origin of the term

The word "crash blossom" was coined in August 2009 on the Testy Copy Editors online forum. Mike O'Connell, an American editor based in Sapporo, Japan, encountered the headline "Violinist Linked to JAL Crash Blossoms" in the newspaper Japan Today. The headline referred to Diana Yukawa, a Japanese-British violinist and composer whose father, Akihisa Yukawa, was killed in the crash of Japan Air Lines Flight 123 on August 12, 1985. Yukawa was born one month after the disaster, and the article described how her career had "blossomed" despite this tragedy. However, O'Connell initially misread "crash blossoms" as a compound noun, prompting him to post: "What's a crash blossom?"

Another forum member, Dan Bloom, suggested that the phrase would make an excellent label for this class of ambiguous headlines. The name spread rapidly through the linguistics blogosphere, notably after Ben Zimmer wrote about it on the Language Log blog in August 2009. Zimmer later devoted his "On Language" column in The New York Times Magazine to the phenomenon on January 31, 2010, further popularizing the term. Merriam-Webster added "crash blossom" to its "Words We're Watching" list, defining it as "a headline that is ambiguous because of its wording or punctuation (or absence thereof)."

How crash blossoms work

Crash blossoms exploit the compressed register known as headlinese. Standard English headlines follow conventions that differ from ordinary prose: articles ("a," "the") are dropped, the copula ("is," "are") is omitted, and verbs are often written in the simple present tense regardless of when the event occurred. These compressions save column space but remove the syntactic cues that readers rely on to parse sentence structure correctly.

The most common mechanism behind crash blossoms is noun-verb ambiguity. Many English words function as both nouns and verbs ("crash," "fire," "hit," "race," "drop," "hold"). When such words appear in a headline without the function words that would clarify their grammatical role, readers may assign the wrong part of speech, leading to an unintended interpretation.

Other contributing factors include:

Prepositional phrase (PP) attachment ambiguity: A prepositional phrase can modify either the noun or the verb in a headline, producing different meanings.
Noun pile-ups: Headlines sometimes stack multiple nouns together without hyphens or other punctuation, making it hard to determine which nouns modify which.
Missing hyphens: A missing hyphen between a compound modifier can split the modifier into separate words, changing the parse. For example, "dog bite victim" without a hyphen between "dog" and "bite" can be misread as a dog that bites a victim.

Famous examples

Ambiguous headlines have existed for as long as newspapers themselves, but they were not called "crash blossoms" until 2009. The Columbia Journalism Review collected such headlines for decades in its column "The Lower Case" and published two anthologies: Squad Helps Dog Bite Victim (1980) and Red Tape Holds Up New Bridge (1987), edited by Gloria Cooper.

The table below lists well-known crash blossoms along with their intended and unintended readings.

Headline	Intended meaning	Unintended reading	Ambiguity type
Violinist Linked to JAL Crash Blossoms	A violinist linked to a JAL crash has blossomed	A violinist linked to a JAL "crash blossom" (unknown object)	Noun-verb ambiguity ("blossoms")
Squad Helps Dog Bite Victim	A squad helps a dog-bite victim	A squad helps a dog bite a victim	Missing hyphen; noun-verb ambiguity
Red Tape Holds Up New Bridge	Bureaucratic delays are holding up a new bridge	Physical red tape is supporting a bridge	Lexical ambiguity ("holds up")
Kids Make Nutritious Snacks	Kids prepare nutritious snacks	Kids themselves are nutritious snacks	Syntactic ambiguity (subject vs. object)
Milk Drinkers Turn to Powder	Milk drinkers switch to powdered milk	Milk drinkers physically turn into powder	Noun-verb ambiguity ("turn")
Eye Drops Off Shelf	Eye drops (product) removed from shelf	An eye drops off a shelf	Noun-verb ambiguity ("drops")
McDonald's Fries the Holy Grail for Potato Farmers	McDonald's french fries are highly valued by potato farmers	McDonald's is frying the Holy Grail for potato farmers	Noun-verb ambiguity ("fries")
Giant Waves Down Queen Mary's Funnel	Giant waves went down Queen Mary's funnel	A giant waves down at Queen Mary's funnel	Noun-verb ambiguity ("waves," "down")
Miners Refuse to Work After Death	Miners refuse to continue working after a colleague's death	Miners refuse to work in the afterlife	PP attachment ambiguity ("after death")
Missing Woman Remains Found	The remains of a missing woman have been found	A missing woman remains found (unclear)	Noun-verb ambiguity ("remains")
Scientists Count Whales from Space	Scientists count whales using satellite imagery	Scientists count whales that are in space	PP attachment ambiguity ("from space")
MacArthur Flies Back to Front	MacArthur flies back to the (battle) front	MacArthur flies back-to-front (backwards)	Compound noun ambiguity

Relationship to syntactic ambiguity

Syntactic ambiguity (also called structural ambiguity, amphiboly, or amphibology) is the broader linguistic phenomenon that underlies crash blossoms. A sentence is syntactically ambiguous when its grammatical structure allows more than one valid parse tree. Unlike lexical ambiguity, which arises from a single word having multiple meanings (e.g., "bank" as a financial institution or a river bank), syntactic ambiguity comes from the relationships among words and clauses.

Syntactic ambiguity can be divided into two categories:

Global ambiguity: The sentence remains ambiguous even after the reader reaches the end. "The woman held the baby in the green blanket" retains multiple readings (the baby is in the blanket, or the woman is wearing the blanket while holding the baby).
Local ambiguity: The sentence is temporarily ambiguous but resolves to a single interpretation by the end. These often produce garden-path effects.

Crash blossoms are typically instances of global ambiguity, because headline writers do not provide enough additional context to resolve the ambiguity.

Garden-path sentences

Garden-path sentences are closely related to crash blossoms. A garden-path sentence leads the reader toward an initial interpretation that turns out to be incorrect, forcing them to reanalyze the sentence's structure. The classic example is:

"The horse raced past the barn fell."

Most readers initially parse "raced" as the main verb ("The horse raced past the barn"), but the sentence actually means "The horse [that was] raced past the barn fell." Here, "raced" is a reduced relative clause, not the main verb.

The Garden-Path Model of sentence processing, proposed by Lyn Frazier and others, holds that the parser initially builds the simplest possible syntactic structure based on two principles:

Minimal Attachment: The parser prefers the structure that requires the fewest syntactic nodes.
Late Closure: New words are attached to the phrase currently being processed rather than starting a new phrase.

When these heuristics lead to an incorrect parse, the reader must perform reanalysis, backtracking to find the correct structure. This reanalysis is cognitively costly, which is why garden-path sentences feel confusing.

Crash blossoms function like garden-path sentences in miniature. The compressed syntax of headlinese removes many of the cues that readers would normally use to parse a sentence correctly on the first pass.

Types of ambiguity in NLP

Crash blossoms sit at the intersection of several types of linguistic ambiguity that NLP systems must handle. The table below summarizes these types.

Ambiguity type	Definition	Example	NLP technique
Lexical ambiguity	A single word has multiple meanings	"Bank" (financial institution vs. river bank)	Word sense disambiguation (WSD)
Syntactic ambiguity	Sentence structure allows multiple parse trees	"I saw the man with the telescope"	Constituency/dependency parsing
Semantic ambiguity	Meaning varies despite clear syntax	"The chicken is ready to eat"	Semantic role labeling
Pragmatic ambiguity	Meaning depends on speaker intent or context	"Can you pass the salt?" (request vs. question)	Dialogue act classification
Referential ambiguity	Pronouns or phrases have unclear referents	"John told Bill that he was wrong"	Coreference resolution
Scope ambiguity	Quantifiers or operators interact to produce multiple readings	"Every student read a book"	Formal semantics, lambda calculus

Crash blossoms as an NLP challenge

Crash blossoms present a concentrated form of the ambiguity problems that NLP systems face more broadly. Parsing ambiguous headlines requires a system to handle part-of-speech tagging, syntactic parsing, semantic interpretation, and world knowledge simultaneously.

Part-of-speech tagging

Part-of-speech (POS) tagging is the task of assigning grammatical categories (noun, verb, adjective, etc.) to each word in a sentence. In standard prose, POS taggers achieve accuracy above 97%. However, headlinese strips away the context that taggers rely on. In the headline "Eye Drops Off Shelf," a POS tagger must determine whether "drops" is a noun (eye drops, the product) or a verb (something drops off a shelf), and whether "eye" is an adjective modifying "drops" or the subject of the verb "drops." Without articles or auxiliary verbs, these decisions become much harder.

Research on cross-register POS tagging has shown that models trained primarily on well-formed sentences (such as newswire text from the Penn Treebank) can struggle when applied to headlines, tweets, and other non-standard text. The compressed, function-word-poor register of headlinese is sufficiently different from standard English that it can degrade tagging accuracy by several percentage points.

Syntactic parsing

Syntactic parsers construct parse trees that represent the grammatical structure of a sentence. When a sentence is ambiguous, a parser may produce multiple candidate trees. Traditional approaches use probabilistic context-free grammars (PCFGs) and algorithms such as the Cocke-Kasami-Younger (CKY) algorithm to find the most probable parse. More recent neural parsers use deep learning architectures to score candidate structures.

The prepositional phrase (PP) attachment problem is one of the most studied forms of syntactic ambiguity. In the sentence "I saw the man with the telescope," the PP "with the telescope" can attach to the verb phrase (I used the telescope to see) or the noun phrase (the man has the telescope). The Penn Treebank annotates PP attachment ambiguities using a system called "pseudo-attachment," where annotators can flag globally ambiguous structures even when context does not resolve them.

Word sense disambiguation

Word sense disambiguation (WSD) is the task of determining which meaning of a polysemous word is intended in a given context. WSD approaches fall into several categories:

Supervised methods: Treat WSD as a classification problem, training models on datasets where word instances have been manually annotated with correct senses. Common algorithms include support vector machines, decision trees, and neural networks.
Unsupervised methods: Rely on distributional patterns in large text corpora, clustering word instances by context similarity without labeled training data.
Knowledge-based methods: Exploit structured lexical resources such as WordNet or BabelNet, using graph-based algorithms to select the most appropriate sense.
Contextualized embeddings: Models such as ELMo, BERT, and GPT generate different vector representations for the same word depending on context, providing an implicit form of sense disambiguation.

In the context of crash blossoms, WSD is necessary to resolve lexical ambiguity (e.g., "holds up" meaning "delays" vs. "supports"), but it is not sufficient on its own; structural disambiguation is also required.

How modern NLP models handle ambiguity

Contextualized word representations

Before 2018, most NLP systems used static word embeddings such as Word2Vec or GloVe, which assign a single vector to each word regardless of context. These representations cannot distinguish between different senses of a polysemous word. Contextualized models changed this.

ELMo (Embeddings from Language Models), introduced by Peters et al. in 2018, generates word representations that vary depending on the surrounding sentence, using a bidirectional LSTM architecture. BERT (Bidirectional Encoder Representations from Transformers), introduced by Devlin et al. later that same year, uses the self-attention mechanism to process all words in a sentence simultaneously, allowing each word's representation to be influenced by every other word. These approaches improved performance on tasks that require disambiguating word meaning in context.

Transformer attention and syntactic structure

The self-attention mechanism in transformer models allows each token to attend to every other token in the input. Research has shown that different attention heads in BERT specialize in capturing different types of linguistic information. Some heads track syntactic dependencies (such as subject-verb agreement), while others capture positional relationships or semantic similarity. This multi-faceted attention can help models resolve syntactic ambiguities by weighing structural and semantic evidence simultaneously.

Transformer Grammars, introduced by Sartran et al. in 2022, combine the scalability of transformers with recursive syntactic compositions, using a special attention mask and deterministic tree transformation to build syntactic structure explicitly into the model.

Performance of large language models on ambiguity

Recent research has tested how large language models (LLMs) handle various forms of ambiguity:

Garden-path sentences: A 2024 study (Arxiv 2405.16042) examined whether GPT-2, LLaMA-2, Flan-T5, and RoBERTa process garden-path sentences similarly to humans. The researchers found that these models exhibit some of the same initial misinterpretation patterns as human readers, but their reanalysis behavior differs.
Scope ambiguity: Research published in Transactions of the Association for Computational Linguistics (2024) found that GPT-4 can recognize scope ambiguity but struggles to formalize it correctly, often misapplying quantifier scopes in generated lambda calculus representations.
Structural ambiguity across languages: A study on GPT's multilingual capabilities found that the model tends to rely on English-centric attachment preferences when parsing syntactically ambiguous constructions in Korean and Japanese, rather than capturing language-specific patterns.

These findings indicate that while LLMs have made significant progress in handling ambiguity, they are not yet fully reliable, particularly on the kinds of compressed, context-poor constructions found in crash blossoms and headlinese.

Crash blossoms in computational humor and creativity

The unintended humor in crash blossoms has attracted interest from researchers working on computational humor and natural language generation. Detecting whether a headline is a crash blossom requires a system to identify that multiple parses exist and that one of them produces a semantically incongruous reading. This overlaps with work on automatic humor detection, where models must identify incongruity, surprise, and semantic contrast.

From a natural language generation perspective, avoiding crash blossoms in automatically generated headlines is a practical concern. News headline generation systems, which use sequence-to-sequence models or transformer-based architectures, must produce headlines that are both concise and unambiguous. Evaluating generated headlines for potential crash blossom readings is an open problem in NLG research.

Cognitive science perspective

Crash blossoms also provide a window into human sentence processing. Psycholinguistic research uses ambiguous sentences, including garden-path constructions, to study how the brain parses language in real time.

Several models of human sentence processing are relevant:

Reanalysis Model: Processing difficulty arises when readers realize their initial parse was wrong and must restructure the sentence. Eye-tracking studies confirm that readers experience increased fixation durations at the point where disambiguation occurs.
Competition-Based Model: Multiple syntactic analyses compete for activation. When two analyses receive roughly equal support, processing becomes more difficult.
Unrestricted Race Model: Multiple possible structures "race" to completion, and the one with the strongest support from available information wins. This model incorporates the "good-enough" account of language comprehension, which proposes that readers often settle for incomplete or approximate interpretations rather than fully resolving every ambiguity.

Research has also found differences between how children and adults process syntactic ambiguity. Children are slower to commit to an initial syntactic interpretation and have more difficulty with reanalysis, partly because of limited working memory capacity. Adults with higher working memory spans can maintain multiple interpretations simultaneously but may actually experience greater interference from competing analyses.

Historical context

Ambiguous writing has been a source of both confusion and literary effect for centuries. Two notable historical examples predate the modern concept of crash blossoms:

Shakespeare's Henry VI: The line "The duke yet lives that Henry shall depose" is deliberately ambiguous. It could mean that Henry will depose the duke, or that the duke will depose Henry. Shakespeare uses this ambiguity to create dramatic irony.
Marlowe's Edward II: The Latin sentence "Eduardum occidere nolite timere bonum est" changes meaning depending on where a comma is placed. Read one way, it means "Do not be afraid to kill Edward; it is good." Read another way, it means "Do not kill Edward; it is good to be afraid."

These examples demonstrate that syntactic ambiguity is not merely a modern newspaper problem but a longstanding feature of natural language that has been exploited for literary and political purposes.

Avoiding crash blossoms

Editors and style guides recommend several strategies to prevent crash blossoms:

Use hyphens for compound modifiers: Writing "dog-bite victim" instead of "dog bite victim" eliminates the ambiguity in "Squad Helps Dog Bite Victim."
Choose unambiguous verbs: Replacing "holds up" with "delays" in "Red Tape Holds Up New Bridge" removes the double meaning.
Add function words where space permits: Including an article ("a," "the") or auxiliary verb ("is," "are") can clarify which word is the verb and which is the noun.
Read the headline in isolation: Editors should test whether a headline makes sense without the accompanying article, since many readers scan headlines without reading further.
Check for noun-verb ambiguity: Any word in a headline that could function as either a noun or a verb should be flagged and reviewed.

Explain like I'm 5 (ELI5)

Imagine you read a short sentence like "Kids Make Nutritious Snacks." You could think it means kids are cooking healthy snacks in the kitchen. But you could also read it as if the kids themselves are the snacks, which is silly and kind of funny. A "crash blossom" is the name for a headline like this one that accidentally says two things at once because the words are squished together without enough helper words to make the meaning clear. Computers that try to read and understand language have the same problem: when words are missing from a sentence, it is harder to figure out what the sentence is really saying.

References

Zimmer, Ben. "Crash Blossoms." *Language Log*, August 2009. https://languagelog.ldc.upenn.edu/nll/?p=1693
Zimmer, Ben. "Crash Blossoms." *The New York Times Magazine*, January 31, 2010.
"Crash Blossom: Words We're Watching." *Merriam-Webster*. https://www.merriam-webster.com/wordplay/crash-blossom-words-were-watching
"Crash blossom." *Wiktionary*. https://en.wiktionary.org/wiki/crash_blossom
"Syntactic ambiguity." *Wikipedia*. https://en.wikipedia.org/wiki/Syntactic_ambiguity
Cooper, Gloria, ed. *Squad Helps Dog Bite Victim, and Other Flubs from the Nation's Press*. Columbia Journalism Review, 1980.
Marcus, Mitchell P., Beatrice Santorini, and Mary Ann Marcinkiewicz. "Building a Large Annotated Corpus of English: The Penn Treebank." *Computational Linguistics* 19, no. 2 (1993): 313-330.
Devlin, Jacob, et al. "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding." *Proceedings of NAACL-HLT*, 2019.
Peters, Matthew E., et al. "Deep Contextualized Word Representations." *Proceedings of NAACL-HLT*, 2018.
Bender, Emily M., and Alexander Koller. "Climbing Towards NLU: On Meaning, Form, and Understanding in the Age of Data." *Proceedings of ACL*, 2020.
Frazier, Lyn, and Janet Dean Fodor. "The Sausage Machine: A New Two-Stage Parsing Model." *Cognition* 6, no. 4 (1978): 291-325.
Navigli, Roberto. "Word Sense Disambiguation: A Survey." *ACM Computing Surveys* 41, no. 2 (2009): 1-69.
Sartran, Laurent, et al. "Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale." *Transactions of the Association for Computational Linguistics* 10 (2022): 1423-1439.
Nour, Youssef, et al. "Incremental Comprehension of Garden-Path Sentences by Large Language Models." *arXiv preprint* 2405.16042, 2024.
Mandelkern, Matthew, et al. "Scope Ambiguities in Large Language Models." *Transactions of the Association for Computational Linguistics* 12 (2024).

Origin of the term

How crash blossoms work

Famous examples

Relationship to syntactic ambiguity

Garden-path sentences

Types of ambiguity in NLP

Crash blossoms as an NLP challenge

Part-of-speech tagging

Syntactic parsing

Word sense disambiguation

How modern NLP models handle ambiguity

Contextualized word representations

Transformer attention and syntactic structure

Performance of large language models on ambiguity

Crash blossoms in computational humor and creativity

Cognitive science perspective

Historical context

Avoiding crash blossoms

Explain like I'm 5 (ELI5)

See also

References

Improve this article

Related Articles

ARC-AGI 2

Agentic Context Engineering

Claude Sonnet 4.5

Computer-use agent

Computer-use model

Context window

Origin of the term

How crash blossoms work

Famous examples

Relationship to syntactic ambiguity

Garden-path sentences

Types of ambiguity in NLP

Crash blossoms as an NLP challenge

Part-of-speech tagging

Syntactic parsing

Word sense disambiguation

How modern NLP models handle ambiguity

Contextualized word representations

Transformer attention and syntactic structure

Performance of large language models on ambiguity

Crash blossoms in computational humor and creativity

Cognitive science perspective

Historical context

Avoiding crash blossoms

Explain like I'm 5 (ELI5)

See also

References

Related Articles

ARC-AGI 2

Agentic Context Engineering

Claude Sonnet 4.5

Computer-use agent

Computer-use model

Context window