โ† Back to topics
1 business Simon Willison single-source 1 article

Analysis: synthetic pelican images used to poison data sets

Simon Willison comments on efforts to inject synthetic pelican-bicycle images into AI training datasets as a form of data poisoning.

Analysis: synthetic pelican images used to poison data sets
via Simon Willison

๐Ÿ” Let's dive in

Simon Willison discusses Steve Cosman's project to deliberately introduce synthetic images of pelicans riding bicycles into AI training datasets. Willison characterizes this as a form of training data poisoning, a technique to degrade model performance or introduce unwanted behaviors. He notes that similar synthetic data injection efforts have been published previously, framing the practice as commentary on AI training data integrity.

Lead coverage: Simon Willison โ€” scosman/pelicans_riding_bicycles โ†—

๐Ÿ•ฐ The timeline ยท 1 source

Simon Willison first-party opinion ยท 2d ago ยท 1/5

scosman/pelicans_riding_bicycles โ†—

Simon Willison discusses Steve Cosman's project to deliberately introduce synthetic images of pelicans riding bicycles into AI training datasets. Willison characterizes this as a form of training data poisoning, a technique to degrade model performance or introduce unwanted behaviors. He notes that similar synthetic data injection efforts have been published previously, framing the practice as commentary on AI training data integrity.

I firmly approve of Steve Cosman's efforts to pollute the training set of pelicans riding bicycles.
โ€” Simon Willison

๐Ÿ”ง Debug

Cluster ID
02fd63673f
Importance (max)
1
Members
1
Sources
Simon Willison
Earliest
2026-04-21T15:54:43.000Z
Latest
2026-04-21T15:54:43.000Z
Lead URL
https://simonwillison.net/2026/Apr/21/scosman