← Back to Timeline
2021From Labs to Tools

> DALL-E & Copilot_

Text-to-image generation. AI pair programming arrived.

> DEEP DIVE_

In January 2021, OpenAI unveiled DALL-E, a 12-billion-parameter neural network that could generate images from text descriptions. The name, a playful fusion of Salvador Dali and Pixar's WALL-E, hinted at the system's blend of surrealism and engineering. Built on a variant of GPT-3 adapted to treat image generation as a sequence prediction problem, DALL-E could produce images for prompts as whimsical as "an armchair in the shape of an avocado" or "a professional high-quality illustration of a giraffe turtle chimera." The results were imperfect but unmistakable: a machine could now translate natural language imagination into visual form.

DALL-E represented the convergence of two powerful threads in AI research: large language models and generative image models. By treating images as sequences of discrete tokens (using a technique called dVAE, or discrete variational autoencoder), the system could leverage the same autoregressive training that made GPT-3 so powerful. The approach was conceptually elegant: text and images were reduced to the same representation, allowing the model to learn cross-modal relationships between words and visual concepts. DALL-E 2, released in April 2022, dramatically improved quality by using a diffusion-based approach, producing photorealistic images that sparked equal measures of wonder and anxiety.

The same year saw another milestone in AI-assisted creativity. In June 2021, GitHub launched Copilot, powered by OpenAI's Codex model, a descendant of GPT-3 fine-tuned on billions of lines of public code. Copilot sat inside developers' code editors, suggesting entire functions, completing complex algorithms, and translating natural language comments into working code. Early studies suggested that developers using Copilot completed tasks up to 55% faster. The tool quickly became one of the most widely adopted AI products in history, with millions of developers relying on it daily.

But both DALL-E and Copilot raised urgent questions about intellectual property and the nature of creativity. DALL-E was trained on images scraped from the internet, many created by professional artists who had never consented to their work being used as training data. Copilot was trained on open-source code, and developers quickly discovered it could sometimes reproduce substantial portions of copyrighted code verbatim. Lawsuits followed, and the legal question of whether training AI on copyrighted material constitutes fair use remains unresolved. The year 2021 proved that AI could be a powerful creative collaborator, but it also proved that the legal and ethical frameworks for this collaboration were woefully unprepared for the reality.