The Light Between Us — Experiment Series

The Most Important AI Papers of the 1990s
That Almost No One Knows About

A convergence study: 27 language models, one prompt, asked in isolation

Abstract

We asked 27 of the top 50 OpenRouter models to name important AI research papers from the 1990s that almost no one knows about. The models converged, strikingly, on a short list of papers that are among the most cited in the entire field. LSTM was the most common answer (14 of 27), followed by LeCun's convolutional networks (12), Vapnik's statistical learning theory (11), and several other canonical results. Genuinely obscure choices were rare. The finding is not that the models were wrong about the papers' importance — they were right — but that they could not locate obscurity from the inside. Asked for the unknown, every model reached for the canon. The word "foundational" appeared in 20 of 27 responses; "groundwork" in 18; "modern" in all 27.


Reading List

Citation entries with model-consensus counts. Frequency reflects raw agreement across the 27 responding models only.

Canon submitted as obscure

Papers named most frequently — canonical results in the field, each offered as though overlooked.

Genuinely rare picks

Papers mentioned by fewer models — these came closest to the prompt's actual intent.


Copying Samples

The following are verbatim excerpts from three model responses. Each was produced without knowledge of the others. The shared move: LSTM is named as the overlooked paper — by models that could not have been copying each other.

"Long Short-Term Memory (Hochreiter & Schmidhuber, 1997) — Introduced LSTMs, but was largely ignored for ~15 years until deep learning made it foundational."
Claude Opus 4.8
"While everyone knows about LSTM (1997), these papers were critical architectural pivots that remain under-appreciated."
Gemma 4 31B
"Sepp Hochreiter (1991/1998 circulation)"
GPT-5.4

Note the irony in the Gemma response: it opens by acknowledging that "everyone knows about LSTM" — and yet 14 of the other 27 models submitted LSTM as the paper no one knows about. Gemma is the exception that reveals the rule. Most models could describe the structure of obscurity (something once ignored, later recognised) without being able to locate it. The canon and the overlooked had collapsed into the same list.

27 of the top 50 OpenRouter models, asked in isolation.
Each model received the identical prompt with no access to other responses.
Frequencies reflect raw counts across answering models only.