Eric Horvitz
The qualitative shape of language-model capability — what it looks like as a thing, and whether we have the vocabulary to describe it before we have the methodology to measure it.
The Microsoft Research paper "Sparks of Artificial General Intelligence", which Horvitz co-authored in 2023, was the field's most-cited qualitative evaluation of GPT-4 — a hundred-plus pages of examples showing the model doing things its predecessors couldn't, framed as preliminary evidence of capabilities approaching general intelligence. The paper drew immediate criticism for its lack of systematic protocol, but the criticism mostly missed what made the document influential: it gave the industry a vocabulary for what it was looking at before anyone had built rigorous instruments to measure it. Horvitz's longer career — probabilistic AI, medical decision-support, AAAI presidency, the build-out of Microsoft's responsible-AI infrastructure — gives him standing the casual reader of "Sparks" usually misses.