← Back to the list
Jesse Dodge
What has to be in a paper about a language model for another lab to be able to verify the result.
Before Dodge's work, most LLM-efficiency publications compared results obtained under incompatible conditions — different hyperparameter-search budgets, different dataset variants, undocumented normalization steps that made head-to-head numbers misleading. The "Show Your Work" reporting standard he formulated and pushed at NeurIPS and ACL as a reviewer requirement turned reproducibility from a convention into an enforceable line. The same instinct drives his Green AI work — report not just accuracy but compute cost, because "percent correct" is not a comparable number across labs without it.