Yulia Tsvetkov
What kinds of bias and harm look like in language-model outputs across languages — especially the languages and communities the field's standard evaluation has historically ignored.
The standard pipeline for studying bias in language models has been English-first by default: train on English, evaluate on English, draw conclusions, then port to other languages as an afterthought. Tsvetkov's UW group runs the inverse pipeline — start with the languages and communities that English-centric methods don't reach, ask which bias and harm categories actually surface in those settings, and let the multilingual data shape the analytical categories from the start. For ai100, which evaluates engines in five language regions, this is the methodological backing for treating each locale's bias profile as its own object of study.