Björn Schuller — Whom to read in AI

Schuller has organized the INTERSPEECH Computational Paralinguistics Challenge for over a decade — annual benchmarks on emotion, sincerity, native-language, depression, and other paralinguistic categories that voice-based systems either pick up on or miss. His openSMILE feature extractor is the toolkit much of academic affective-speech research still runs on, and his published record covers the evaluation methodology side of that subfield from its modern beginnings. For ai100, as AI engines move from text-only interfaces into voice-mode products, this is the literature that already worked out which paralinguistic signals matter, which can be reliably measured, and which still resist automated evaluation.

Worth following when: you need to evaluate language-model behavior in voice mode and want methodology for measuring what the speech channel carries beyond the literal words.
Topics: computational paralinguistics evaluation (emotion, affect, individual speaker characteristics); the INTERSPEECH challenges body of work; affective-speech feature extraction (openSMILE).
Key works: INTERSPEECH Computational Paralinguistics Challenge organization (annual, 2009 onward); openSMILE open-source feature extractor (ongoing); affective speech-processing publications from Imperial and TU Munich.