Gao led the 2024 "Large Language Models: A Survey" — one of the broader synthesis papers of the LLM era, organizing the literature not by paper but by the questions that distinguish one architectural lineage from another (decoder-only versus encoder-decoder, instruction-tuning lineages, alignment-training variations, modality extensions). The framing is unusual in that it treats LLMs as a research artifact to be classified and analyzed at the species level, not as a single thing to be cheered for or against. His longer body of work in conversational AI and grounding gives that taxonomical view depth — he has seen which architectural choices survive contact with deployment and which don't.

Worth following when
you want a taxonomically organized map of the LLM landscape from someone with the industrial-research vantage point to know which distinctions actually matter in practice.
Topics
large-scale survey of the LLM landscape; conversational AI and grounding; the architectural lineages of modern language models.
Key works
"Large Language Models: A Survey" (2024, senior author with collaborators); long body of work on conversational AI and language grounding; Microsoft Research Redmond publications on foundation models.