SpeechLLMs, that is, multimodal LLMs that ingest language in both text and speech form, have been a source of excitement and a subject of research for several years now.  The panel will examine the status of SpeechLLMs, and LLMs applied to speech processing in AI industry settings more broadly, discuss successes and obstacles along the way, and speculate about future directions.

  • Screenshot 2025-10-26 120512.png

    Amazon

  • Jinyu Li.jpg

    Microsoft

  • Screenshot 2025-10-14 154255.png

    IBM Research

  • Mike Seltzer

    Meta

  • kadrihacioglu.jpeg

    Uniphore, USA

  • Andreas_Stolcke.jpg

    Uniphore, US