Retrieving and Sampling Diverse Outputs
- đ¤ Speaker: Prof. Eunsol Choi (NYU)
- đ Date & Time: Thursday 13 November 2025, 14:00 - 15:00
- đ Venue: https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
Abstract
Abstract: Real-world user queries often contain questions that admit a wide range of valid answers without a single ground truth. However, large language models (LLMs) often struggle to generate diverse and comprehensive responses. In this talk, we will discuss two paths towards this goal, (1) retrieving a diverse set of documents and (2) sampling a large number of responses from LLMs. In the first part of the talk, I will first quantify the limitations of existing dense retrievers which generate one query vector. Many strong retrievers all struggle when the gold document set contains dissimilar targets. To address this, we present a new retriever architecture that autoregressively generates multiple, distinct query vectors, and each query vector is used to retrieve documents from the corpus. In the second part of the talk, I will discuss inference strategies for sampling diverse outputs from LLMs. Prompting LLMs to sequentially generate a diverse set of answers works well for simpler factoid queries, but is less effective for more complex queries. We further explore merging outputs from multiple LLMs, showing its potential and challenges. I will conclude by discussing a multi-turn agentic framework interleaving retrieval and generation from LLMs to craft a comprehensive answer.
Bio: Eunsol Choi is an assistant professor of computer science and data science at New York University. Her research spans natural language processing and machine learning, with a focus on interpreting and reasoning about text in dynamic real-world contexts. Prior to joining NYU , she was an assistant professor at the University of Texas at Austin and a visiting researcher at Google. She holds a Ph.D. in computer science and engineering from the University of Washington. She is a recipient of a Facebook research fellowship, Google faculty research award, Sony faculty award, NSF CAREER award and an outstanding paper award at EMNLP .
Series This talk is part of the Language Technology Lab Seminars series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09
- Interested Talks
- Language Sciences for Graduate Students
- Language Technology Lab Seminars
- ndk22's list
- ob366-ai4er
- rp587
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Prof. Eunsol Choi (NYU)
Thursday 13 November 2025, 14:00-15:00