Models that prove their own correctness
- đ¤ Speaker: Orr Paradise (UC Berkeley) đ Website
- đ Date & Time: Thursday 09 January 2025, 14:00 - 15:00
- đ Venue: Computer Laboratory, William Gates Building, FW26
Abstract
This talk introduces Self-Proving models, a new class of models that formally prove the correctness of their outputs via an Interactive Proof system. After reviewing some related literature, I will formally define Self-Proving models and their per-input (worst-case) guarantees. I will then present algorithms for learning these models and explain how the complexity of the proof system affects the complexity of the learning algorithms. Finally, I will show experiments where Self-Proving models are trained to compute the Greatest Common Divisor of two integers, and to prove the correctness of their results to a simple verifier.
No prior knowledge of autoregressive models or Interactive Proofs will be assumed of the listener. This is a joint work with Noga Amit, Shafi Goldwasser, and Guy Rothblum.
Series This talk is part of the Algorithms and Complexity Seminar series.
Included in Lists
- Algorithms and Complexity Seminar
- All Talks (aka the CURE list)
- bld31
- Cambridge talks
- Computer Laboratory, William Gates Building, FW26
- Department of Computer Science and Technology talks and seminars
- Interested Talks
- School of Technology
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)



Thursday 09 January 2025, 14:00-15:00