Eric Wong (University of Pennsylvania): “Provable vs Impossible Trust: Reasoning, Steering, and Safety”
Amy Gutmann Hall, Room 414 3333 Chestnut Street, PhiladelphiaAbstract: Abstract: In this talk, I will discuss a collection of highlights from our recent work in trustworthy AI. (1) Certifying reasoning explanations with reliability guarantees and aligning with expert […]