Yiqiao Zhong (UWM): “How do LLMs generalize on out-of-distribution tasks? insights from model’s internal representations”
Amy Gutmann Hall, Room 414 3333 Chestnut Street, Philadelphia, United StatesAbstract: A mystery of large language models (LLMs) is their ability to solve novel tasks, notably through a few demonstrations in the prompt (in-context learning). Such tasks often require the […]