AI Seminar Series 2023: Yuqiao Wen

The AI Seminar is a weekly meeting at the University of Alberta where researchers interested in artificial intelligence (AI) can share their research. Presenters include both local speakers from the University of Alberta and visitors from other institutions. Topics can be related in any way to artificial intelligence, from foundational theoretical work to innovative applications of AI techniques to new fields and problems.

On March 17, Yuqiao Wen — PhD student at the University of Alberta — presented “f-Divergence Minimization for Sequential Knowledge Distillation" at the AI Seminar.

Knowledge distillation (KD) is the process of transferring knowledge from a large model to a small one. It has gained increasing attention in the natural language processing community, driven by the demands of compressing ever-growing language models.

In this work, Wen proposes an f-distill framework, which formulates sequential knowledge distillation as minimizing a generalized f-divergence function. He proposes four distilling variants under his team's framework and shows that existing SeqKD and ENGINE approaches are approximations of their f-distill methods. He further derives step-wise decomposition for their f-distill, reducing intractable sequence-level divergence to word-level losses that can be computed in a tractable manner.

Experiments across four datasets show that the methods outperform existing KD approaches, and that symmetric distilling losses can better force the student to learn from the teacher distribution.  

Watch the full presentation below:

