University of Cambridge > Talks.cam > CUED Speech Group Seminars > Attention Forcing: Improving attention-based sequence-to-sequence models

Log in

University Account

External (via Google)

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Attention Forcing: Improving attention-based sequence-to-sequence models

Download to your calendar using vCal

Qingyun Dou, University of Cambridge
Thursday 30 March 2023, 14:00-15:00
Hybrid: LT6, First floor Baker building, Engineering Dept or Zoom: https://eng-cam.zoom.us/j/89657740934?pwd=d1RUR29PenZXUlFQNVNVeU8zN2xoUT09.

If you have a question about this talk, please contact Dr Kate Knill .

Autoregressive sequence-to-sequence models with attention mechanisms have achieved state-of-the-art performance in various tasks including Neural Machine Translation (NMT), Automatic Speech Recognition (ASR) and Text-To-Speech (TTS). This talk introduces attention forcing, a group of training approaches, to address a training-inference mismatch. For autoregressive models, the most standard training approach, teacher forcing, guides a model with the reference output history. However during inference the generated output history must be used. To reduce the mismatch, attention forcing guides the model with the generated output history and reference attention. Extensions of this general framework will be introduced for more challenging applications. For example, most approaches addressing the training-inference mismatch are incompatible with parallel training, which is essential for Transformer models. In contrast, the parallel version of attention forcing supports parallel training, and hence Transformer models. The effectiveness of attention forcing will be demonstrated by the experiments in TTS and NMT .

This talk is part of the CUED Speech Group Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Attention Forcing: Improving attention-based sequence-to-sequence models

📅 Download to calendar (vCal)

👤 Speaker: Qingyun Dou, University of Cambridge
📅 Date & Time: Thursday 30 March 2023, 14:00 - 15:00
📍 Venue: Hybrid: LT6, First floor Baker building, Engineering Dept or Zoom: https://eng-cam.zoom.us/j/89657740934?pwd=d1RUR29PenZXUlFQNVNVeU8zN2xoUT09

Questions? Contact Dr Kate Knill

Abstract

Series This talk is part of the CUED Speech Group Seminars series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Attention Forcing: Improving attention-based sequence-to-sequence models

This talk is included in these lists:

Attention Forcing: Improving attention-based sequence-to-sequence models

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Attention Forcing: Improving attention-based sequence-to-sequence models

This talk is included in these lists:

Other lists

Other talks

Attention Forcing: Improving attention-based sequence-to-sequence models

Abstract

Included in Lists