Machines that can read lips
- đ¤ Speaker: Pingchuan Ma (Imperial College London)
- đ Date & Time: Monday 25 October 2021, 12:00 - 13:00
- đ Venue: Zoom: https://eng-cam.zoom.us/j/81927138251?pwd=TVd3MXliV003dUdYVlFwU2NDWGpmdz09
Abstract
Abstract: Decades of research in acoustic speech recognition have led to systems that we use in our everyday life. However, even the most advanced speech recognition systems fail in the presence of noise. The degraded performance can be (partially) addressed by introducing visual speech information. In this talk, we will see how deep learning has made this possible and also present our works in visual speech recognition (lip-reading).
Bio: Pingchuan Ma is a fourth-year Ph.D. student in the Intelligent Behaviour Understanding Group (IBUG) at Imperial College London, advised by Prof. Maja Pantic and Dr. Stavros Petridis. Before that, He received an MSc degree in Machine Learning from Imperial College London in 2017 and received a BSc degree in Automation from Beihang University in 2015. He was a research intern at Facebook AI Applied Research (FAIAR) in 2021.
Series This talk is part of the CUED Speech Group Seminars series.
Included in Lists
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- CUED Speech Group Seminars
- Guy Emerson's list
- Information Engineering Division seminar list
- PhD related
- Zoom: https://eng-cam.zoom.us/j/81927138251?pwd=TVd3MXliV003dUdYVlFwU2NDWGpmdz09
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Monday 25 October 2021, 12:00-13:00