Audio-Visual Learning: Challenges and New Approaches
- đ¤ Speaker: Dr Jie Pu, Cambridge University Engineering Department
- đ Date & Time: Tuesday 18 May 2021, 12:00 - 13:00
- đ Venue: Zoom: https://zoom.us/j/95352633552?pwd=RzJVK2UzOGZyNU5mVHd1Y1VPT2tDUT09
Abstract
Abstract: Audio-visual learning is a research topic that aims at exploiting the relationship between audio and visual modalities. By leveraging these two modalities, we could either improve the performance of previously considered single-modality tasks or address new challenging problems. With the success of deep-learning base methods, some challenging audio-visual problems that are infeasible before becomes possible, e.g. audio-visual generation. In this talk, I will present the recent development of audio-visual learning, along with my PhD works. Several interesting applications in audio-visual learning will be visited, such as audio-visual separation and localization, audio-visual speech recognition and enhancement, audio-visual generation. I will review state-of-the-art approaches on these applications, and also discuss some of the challenges and opportunities in the future.
Bio: Jie Pu is a research associate in the Machine Intelligence Laboratory, University of Cambridge. He is a member of the Speech Research Group and works with Professor Mark Gales. His work primarily focuses on audio-visual learning, computer vision and speech analysis. Prior to this, Jie completed his PhD with Professor Maja Pantic at Imperial College London.
Series This talk is part of the CUED Speech Group Seminars series.
Included in Lists
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- CUED Speech Group Seminars
- Guy Emerson's list
- Information Engineering Division seminar list
- PhD related
- Zoom: https://zoom.us/j/95352633552?pwd=RzJVK2UzOGZyNU5mVHd1Y1VPT2tDUT09
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Tuesday 18 May 2021, 12:00-13:00