University of Cambridge > > Computer Laboratory Wednesday Seminars > TALK CANCELLED: Cross domain similarities and intra-person changes

TALK CANCELLED: Cross domain similarities and intra-person changes

Add to your list(s) Download to your calendar using vCal

  • UserProfessor Maria Liakata - University of Warwick
  • ClockWednesday 10 February 2021, 15:00-16:00
  • HouseOnline.

If you have a question about this talk, please contact Ben Karniely.

This talk has been canceled/deleted

I will talk about two conceptually interconnected lines of work in NLP within my group; on the one hand identifying semantic similarities between instances (sentences or longer texts but also entities) across domains, and on the other hand detecting changes within the same person or domain over time.

Even though semantic similarity is a fundamental task within NLP it can be very challenging when comparisons are made across domains as the vocabulary and context can be very different from one domain setting to another. I will talk about recent work of ours where we address semantic similarity between two texts in a variety of datasets, including community question answering, by injecting domain-specific topic model information to pre-trained language models [1]. I will also be discussing how in the case of cross domain entity similarity (and co-reference more specifically) current models struggle, some of the reasons behind this and a new resource to help with addressing this problem [2]. The second part of my talk can be seen as the flip side of semantic similarity, where the goal is to look for differences in the representation of the same individual (word or person) that signal a change. I will be discussing work of ours on sequential modelling of the evolution of a word for semantic change detection [3] and how we are developing methods to detect changes in individuals as part of my UKRI Turing AI fellowship.

[1] Peinelt, N., Nguyen, D., & Liakata, M. (2020, July). tBERT: Topic models and BERT joining forces for semantic similarity detection. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (pp. 7047-7055). [2] Ravenscroft, J., Cattan, A., Clare, A., Dagan, I., & Liakata, M. (2021). CD2CR : Co-reference Resolution Across Documents and Domains. arXiv preprint arXiv:2101.12637. (Accepted at EACL 2021 ). [3] Tsakalidis, A., & Liakata, M. (2020, November). Sequential Modelling of the Evolution of Word Representations for Semantic Change Detection. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) (pp. 8485-8497).

This talk is part of the Computer Laboratory Wednesday Seminars series.

Tell a friend about this talk:

This talk is included in these lists:

This talk is not included in any other list

Note that ex-directory lists are not shown.


© 2006-2021, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity