Modelling of idiomaticity in language
Tuesday 30 January 10:00 until 11:30
University of Sussex Campus : CHICHESTER 3, 3R143, GTS Lecture Theatre
Speaker: Aline Villavicencio
Part of the series: Sussex AI seminar

Aline Villavicencio is the Director of the Institute of Data Science and Artificial Intelligence, University of Exeter and also affiliated to the University of Sheffield.
Read more about Aline here: Professor Aline Villavicencio | Computer Science | The University of Sheffield
Title: Modelling of idiomaticity in language
Large language models have been successfully used for capturing distinct (and very specific) word usages, and therefore could provide an attractive alternative for accurately determining meaning in language. However, these models still face a serious challenge when dealing with non-literal language, like that involved in Multiword Expressions (MWEs) such as idioms (make ends meet), light verb constructions (give a sigh), verb particle constructions (shake up) and noun compounds (loan shark). MWEs are an integral part of the mental lexicon of native speakers often used to express complex ideas in a simple and conventionalised way accepted by a given linguistic community. Although they may display a wealth of idiosyncrasies, from lexical, syntactic and semantic to statistical, that represents a real challenge for current NLP techniques, their accurate integration has the potential for improving the precision, naturalness and fluency of downstream tasks like machine translation. In this talk, I will present an overview of how advances in word representations have made an impact for the identification and modelling of idiomaticity and MWEs. I will concentrate on what models seem to incorporate of idiomaticity, as idiomatic interpretation may require knowledge that goes beyond what can be gathered from the individual words of an expression (e.g. “dark horse” as an unknown candidate who unexpectedly succeeds).
By: Aleks Kossowska
Last updated: Tuesday, 23 January 2024