5-th Workshop "Computational Linguistics and Language Science"

The workshop on Computational Linguistics and Language Science will be held on April 26 online.

As the number of digital texts increases rapidly, there is a pressing need for more advanced and diverse tools of natural language processing. While purely statistical approaches proved powerful and efficient for many NLP tasks, there are many applications that would benefit from the formal models and approaches traditional language science has to offer. With hopes to facilitate this interaction between theory and practical implementation.

Invited speakers

Valentin Malykh (Huawei) Topic: SumTitles: a Summarization Dataset with Low Extractiveness
Boris Galitsky (NRU HSE, Oracle Inc.) Topic: Discourse Trees for Dialogue Management

Program of the workshop

10.00 – 11.00
Invited talk. SumTitles: a Summarization Dataset with Low Extractiveness

Valentin Malykh (Huawei)

email: valentin.malykh@huawei.com

11.00 – 11.20
Language model for error correction in Russian-language texts of foreign-language authors

Nikita Remnev (NRU HSE)

email: remnev.nikita@gmail.com

11.20 – 11.40
Constructing a dialog graph based on the play script

Alexey Podchezertsev (NRU HSE)

email: aepodchezertsev@edu.hse.ru

11.40 – 12.00
On a system for automatic correction of word formation errors in texts of students studying Russian as a foreign language

Ivan Smirnov (NRU HSE)

email: smirnof.van@gmail.com

12.10 – 12.30
Applying deep learning models extended with the structural linguistic information to the classification and ranking of texts

Alexander Chernyavskiy (NRU HSE)

email: alschernyavskiy@gmail.com

12.30 – 12.50
On the methods of computer processing of texts for low-resource languages

Maxim Kulaev (NRU HSE)

email: mkulaev@hse.ru

12.50 – 13.10
Sentiment analysis of the "VKontakte" social media posts within the context of the COVID-19 pandemic

Alina Lozovskaya (UrFU)

email: a.i.lozovskaya@yandex.ru

13.10 – 13.30
Classification of Texts Using a Vocabulary of Antonyms

Albina Giliazova (ICS RAS)

email: giliazova@mail.ru

14.10 – 15.10
Invited talk. Discourse Trees for Dialogue Management

Boris Galitsky (NRU HSE, Oracle Inc.)

email: bgalitskiy@hse.ru

15.10 – 15.30
Method of taxonomic content analysis of thematic text collections

Boris Mirkin, Dmitry Frolov (NRU HSE)

email: bmirkin@hse.ru

15.30 – 15.50
Linguistic representativeness of word embeddings

Amir Bakarov (NRU HSE)

email: amirbakarov@gmail.com

15.50 – 16.10
Morphophonological learning with multiple processes

Daniel Akim (Rutgers University)

email: dla91@scarletmail.rutgers.edu

16.10 – 16.30
On the language model's knowledge of linguistic features for the NLI problem

Maria Tikhonova (NRU HSE, Sber)

email: mtihonova@hse.ru