Text Technology/Digital Linguistics colloquium FS 2023

Colloquium FS 2023: Reports from current research at the institute, bachelor and master theses, programming projects, guest lectures

Time & Location: every 2-3 weeks on Tuesdays from 10:15 am to 12:00 pm, BIN-2.A.10 (Karte)

Online participation via the MS Teams Team CL Colloquium is also possible.

Colloquium Schedule

Date

Speaker

Topic
Tuesday, 07.03.2023 Jannis Vamvas Challenges in Language-adaptive Pre-training
   
Tuesday, 21.03.2023

Amit Moryossef

Real-time Multilingual Sign Language Processing

Zifan Jiang
Machine Translation between Spoken Languages and Signed Languages Represented in SignWriting

Tuesday, 04.04.2023

Omar Sanseviero

 

 
Tuesday, 02.05.2023

Chiara Tschirner

 
Chantal Amrhein  
Tuesday, 16.05.2023

Janis Goldzycher

 
Alessia Battisti  
Tuesday, 30.05.2023

Lena Bolliger

 

Noëmi Aepli  

Abstracts

 

Jannis Vamvas: Challenges in Language-adaptive Pre-training.

The multilingual language situation of Switzerland calls for multilingual NLP tools. We present an ongoing project that involves training a masked language model in all the national languages of Switzerland. A main challenge of the project is learning multilingual representations with a highly imbalanced training corpus, with many more texts in German or French than in Italian or Romansh. Another challenge is building on existing pre-trained models in a modular way. We adopt a recently proposed approach involving language adapters and show that the resulting model performs well in tasks such as Named Entity Recognition or German–Romansh alignment. We also highlight some limitations of our approach.

Amit Moryossef: Real-time Multilingual Sign Language Processing

Zifan Jiang: Machine Translation between Spoken Languages and Signed Languages Represented in SignWriting

Omar Sanseviero: TBA

Chiara Tschirner: TBA

Chantal Amrhein: TBA

Janis Goldzycher: TBA

Alessia Battisti: TBA

Lena Bolliger: TBA

Noëmi Aepli: TBA