publications

publications and talks

Publications

Preprints

  1. Language Modelling with Pixels
    Phillip Rust, Jonas F. Lotz, Emanuele Bugliarello, Elizabeth Salesky, Miryam de Lhoneux, and Desmond Elliott
    arXiv preprint, 2022


              Conference papers and journal articles

              2022

              1. ACL
                Finding Structural Knowledge in Multimodal-BERT
                Victor Milewski, Miryam de Lhoneux, and Marie-Francine Moens
                In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
              2. ACL
                Zero-Shot Dependency Parsing with Worst-Case Aware Automated Curriculum Learning
                Miryam de Lhoneux, Sheng Zhang, and Anders Søgaard
                In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022
              3. ACL
                Challenges and Strategies in Cross-Cultural NLP
                Daniel Hershcovich, Stella Frank, Heather Lent, Miryam de Lhoneux, Mostafa Abdou, Stephanie Brandl, Emanuele Bugliarello, Laura Cabello Piqueras, Ilias Chalkidis, Ruixiang Cui, Constanza Fierro, Katerina Margatina, Phillip Rust, and Anders Søgaard
                In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022
              4. LREC
                What a Creole Wants, What a Creole Needs
                Heather Lent, Kelechi Ogueji, Miryam de Lhoneux, Orevaoghene Ahia, and Anders Søgaard
                In LREC, 2022
              5. LT4HALA
                Syntactic parsing of a Neo-Latin mathematical text: a pilot study
                Margherita Fantoli, and Miryam de Lhoneux
                In LT4HALA, 2022

              2021

              1. TLT
                Parsing with Pretrained Language Models, Multiple Datasets, and Dataset Embeddings
                Rob van der Goot, and Miryam de Lhoneux
                In Proceedings of the 20th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2021), 2021
              2. CoNLL
                On Language Models for Creoles
                Heather Lent, Emanuele Bugliarello, Miryam de Lhoneux, Chen Qiu, and Anders Søgaard
                In Proceedings of the 25th Conference on Computational Natural Language Learning, 2021
              3. CoNLL
                A Multilingual Benchmark for Probing Negation-Awareness with Minimal Pairs
                Mareike Hartmann, Miryam de Lhoneux, Daniel Hershcovich, Yova Kementchedjhieva, Lukas Nielsen, Chen Qiu, and Anders Søgaard
                In Proceedings of the 25th Conference on Computational Natural Language Learning, 2021
              4. WAT
                Itihasa: A large-scale corpus for Sanskrit to English translation
                Rahul Aralikatte, Miryam de Lhoneux, Anoop Kunchukuttan, and Anders Søgaard
                In Proceedings of the 8th Workshop on Asian Translation (WAT2021), 2021
              5. WAT
                How far can we get with one GPU in 100 hours? CoAStaL at MultiIndicMT Shared Task
                Rahul Aralikatte, Héctor Ricardo Murrieta Bello, Miryam de Lhoneux, Daniel Hershcovich, Marcel Bollmann, and Anders Søgaard
                In Proceedings of the 8th Workshop on Asian Translation (WAT2021), 2021
              6. AmericasNLP ST
                Moses and the Character-Based Random Babbling Baseline: CoAStaL at AmericasNLP 2021 Shared Task
                Marcel Bollmann, Rahul Aralikatte, Héctor Murrieta Bello, Daniel Hershcovich, Miryam de Lhoneux, and Anders Søgaard
                In Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, 2021

              2020

              1. IWPT ST
                Køpsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
                Daniel Hershcovich, Miryam de Lhoneux, Artur Kulmizev, Elham Pejhan, and Joakim Nivre
                In Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies, 2020
              2. COLING
                Comparison by Conversion: Reverse-Engineering UCCA from Syntax and Lexical Semantics
                Daniel Hershcovich, Nathan Schneider, Dotan Dvir, Jakob Prange, Miryam de Lhoneux, and Omri Abend
                In Proceedings of the 28th International Conference on Computational Linguistics, 2020
              3. CL
                What Should/Do/Can LSTMs Learn When Parsing Auxiliary Verb Constructions?
                Miryam de Lhoneux, Sara Stymne, and Joakim Nivre
                Computational Linguistics, 2020

              2019

              1. NAACL
                Recursive Subtree Composition in LSTM-Based Dependency Parsing
                Miryam de Lhoneux, Miguel Ballesteros, and Joakim Nivre
                In NAACL, 2019
              2. EMNLP
                Deep Contextualized Word Embeddings in Transition-Based and Graph-Based Dependency Parsing - A Tale of Two Parsers Revisited
                Artur Kulmizev, Miryam de Lhoneux, Johannes Gontrum, Elena Fano, and Joakim Nivre
                In EMNLP-IJCNLP, 2019

              2018

              1. ACL
                Parser Training with Heterogeneous Treebanks
                Sara Stymne, Miryam de Lhoneux, Aaron Smith, and Joakim Nivre
                In ACL, 2018
              2. EMNLP
                Parameter sharing between dependency parsers for related languages
                Miryam de Lhoneux, Johannes Bjerva, Isabelle Augenstein, and Anders Søgaard
                In EMNLP , 2018
              3. EMNLP
                An Investigation of the Interactions Between Pre-Trained Word Embeddings, Character Models and POS Tags in Dependency Parsing
                Aaron Smith, Miryam de Lhoneux, Sara Stymne, and Joakim Nivre
                In EMNLP, 2018
              4. CoNLL ST
                82 Treebanks, 34 Models: Universal Dependency Parsing with Multi-Treebank Models
                Aaron Smith, Bernd Bohnet, Miryam de Lhoneux, Joakim Nivre, Yan Shao, and Sara Stymne
                In Proc. of the CoNLL 2018 Shared Task, 2018
              5. Blackbox NLP
                Nightmare at test time: How punctuation prevents parsers from generalizing
                Anders Søgaard, Miryam de Lhoneux, and Isabelle Augenstein
                In Proc. of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2018

              2017

              1. TLT
                Old School vs. New School: Comparing Transition-Based Parsers with and without Neural Network Enhancement.
                Miryam de Lhoneux, Sara Stymne, and Joakim Nivre
                In TLT , 2017
              2. CoNLL ST
                From Raw Text to Universal Dependencies - Look, No Tags!
                Miryam de Lhoneux, Yan Shao, Ali Basirat, Eliyahu Kiperwasser, Sara Stymne, Yoav Goldberg, and Joakim Nivre
                In Proc. of the CoNLL 2017 Shared Task, 2017
              3. IWPT
                Arc-Hybrid Non-Projective Dependency Parsing with a Static-Dynamic Oracle
                Miryam de Lhoneux, Sara Stymne, and Joakim Nivre
                In IWPT, 2017

              2016

              1. Should Have, Would Have, Could Have. Investigating Verb Group Representations for Parsing with Universal Dependencies.
                Miryam de Lhoneux, and Joakim Nivre
                In Proc. of the Workshop on Multilingual and Cross-lingual Methods in NLP, 2016


              Book chapters

                    1. Investigating the effect of automatic MWE recognition on CCG parsing
                      Miryam de Lhoneux, Omri Abend, and Mark Steedman
                      2019

                          Theses

                          1. Linguistically Informed Neural Dependency Parsing for Typologically Diverse Languages
                            Miryam de Lhoneux
                            2019
                          1. CCG Parsing and Multiword Expressions
                            Miryam de Lhoneux
                            2014
                          1. Towards a Systematic Contrastive Constructional Approach to the Resultative Construction: An Exploratory Study on English and French
                            Miryam de Lhoneux
                            2013

                          Non-archival stuff

                          1. From manuscript to syntactic tree: the long journey of a mathematical text
                            Margherita Fantoli, Miryam de Lhoneux, and Beatrice Sisana
                            In DHBenelux, 2022
                          2. Not all zero-shot settings are created equal in multilingual NLP
                            Miryam de Lhoneux
                            In CLIN, 2022
                          1. Probing structures in the visual region embeddings from multimodal BERT
                            Victor Milewski, Miryam de Lhoneux, and Marie-Francine Moens
                            In BlackboxNLP, 2021
                            1. Polyglot Parsing for One Thousand and One Languages (And Then Some)
                              Ali Basirat, Miryam de Lhoneux, Artur Kulmizev, Murathan Kurfal, Joakim Nivre, and Robert Ă–stling
                              In First workshop on Typology for Polyglot NLP, 2019
                            1. Universal Dependency Parsing at Uppsala University
                              Joakim Nivre, Miryam de Lhoneux, Aaron Smith, and Sara Stymne
                              In SLTC, 2018
                            2. Parameter Sharing in Multilingual Dependency Parsing
                              Miryam de Lhoneux
                              In SLTC, 2018
                              1. UD treebank sampling for comparative parser evaluation
                                Miryam de Lhoneux, and Joakim Nivre
                                In SLTC, 2016


                              Invited talks

                              • Low-resource NLP with no reliance on language relatedness. Seminar series at the University of Groningen, 25 March 2022.
                              • Low-resource NLP: Lessons from Dependency Parsing. Magnet seminar series at Inria Lille, 16 December 2021.
                              • Low-resource NLP: Lessons from Dependency Parsing. CLingDing seminar series at Indiana University, 3 November 2021.
                              • Low-resource NLP: Lessons from Dependency Parsing. Sigtyp 2021, 10 June 2021. [video|slides]
                              • Parsing Typologically Diverse Languages. Aix Marseille University, 26 Nov 2020.
                              • Parsing Typologically Diverse Languages. Workshop on Treebanks ang Linguistic Theories (TLT), 27 October 2020. [video|slides]
                              • Do we need recursive subtree composition in dependency parsing? Invited talk at the Workshop on Data-driven Approaches to Parsing and Semantic Composition, TĂĽbingen 10 December 2019. [slides]