We present the results and the findings of the Second VarDial Evaluation Campaign on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects. The campaign was organized as part of the fifth edition of the VarDial workshop, collocated with COLING’2018. This year, the campaign included five shared tasks, including two task re-runs – Arabic Dialect Identification (ADI) and German Dialect Identification (GDI) –, and three new tasks – Morphosyntactic Tagging of Tweets (MTT), Discriminating between Dutch and Flemish in Subtitles (DFS), and Indo-Aryan Language Identification (ILI). A total of 24 teams submitted runs across the five shared tasks, and contributed 22 system description papers, which were included in the VarDial workshop proceedings and are referred to in this report.
Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Ahmed Ali, Suwon Shon, James Glass, Yves Scherrer, Tanja Samardžić, Nikola Ljubešić, Jörg Tiedemann, Chris van der Lee, Stefan Grondelaers, Nelleke Oostdijk, Dirk Speelman, Antal van den Bosch, Ritesh Kumar, Bornini Lahiri, and Mayank Jain. 2018.Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign. InProceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), pages 1–17, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
@inproceedings{zampieri-etal-2018-language, title = "Language Identification and Morphosyntactic Tagging: The Second {V}ar{D}ial Evaluation Campaign", author = {Zampieri, Marcos and Malmasi, Shervin and Nakov, Preslav and Ali, Ahmed and Shon, Suwon and Glass, James and Scherrer, Yves and Samard{\v{z}}i{\'c}, Tanja and Ljube{\v{s}}i{\'c}, Nikola and Tiedemann, J{\"o}rg and van der Lee, Chris and Grondelaers, Stefan and Oostdijk, Nelleke and Speelman, Dirk and van den Bosch, Antal and Kumar, Ritesh and Lahiri, Bornini and Jain, Mayank}, editor = {Zampieri, Marcos and Nakov, Preslav and Ljube{\v{s}}i{\'c}, Nikola and Tiedemann, J{\"o}rg and Malmasi, Shervin and Ali, Ahmed}, booktitle = "Proceedings of the Fifth Workshop on {NLP} for Similar Languages, Varieties and Dialects ({V}ar{D}ial 2018)", month = aug, year = "2018", address = "Santa Fe, New Mexico, USA", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/W18-3901/", pages = "1--17", abstract = "We present the results and the findings of the Second VarDial Evaluation Campaign on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects. The campaign was organized as part of the fifth edition of the VarDial workshop, collocated with COLING`2018. This year, the campaign included five shared tasks, including two task re-runs {--} Arabic Dialect Identification (ADI) and German Dialect Identification (GDI) {--}, and three new tasks {--} Morphosyntactic Tagging of Tweets (MTT), Discriminating between Dutch and Flemish in Subtitles (DFS), and Indo-Aryan Language Identification (ILI). A total of 24 teams submitted runs across the five shared tasks, and contributed 22 system description papers, which were included in the VarDial workshop proceedings and are referred to in this report."}
%0 Conference Proceedings%T Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign%A Zampieri, Marcos%A Malmasi, Shervin%A Nakov, Preslav%A Ali, Ahmed%A Shon, Suwon%A Glass, James%A Scherrer, Yves%A Samardžić, Tanja%A Ljubešić, Nikola%A Tiedemann, Jörg%A van der Lee, Chris%A Grondelaers, Stefan%A Oostdijk, Nelleke%A Speelman, Dirk%A van den Bosch, Antal%A Kumar, Ritesh%A Lahiri, Bornini%A Jain, Mayank%Y Zampieri, Marcos%Y Nakov, Preslav%Y Ljubešić, Nikola%Y Tiedemann, Jörg%Y Malmasi, Shervin%Y Ali, Ahmed%S Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018)%D 2018%8 August%I Association for Computational Linguistics%C Santa Fe, New Mexico, USA%F zampieri-etal-2018-language%X We present the results and the findings of the Second VarDial Evaluation Campaign on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects. The campaign was organized as part of the fifth edition of the VarDial workshop, collocated with COLING‘2018. This year, the campaign included five shared tasks, including two task re-runs – Arabic Dialect Identification (ADI) and German Dialect Identification (GDI) –, and three new tasks – Morphosyntactic Tagging of Tweets (MTT), Discriminating between Dutch and Flemish in Subtitles (DFS), and Indo-Aryan Language Identification (ILI). A total of 24 teams submitted runs across the five shared tasks, and contributed 22 system description papers, which were included in the VarDial workshop proceedings and are referred to in this report.%U https://aclanthology.org/W18-3901/%P 1-17
[Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign](https://aclanthology.org/W18-3901/) (Zampieri et al., VarDial 2018)
Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Ahmed Ali, Suwon Shon, James Glass, Yves Scherrer, Tanja Samardžić, Nikola Ljubešić, Jörg Tiedemann, Chris van der Lee, Stefan Grondelaers, Nelleke Oostdijk, Dirk Speelman, Antal van den Bosch, Ritesh Kumar, Bornini Lahiri, and Mayank Jain. 2018.Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign. InProceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), pages 1–17, Santa Fe, New Mexico, USA. Association for Computational Linguistics.