Publications

You can also find my articles on Google Scholar or Semantic Scholar.

Conference Papers

WebIE: Faithful and Robust Information Extraction on the Web. Chenxi Whitehouse, Clara Vania, Alham Fikri Aji, Christos Christodoulopoulos, Andrea Pierleoni. ACL 2023. GitHub repo
IndoNLI: A Natural Language Inference Dataset for Indonesian. Rahmad Mahendra, Alham Fikri Aji, Samuel Louvan, Fahrurrozi Rahman, Clara Vania. EMNLP 2021. GitHub repo
Comparing Test Sets with Item Response Theory. {Clara Vania, Phu Mon Htut, William Huang}, Dhara Mungra, Richard Yuanzhe Pang, Jason Phang, Haokun Liu, Kyunghyun Cho, Samuel R. Bowman. ACL 2021.
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?. {Nikita Nangia, Saku Sugawara}, Harsh Trivedi, Alex Warstadt, Clara Vania, Samuel R. Bowman. ACL 2021.
CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models. {Nikita Nangia, Clara Vania, Rasika Bhalerao}, and Samuel R. Bowman. EMNLP 2020.
Asking Crowdworkers to Write Entailment Examples: The Best of Bad Options. Clara Vania, Ruijie Chen, and Samuel R. Bowman. AACL 2020.
English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too. {Jason Phang, Iacer Calixto}, Phu Mon Htut, Yada Pruksachatkun, Haokun Liu, Clara Vania, Katharina Kann, and Samuel R. Bowman. AACL 2020.
Intermediate-Task Transfer Learning with Pretrained Language Models: When and Why Does It Work? {Yada Pruksachatkun, Jason Phang, Haokun Liu, Phu Mon Htut}, Xiaoyi Zhang, Richard Yuanzhe Pang, Clara Vania, Katharina Kann, and Samuel R. Bowman. ACL 2020.
A systematic comparison of methods for low-resource dependency parsing on genuinely low-resource languages. Clara Vania, Yova Kementchedjhieva, Anders Søgaard, and Adam Lopez. EMNLP 2019.
What do character-level models learn about morphology? The case of dependency parsing. Clara Vania, Andreas Grivas, and Adam Lopez. EMNLP 2018.
From characters to words to in between: Do we capture morphology? Clara Vania and Adam Lopez. ACL 2017. [code]

Workshop Papers

Improving distantly supervised document-level relation extraction through natural language inference. Clara Vania, Grace E. Lee, and Andrea Pierleoni. The 3rd Workshop on Deep Learning for Low-Resource NLP (co-located with NAACL 2022).
VisualSem: a high-quality knowledge graph for vision and language. Houda Alberts, Ningyuan Huang, Yash Deshpande, Yibo Liu, Kyunghyun Cho, Clara Vania, Iacer Calixto. The 1st Workshop on Multilingual Representation Learning (co-located with EMNLP 2021).
UParse: the Edinburgh system for the CoNLL 2017 UD shared task. Clara Vania, Xingxing Zhang, and Adam Lopez. CoNLL 2017 UD Shared Task 2017.

Journals

Multilingual Probing Tasks for Word Representations.. Gözde Gül Şahin, Clara Vania, Ilia Kuznetsov, Iryna Gurevych. Computational Linguistics 2020. [demo]

Thesis

On Understanding Character-level Models for Representing Morphology. PhD Thesis. University of Edinburgh, UK. 2019.

Clara Vania

Publications

Conference Papers

Workshop Papers

Journals

Thesis