Publications
You can also find my articles on Google Scholar or Semantic Scholar.
Conference Papers
- WebIE: Faithful and Robust Information Extraction on the Web. Chenxi Whitehouse, Clara Vania, Alham Fikri Aji, Christos Christodoulopoulos, Andrea Pierleoni. ACL 2023. GitHub repo
- IndoNLI: A Natural Language Inference Dataset for Indonesian. Rahmad Mahendra, Alham Fikri Aji, Samuel Louvan, Fahrurrozi Rahman, Clara Vania. EMNLP 2021. GitHub repo
- Comparing Test Sets with Item Response Theory. {Clara Vania, Phu Mon Htut, William Huang}, Dhara Mungra, Richard Yuanzhe Pang, Jason Phang, Haokun Liu, Kyunghyun Cho, Samuel R. Bowman. ACL 2021.
- What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?. {Nikita Nangia, Saku Sugawara}, Harsh Trivedi, Alex Warstadt, Clara Vania, Samuel R. Bowman. ACL 2021.
- CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models. {Nikita Nangia, Clara Vania, Rasika Bhalerao}, and Samuel R. Bowman. EMNLP 2020.
- Asking Crowdworkers to Write Entailment Examples: The Best of Bad Options. Clara Vania, Ruijie Chen, and Samuel R. Bowman. AACL 2020.
- English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too. {Jason Phang, Iacer Calixto}, Phu Mon Htut, Yada Pruksachatkun, Haokun Liu, Clara Vania, Katharina Kann, and Samuel R. Bowman. AACL 2020.
- Intermediate-Task Transfer Learning with Pretrained Language Models: When and Why Does It Work? {Yada Pruksachatkun, Jason Phang, Haokun Liu, Phu Mon Htut}, Xiaoyi Zhang, Richard Yuanzhe Pang, Clara Vania, Katharina Kann, and Samuel R. Bowman. ACL 2020.
- A systematic comparison of methods for low-resource dependency parsing on genuinely low-resource languages. Clara Vania, Yova Kementchedjhieva, Anders Søgaard, and Adam Lopez. EMNLP 2019.
- What do character-level models learn about morphology? The case of dependency parsing. Clara Vania, Andreas Grivas, and Adam Lopez. EMNLP 2018.
- From characters to words to in between: Do we capture morphology? Clara Vania and Adam Lopez. ACL 2017. [code]
Workshop Papers
- Improving distantly supervised document-level relation extraction through natural language inference. Clara Vania, Grace E. Lee, and Andrea Pierleoni. The 3rd Workshop on Deep Learning for Low-Resource NLP (co-located with NAACL 2022).
- VisualSem: a high-quality knowledge graph for vision and language. Houda Alberts, Ningyuan Huang, Yash Deshpande, Yibo Liu, Kyunghyun Cho, Clara Vania, Iacer Calixto. The 1st Workshop on Multilingual Representation Learning (co-located with EMNLP 2021).
- UParse: the Edinburgh system for the CoNLL 2017 UD shared task. Clara Vania, Xingxing Zhang, and Adam Lopez. CoNLL 2017 UD Shared Task 2017.
Journals
- Multilingual Probing Tasks for Word Representations.. Gözde Gül Şahin, Clara Vania, Ilia Kuznetsov, Iryna Gurevych. Computational Linguistics 2020. [demo]
Thesis
- On Understanding Character-level Models for Representing Morphology. PhD Thesis. University of Edinburgh, UK. 2019.