School of Informatics, IF 3.32
University of Edinburgh
Email: f.lastname(at)
Github: claravania

I am a second year PhD student at the Institute for Language, Cognition and Computation (ILCC), University of Edinburgh. I am supervised by Adam Lopez and Sharon Goldwater.

My research interests include natural language understanding, multilingual NLP application, representation learning, morphology, and low-resource NLP. My PhD focuses on learning morphological representation for multilingual NLP application.

I am also affiliated with the CDT in Data Science programme (2014 cohort).


  • Clara Vania, Xingxing Zhang, and Adam Lopez. UParse: the Edinburgh system for the CoNLL 2017 UD shared task. CoNLL 2017 UD Shared Task [pdf] [bib]

  • Clara Vania and Adam Lopez. From characters to words to in between: Do we capture morphology? ACL 2017 [arXiv]

  • Alfan Farizki Wicaksono, Clara Vania, Bayu Distiawan, and Mirna Adriani. Automatically Building a Corpus for Sentiment Analysis on Indonesian Tweets. PACLIC 2014 [pdf] [bib]

  • Clara Vania, Moh. Ibrahim, and Mirna Adriani. Sentiment Lexicon Generation for an Under-Resourced Language. CICLING 2014 (IJCLA) [pdf]

  • Samuel Louvan, Moh. Ibrahim, Mirna Adriani, Clara Vania, B. Distiawan, and MZ. Wanagiri. University of Indonesia at TREC 2011 Microblog Task. TREC 2011 [pdf]

  • Clara Vania and Mirna Adriani. The effect of syllable and word stress on the quality of Indonesian HMM-based speech synthesis system. ICACSIS 2011 [pdf]

  • Clara Vania and Mirna Adriani. Automatic external plagiarism detection using passage similarities. PAN Plagiarism Detection @ CLEF 2010 [pdf]


  • 2015 - present : PhD student at ILCC/CDT in Data Science, University of Edinburgh
  • 2014 - 2015 : MSc by Research in Data Science, University of Edinburgh
  • 2009 - 2011 : MSc in Computer Science, Universitas Indonesia
  • 2005 - 2009 : BSc in Computer Science, Universitas Indonesia