I am a second year PhD student at the Institute for Language, Cognition and Computation (ILCC), University of Edinburgh. I am supervised by Adam Lopez and Sharon Goldwater.
My research interests include natural language understanding, multilingual NLP application, representation learning, morphology, and low-resource NLP. My PhD focuses on learning morphological representation for multilingual NLP application.
I am also affiliated with the CDT in Data Science programme (2014 cohort).
Clara Vania, Xingxing Zhang, and Adam Lopez. UParse: the Edinburgh system for the CoNLL 2017 UD shared task. CoNLL 2017 UD Shared Task
Clara Vania and Adam Lopez. From characters to words to in between: Do we capture morphology? ACL 2017 [arXiv]
Clara Vania, Moh. Ibrahim, and Mirna Adriani. Sentiment Lexicon Generation for an Under-Resourced Language. CICLING 2014 (IJCLA) [pdf]
Samuel Louvan, Moh. Ibrahim, Mirna Adriani, Clara Vania, B. Distiawan, and MZ. Wanagiri. University of Indonesia at TREC 2011 Microblog Task. TREC 2011 [pdf]
Clara Vania and Mirna Adriani. The effect of syllable and word stress on the quality of Indonesian HMM-based speech synthesis system. ICACSIS 2011 [pdf]
Clara Vania and Mirna Adriani. Automatic external plagiarism detection using passage similarities. PAN Plagiarism Detection @ CLEF 2010 [pdf]
- 2015 - present : PhD student at ILCC/CDT in Data Science, University of Edinburgh
- 2014 - 2015 : MSc by Research in Data Science, University of Edinburgh
- 2009 - 2011 : MSc in Computer Science, Universitas Indonesia
- 2005 - 2009 : BSc in Computer Science, Universitas Indonesia