Các bài báo công bố quốc tế

Duy K. Van, Huy M. Huynh, Hien T. Nguyen, Vinh T. Vo; Entity Linking for Vietnamese Tweets; The Sixth International Conference on Knowledge and Systems Engineering (2014).

Abstract

We study the task of entity linking for Vietnamese tweets, which aims at detecting entity mentions and linking them to corresponding entries in a given knowledge base. Unlike authored news or textual web content, tweets are noisy, irregular, and short, which causes entity linking in tweets much more challenging.We propose an approach to build an end-to-end entity linking system for Vietnamese tweets. The system consists of two stages. The first stage is to detect mentions and the second one performs entity disambiguation. We create a dataset including 524 Vietnamese tweets with 1,061 mentions and evaluate the system on this dataset. Our system achieves 69.2% F1-score. In order to show that our system is language-independent,we evaluate the system on a public dataset including 562 English tweets. The experiment results show that our system achieves 54.5% F1-score and outperforms the state-of-the-art end-to-end entity linking methods for tweets. To the best of our knowledge, this is the first attempt to build an end-to-end entity linking system for Vietnamese tweets and the system achieves very encouraging performance.

Keywords

  • Entity Linking
  • Wikification
  • Entity Disambiguation