skip to main content
10.1145/3155133.3155200acmotherconferencesArticle/Chapter ViewAbstractPublication PagessoictConference Proceedingsconference-collections
research-article

Towards Building Vietnamese Discourse Treebank

Published:07 December 2017Publication History

ABSTRACT

Discourse analysis is an important natural language processing task. There are many discourse parsers in many languages, such as English and Chinese, constructing discourse trees from text documents for further semantic analysis. However, there is no official release of Vietnamese discourse treebank for research in Vietnamese discourse parser. Therefore, this paper presents our preliminary result in building Vietnamese discourse treebank. some problems when building discourse treebank and proposes a discourse annotation framework for it. In order to show the feasibility of developing discourse parsers for Vietnamese documents, two experiments in discourse relation classification and in discourse nucleus classification are conducted using the discourse annotated documents.

References

  1. V. W. Feng, G. A. Hirst. 2014. Linear-Time Bottom-Up Discourse Parser with Constraints and Post-Editing. In ACL 1 (2014), 511--521.Google ScholarGoogle Scholar
  2. W. Feng, G. Hirst. 2012. Text-level Discourse Parsing with Rich Linguistic Features. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, ACL '12, Jeju Island, Korea (2012), 60--68. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. H. Hernault, H. Prendinger and M. Ishizuka. 2010. HILDA: A discourse parser using support vector machine classification. Dialogue & Discourse 1, 3 (2010). 1--33Google ScholarGoogle Scholar
  4. Z. Lin, H.T. Ng, and M.Y. Kan. 2014. A PDTB-styled end-to-end discourse parser. Natural Language Engineering 20, 2 (2014), 151--184.Google ScholarGoogle ScholarCross RefCross Ref
  5. Ghosh, S., Johansson, R. and Tonelli, S., 2011. Shallow discourse parsing with conditional random fields. In Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP 2011), Chiang Mai, Thailand, 1071--1079.Google ScholarGoogle Scholar
  6. W. C. Mann, S. A. Thompson. 1988. Rhetorical structure theory: towards a functional theory of text organization. Text 3, 8 (1988), 243--281Google ScholarGoogle Scholar
  7. S. Verberne. 2009. In Search of Why: Developing a system for answering why-questions. Ph.D. Dissertation. Radboud University, Nijmegen, Germany.Google ScholarGoogle Scholar
  8. M. Taboada, W. C. Mann. 2006. Applications of rhetorical structure theory. Discourse studies 8, 4 (2006), 567--588.Google ScholarGoogle Scholar
  9. L. Carlson, D. Marcu, M. E. Okurowski. 2003. Building a discourse-tagged corpus in the framework of rhetorical structure theory. Current and new directions in discourse and dialogue. Springer. Nethrelands, 85--112.Google ScholarGoogle Scholar
  10. E. Miltsakaki, L. Robaldo, A. Lee and A. Joshi. 2008. Sense annotation in the penn discourse treebank. In International Conference on Intelligent Text Processing and Computational Linguistics. Springer, Berlin, Heidelberg, 275--286. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. Zhou, N. Xue. 2012. PDTB-style discourse annotation of Chinese text. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics. ACL 1 (2012), 69--77. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Y. Zhou, N. Xue. 2015 The Chinese discourse treebank: a Chinese corpus annotated with discourse relations. Language Resources and Evaluation 49, 2 (2015), 397--431. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. D. Zeyrek, I. Demirsahin, A. Sevdik-Callı, R. Çakici. 2013. Turkish Discourse Bank: Porting a discourse annotation style to a morphologically rich language. Dialog and Discourse 4, 2 (2013), 174--184.Google ScholarGoogle ScholarCross RefCross Ref
  14. A. Al-Saif, K. Markert 2010. The Leeds Arabic Discourse Treebank: Annotating Discourse Connectives for Arabic. In LREC (2010).Google ScholarGoogle Scholar
  15. D. Marcu 1998. The Rhetorical Parsing, Summarization, and Generation of Natural Language Texts. Ph.D. Dissertation, University of Toronto, Canada. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Towards Building Vietnamese Discourse Treebank

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          SoICT '17: Proceedings of the 8th International Symposium on Information and Communication Technology
          December 2017
          486 pages
          ISBN:9781450353281
          DOI:10.1145/3155133

          Copyright © 2017 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 7 December 2017

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate147of318submissions,46%
        • Article Metrics

          • Downloads (Last 12 months)1
          • Downloads (Last 6 weeks)0

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader