casa 16(9): e3

Research Article

An approach for summarization of two-sentences Vietnamese paragraph

Download896 downloads
  • @ARTICLE{10.4108/eai.2-5-2016.151210,
        author={Trung Tran and Dang Tuan Nguyen},
        title={An approach for summarization of two-sentences Vietnamese paragraph},
        journal={EAI Endorsed Transactions on Context-aware Systems and Applications},
        volume={3},
        number={9},
        publisher={EAI},
        journal_a={CASA},
        year={2016},
        month={5},
        keywords={inter-sentential anaphoric pronoun, referent resolution, discourse representation, meaning summarization, sentence generation.},
        doi={10.4108/eai.2-5-2016.151210}
    }
    
  • Trung Tran
    Dang Tuan Nguyen
    Year: 2016
    An approach for summarization of two-sentences Vietnamese paragraph
    CASA
    EAI
    DOI: 10.4108/eai.2-5-2016.151210
Trung Tran1,*, Dang Tuan Nguyen1
  • 1: Faculty of Computer Science, University of Information Technology, VNU-HCM, Ho Chi Minh City, Vietnam
*Contact email: ttrung@nlke-group.net

Abstract

The purpose of this paper is to introduce a general approach for summarizing the meaning of Vietnamese paragraphs based on simple two-sentences. The studied objects are paragraphs having the common characteristics: the first sentence has one or two nouns indicating human objects; the second sentence has one or two anaphoric pronouns. We only consider two types of Vietnamese human pronouns in this research: the pronouns standing alone in the sentence; the pronouns standing with demonstrative adjective in the sentence. At the first phase, depending on the context of pronouns in the second sentence, we propose appropriate strategies to find the exact human object at the first sentence which is referred to by each pronoun. A discourse structure is also built to represent the meaning of each paragraph. At the second phase, each discourse representation will be transformed to a syntactic structure of meaning-summarizing sentence. The final phase complete the new sentence of summarization.