From web to SMS: A text summarization of Wikipedia pages with character limitation

Fendji, J.L.E.K and Aminatou, B.A.H. (2020) From web to SMS: A text summarization of Wikipedia pages with character limitation. EAI Endorsed Transactions on Creative Technologies, 7 (24): e5. ISSN 2409-9708

[img]
Preview
Text
eai.11-6-2020.165277.pdf - Published Version

Download (2MB) | Preview

Abstract

Wikipedia is one of the main sources of information on the Web. But the access to this content may be difficult especially when using a basic telephone without browsing capability and only a GSM network. The only means of text-based communication remains through SMS. Due to the limitation of the number of characters, a Wikipedia page cannot always be sent through SMS. This work raises the issue of text summarization with character limitation. To solve this issue, two extractive approaches have been combined: LSA and TextRank algorithms. Generated summaries have been evaluated using ROUGE metrics. Since ROUGE metrics do not consider character limitation, a new threshold named Threshold of Acceptability for Character-Oriented Summaries (TACOS) has been proposed to appreciate ROUGE metrics. The evaluation showed the relevance of the approach for pages of at most 2000 characters. The system has been tested using the SMS simulator of RapidSMS without a GSM gateway to simulate the deployment in a real environment. To the best of our knowledge, this is the first work tackling text summarization issue with character limitation.

Item Type: Article
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
QA75 Electronic computers. Computer science
Depositing User: EAI Editor II.
Date Deposited: 07 Sep 2020 09:56
Last Modified: 07 Sep 2020 09:56
URI: https://eprints.eudl.eu/id/eprint/6

Actions (login required)

View Item View Item