Next Article in Journal
A Fusion-Based Hybrid-Feature Approach for Recognition of Unconstrained Offline Handwritten Hindi Characters
Previous Article in Journal
Real-Time Power Electronics Laboratory to Strengthen Distance Learning Engineering Education on Smart Grids and Microgrids
Article

WATS-SMS: A T5-Based French Wikipedia Abstractive Text Summarizer for SMS

1
Department of Computer Engineering, University Institute of Technology, University of Ngaoundere, Ngaoundere P.O. Box 454, Cameroon
2
Department of Mathematics and Computer Science, Faculty of Science, University of Ngaoundere, Ngaoundere P.O. Box 454, Cameroon
3
Department of Mathematics, Rhodes University, Grahamstown 6140, South Africa
*
Authors to whom correspondence should be addressed.
Academic Editors: Efstathios Stamatatos and Paolo Bellavista
Future Internet 2021, 13(9), 238; https://doi.org/10.3390/fi13090238
Received: 4 August 2021 / Revised: 27 August 2021 / Accepted: 16 September 2021 / Published: 18 September 2021
(This article belongs to the Section Big Data and Augmented Intelligence)
Text summarization remains a challenging task in the natural language processing field despite the plethora of applications in enterprises and daily life. One of the common use cases is the summarization of web pages which has the potential to provide an overview of web pages to devices with limited features. In fact, despite the increasing penetration rate of mobile devices in rural areas, the bulk of those devices offer limited features in addition to the fact that these areas are covered with limited connectivity such as the GSM network. Summarizing web pages into SMS becomes, therefore, an important task to provide information to limited devices. This work introduces WATS-SMS, a T5-based French Wikipedia Abstractive Text Summarizer for SMS. It is built through a transfer learning approach. The T5 English pre-trained model is used to generate a French text summarization model by retraining the model on 25,000 Wikipedia pages then compared with different approaches in the literature. The objective is twofold: (1) to check the assumption made in the literature that abstractive models provide better results compared to extractive ones; and (2) to evaluate the performance of our model compared to other existing abstractive models. A score based on ROUGE metrics gave us a value of 52% for articles with length up to 500 characters against 34.2% for transformer-ED and 12.7% for seq-2seq-attention; and a value of 77% for articles with larger size against 37% for transformers-DMCA. Moreover, an architecture including a software SMS-gateway has been developed to allow owners of mobile devices with limited features to send requests and to receive summaries through the GSM network. View Full-Text
Keywords: text summarization; fine-tuning; transformers; SMS; gateway; French Wikipedia text summarization; fine-tuning; transformers; SMS; gateway; French Wikipedia
Show Figures

Figure 1

MDPI and ACS Style

Fendji, J.L.E.K.; Taira, D.M.; Atemkeng, M.; Ali, A.M. WATS-SMS: A T5-Based French Wikipedia Abstractive Text Summarizer for SMS. Future Internet 2021, 13, 238. https://doi.org/10.3390/fi13090238

AMA Style

Fendji JLEK, Taira DM, Atemkeng M, Ali AM. WATS-SMS: A T5-Based French Wikipedia Abstractive Text Summarizer for SMS. Future Internet. 2021; 13(9):238. https://doi.org/10.3390/fi13090238

Chicago/Turabian Style

Fendji, Jean L.E.K., Désiré M. Taira, Marcellin Atemkeng, and Adam M. Ali. 2021. "WATS-SMS: A T5-Based French Wikipedia Abstractive Text Summarizer for SMS" Future Internet 13, no. 9: 238. https://doi.org/10.3390/fi13090238

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop