Indonesian Document Text Summarization Based on Extractive Using Sentences Scoring and Fuzzy Logic
- DOI
- 10.2991/978-94-6463-480-8_11How to use a DOI?
- Keywords
- Text Summarization; Sentence Scoring; Fuzzy Logic; Indonesian News Document
- Abstract
The large number of text documents available on the internet has resulted in a demand for quick access to get the essence for making decisions based on the available information. One method to overcome this problem is to use text summarization. There are 2 ways to summarize, namely abstractive and extractive. In this study, the extractive method was used to find sentences that were considered important. By using sentence scoring as a sentence weighting feature, the sentences are given weight by paying attention to the features frequency, Uppercase, Proper Noun, Sentence to Sentence Similarity, Numerical Data, Sentence Length, Sentence Position, and Similarity to the Title. Then the fuzzy logic algorithm is used to select words based on the value of the sentence score which creates a total of 6561 rules. Evaluation in this research uses a confusion matrix, using precision, recall and f-measure measurements. In Indonesian CNN data, the evaluation results show an average precision value of 0.581, an average recall of 1.394 and an average f-measure of 0.796. Meanwhile, Indonesian media documents produce an average precision value of 0.481, recall 1.267 and f-measure 0.709.
- Copyright
- © 2024 The Author(s)
- Open Access
- Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
Cite this article
TY - CONF AU - Basirudin Ansor AU - Achmad Solichan AU - Muhammad Zainudin Al Amin AU - Aditya Putra Ramdani AU - Mulil Khaira AU - Nova Christina Sari PY - 2024 DA - 2024/07/29 TI - Indonesian Document Text Summarization Based on Extractive Using Sentences Scoring and Fuzzy Logic BT - Proceedings of the 2nd Lawang Sewu Internasional Symposium on Engineering and Applied Sciences (LEWIS-EAS 2023) PB - Atlantis Press SP - 131 EP - 146 SN - 2352-5401 UR - https://doi.org/10.2991/978-94-6463-480-8_11 DO - 10.2991/978-94-6463-480-8_11 ID - Ansor2024 ER -