An experimental comparison of unsupervised keyphrase extraction techniques for extracting significant information from scientific research articles

Sarwar, Talha and Mohd Noor, Noorhuzaimi Karimah (2021) An experimental comparison of unsupervised keyphrase extraction techniques for extracting significant information from scientific research articles. In: 7th International Conference on Software Engineering and Computer Systems and 4th International Conference on Computational Science and Information Management, ICSECS-ICOCSIM 2021 , 24 - 26 August 2021 , Pekan, Online. 130 -135.. ISBN 9781665414074

[img]
Preview
Pdf
An experimental comparison of unsupervised keyphrase extraction techniques .pdf

Download (134kB) | Preview

Abstract

The automatic extraction of key information from an article that expresses all of the document’s main elements is referred to as keyphrase extraction. The number of scientific research articles each year is growing. Finding a research article on relevant topics or summarizing a particular research article using important information has become time-consuming by going through the entire article. Therefore, the textual information processing task involves the automatic keyphrase extraction from a document that expresses all of the document’s main elements. This article aims to make an experimental comparison of different unsupervised keyphrase extraction approaches, namely statistical-based, graph-based, and tree-based. The experiment is conducted upon 120 research articles from different subject areas of the computer science. The comparison between different techniques is made by calculating the precision, recall, and Fl-score. The overall performance of the experimental result shows that KP-Miner, a statistical-based technique, outperforms all the other graph-based and tree-based techniques. Among the other techniques, the tree-based technique TeKET performs better after KPMiner. The statistical-based and tree-based approach performs better than the graph-based approach.

Item Type: Conference or Workshop Item (Lecture)
Additional Information: Indexed by Scopus
Uncontrolled Keywords: Unsupervised keyphrase extraction; Automatic keyphrase extraction; Statistical-based technique; Tree-based technique; Graph-based technique; Comparison analysis
Subjects: Q Science > QA Mathematics > QA76 Computer software
T Technology > TK Electrical engineering. Electronics Nuclear engineering
Faculty/Division: Institute of Postgraduate Studies
Faculty of Computing
Depositing User: Mrs Norsaini Abdul Samat
Date Deposited: 29 Dec 2021 02:14
Last Modified: 29 Dec 2021 02:14
URI: http://umpir.ump.edu.my/id/eprint/32693
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item