LMRank: Utilizing Pre-Trained Language Models and Dependency Parsing for Keyphrase Extraction

Giarelis, Nikolaos; Karacapilidis, N.

Javascript is disabled or not supported in your browser. JavaScript must be enabled in order for you to use WIKINDX fully. Enable JavaScript through your browser options then try again, otherwise, try using a different browser.

WIKINDX

WIKINDX Resources

Giarelis, N., & Karacapilidis, N. LMRank: Utilizing Pre-Trained Language Models and Dependency Parsing for Keyphrase Extraction.

Resource type: Journal Article
BibTeX citation key: anon.65
View all bibliographic details

Categories: General
Creators: Giarelis, Karacapilidis

Attachments

URLs https://www.semant ... 564ec076f33e2a10e6

Abstract

A novel approach that utilizes dependency parsing and the sentence embeddings of pre-trained language models to improve the accuracy of the keyphrase extraction task is presented, which showcases that it scales far better than similar ones in terms of execution time. Keyphrase extraction is a Natural Language Processing task pertaining to the automatic extraction of salient terms that semantically encapsulate the major theme and topics of a document. In this article, we present LMRank, a novel approach that utilizes dependency parsing and the sentence embeddings of pre-trained language models to improve the accuracy of the keyphrase extraction task. In addition, we conduct a benchmark analysis of our approach, which showcases that it scales far better than similar ones in terms of execution time. The contribution of this work is threefold: (i) we propose a novel approach that significantly outperforms the state-of-the-art keyphrase extraction approaches in terms of time performance and accuracy in selected datasets; (ii) we provide a comparative evaluation of our approach against previous ones, by utilizing broadly used datasets in the literature and established evaluation metrics (e.g., the F1 and pF1 scores); (iii) we make the datasets and code used in our experiments public, aiming to further increase the reproducibility of this work and facilitate future research in the field.

WIKINDX 6.11.0 | Total resources: 209 | Username: -- | Bibliography: WIKINDX Master Bibliography | Style: American Psychological Association (APA)