AAAI Publications, Thirty-First AAAI Conference on Artificial Intelligence

Font Size: 
Keyphrase Extraction with Sequential Pattern Mining
Qingren Wang, Victor S. Sheng, Xindong Wu

Last modified: 2017-02-12


Existing studies show that extracting a complete keyphrase candidate set is the first and crucial step to extract high quality keyphrases from documents. Based on a common sense that words do not repeatedly appear in an effective keyphrase, we propose a novel algorithm named KCSP for document-specific keyphrase candidate search using sequential pattern mining with gap constraints, which only needs to scan a document once and automatically specifies appropriate gap constraints for words without users’ participation. The experimental results confirm that it helps improve the quality of keyphrase extraction.


keyphrase extraction; sequential pattern mining; entropy; interval calculation

Full Text: PDF