DeepVar: An End-to-End Deep Learning Approach for Genomic Variant Recognition in Biomedical Literature

Authors

  • Chaoran Cheng New Jersey Institute of Technology
  • Fei Tan Yahoo Research
  • Zhi Wei New Jersey Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v34i01.5399

Abstract

We consider the problem of Named Entity Recognition (NER) on biomedical scientific literature, and more specifically the genomic variants recognition in this work. Significant success has been achieved for NER on canonical tasks in recent years where large data sets are generally available. However, it remains a challenging problem on many domain-specific areas, especially the domains where only small gold annotations can be obtained. In addition, genomic variant entities exhibit diverse linguistic heterogeneity, differing much from those that have been characterized in existing canonical NER tasks. The state-of-the-art machine learning approaches heavily rely on arduous feature engineering to characterize those unique patterns. In this work, we present the first successful end-to-end deep learning approach to bridge the gap between generic NER algorithms and low-resource applications through genomic variants recognition. Our proposed model can result in promising performance without any hand-crafted features or post-processing rules. Our extensive experiments and results may shed light on other similar low-resource NER applications.

Downloads

Published

2020-04-03

How to Cite

Cheng, C., Tan, F., & Wei, Z. (2020). DeepVar: An End-to-End Deep Learning Approach for Genomic Variant Recognition in Biomedical Literature. Proceedings of the AAAI Conference on Artificial Intelligence, 34(01), 598-605. https://doi.org/10.1609/aaai.v34i01.5399

Issue

Section

AAAI Technical Track: Applications