BioCreAtIvE Task 1A: gene mention finding evaluation-Reference-Cited by-同舟云学术

BioCreAtIvE Task 1A: gene mention finding evaluation

Published:2005-05 Issue:S1 Volume:6 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Yeh Alexander,Morgan Alexander,Colosimo Marc,Hirschman Lynette

Abstract

Abstract Background The biological research literature is a major repository of knowledge. As the amount of literature increases, it will get harder to find the information of interest on a particular topic. There has been an increasing amount of work on text mining this literature, but comparing this work is hard because of a lack of standards for making comparisons. To address this, we worked with colleagues at the Protein Design Group, CNB-CSIC, Madrid to develop BioCreAtIvE (Critical Assessment for Information Extraction in Biology), an open common evaluation of systems on a number of biological text mining tasks. We report here on task 1A, which deals with finding mentions of genes and related entities in text. "Finding mentions" is a basic task, which can be used as a building block for other text mining tasks. The task makes use of data and evaluation software provided by the (US) National Center for Biotechnology Information (NCBI). Results 15 teams took part in task 1A. A number of teams achieved scores over 80% F-measure (balanced precision and recall). The teams that tried to use their task 1A systems to help on other BioCreAtIvE tasks reported mixed results. Conclusion The 80% plus F-measure results are good, but still somewhat lag the best scores achieved in some other domains such as newswire, due in part to the complexity and length of gene names, compared to person or organization names in newswire.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1471-2105-6-S1-S2.pdf

Reference26 articles.

1. Hirschman L, Park JC, Tsujii J, Wong L, Wu CH: Accomplishments and challenges in literature data mining for biology. Bioinformatics 2002, 18: 1553–1561. 10.1093/bioinformatics/18.12.1553

2. Critical Assessment of Techniques for Protein Structure Prediction[http://predictioncenter.llnl.gov/]

3. Hirschman L: The evolution of evaluation: lessons from the message understanding conferences. Computer Speech and Language 1998, 12: 281–305. 10.1006/csla.1998.0102

4. Text REtrieval Conference[http://trec.nist.gov/]

5. Voorhees EM, Buckland LP, Ed:J. The Eleventh Text Retrieval Conference (TREC 2002): NIST Special Publication 500-XXX, Gaithersburg, Maryland. 2002. [http://trec.nist.gov/pubs/trec11/t11_proceedings.html]

Cited by 113 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Mining microbe–disease interactions from literature via a transfer learning model;BMC Bioinformatics;2021-09-10

2. Character level and word level embedding with bidirectional LSTM – Dynamic recurrent neural network for biomedical named entity recognition from literature;Journal of Biomedical Informatics;2020-12

3. Active Learning Query Strategies for Classification, Regression, and Clustering: A Survey;Journal of Computer Science and Technology;2020-07

4. Biomedical named entity recognition and linking datasets: survey and our recent development;Briefings in Bioinformatics;2020-06-30

5. ModEx: A text mining system for extracting mode of regulation of transcription factor-gene regulatory interaction;Journal of Biomedical Informatics;2020-02