Lightweight Technology Stacks for Assistive Linked Annotations

Author:

Thalhath Nishad1

Affiliation:

1. RIKEN Center for Integrative Medical Sciences

Abstract

Abstract

This report presents the findings of a project from the 8th Biomedical Linked Annotation Hackathon (BLAH) to explore lightweight technology stacks to enhance assistive linked annotations. Using modern JavaScript frameworks and edge functions, in-browser Named Entity Recognition (NER), serverless embedding and vector search within web interfaces, and efficient serverless full-text search were implemented. Through this experimental approach, a proof of concept to demonstrate the feasibility and performance of these technologies was demonstrated. The results show that lightweight stacks can significantly improve the efficiency and cost-effectiveness of annotation tools and provide a local-first, privacy-oriented, and secure alternative to traditional server-based solutions in various use cases. This work emphasizes the potential of developing annotation interfaces that are more responsive, scalable, and user-friendly, which would benefit bioinformatics researchers, practitioners, and software developers.

Publisher

Springer Science and Business Media LLC

Reference7 articles.

1. Roddy, Jack W and Lesica, George T and Wheeler, Travis J (2022) {{SODA}}: A {{TypeScript}}/{{JavaScript}} Library for Visualizing Biological Sequence Annotation. NAR Genomics and Bioinformatics 4(4): lqac077 https://doi.org/10.1093/nargab/lqac077, /Users/nishad/Zotero/storage/CTH6BQ5L/Roddy et al. - 2022 - SODA a TypeScriptJavaScript library for visualiz.pdf;/Users/nishad/Zotero/storage/B2TTVVTQ/6749379.html, We present SODA, a lightweight and open-source visualization library for biological sequence annotations that enables straightforward development of flexible, dynamic and interactive web graphics. SODA is implemented in TypeScript and can be used as a library within TypeScript and JavaScript., https://doi.org/10.1093/nargab/lqac077, 2631-9268, December, {{SODA}}

2. Neves, Mariana and Ševa, Jurica (2019) An extensive review of tools for manual annotation of documents. Briefings in Bioinformatics 22(1): 146 –163 https://doi.org/10.1093/bib/bbz130, December, Annotation tools are applied to build training and test corpora, which are essential for the development and evaluation of new natural language processing algorithms. Further, annotation tools are also used to extract new information for a particular use case. However, owing to the high number of existing annotation tools, finding the one that best fits particular needs is a demanding task that requires searching the scientific literature followed by installing and trying various tools.We searched for annotation tools and selected a subset of them according to five requirements with which they should comply, such as being Web-based or supporting the definition of a schema. We installed the selected tools (when necessary), carried out hands-on experiments and evaluated them using 26 criteria that covered functional and technical aspects. We defined each criterion on three levels of matches and a score for the final evaluation of the tools.We evaluated 78 tools and selected the following 15 for a detailed evaluation: BioQRator, brat, Catma, Djangology, ezTag, FLAT, LightTag, MAT, MyMiner, PDFAnno, prodigy, tagtog, TextAE, WAT-SL and WebAnno. Full compliance with our 26 criteria ranged from only 9 up to 20 criteria, which demonstrated that some tools are comprehensive and mature enough to be used on most annotation projects. The highest score of 0.81 was obtained by WebAnno (of a maximum value of 1.0)., 1477-4054

3. Enberg, Pekka (2024) Latency - Reduce delay in software systems. Manning Publications Co., New York, ISBN 9781633438088, May

4. Diehl, Alexander D. and Meehan, Terrence F. and Bradford, Yvonne M. and Brush, Matthew H. and Dahdul, Wasila M. and Dougall, David S. and He, Yongqun and {Osumi-Sutherland}, David and Ruttenberg, Alan and Sarntivijai, Sirarat and Van Slyke, Ceri E. and Vasilevsky, Nicole A. and Haendel, Melissa A. and Blake, Judith A. and Mungall, Christopher J. (2016) The {{Cell Ontology}} 2016: Enhanced Content, Modularization, and Ontology Interoperability. Journal of Biomedical Semantics 7(1): 44 https://doi.org/10.1186/s13326-016-0088-7, /Users/nishad/Zotero/storage/F7K7MLC5/Diehl et al. - 2016 - The Cell Ontology 2016 enhanced content, modulari.pdf;/Users/nishad/Zotero/storage/7NDBUBUT/s13326-016-0088-7.html, Anatomy Ontology,Cell Line Cell,Gene Ontology,Logical Definition,Neuroscience Information Framework, The Cell Ontology

5. (CL) is an OBO Foundry candidate ontology covering the domain of canonical, natural biological cell types. Since its inception in 2005, the CL has undergone multiple rounds of revision and expansion, most notably in its representation of hematopoietic cells. For in vivo cells, the CL focuses on vertebrates but provides general classes that can be used for other metazoans, which can be subtyped in species-specific ontologies., https://doi.org/10.1186/s13326-016-0088-7, 2041-1480, July, The {{Cell Ontology}} 2016

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3