FungiRegEx: A Tool for Pattern Identification in Fungal Proteomic Sequences Using Regular Expressions
-
Published:2024-05-23
Issue:11
Volume:14
Page:4429
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Terrón-Macias Victor1ORCID, Mejia Jezreel1ORCID, Canseco-Pérez Miguel Angel2, Muñoz Mirna1ORCID, Terrón-Hernández Miguel3
Affiliation:
1. Ingeniería de Software, Centro de Investigación en Matemáticas (CIMAT, A.C) Unidad Zacatecas, Zacatecas 98068, Mexico 2. Ingeniería Agroindustrial, Universidad Politécnica de Chiapas, Suchiapa 29150, Mexico 3. Ingeniería en Mantenimiento Industrial, Universidad Tecnológica de Tlaxcala, Huamantla 90500, Mexico
Abstract
In the context of proteomic-scale research, it is imperative to automatically analyze numerous species and subspecies to discern distinctive characteristics present in multiple species of the fungi kingdom that contain sequences of interest that could fulfill a specific biological function. To achieve this, complex sequences must be recognized within an organism’s entire set of proteomes. Our study presents FungiRegEx, a piece of software that facilitates the identification of regular expressions of proteomes of fungal organisms and uses real-time data retrieval of the different species from the JGI Mycocosm database without the need to download any file. Integrating a graphical user interface that makes it easy to use, the tool offers regular expression searches on 2402 fungal species from the JGI Mycocosm portal. The tool was validated with the AXSXG sequence and the RXRL effector, demonstrating the effectiveness of FungiRegEx in identifying user-defined patterns in the recovered sequences. This tool allows customization and filtering, and it can save results if required, combining speed, adaptability, and ease of use. It provides an experience without a console and programming, displaying the results in a GUI and making them easier to read. Its architecture guarantees optimized use of resources, time consumption, and implementation flexibility, allowing the customization of specific software parameters for resource management. The tool’s potential for future research and exploration is emphasized, providing a nuanced perspective on its practical use within the fungal genomics community. The tools are available at the addresses mentioned in the text.
Funder
Council of Science Technology and Innovation of Zacatecas state
Reference26 articles.
1. Muggia, L., Ametrano, C.G., Sterflinger, K., and Tesei, D. (2020). An Overview of Genomics, Phylogenomics and Proteomics Approaches in Ascomycota. Life, 10. 2. Proteins and Their Interacting Partners: An Introduction to Protein–Ligand Binding Site Prediction Methods;Roche;Int. J. Mol. Sci.,2015 3. Merski, M., Młynarczyk, K., Ludwiczak, J., Skrzeczkowski, J., Dunin-Horkawicz, S., and Górna, M.W. (2020). Self-analysis of repeat proteins reveals evolutionarily conserved patterns. BMC Bioinform., 21. 4. Bull, R.I., Trevors, A., Malton, A.J., and Godfrey, M.W. (November, January 29). Semantic grep: Regular expressions + relational abstraction. Proceedings of the Ninth Working Conference on Reverse Engineering, Richmond, VA, USA. 5. Nagaev, B., Yashina, K., and Palmblad, M. (2022, March 02). msgfdb2pepxml (Version 2.0) [Python Script]. Available online: https://ms-utils.org/msgfdb2pepxml/.
|
|