Data format standards in analytical chemistry
Author:
Rauh David1ORCID, Blankenburg Claudia1, Fischer Tillmann G.1, Jung Nicole2, Kuhn Stefan3, Schatzschneider Ulrich4, Schulze Tobias5, Neumann Steffen1ORCID
Affiliation:
1. Leibniz Institute of Plant Biochemistry, Bioinformatics and Scientific Data , Weinberg 3 , 06120 Halle , Germany 2. Karlsruhe Institute of Technology, Institute for Chemical and Biological Systems (IBCS-FMS) , Hermann von Helmholtz Platz 1 , 76344 Eggenstein-Leopolshafen , Germany 3. School of Computer Science and Informatics , De Montfort University , Leicester , UK 4. Institut für Anorganische Chemie , Julius-Maximilians-Universität Würzburg , Am Hubland , D-97074 Würzburg , Germany 5. Department of Effect-Directed Analysis , Helmholtz Centre for Environmental Research – UFZ , Permoserstr. 15, 04318 Leipzig , Germany
Abstract
Abstract
Research data is an essential part of research and almost every publication in chemistry. The data itself can be valuable for reuse if sustainably deposited, annotated and archived. Thus, it is important to publish data following the FAIR principles, to make it findable, accessible, interoperable and reusable not only for humans but also in machine-readable form. This also improves transparency and reproducibility of research findings and fosters analytical work with scientific data to generate new insights, being only accessible with manifold and diverse datasets. Research data requires complete and informative metadata and use of open data formats to obtain interoperable data. Generic data formats like AnIML and JCAMP-DX have been used for many applications. Special formats for some analytical methods are already accepted, like mzML for mass spectrometry or nmrML and NMReDATA for NMR spectroscopy data. Other methods still lack common standards for data. Only a joint effort of chemists, instrument and software vendors, publishers and infrastructure maintainers can make sure that the analytical data will be of value in the future. In this review, we describe existing data formats in analytical chemistry and introduce guidelines for the development and use of standardized and open data formats.
Publisher
Walter de Gruyter GmbH
Subject
General Chemical Engineering,General Chemistry
Reference69 articles.
1. K. Rajan, H. O. Brinkhaus, A. Zielesny, C. Steinbeck. J. Cheminf. 12, 60 (2020), https://doi.org/10.1186/s13321-020-00465-0. 2. M. D. Wilkinson, M. Dumontier, I. J. Jan Aalbersberg, G. Appleton, M. Axton, A. Baak, N. Blomberg, J.-W. Boiten, L. B. da Silva Santos, P. E. Bourne, J. Bouwman, A. J. Brookes, T. Clark, M. Crosas, I. Dillo, O. Dumon, S. Edmunds, C. T. Evelo, R. Finkers, A. Gonzalez-Beltran, A. J. G. Gray, P. Groth, C. Goble, J. S. Grethe, J. Heringa, P. A. C. ’t Hoen, R. Hooft, T. Kuhn, R. Kok, J. Kok, S. J. Lusher, M. E. Martone, A. Mons, A. L. Packer, B. Persson, P. Rocca-Serra, M. Roos, R. van Schaik, S.-A. Sansone, E. Schultes, T. Sengstag, T. Slater, G. Strawn, M. A. Swertz, M. Thompson, J. van der Lei, E. van Mulligen, J. Velterop, A. Waagmeester, P. Wittenburg, K. Wolstencroft, J. Zhao, B. Mons. Sci. Data 3, 160018 (2016), https://doi.org/10.1038/sdata.2016.18. 3. T. Habermann. Patterns (N Y) 1, 100004 (2020), https://doi.org/10.1016/j.patter.2020.100004. 4. M. J. Harvey, N. J. Mason, A. McLean, P. Murray-Rust, H. S. Rzepa, J. J. P. Stewart. J. Cheminf. 7, 43 (2015), https://doi.org/10.1186/s13321-015-0093-3. 5. J. B. McAlpine, S.-N. Chen, A. Kutateladze, J. B. MacMillan, G. Appendino, B. Andersson, M. A. Beniddir, M. W. Biavatti, S. Bluml, A. Boufridi, M. S. Butler, R. J. Capon, Y. H. Choi, D. Coppage, P. Crews, M. T. Crimmins, M. Csete, P. Dewapriya, J. M. Egan, M. J. Garson, G. Genta-Jouve, W. H. Gerwick, H. Gross, M. K. Harper, P. Hermanto, J. M. Hook, L. Hunter, D. Jeannerat, N.-Y. Ji, T. A. Johnson, D. G. I. Kingston, H. Koshino, H.-W. Lee, L. Guy, J. Li, R. G. Linington, M. Liu, K. L. McPhail, T. F. Molinski, B. S. Moore, J.-W. Nam, R. P. Neupane, M. Niemitz, J.-M. Nuzillard, N. H. Oberlies, F. M. M. Ocampos, G. Pan, R. J. Quinn, D. S. Reddy, J.-H. Renault, J. Rivera-Chávez, W. Robien, C. M. Saunders, T. J. Schmidt, C. Seger, B. Shen, C. Steinbeck, H. Stuppner, S. Sturm, O. Taglialatela-Scafati, D. J. Tantillo, R. Verpoorte, B.-G. Wang, C. M. Williams, P. G. Williams, J. Wist, J.-M. Yue, C. Zhang, Z. Xu, C. Simmler, D. C. Lankin, J. Bisson, G. F. Pauli. Nat. Prod. Rep. 36, 35 (2019), https://doi.org/10.1039/c7np00064b.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|