• search hit 9 of 1344
Back to Result List

Metadata integrity in bioinformatics: bridging the gap between data and knowledge

Please always quote using this URN: urn:nbn:de:bvb:20-opus-349990
  • In the fast-evolving landscape of biomedical research, the emergence of big data has presented researchers with extraordinary opportunities to explore biological complexities. In biomedical research, big data imply also a big responsibility. This is not only due to genomics data being sensitive information but also due to genomics data being shared and re-analysed among the scientific community. This saves valuable resources and can even help to find new insights in silico. To fully use these opportunities, detailed and correct metadata areIn the fast-evolving landscape of biomedical research, the emergence of big data has presented researchers with extraordinary opportunities to explore biological complexities. In biomedical research, big data imply also a big responsibility. This is not only due to genomics data being sensitive information but also due to genomics data being shared and re-analysed among the scientific community. This saves valuable resources and can even help to find new insights in silico. To fully use these opportunities, detailed and correct metadata are imperative. This includes not only the availability of metadata but also their correctness. Metadata integrity serves as a fundamental determinant of research credibility, supporting the reliability and reproducibility of data-driven findings. Ensuring metadata availability, curation, and accuracy are therefore essential for bioinformatic research. Not only must metadata be readily available, but they must also be meticulously curated and ideally error-free. Motivated by an accidental discovery of a critical metadata error in patient data published in two high-impact journals, we aim to raise awareness for the need of correct, complete, and curated metadata. We describe how the metadata error was found, addressed, and present examples for metadata-related challenges in omics research, along with supporting measures, including tools for checking metadata and software to facilitate various steps from data analysis to published research. Highlights • Data awareness and data integrity underpins the trustworthiness of results and subsequent further analysis. • Big data and bioinformatics enable efficient resource use by repurposing publicly available RNA-Sequencing data. • Manual checks of data quality and integrity are insufficient due to the overwhelming volume and rapidly growing data. • Automation and artificial intelligence provide cost-effective and efficient solutions for data integrity and quality checks. • FAIR data management, various software solutions and analysis tools assist metadata maintenance.show moreshow less

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar Statistics
Metadaten
Author: Aylin Caliskan, Seema Dangwal, Thomas Dandekar
URN:urn:nbn:de:bvb:20-opus-349990
Document Type:Journal article
Faculties:Medizinische Fakultät / Theodor-Boveri-Institut für Biowissenschaften
Language:English
Parent Title (English):Computational and Structural Biotechnology Journal
ISSN:2001-0370
Year of Completion:2023
Volume:21
Pagenumber:4895-4913
Source:Computational and Structural Biotechnology Journal (2023) 21:4895-4913. DOI: 10.1016/j.csbj.2023.10.006
DOI:https://doi.org/10.1016/j.csbj.2023.10.006
Dewey Decimal Classification:5 Naturwissenschaften und Mathematik / 57 Biowissenschaften; Biologie / 570 Biowissenschaften; Biologie
Tag:annotation; control group; error; error-transfer; meta-data; patient data; tools overview; wrong labelling
Release Date:2024/03/28
Licence (German):License LogoCC BY-NC-ND: Creative-Commons-Lizenz: Namensnennung, Nicht kommerziell, Keine Bearbeitungen 4.0 International