TY  - INPR
A1  - Dandekar, Thomas
T1  - Analysing the phase space of the standard model and its basic four forces from a qubit phase transition perspective: implications for large-scale structure generation and early cosmological events
N2  - The phase space for the standard model of the basic four forces for n quanta includes all possible ensemble combinations of their quantum states m, a total of n**m states. Neighbor states reach according to transition possibilities (S-matrix) with emergent time from entropic ensemble gradients.
We replace the “big bang” by a condensation event (interacting qubits become decoherent) and inflation by a crystallization event – the crystal unit cell guarantees same symmetries everywhere. Interacting qubits solidify and form a rapidly growing domain where the n**m states become separated ensemble states, rising long-range forces stop ultimately further growth. After that very early events, standard cosmology with the hot fireball model takes over. Our theory agrees well with lack of inflation traces in cosmic background measurements, large-scale structure of voids and filaments, supercluster formation, galaxy formation, dominance of matter and life-friendliness.

We prove qubit interactions to be 1,2,4 or 8 dimensional (agrees with E8 symmetry of our universe). Repulsive forces at ultrashort distances result from quantization, long-range forces limit crystal growth. Crystals come and go in the qubit ocean. This selects for the ability to lay seeds for new crystals, for self-organization and life-friendliness. 
We give energy estimates for free qubits vs bound qubits, misplacements in the qubit crystal and entropy increase during qubit decoherence / crystal formation. Scalar fields for color interaction and gravity derive from the permeating qubit-interaction field. Hence, vacuum energy gets low only inside the qubit crystal. Condensed mathematics may advantageously model free / bound qubits in phase space.
KW  - phase space
KW  - cosmology
KW  - emergent time
KW  - qubit
KW  - phase transition
KW  - bit
Y1  - 2023
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-298580
ER  - 
TY  - JOUR
A1  - Salihoglu, Rana
A1  - Srivastava, Mugdha
A1  - Liang, Chunguang
A1  - Schilling, Klaus
A1  - Szalay, Aladar
A1  - Bencurova, Elena
A1  - Dandekar, Thomas
T1  - PRO-Simat: Protein network simulation and design tool
JF  - Computational and Structural Biotechnology Journal
N2  - PRO-Simat is a simulation tool for analysing protein interaction networks, their dynamic change and pathway engineering. It provides GO enrichment, KEGG pathway analyses, and network visualisation from an integrated database of more than 8 million protein-protein interactions across 32 model organisms and the human proteome. We integrated dynamical network simulation using the Jimena framework, which quickly and efficiently simulates Boolean genetic regulatory networks. It enables simulation outputs with in-depth analysis of the type, strength, duration and pathway of the protein interactions on the website. Furthermore, the user can efficiently edit and analyse the effect of network modifications and engineering experiments. In case studies, applications of PRO-Simat are demonstrated: (i) understanding mutually exclusive differentiation pathways in Bacillus subtilis, (ii) making Vaccinia virus oncolytic by switching on its viral replication mainly in cancer cells and triggering cancer cell apoptosis and (iii) optogenetic control of nucleotide processing protein networks to operate DNA storage. Multilevel communication between components is critical for efficient network switching, as demonstrated by a general census on prokaryotic and eukaryotic networks and comparing design with synthetic networks using PRO-Simat. The tool is available at https://prosimat.heinzelab.de/ as a web-based query server.
KW  - network simulation
KW  - protein analysis
KW  - signalling pathways
KW  - dynamic protein-protein interactions
KW  - optogenetics
KW  - oncolytic virus
KW  - DNA storage
Y1  - 2023
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-350034
SN  - 2001-0370
VL  - 21
ER  - 
TY  - JOUR
A1  - Caliskan, Aylin
A1  - Caliskan, Deniz
A1  - Rasbach, Lauritz
A1  - Yu, Weimeng
A1  - Dandekar, Thomas
A1  - Breitenbach, Tim
T1  - Optimized cell type signatures revealed from single-cell data by combining principal feature analysis, mutual information, and machine learning
JF  - Computational and Structural Biotechnology Journal
N2  - Machine learning techniques are excellent to analyze expression data from single cells. These techniques impact all fields ranging from cell annotation and clustering to signature identification. The presented framework evaluates gene selection sets how far they optimally separate defined phenotypes or cell groups. This innovation overcomes the present limitation to objectively and correctly identify a small gene set of high information content regarding separating phenotypes for which corresponding code scripts are provided. The small but meaningful subset of the original genes (or feature space) facilitates human interpretability of the differences of the phenotypes including those found by machine learning results and may even turn correlations between genes and phenotypes into a causal explanation. For the feature selection task, the principal feature analysis is utilized which reduces redundant information while selecting genes that carry the information for separating the phenotypes. In this context, the presented framework shows explainability of unsupervised learning as it reveals cell-type specific signatures. Apart from a Seurat preprocessing tool and the PFA script, the pipeline uses mutual information to balance accuracy and size of the gene set if desired. A validation part to evaluate the gene selection for their information content regarding the separation of the phenotypes is provided as well, binary and multiclass classification of 3 or 4 groups are studied. Results from different single-cell data are presented. In each, only about ten out of more than 30000 genes are identified as carrying the relevant information. The code is provided in a GitHub repository at https://github.com/AC-PHD/Seurat_PFA_pipeline.
KW  - single cell analysis
KW  - machine learning
KW  - explainability of machine learning
KW  - principal
KW  - feature analysis
KW  - model reduction
KW  - feature selection
Y1  - 2023
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-349989
SN  - 2001-0370
VL  - 21
ER  - 
TY  - JOUR
A1  - Caliskan, Aylin
A1  - Dangwal, Seema
A1  - Dandekar, Thomas
T1  - Metadata integrity in bioinformatics: bridging the gap between data and knowledge
JF  - Computational and Structural Biotechnology Journal
N2  - In the fast-evolving landscape of biomedical research, the emergence of big data has presented researchers with extraordinary opportunities to explore biological complexities. In biomedical research, big data imply also a big responsibility. This is not only due to genomics data being sensitive information but also due to genomics data being shared and re-analysed among the scientific community. This saves valuable resources and can even help to find new insights in silico. To fully use these opportunities, detailed and correct metadata are imperative. This includes not only the availability of metadata but also their correctness. Metadata integrity serves as a fundamental determinant of research credibility, supporting the reliability and reproducibility of data-driven findings. Ensuring metadata availability, curation, and accuracy are therefore essential for bioinformatic research. Not only must metadata be readily available, but they must also be meticulously curated and ideally error-free. Motivated by an accidental discovery of a critical metadata error in patient data published in two high-impact journals, we aim to raise awareness for the need of correct, complete, and curated metadata. We describe how the metadata error was found, addressed, and present examples for metadata-related challenges in omics research, along with supporting measures, including tools for checking metadata and software to facilitate various steps from data analysis to published research.

Highlights
• Data awareness and data integrity underpins the trustworthiness of results and subsequent further analysis.
• Big data and bioinformatics enable efficient resource use by repurposing publicly available RNA-Sequencing data.
• Manual checks of data quality and integrity are insufficient due to the overwhelming volume and rapidly growing data.
• Automation and artificial intelligence provide cost-effective and efficient solutions for data integrity and quality checks.
• FAIR data management, various software solutions and analysis tools assist metadata maintenance.
KW  - meta-data
KW  - error
KW  - annotation
KW  - error-transfer
KW  - wrong labelling
KW  - patient data
KW  - control group
KW  - tools overview
Y1  - 2023
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-349990
SN  - 2001-0370
VL  - 21
ER  - 
TY  - JOUR
A1  - Bencurova, Elena
A1  - Akash, Aman
A1  - Dobson, Renwick C.J.
A1  - Dandekar, Thomas
T1  - DNA storage-from natural biology to synthetic biology
JF  - Computational and Structural Biotechnology Journal
N2  - Natural DNA storage allows cellular differentiation, evolution, the growth of our children and controls all our ecosystems. Here, we discuss the fundamental aspects of DNA storage and recent advances in this field, with special emphasis on natural processes and solutions that can be exploited. We point out new ways of efficient DNA and nucleotide storage that are inspired by nature. Within a few years DNA-based information storage may become an attractive and natural complementation to current electronic data storage systems. We discuss rapid and directed access (e.g. DNA elements such as promotors, enhancers), regulatory signals and modulation (e.g. lncRNA) as well as integrated high-density storage and processing modules (e.g. chromosomal territories). There is pragmatic DNA storage for use in biotechnology and human genetics. We examine DNA storage as an approach for synthetic biology (e.g. light-controlled nucleotide processing enzymes). The natural polymers of DNA and RNA offer much for direct storage operations (read-in, read-out, access control). The inbuilt parallelism (many molecules at many places working at the same time) is important for fast processing of information. Using biology concepts from chromosomal storage, nucleic acid processing as well as polymer material sciences such as electronical effects in enzymes, graphene, nanocellulose up to DNA macramé , DNA wires and DNA-based aptamer field effect transistors will open up new applications gradually replacing classical information storage methods in ever more areas over time (decades).
KW  - DNA
KW  - RNA
KW  - data storage
KW  - natural processing
KW  - synthetic biology
Y1  - 2023
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-349971
SN  - 2001-0370
VL  - 21
ER  - 
TY  - JOUR
A1  - Osmanoglu, Özge
A1  - Gupta, Shishir K.
A1  - Almasi, Anna
A1  - Yagci, Seray
A1  - Srivastava, Mugdha
A1  - Araujo, Gabriel H. M.
A1  - Nagy, Zoltan
A1  - Balkenhol, Johannes
A1  - Dandekar, Thomas
T1  - Signaling network analysis reveals fostamatinib as a potential drug to control platelet hyperactivation during SARS-CoV-2 infection
JF  - Frontiers in Immunology
N2  - Introduction 
Pro-thrombotic events are one of the prevalent causes of intensive care unit (ICU) admissions among COVID-19 patients, although the signaling events in the stimulated platelets are still unclear.

Methods 
We conducted a comparative analysis of platelet transcriptome data from healthy donors, ICU, and non-ICU COVID-19 patients to elucidate these mechanisms. To surpass previous analyses, we constructed models of involved networks and control cascades by integrating a global human signaling network with transcriptome data. We investigated the control of platelet hyperactivation and the specific proteins involved.

Results
Our study revealed that control of the platelet network in ICU patients is significantly higher than in non-ICU patients. Non-ICU patients require control over fewer proteins for managing platelet hyperactivity compared to ICU patients. Identification of indispensable proteins highlighted key subnetworks, that are targetable for system control in COVID-19-related platelet hyperactivity. We scrutinized FDA-approved drugs targeting indispensable proteins and identified fostamatinib as a potent candidate for preventing thrombosis in COVID-19 patients.

Discussion 
Our findings shed light on how SARS-CoV-2 efficiently affects host platelets by targeting indispensable and critical proteins involved in the control of platelet activity. We evaluated several drugs for specific control of platelet hyperactivity in ICU patients suffering from platelet hyperactivation. The focus of our approach is repurposing existing drugs for optimal control over the signaling network responsible for platelet hyperactivity in COVID-19 patients. Our study offers specific pharmacological recommendations, with drug prioritization tailored to the distinct network states observed in each patient condition. Interactive networks and detailed results can be accessed at https://fostamatinib.bioinfo-wuerz.eu/.
KW  - signaling network
KW  - controllability
KW  - platelet
KW  - SARS-CoV-2
KW  - fostamatinib
KW  - drug repurposing
KW  - COVID-19
Y1  - 2023
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-354158
VL  - 14
ER  -