TY - JOUR A1 - Sahlol, Ahmed T. A1 - Kollmannsberger, Philip A1 - Ewees, Ahmed A. T1 - Efficient Classification of White Blood Cell Leukemia with Improved Swarm Optimization of Deep Features JF - Scientific Reports N2 - White Blood Cell (WBC) Leukaemia is caused by excessive production of leukocytes in the bone marrow, and image-based detection of malignant WBCs is important for its detection. Convolutional Neural Networks (CNNs) present the current state-of-the-art for this type of image classification, but their computational cost for training and deployment can be high. We here present an improved hybrid approach for efficient classification of WBC Leukemia. We first extract features from WBC images using VGGNet, a powerful CNN architecture, pre-trained on ImageNet. The extracted features are then filtered using a statistically enhanced Salp Swarm Algorithm (SESSA). This bio-inspired optimization algorithm selects the most relevant features and removes highly correlated and noisy features. We applied the proposed approach to two public WBC Leukemia reference datasets and achieve both high accuracy and reduced computational complexity. The SESSA optimization selected only 1 K out of 25 K features extracted with VGGNet, while improving accuracy at the same time. The results are among the best achieved on these datasets and outperform several convolutional network models. We expect that the combination of CNN feature extraction and SESSA feature optimization could be useful for many other image classification tasks. KW - Acute lymphocytic leukaemia KW - Computer science KW - Image processing Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-229398 VL - 10 IS - 1 ER - TY - JOUR A1 - Reinhard, Sebastian A1 - Helmerich, Dominic A. A1 - Boras, Dominik A1 - Sauer, Markus A1 - Kollmannsberger, Philip T1 - ReCSAI: recursive compressed sensing artificial intelligence for confocal lifetime localization microscopy JF - BMC Bioinformatics N2 - Background Localization-based super-resolution microscopy resolves macromolecular structures down to a few nanometers by computationally reconstructing fluorescent emitter coordinates from diffraction-limited spots. The most commonly used algorithms are based on fitting parametric models of the point spread function (PSF) to a measured photon distribution. These algorithms make assumptions about the symmetry of the PSF and thus, do not work well with irregular, non-linear PSFs that occur for example in confocal lifetime imaging, where a laser is scanned across the sample. An alternative method for reconstructing sparse emitter sets from noisy, diffraction-limited images is compressed sensing, but due to its high computational cost it has not yet been widely adopted. Deep neural network fitters have recently emerged as a new competitive method for localization microscopy. They can learn to fit arbitrary PSFs, but require extensive simulated training data and do not generalize well. A method to efficiently fit the irregular PSFs from confocal lifetime localization microscopy combining the advantages of deep learning and compressed sensing would greatly improve the acquisition speed and throughput of this method. Results Here we introduce ReCSAI, a compressed sensing neural network to reconstruct localizations for confocal dSTORM, together with a simulation tool to generate training data. We implemented and compared different artificial network architectures, aiming to combine the advantages of compressed sensing and deep learning. We found that a U-Net with a recursive structure inspired by iterative compressed sensing showed the best results on realistic simulated datasets with noise, as well as on real experimentally measured confocal lifetime scanning data. Adding a trainable wavelet denoising layer as prior step further improved the reconstruction quality. Conclusions Our deep learning approach can reach a similar reconstruction accuracy for confocal dSTORM as frame binning with traditional fitting without requiring the acquisition of multiple frames. In addition, our work offers generic insights on the reconstruction of sparse measurements from noisy experimental data by combining compressed sensing and deep learning. We provide the trained networks, the code for network training and inference as well as the simulation tool as python code and Jupyter notebooks for easy reproducibility. KW - compressed sensing KW - AI KW - SMLM KW - FLIMbee KW - dSTORM Y1 - 2022 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-299768 VL - 23 IS - 1 ER - TY - JOUR A1 - Pauli, Martin A1 - Paul, Mila M. A1 - Proppert, Sven A1 - Mrestani, Achmed A1 - Sharifi, Marzieh A1 - Repp, Felix A1 - Kürzinger, Lydia A1 - Kollmannsberger, Philip A1 - Sauer, Markus A1 - Heckmann, Manfred A1 - Sirén, Anna-Leena T1 - Targeted volumetric single-molecule localization microscopy of defined presynaptic structures in brain sections JF - Communications Biology N2 - Revealing the molecular organization of anatomically precisely defined brain regions is necessary for refined understanding of synaptic plasticity. Although three-dimensional (3D) single-molecule localization microscopy can provide the required resolution, imaging more than a few micrometers deep into tissue remains challenging. To quantify presynaptic active zones (AZ) of entire, large, conditional detonator hippocampal mossy fiber (MF) boutons with diameters as large as 10 mu m, we developed a method for targeted volumetric direct stochastic optical reconstruction microscopy (dSTORM). An optimized protocol for fast repeated axial scanning and efficient sequential labeling of the AZ scaffold Bassoon and membrane bound GFP with Alexa Fluor 647 enabled 3D-dSTORM imaging of 25 mu m thick mouse brain sections and assignment of AZs to specific neuronal substructures. Quantitative data analysis revealed large differences in Bassoon cluster size and density for distinct hippocampal regions with largest clusters in MF boutons. Pauli et al. develop targeted volumetric dSTORM in order to image large hippocampal mossy fiber boutons (MFBs) in brain slices. They can identify synaptic targets of individual MFBs and measured size and density of Bassoon clusters within individual untruncated MFBs at nanoscopic resolution. KW - mossy fiber synapses KW - CA3 pyrimidal cells KW - CA2+ channels KW - active zone KW - hippocampal KW - release KW - plasticity KW - proteins KW - platform KW - reveals Y1 - 2021 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-259830 VL - 4 ER - TY - JOUR A1 - Paul, Torsten Johann A1 - Kollmannsberger, Philip T1 - Biological network growth in complex environments: A computational framework JF - PLoS Computational Biology N2 - Spatial biological networks are abundant on all scales of life, from single cells to ecosystems, and perform various important functions including signal transmission and nutrient transport. These biological functions depend on the architecture of the network, which emerges as the result of a dynamic, feedback-driven developmental process. While cell behavior during growth can be genetically encoded, the resulting network structure depends on spatial constraints and tissue architecture. Since network growth is often difficult to observe experimentally, computer simulations can help to understand how local cell behavior determines the resulting network architecture. We present here a computational framework based on directional statistics to model network formation in space and time under arbitrary spatial constraints. Growth is described as a biased correlated random walk where direction and branching depend on the local environmental conditions and constraints, which are presented as 3D multilayer grid. To demonstrate the application of our tool, we perform growth simulations of a dense network between cells and compare the results to experimental data from osteocyte networks in bone. Our generic framework might help to better understand how network patterns depend on spatial constraints, or to identify the biological cause of deviations from healthy network function. Author summary We present a novel modeling approach and computational implementation to better understand the development of spatial biological networks under the influence of external signals. Our tool allows us to study the relationship between local biological growth parameters and the emerging macroscopic network function using simulations. This computational approach can generate plausible network graphs that take local feedback into account and provide a basis for comparative studies using graph-based methods. KW - osteocyte network KW - connectome KW - mechanisms KW - generation KW - shape Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-231373 VL - 16 IS - 11 ER - TY - JOUR A1 - Mrestani, Achmed A1 - Pauli, Martin A1 - Kollmannsberger, Philip A1 - Repp, Felix A1 - Kittel, Robert J. A1 - Eilers, Jens A1 - Doose, Sören A1 - Sauer, Markus A1 - Sirén, Anna-Leena A1 - Heckmann, Manfred A1 - Paul, Mila M. T1 - Active zone compaction correlates with presynaptic homeostatic potentiation JF - Cell Reports N2 - Neurotransmitter release is stabilized by homeostatic plasticity. Presynaptic homeostatic potentiation (PHP) operates on timescales ranging from minute- to life-long adaptations and likely involves reorganization of presynaptic active zones (AZs). At Drosophila melanogaster neuromuscular junctions, earlier work ascribed AZ enlargement by incorporating more Bruchpilot (Brp) scaffold protein a role in PHP. We use localization microscopy (direct stochastic optical reconstruction microscopy [dSTORM]) and hierarchical density-based spatial clustering of applications with noise (HDBSCAN) to study AZ plasticity during PHP at the synaptic mesoscale. We find compaction of individual AZs in acute philanthotoxin-induced and chronic genetically induced PHP but unchanged copy numbers of AZ proteins. Compaction even occurs at the level of Brp subclusters, which move toward AZ centers, and in Rab3 interacting molecule (RIM)-binding protein (RBP) subclusters. Furthermore, correlative confocal and dSTORM imaging reveals how AZ compaction in PHP translates into apparent increases in AZ area and Brp protein content, as implied earlier. KW - active zone KW - Bruchpilot KW - RIM-binding protein KW - compaction KW - homeostasis KW - presynaptic plasticity KW - super-resolution microscopy Y1 - 2021 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-265497 VL - 37 IS - 1 ER - TY - JOUR A1 - Mostosi, Philipp A1 - Schindelin, Hermann A1 - Kollmannsberger, Philip A1 - Thorn, Andrea T1 - Haruspex: A Neural Network for the Automatic Identification of Oligonucleotides and Protein Secondary Structure in Cryo‐Electron Microscopy Maps JF - Angewandte Chemie International Edition N2 - In recent years, three‐dimensional density maps reconstructed from single particle images obtained by electron cryo‐microscopy (cryo‐EM) have reached unprecedented resolution. However, map interpretation can be challenging, in particular if the constituting structures require de‐novo model building or are very mobile. Herein, we demonstrate the potential of convolutional neural networks for the annotation of cryo‐EM maps: our network Haruspex has been trained on a carefully curated set of 293 experimentally derived reconstruction maps to automatically annotate RNA/DNA as well as protein secondary structure elements. It can be straightforwardly applied to newly reconstructed maps in order to support domain placement or as a starting point for main‐chain placement. Due to its high recall and precision rates of 95.1 % and 80.3 %, respectively, on an independent test set of 122 maps, it can also be used for validation during model building. The trained network will be available as part of the CCP‐EM suite. KW - DNA structures KW - electron microscopy KW - neural networks KW - protein structures KW - RNA structures Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-214763 VL - 59 IS - 35 SP - 14788 EP - 14795 ER - TY - JOUR A1 - Marquardt, André A1 - Solimando, Antonio Giovanni A1 - Kerscher, Alexander A1 - Bittrich, Max A1 - Kalogirou, Charis A1 - Kübler, Hubert A1 - Rosenwald, Andreas A1 - Bargou, Ralf A1 - Kollmannsberger, Philip A1 - Schilling, Bastian A1 - Meierjohann, Svenja A1 - Krebs, Markus T1 - Subgroup-Independent Mapping of Renal Cell Carcinoma — Machine Learning Reveals Prognostic Mitochondrial Gene Signature Beyond Histopathologic Boundaries JF - Frontiers in Oncology N2 - Background: Renal cell carcinoma (RCC) is divided into three major histopathologic groups—clear cell (ccRCC), papillary (pRCC) and chromophobe RCC (chRCC). We performed a comprehensive re-analysis of publicly available RCC datasets from the TCGA (The Cancer Genome Atlas) database, thereby combining samples from all three subgroups, for an exploratory transcriptome profiling of RCC subgroups. Materials and Methods: We used FPKM (fragments per kilobase per million) files derived from the ccRCC, pRCC and chRCC cohorts of the TCGA database, representing transcriptomic data of 891 patients. Using principal component analysis, we visualized datasets as t-SNE plot for cluster detection. Clusters were characterized by machine learning, resulting gene signatures were validated by correlation analyses in the TCGA dataset and three external datasets (ICGC RECA-EU, CPTAC-3-Kidney, and GSE157256). Results: Many RCC samples co-clustered according to histopathology. However, a substantial number of samples clustered independently from histopathologic origin (mixed subgroup)—demonstrating divergence between histopathology and transcriptomic data. Further analyses of mixed subgroup via machine learning revealed a predominant mitochondrial gene signature—a trait previously known for chRCC—across all histopathologic subgroups. Additionally, ccRCC samples from mixed subgroup presented an inverse correlation of mitochondrial and angiogenesis-related genes in the TCGA and in three external validation cohorts. Moreover, mixed subgroup affiliation was associated with a highly significant shorter overall survival for patients with ccRCC—and a highly significant longer overall survival for chRCC patients. Conclusions: Pan-RCC clustering according to RNA-sequencing data revealed a distinct histology-independent subgroup characterized by strengthened mitochondrial and weakened angiogenesis-related gene signatures. Moreover, affiliation to mixed subgroup went along with a significantly shorter overall survival for ccRCC and a longer overall survival for chRCC patients. Further research could offer a therapy stratification by specifically addressing the mitochondrial metabolism of such tumors and its microenvironment. KW - kidney cancer KW - pan-RCC KW - machine learning KW - mitochondrial DNA KW - mtDNA KW - mTOR Y1 - 2021 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-232107 SN - 2234-943X VL - 11 ER - TY - JOUR A1 - Marquardt, André A1 - Landwehr, Laura-Sophie A1 - Ronchi, Cristina L. A1 - di Dalmazi, Guido A1 - Riester, Anna A1 - Kollmannsberger, Philip A1 - Altieri, Barbara A1 - Fassnacht, Martin A1 - Sbiera, Silviu T1 - Identifying New Potential Biomarkers in Adrenocortical Tumors Based on mRNA Expression Data Using Machine Learning JF - Cancers N2 - Simple Summary Using a visual-based clustering method on the TCGA RNA sequencing data of a large adrenocortical carcinoma (ACC) cohort, we were able to classify these tumors in two distinct clusters largely overlapping with previously identified ones. As previously shown, the identified clusters also correlated with patient survival. Applying the visual clustering method to a second dataset also including benign adrenocortical samples additionally revealed that one of the ACC clusters is more closely located to the benign samples, providing a possible explanation for the better survival of this ACC cluster. Furthermore, the subsequent use of machine learning identified new possible biomarker genes with prognostic potential for this rare disease, that are significantly differentially expressed in the different survival clusters and should be further evaluated. Abstract Adrenocortical carcinoma (ACC) is a rare disease, associated with poor survival. Several “multiple-omics” studies characterizing ACC on a molecular level identified two different clusters correlating with patient survival (C1A and C1B). We here used the publicly available transcriptome data from the TCGA-ACC dataset (n = 79), applying machine learning (ML) methods to classify the ACC based on expression pattern in an unbiased manner. UMAP (uniform manifold approximation and projection)-based clustering resulted in two distinct groups, ACC-UMAP1 and ACC-UMAP2, that largely overlap with clusters C1B and C1A, respectively. However, subsequent use of random-forest-based learning revealed a set of new possible marker genes showing significant differential expression in the described clusters (e.g., SOAT1, EIF2A1). For validation purposes, we used a secondary dataset based on a previous study from our group, consisting of 4 normal adrenal glands and 52 benign and 7 malignant tumor samples. The results largely confirmed those obtained for the TCGA-ACC cohort. In addition, the ENSAT dataset showed a correlation between benign adrenocortical tumors and the good prognosis ACC cluster ACC-UMAP1/C1B. In conclusion, the use of ML approaches re-identified and redefined known prognostic ACC subgroups. On the other hand, the subsequent use of random-forest-based learning identified new possible prognostic marker genes for ACC. KW - adrenocortical carcinoma KW - in silico analysis KW - machine learning KW - bioinformatic clustering KW - biomarker prediction Y1 - 2021 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-246245 SN - 2072-6694 VL - 13 IS - 18 ER - TY - JOUR A1 - Marquardt, André A1 - Kollmannsberger, Philip A1 - Krebs, Markus A1 - Argentiero, Antonella A1 - Knott, Markus A1 - Solimando, Antonio Giovanni A1 - Kerscher, Alexander Georg T1 - Visual clustering of transcriptomic data from primary and metastatic tumors — dependencies and novel pitfalls JF - Genes N2 - Personalized oncology is a rapidly evolving area and offers cancer patients therapy options that are more specific than ever. However, there is still a lack of understanding regarding transcriptomic similarities or differences of metastases and corresponding primary sites. Applying two unsupervised dimension reduction methods (t-Distributed Stochastic Neighbor Embedding (t-SNE) and Uniform Manifold Approximation and Projection (UMAP)) on three datasets of metastases (n = 682 samples) with three different data transformations (unprocessed, log10 as well as log10 + 1 transformed values), we visualized potential underlying clusters. Additionally, we analyzed two datasets (n = 616 samples) containing metastases and primary tumors of one entity, to point out potential familiarities. Using these methods, no tight link between the site of resection and cluster formation outcome could be demonstrated, or for datasets consisting of solely metastasis or mixed datasets. Instead, dimension reduction methods and data transformation significantly impacted visual clustering results. Our findings strongly suggest data transformation to be considered as another key element in the interpretation of visual clustering approaches along with initialization and different parameters. Furthermore, the results highlight the need for a more thorough examination of parameters used in the analysis of clusters. KW - visual clustering KW - t-SNE KW - UMAP KW - transcriptomic analysis KW - cancer KW - metastasis Y1 - 2022 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-281872 SN - 2073-4425 VL - 13 IS - 8 ER - TY - JOUR A1 - Marquardt, André A1 - Hartrampf, Philipp A1 - Kollmannsberger, Philip A1 - Solimando, Antonio G. A1 - Meierjohann, Svenja A1 - Kübler, Hubert A1 - Bargou, Ralf A1 - Schilling, Bastian A1 - Serfling, Sebastian E. A1 - Buck, Andreas A1 - Werner, Rudolf A. A1 - Lapa, Constantin A1 - Krebs, Markus T1 - Predicting microenvironment in CXCR4- and FAP-positive solid tumors — a pan-cancer machine learning workflow for theranostic target structures JF - Cancers N2 - (1) Background: C-X-C Motif Chemokine Receptor 4 (CXCR4) and Fibroblast Activation Protein Alpha (FAP) are promising theranostic targets. However, it is unclear whether CXCR4 and FAP positivity mark distinct microenvironments, especially in solid tumors. (2) Methods: Using Random Forest (RF) analysis, we searched for entity-independent mRNA and microRNA signatures related to CXCR4 and FAP overexpression in our pan-cancer cohort from The Cancer Genome Atlas (TCGA) database — representing n = 9242 specimens from 29 tumor entities. CXCR4- and FAP-positive samples were assessed via StringDB cluster analysis, EnrichR, Metascape, and Gene Set Enrichment Analysis (GSEA). Findings were validated via correlation analyses in n = 1541 tumor samples. TIMER2.0 analyzed the association of CXCR4 / FAP expression and infiltration levels of immune-related cells. (3) Results: We identified entity-independent CXCR4 and FAP gene signatures representative for the majority of solid cancers. While CXCR4 positivity marked an immune-related microenvironment, FAP overexpression highlighted an angiogenesis-associated niche. TIMER2.0 analysis confirmed characteristic infiltration levels of CD8+ cells for CXCR4-positive tumors and endothelial cells for FAP-positive tumors. (4) Conclusions: CXCR4- and FAP-directed PET imaging could provide a non-invasive decision aid for entity-agnostic treatment of microenvironment in solid malignancies. Moreover, this machine learning workflow can easily be transferred towards other theranostic targets. KW - machine learning KW - tumor microenvironment KW - immune infiltration KW - angiogenesis KW - mRNA KW - miRNA KW - transcriptome Y1 - 2023 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:20-opus-305036 SN - 2072-6694 VL - 15 IS - 2 ER -