Refine
Has Fulltext
- yes (3)
Is part of the Bibliography
- yes (3)
Document Type
- Journal article (3)
Language
- English (3)
Keywords
- DM2 (1)
- Disease gene prioritization (1)
- Fabry disease (1)
- Fabry genotype (1)
- Fabry phenotype (1)
- Protein function prediction (1)
- expansion (1)
- intergenerational contraction (1)
- lyso‐Gb3 (1)
- penetrance (1)
An expanded evaluation of protein function prediction methods shows an improvement in accuracy
(2016)
Background
A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging.
Results
We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2.
Conclusions
The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent.
Autosomal dominant inherited Myotonic dystrophy type 1 and 2 (DM1 and DM2) are the most frequent muscle dystrophies in the European population and are caused by repeat expansion mutations. For Germany cumulative empiric evidence suggests an estimated prevalence of DM2 of roughly 9 in 100,000, therefore being as prevalent as DM1. In DM2, a (CCTG)n repeat tract located in the first intron of the CNBP gene is expanded. The CCTG repeat tract is part of a complex repeat structure comprising not only CCTG tetraplets but also repeated TG dinucleotides and TCTG tetraplet elements as well as NCTG interruptions. Here, we provide the distribution of normal sized alleles in the German population, which was found to be highly similar to the Slovak population. Sequencing of 34 unexpanded healthy range alleles in DM2 positive patients (heterozygous for a full expansion) revealed that the CCTG repeat tract is usually interrupted by at least three tetraplets which according to current opinion is supposed to render it stable against expansion. Interestingly, only the largest analyzed normal allele had 23 uninterrupted CCTGs and consequently could represent an instable early premutation allele. In our diagnostic history of DM2 cases, a total of 18 premutations were detected in 16 independent cases. Here, we describe two premutation families, one with an expansion from a premutation allele and the other with a contraction of a full expansion down to a premutation allele. Our diagnostic results support the general assumption that the premutation range of unstable CCTG stretches lies obviously between 25 and 75 CCTGs. However, the clinical significance of premutation alleles is still unclear. In the light of the two described families we suggest incomplete penetrance. Thus, as it was proposed for other repeat expansion diseases (e.g., Huntington's disease), a fluid transition of penetrance is more likely rather than a clear cut CCTG number threshold.
Background
Fabry disease (FD) is an X‐linked lysosomal storage and multi‐system disorder due to mutations in the α‐galactosidase A (α‐GalA) gene. We investigated the impact of individual amino acid exchanges in the α‐GalA 3D‐structure on the clinical phenotype of FD patients.
Patients and methods
We enrolled 80 adult FD patients with α‐GalA missense mutations and stratified them into three groups based on the amino acid exchange location in the α‐GalA 3D‐structure: patients with active site mutations, buried mutations and other mutations. Patient subgroups were deep phenotyped for clinical and laboratory parameters and FD‐specific treatment.
Results
Patients with active site or buried mutations showed a severe phenotype with multi‐organ involvement and early disease manifestation. Patients with other mutations had a milder phenotype with less organ impairment and later disease onset. α‐GalA activity was lower in patients with active site or buried mutations than in those with other mutations (P < 0.01 in men; P < 0.05 in women) whilst lyso‐Gb3 levels were higher (P < 0.01 in men; <0.05 in women).
Conclusions
The type of amino acid exchange location in the α‐GalA 3D‐structure determines disease severity and temporal course of symptom onset. Patient stratification using this parameter may become a useful tool in the management of FD patients.