Refine
Has Fulltext
- yes (3)
Is part of the Bibliography
- yes (3)
Document Type
- Journal article (3)
Language
- English (3)
Keywords
Institute
An expanded evaluation of protein function prediction methods shows an improvement in accuracy
(2016)
Background
A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging.
Results
We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2.
Conclusions
The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent.
Duplications at 15q11.2-q13.3 overlapping the Prader-Willi/Angelman syndrome (PWS/AS) region have been associated with developmental delay (DD), autism spectrum disorder (ASD) and schizophrenia (SZ). Due to presence of imprinted genes within the region, the parental origin of these duplications may be key to the pathogenicity. Duplications of maternal origin are associated with disease, whereas the pathogenicity of paternal ones is unclear. To clarify the role of maternal and paternal duplications, we conducted the largest and most detailed study to date of parental origin of 15q11.2-q13.3 interstitial duplications in DD, ASD and SZ cohorts. We show, for the first time, that paternal duplications lead to an increased risk of developing DD/ASD/multiple congenital anomalies (MCA), but do not appear to increase risk for SZ. The importance of the epigenetic status of 15q11.2-q13.3 duplications was further underlined by analysis of a number of families, in which the duplication was paternally derived in the mother, who was unaffected, whereas her offspring, who inherited a maternally derived duplication, suffered from psychotic illness. Interestingly, the most consistent clinical characteristics of SZ patients with 15q11.2-q13.3 duplications were learning or developmental problems, found in 76% of carriers. Despite their lower pathogenicity, paternal duplications are less frequent in the general population with a general population prevalence of 0.0033% compared to 0.0069% for maternal duplications. This may be due to lower fecundity of male carriers and differential survival of embryos, something echoed in the findings that both types of duplications are de novo in just over 50% of cases. Isodicentric chromosome 15 (idic15) or interstitial triplications were not observed in SZ patients or in controls. Overall, this study refines the distinct roles of maternal and paternal interstitial duplications at 15q11.2-q13.3, underlining the critical importance of maternally expressed imprinted genes in the contribution of Copy Number Variants (CNVs) at this interval to the incidence of psychotic illness. This work will have tangible benefits for patients with 15q11.2-q13.3 duplications by aiding genetic counseling.
DCLK1 Variants Are Associated across Schizophrenia and Attention Deficit/Hyperactivity Disorder
(2012)
Doublecortin and calmodulin like kinase 1 (DCLK1) is implicated in synaptic plasticity and neurodevelopment. Genetic variants in DCLK1 are associated with cognitive traits, specifically verbal memory and general cognition. We investigated the role of DCLK1 variants in three psychiatric disorders that have neuro-cognitive dysfunctions: schizophrenia (SCZ), bipolar affective disorder (BP) and attention deficit/hyperactivity disorder (ADHD). We mined six genome wide association studies (GWASs) that were available publically or through collaboration; three for BP, two for SCZ and one for ADHD. We also genotyped the DCLK1 region in additional samples of cases with SCZ, BP or ADHD and controls that had not been whole-genome typed. In total, 9895 subjects were analysed, including 5308 normal controls and 4,587 patients (1,125 with SCZ, 2,496 with BP and 966 with ADHD). Several DCLK1 variants were associated with disease phenotypes in the different samples. The main effect was observed for rs7989807 in intron 3, which was strongly associated with SCZ alone and even more so when cases with SCZ and ADHD were combined (P-value = 4x10\(^{-5}\) and 4x10\(^{-6}\), respectively). Associations were also observed with additional markers in intron 3 (combination of SCZ, ADHD and BP), intron 19 (SCZ+BP) and the 3'UTR (SCZ+BP). Our results suggest that genetic variants in DCLK1 are associated with SCZ and, to a lesser extent, with ADHD and BP. Interestingly the association is strongest when SCZ and ADHD are considered together, suggesting common genetic susceptibility. Given that DCLK1 variants were previously found to be associated with cognitive traits, these results are consistent with the role of DCLK1 in neurodevelopment and synaptic plasticity.