-
The New Phytologist Jul 2024Understanding the genetic basis of how plants defend against pathogens is important to monitor and maintain resilient tree populations. Swiss needle cast (SNC) and...
Understanding the genetic basis of how plants defend against pathogens is important to monitor and maintain resilient tree populations. Swiss needle cast (SNC) and Rhabdocline needle cast (RNC) epidemics are responsible for major damage of forest ecosystems in North America. Here we investigate the genetic architecture of tolerance and resistance to needle cast diseases in Douglas-fir (Pseudotsuga menziesii) caused by two fungal pathogens: SNC caused by Nothophaeocryptopus gaeumannii, and RNC caused by Rhabdocline pseudotsugae. We performed case-control genome-wide association analyses and found disease resistance and tolerance in Douglas-fir to be polygenic and under strong selection. We show that stomatal regulation as well as ethylene and jasmonic acid pathways are important for resisting SNC infection, and secondary metabolite pathways play a role in tolerating SNC once the plant is infected. We identify a major transcriptional regulator of plant defense, ERF1, as the top candidate for RNC resistance. Our findings shed light on the highly polygenic architectures underlying fungal disease resistance and tolerance and have important implications for forestry and conservation as the climate changes.
Topics: Disease Resistance; Plant Diseases; Pseudotsuga; Genome-Wide Association Study; Ascomycota; Trees; Adaptation, Physiological; Multifactorial Inheritance; Gene Expression Regulation, Plant; Genes, Plant
PubMed: 38803110
DOI: 10.1111/nph.19797 -
ELife May 2024Rich data from large biobanks, coupled with increasingly accessible association statistics from genome-wide association studies (GWAS), provide great opportunities to...
Rich data from large biobanks, coupled with increasingly accessible association statistics from genome-wide association studies (GWAS), provide great opportunities to dissect the complex relationships among human traits and diseases. We introduce BADGERS, a powerful method to perform polygenic score-based biobank-wide association scans. Compared to traditional approaches, BADGERS uses GWAS summary statistics as input and does not require multiple traits to be measured in the same cohort. We applied BADGERS to two independent datasets for late-onset Alzheimer's disease (AD; n=61,212). Among 1738 traits in the UK biobank, we identified 48 significant associations for AD. Family history, high cholesterol, and numerous traits related to intelligence and education showed strong and independent associations with AD. Furthermore, we identified 41 significant associations for a variety of AD endophenotypes. While family history and high cholesterol were strongly associated with AD subgroups and pathologies, only intelligence and education-related traits predicted pre-clinical cognitive phenotypes. These results provide novel insights into the distinct biological processes underlying various risk factors for AD.
Topics: Alzheimer Disease; Humans; Risk Factors; Endophenotypes; Genome-Wide Association Study; Male; Biological Specimen Banks; Female; United Kingdom; Aged; Genetic Predisposition to Disease; Multifactorial Inheritance; Aged, 80 and over
PubMed: 38787369
DOI: 10.7554/eLife.91360 -
Arteriosclerosis, Thrombosis, and... Jul 2024Heterozygous familial hypercholesterolemia (FH) is among the most common genetic conditions worldwide that affects ≈ 1 in 300 individuals. FH is characterized by...
BACKGROUND
Heterozygous familial hypercholesterolemia (FH) is among the most common genetic conditions worldwide that affects ≈ 1 in 300 individuals. FH is characterized by increased levels of low-density lipoprotein cholesterol (LDL-C) and increased risk of coronary artery disease (CAD), but there is a wide spectrum of severity within the FH population. This variability in expression is incompletely explained by known risk factors. We hypothesized that genome-wide genetic influences, as represented by polygenic risk scores (PRSs) for cardiometabolic traits, would influence the phenotypic severity of FH.
METHODS
We studied individuals with clinically diagnosed FH (n=1123) from the FH Canada National Registry, as well as individuals with genetically identified FH from the UK Biobank (n=723). For all individuals, we used genome-wide gene array data to calculate PRSs for CAD, LDL-C, lipoprotein(a), and other cardiometabolic traits. We compared the distribution of PRSs in individuals with clinically diagnosed FH, genetically diagnosed FH, and non-FH controls and examined the association of the PRSs with the risk of atherosclerotic cardiovascular disease.
RESULTS
Individuals with clinically diagnosed FH had higher levels of LDL-C, and the incidence of atherosclerotic cardiovascular disease was higher in individuals with clinically diagnosed compared with genetically identified FH. Individuals with clinically diagnosed FH displayed enrichment for higher PRSs for CAD, LDL-C, and lipoprotein(a) but not for other cardiometabolic risk factors. The CAD PRS was associated with a risk of atherosclerotic cardiovascular disease among individuals with an FH-causing genetic variant.
CONCLUSIONS
Genetic background, as expressed by genome-wide PRSs for CAD, LDL-C, and lipoprotein(a), influences the phenotypic severity of FH, expanding our understanding of the determinants that contribute to the variable expressivity of FH. A PRS for CAD may aid in risk prediction among individuals with FH.
Topics: Humans; Hyperlipoproteinemia Type II; Female; Male; Middle Aged; Multifactorial Inheritance; Cholesterol, LDL; Phenotype; Registries; Genetic Predisposition to Disease; Coronary Artery Disease; Risk Assessment; Genome-Wide Association Study; Lipoprotein(a); Adult; Aged; Canada; United Kingdom; Severity of Illness Index; Risk Factors; Case-Control Studies; Biomarkers; Incidence
PubMed: 38779854
DOI: 10.1161/ATVBAHA.123.320287 -
Human Genomics May 2024Given the high prevalence of BPH among elderly men, pinpointing those at elevated risk can aid in early intervention and effective management. This study aimed to...
BACKGROUND
Given the high prevalence of BPH among elderly men, pinpointing those at elevated risk can aid in early intervention and effective management. This study aimed to explore that polygenic risk score (PRS) is effective in predicting benign prostatic hyperplasia (BPH) incidence, prognosis and risk of operation in Han Chinese.
METHODS
A retrospective cohort study included 12,474 male participants (6,237 with BPH and 6,237 non-BPH controls) from the Taiwan Precision Medicine Initiative (TPMI). Genotyping was performed using the Affymetrix Genome-Wide TWB 2.0 SNP Array. PRS was calculated using PGS001865, comprising 1,712 single nucleotide polymorphisms. Logistic regression models assessed the association between PRS and BPH incidence, adjusting for age and prostate-specific antigen (PSA) levels. The study also examined the relationship between PSA, prostate volume, and response to 5-α-reductase inhibitor (5ARI) treatment, as well as the association between PRS and the risk of TURP.
RESULTS
Individuals in the highest PRS quartile (Q4) had a significantly higher risk of BPH compared to the lowest quartile (Q1) (OR = 1.51, 95% CI = 1.274-1.783, p < 0.0001), after adjusting for PSA level. The Q4 group exhibited larger prostate volumes and a smaller volume reduction after 5ARI treatment. The Q1 group had a lower cumulative TURP probability at 3, 5, and 10 years compared to the Q4 group. PRS Q4 was an independent risk factor for TURP.
CONCLUSIONS
In this Han Chinese cohort, higher PRS was associated with an increased susceptibility to BPH, larger prostate volumes, poorer response to 5ARI treatment, and a higher risk of TURP. Larger prospective studies with longer follow-up are warranted to further validate these findings.
Topics: Humans; Male; Prostatic Hyperplasia; Aged; Middle Aged; Polymorphism, Single Nucleotide; Genetic Predisposition to Disease; Retrospective Studies; Multifactorial Inheritance; Asian People; Risk Factors; 5-alpha Reductase Inhibitors; Prostate-Specific Antigen; Taiwan; Prognosis; Prostate; Genetic Risk Score; East Asian People
PubMed: 38778357
DOI: 10.1186/s40246-024-00619-3 -
Scientific Reports May 2024In recent years, the utility of polygenic risk scores (PRS) in forecasting disease susceptibility from genome-wide association studies (GWAS) results has been widely...
In recent years, the utility of polygenic risk scores (PRS) in forecasting disease susceptibility from genome-wide association studies (GWAS) results has been widely recognised. Yet, these models face limitations due to overfitting and the potential overestimation of effect sizes in correlated variants. To surmount these obstacles, we devised the Stacked Neural Network Polygenic Risk Score (SNPRS). This novel approach synthesises outputs from multiple neural network models, each calibrated using genetic variants chosen based on diverse p-value thresholds. By doing so, SNPRS captures a broader array of genetic variants, enabling a more nuanced interpretation of the combined effects of these variants. We assessed the efficacy of SNPRS using the UK Biobank data, focusing on the genetic risks associated with breast and prostate cancers, as well as quantitative traits like height and BMI. We also extended our analysis to the Korea Genome and Epidemiology Study (KoGES) dataset. Impressively, our results indicate that SNPRS surpasses traditional PRS models and an isolated deep neural network in terms of accuracy, highlighting its promise in refining the efficacy and relevance of PRS in genetic studies.
Topics: Humans; Multifactorial Inheritance; Genome-Wide Association Study; Neural Networks, Computer; Genetic Predisposition to Disease; Polymorphism, Single Nucleotide; Female; Male; Prostatic Neoplasms; Breast Neoplasms; Risk Factors; Genetic Risk Score
PubMed: 38773257
DOI: 10.1038/s41598-024-62513-1 -
Scientific Reports May 2024Predisposing factors underlying familial aggregation of non-syndromic gliomas are still to be uncovered. Whole-exome sequencing was performed in four Finnish families...
Predisposing factors underlying familial aggregation of non-syndromic gliomas are still to be uncovered. Whole-exome sequencing was performed in four Finnish families with brain tumors to identify rare predisposing variants. A total of 417 detected exome variants and 102 previously reported glioma-related variants were further genotyped in 19 Finnish families with brain tumors using targeted sequencing. Rare damaging variants in GALNT13, MYO10 and AR were identified. Two families carried either c.553C>T (R185C) or c.1214T>A (L405Q) on GALNT13. Variant c.553C>T is located on the substrate-binding site of GALNT13. AR c.2180G>T (R727L), which is located on a ligand-binding domain of AR, was detected in two families, one of which also carried a GALNT13 variant. MYO10 c.4448A>G (N1483S) was detected in two families and c.1511C>T (A504V) variant was detected in one family. Both variants are located on functional domains related to MYO10 activity in filopodia formation. In addition, affected cases in six families carried a known glioma risk variant rs55705857 in CCDC26 and low-risk glioma variants. These novel findings indicate polygenic inheritance of familial glioma in Finland and increase our understanding of the genetic contribution to familial glioma susceptibility.
Topics: Humans; Finland; Glioma; Female; Male; N-Acetylgalactosaminyltransferases; Genetic Predisposition to Disease; Pedigree; Polypeptide N-acetylgalactosaminyltransferase; Germ-Line Mutation; Adult; Middle Aged; Brain Neoplasms; Exome Sequencing
PubMed: 38773237
DOI: 10.1038/s41598-024-62296-5 -
Briefings in Bioinformatics Mar 2024Polygenetic Risk Scores are used to evaluate an individual's vulnerability to developing specific diseases or conditions based on their genetic composition, by taking... (Review)
Review
Polygenetic Risk Scores are used to evaluate an individual's vulnerability to developing specific diseases or conditions based on their genetic composition, by taking into account numerous genetic variations. This article provides an overview of the concept of Polygenic Risk Scores (PRS). We elucidate the historical advancements of PRS, their advantages and shortcomings in comparison with other predictive methods, and discuss their conceptual limitations in light of the complexity of biological systems. Furthermore, we provide a survey of published tools for computing PRS and associated resources. The various tools and software packages are categorized based on their technical utility for users or prospective developers. Understanding the array of available tools and their limitations is crucial for accurately assessing and predicting disease risks, facilitating early interventions, and guiding personalized healthcare decisions. Additionally, we also identify potential new avenues for future bioinformatic analyzes and advancements related to PRS.
Topics: Humans; Multifactorial Inheritance; Genetic Predisposition to Disease; Software; Computational Biology; Genome-Wide Association Study; Risk Factors; Risk Assessment; Genetic Risk Score
PubMed: 38770718
DOI: 10.1093/bib/bbae240 -
Nature Communications May 2024Transcriptome-wide association study (TWAS) is a popular approach to dissect the functional consequence of disease associated non-coding variants. Most existing TWAS use...
Transcriptome-wide association study (TWAS) is a popular approach to dissect the functional consequence of disease associated non-coding variants. Most existing TWAS use bulk tissues and may not have the resolution to reveal cell-type specific target genes. Single-cell expression quantitative trait loci (sc-eQTL) datasets are emerging. The largest bulk- and sc-eQTL datasets are most conveniently available as summary statistics, but have not been broadly utilized in TWAS. Here, we present a new method EXPRESSO (EXpression PREdiction with Summary Statistics Only), to analyze sc-eQTL summary statistics, which also integrates 3D genomic data and epigenomic annotation to prioritize causal variants. EXPRESSO substantially improves existing methods. We apply EXPRESSO to analyze multi-ancestry GWAS datasets for 14 autoimmune diseases. EXPRESSO uniquely identifies 958 novel gene x trait associations, which is 26% more than the second-best method. Among them, 492 are unique to cell type level analysis and missed by TWAS using whole blood. We also develop a cell type aware drug repurposing pipeline, which leverages EXPRESSO results to identify drug compounds that can reverse disease gene expressions in relevant cell types. Our results point to multiple drugs with therapeutic potentials, including metformin for type 1 diabetes, and vitamin K for ulcerative colitis.
Topics: Quantitative Trait Loci; Humans; Single-Cell Analysis; Genome-Wide Association Study; Genetic Predisposition to Disease; Transcriptome; Autoimmune Diseases; Polymorphism, Single Nucleotide; Multifactorial Inheritance; Gene Expression Profiling
PubMed: 38769300
DOI: 10.1038/s41467-024-48143-1 -
Translational Vision Science &... May 2024The purpose of this study was to conduct a large-scale genome-wide association study (GWAS) and construct a polygenic risk score (PRS) for risk stratification in...
PURPOSE
The purpose of this study was to conduct a large-scale genome-wide association study (GWAS) and construct a polygenic risk score (PRS) for risk stratification in patients with dry eye disease (DED) using the Taiwan Biobank (TWB) databases.
METHODS
This retrospective case-control study involved 40,112 subjects of Han Chinese ancestry, sourced from the publicly available TWB. Cases were patients with DED (n = 14,185), and controls were individuals without DED (n = 25,927). The patients with DED were further divided into 8072 young (<60 years old) and 6113 old participants (≥60 years old). Using PLINK (version 1.9) software, quality control was carried out, followed by logistic regression analysis with adjustments for sex, age, body mass index, depression, and manic episodes as covariates. We also built PRS prediction models using the standard clumping and thresholding method and evaluated their performance (area under the curve [AUC]) through five-fold cross-validation.
RESULTS
Eleven independent risk loci were identified for these patients with DED at the genome-wide significance levels, including DNAJB6, MAML3, LINC02267, DCHS1, SIRPB3P, HULC, MUC16, GAS2L3, and ZFPM2. Among these, MUC16 encodes mucin family protein. The PRS model incorporated 932 and 740 genetic loci for young and old populations, respectively. A higher PRS score indicated a greater DED risk, with the top 5% of PRS individuals having a 10-fold higher risk. After integrating these covariates into the PRS model, the area under the receiver operating curve (AUROC) increased from 0.509 and 0.537 to 0.600 and 0.648 for young and old populations, respectively, demonstrating the genetic-environmental interaction.
CONCLUSIONS
Our study prompts potential candidates for the mechanism of DED and paves the way for more personalized medication in the future.
TRANSLATIONAL RELEVANCE
Our study identified genes related to DED and constructed a PRS model to improve DED prediction.
Topics: Humans; Genome-Wide Association Study; Female; Male; Middle Aged; Retrospective Studies; Dry Eye Syndromes; Case-Control Studies; Genetic Predisposition to Disease; Adult; Multifactorial Inheritance; Aged; Risk Factors; Risk Assessment; Polymorphism, Single Nucleotide; Taiwan; Genetic Risk Score
PubMed: 38767906
DOI: 10.1167/tvst.13.5.13 -
Nature Communications May 2024Type 2 diabetes (T2D) presents a formidable global health challenge, highlighted by its escalating prevalence, underscoring the critical need for precision health...
Type 2 diabetes (T2D) presents a formidable global health challenge, highlighted by its escalating prevalence, underscoring the critical need for precision health strategies and early detection initiatives. Leveraging artificial intelligence, particularly eXtreme Gradient Boosting (XGBoost), we devise robust risk assessment models for T2D. Drawing upon comprehensive genetic and medical imaging datasets from 68,911 individuals in the Taiwan Biobank, our models integrate Polygenic Risk Scores (PRS), Multi-image Risk Scores (MRS), and demographic variables, such as age, sex, and T2D family history. Here, we show that our model achieves an Area Under the Receiver Operating Curve (AUC) of 0.94, effectively identifying high-risk T2D subgroups. A streamlined model featuring eight key variables also maintains a high AUC of 0.939. This high accuracy for T2D risk assessment promises to catalyze early detection and preventive strategies. Moreover, we introduce an accessible online risk assessment tool for T2D, facilitating broader applicability and dissemination of our findings.
Topics: Diabetes Mellitus, Type 2; Humans; Risk Assessment; Female; Male; Middle Aged; Artificial Intelligence; Taiwan; Genetic Predisposition to Disease; Adult; Diagnostic Imaging; Aged; Risk Factors; ROC Curve; Multifactorial Inheritance
PubMed: 38762475
DOI: 10.1038/s41467-024-48618-1