-
Science (New York, N.Y.) Oct 2023Epistasis between genes is traditionally studied with mutations that eliminate protein activity, but most natural genetic variation is in cis-regulatory DNA and...
Epistasis between genes is traditionally studied with mutations that eliminate protein activity, but most natural genetic variation is in cis-regulatory DNA and influences gene expression and function quantitatively. In this study, we used natural and engineered cis-regulatory alleles in a plant stem-cell circuit to systematically evaluate epistatic relationships controlling tomato fruit size. Combining a promoter allelic series with two other loci, we collected over 30,000 phenotypic data points from 46 genotypes to quantify how allele strength transforms epistasis. We revealed a saturating dose-dependent relationship but also allele-specific idiosyncratic interactions, including between alleles driving a step change in fruit size during domestication. Our approach and findings expose an underexplored dimension of epistasis, in which cis-regulatory allelic diversity within gene regulatory networks elicits nonlinear, unpredictable interactions that shape phenotypes.
Topics: Alleles; Epistasis, Genetic; Fruit; Genetic Variation; Genotype; Phenotype; Solanum lycopersicum; Gene Expression Regulation, Plant; Promoter Regions, Genetic; Gene Dosage
PubMed: 37856609
DOI: 10.1126/science.adi5222 -
Science Advances Feb 2023The CRISPR-Cas system can treat autosomal dominant diseases by nonhomologous end joining (NHEJ) gene disruption of mutant alleles. However, many single-nucleotide...
The CRISPR-Cas system can treat autosomal dominant diseases by nonhomologous end joining (NHEJ) gene disruption of mutant alleles. However, many single-nucleotide mutations cannot be discriminated from wild-type alleles by current CRISPR-Cas systems. Here, we functionally screened six Cas12j nucleases and determined Cas12j-8 as an ideal genome editor with a hypercompact size. Cas12j-8 displayed comparable activity to AsCas12a and Un1Cas12f1. Cas12j-8 is a highly specific nuclease sensitive to single-nucleotide mismatches in the protospacer adjacent motif (PAM)-proximal region. We experimentally proved that Cas12j-8 enabled allele-specific disruption of genes with a single-nucleotide polymorphism (SNP). Cas12j-8 recognizes a simple TTN PAM that provides for high target site density. In silico analysis reveals that Cas12j-8 enables allele-specific disruption of 25,931 clinically relevant variants in the ClinVar database, and 485,130,147 SNPs in the dbSNP database. Therefore, Cas12j-8 would be particularly suitable for therapeutic applications.
Topics: Gene Editing; Alleles; CRISPR-Cas Systems; Endonucleases; Nucleotides
PubMed: 36763662
DOI: 10.1126/sciadv.abo6405 -
Forensic Science International. Genetics May 2022The Hardy-Weinberg law is shown to be transitive in the sense that a multi-allelic polymorphism that is in equilibrium will retain its equilibrium status if any allele...
The Hardy-Weinberg law is shown to be transitive in the sense that a multi-allelic polymorphism that is in equilibrium will retain its equilibrium status if any allele together with its corresponding genotypes is deleted from the population. Similarly, the transitivity principle also applies if alleles are joined, which leads to the summation of allele frequencies and their corresponding genotype frequencies. These basic polymorphism properties are intuitive, but they have apparently not been formalized or investigated. This article provides a straightforward proof of the transitivity principle, and its usefulness in genetic data analysis is explored, using high-quality autosomal microsatellite databases from the US National Institute of Standards and Technology. We address the reduction of multi-allelic polymorphisms to variants with fewer alleles, two in the limit. Equilibrium test results obtained with the original and reduced polymorphisms are generally observed to be coherent, in particular when results obtained with length-based and sequence-based microsatellites are compared. We exploit the transitivity principle in order to identify disequilibrium-related alleles, and show its usefulness for detecting population substructure and genotyping problems that relate to null alleles and allele imbalance.
Topics: Alleles; Gene Frequency; Genotype; Humans; Polymorphism, Genetic
PubMed: 35313226
DOI: 10.1016/j.fsigen.2022.102680 -
Mammalian Genome : Official Journal of... Mar 2022In addition to naturally occurring sequence variation and spontaneous mutations, a wide array of technologies exist for modifying the mouse genome. Standardized... (Review)
Review
In addition to naturally occurring sequence variation and spontaneous mutations, a wide array of technologies exist for modifying the mouse genome. Standardized nomenclature, including allele, transgene, and other mutation nomenclature, as well as persistent unique identifiers (PUID) are critical for effective scientific communication, comparison of results, and integration of data into knowledgebases such as Mouse Genome Informatics (MGI), Alliance for Genome Resources, and International Mouse Strain Resource (IMSR). As well as being the authoritative source for mouse gene, allele, and strain nomenclature, MGI integrates published and unpublished genomic, phenotypic, and expression data while linking to other online resources for a complete view of the mouse as a valuable model organism. The International Committee on Standardized Genetic Nomenclature for Mice has developed allele nomenclature rules and guidelines that take into account the number of genes impacted, the method of allele generation, and the nature of the sequence alteration. To capture details that cannot be included in allele symbols, MGI has further developed allele to gene relationships using sequence ontology (SO) definitions for mutations that provide links between alleles and the genes affected. MGI is also using (HGVS) variant nomenclature for variants associated with alleles that will enhance searching for mutations and will improve cross-species comparison. With the ability to assign unique and informative symbols as well as to link alleles with more than one gene, allele and transgene nomenclature rules and guidelines provide an unambiguous way to represent alterations in the mouse genome and facilitate data integration among multiple resources such the Alliance of Genome Resources and International Mouse Strain Resource.
Topics: Alleles; Animals; Databases, Genetic; Genomics; Mice; Mutation; Transgenes
PubMed: 34389871
DOI: 10.1007/s00335-021-09902-3 -
Laboratory Animals Feb 2022The emergence of an array of genome-editing tools in recent years has facilitated the introduction of genetic modifications directly into the embryo, increasing the...
The emergence of an array of genome-editing tools in recent years has facilitated the introduction of genetic modifications directly into the embryo, increasing the ease, efficiency and catalogue of alleles accessible to researchers across a range of species. Bypassing the requirement for a selection cassette and resulting in a broad range of outcomes besides the desired allele, genome editing has altered the allele validation process both temporally and technically. Whereas traditional gene targeting relies upon selection and allows allele validation at the embryonic stem cell modification stage, screening for the presence of the intended allele now occurs in the (frequently mosaic) founder animals. Final confirmation of the edited allele can only take place at the subsequent G1 generation and the validation strategy must differentiate the desired allele from a range of unintended outcomes. Here we present some of the challenges posed by gene editing, strategies for validation and considerations for animal colony management.
Topics: Alleles; Animals; Embryo, Mammalian; Gene Editing; Genetic Testing; Reproducibility of Results
PubMed: 34192966
DOI: 10.1177/00236772211016922 -
Human Genomics Feb 2022CYP2D6 is a key drug-metabolizing enzyme implicated in the biotransformation of approximately 25% of currently prescribed drugs. Interindividual and interethnic... (Review)
Review
CYP2D6 is a key drug-metabolizing enzyme implicated in the biotransformation of approximately 25% of currently prescribed drugs. Interindividual and interethnic differences in CYP2D6 enzymatic activity, and hence variability in substrate drug efficacy and safety, are attributed to a highly polymorphic corresponding gene. This study aims at reviewing the frequencies of the most clinically relevant CYP2D6 alleles in the Arabs countries. Articles published before May 2021 that reported CYP2D6 genotype and allelic frequencies in the Arab populations of the Middle East and North Africa (MENA) region were retrieved from PubMed and Google Scholar databases. This review included 15 original articles encompassing 2737 individuals from 11 countries of the 22 members of the League of Arab States. Active CYP2D6 gene duplications reached the highest frequencies of 28.3% and 10.4% in Algeria and Saudi Arabia, respectively, and lowest in Egypt (2.41%) and Palestine (4.9%). Frequencies of the loss-of-function allele CYP2D6*4 ranged from 3.5% in Saudi Arabia to 18.8% in Egypt. The disparity in frequencies of the reduced-function CYP2D6*10 allele was perceptible, with the highest frequency reported in Jordan (14.8%) and the lowest in neighboring Palestine (2%), and in Algeria (0%). The reduced-function allele CYP2D6*41 was more prevalent in the Arabian Peninsula countries; Saudi Arabia (18.4%) and the United Arab Emirates (15.2%), in comparison with the Northern Arab-Levantine Syria (9.7%) and Algeria (8.3%). Our study demonstrates heterogeneity of CYP2D6 alleles among Arab populations. The incongruities of the frequencies of alleles in neighboring countries with similar demographic composition emphasize the necessity for harmonizing criteria of genotype assignment and conducting comprehensive studies on larger MENA Arab populations to determine their CYP2D6 allelic makeup and improve therapeutic outcomes of CYP2D6- metabolized drugs.
Topics: Alleles; Arabs; Cytochrome P-450 CYP2D6; Gene Frequency; Humans; Polymorphism, Genetic
PubMed: 35123571
DOI: 10.1186/s40246-022-00378-z -
Heredity Jan 2022The two alleles an individual carries at a locus are identical by descent (ibd) if they have descended from a single ancestral allele in a reference population, and the...
The two alleles an individual carries at a locus are identical by descent (ibd) if they have descended from a single ancestral allele in a reference population, and the probability of such identity is the inbreeding coefficient of the individual. Inbreeding coefficients can be predicted from pedigrees with founders constituting the reference population, but estimation from genetic data is not possible without data from the reference population. Most inbreeding estimators that make explicit use of sample allele frequencies as estimates of allele probabilities in the reference population are confounded by average kinships with other individuals. This means that the ranking of those estimates depends on the scope of the study sample and we show the variation in rankings for common estimators applied to different subdivisions of 1000 Genomes data. Allele-sharing estimators of within-population inbreeding relative to average kinship in a study sample, however, do have invariant rankings across all studies including those individuals. They are unbiased with a large number of SNPs. We discuss how allele sharing estimates are the relevant quantities for a range of empirical applications.
Topics: Alleles; Gene Frequency; Humans; Inbreeding; Models, Genetic; Pedigree; Polymorphism, Single Nucleotide
PubMed: 34824382
DOI: 10.1038/s41437-021-00471-4 -
Biometrics Jun 2022The allele-based association test, comparing allele frequency difference between case and control groups, is locally most powerful. However, application of the classical...
The allele-based association test, comparing allele frequency difference between case and control groups, is locally most powerful. However, application of the classical allelic test is limited in practice, because the method is sensitive to the Hardy-Weinberg equilibrium (HWE) assumption, not applicable to continuous traits, and not easy to account for covariate effect or sample correlation. To develop a generalized robust allelic test, we propose a new allele-based regression model with individual allele as the response variable. We show that the score test statistic derived from this robust and unifying regression framework contains a correction factor that explicitly adjusts for potential departure from HWE and encompasses the classical allelic test as a special case. When the trait of interest is continuous, the corresponding allelic test evaluates a weighted difference between individual-level allele frequency estimate and sample estimate where the weight is proportional to an individual's trait value, and the test remains valid under Y-dependent sampling. Finally, the proposed allele-based method can analyze multiple (continuous or binary) phenotypes simultaneously and multiallelic genetic markers, while accounting for covariate effect, sample correlation, and population heterogeneity. To support our analytical findings, we provide empirical evidence from both simulation and application studies.
Topics: Alleles; Computer Simulation; Gene Frequency; Genotype; Models, Genetic; Phenotype
PubMed: 33729547
DOI: 10.1111/biom.13456 -
Human Mutation Nov 2022To determine the phase of NUDT15 sequence variants for more comprehensive star (*) allele diplotyping, we developed a novel long-read single-molecule real-time HiFi...
To determine the phase of NUDT15 sequence variants for more comprehensive star (*) allele diplotyping, we developed a novel long-read single-molecule real-time HiFi amplicon sequencing method. A 10.5 kb NUDT15 amplicon assay was validated using reference material positive controls and additional samples for specimen type and blinded accuracy assessment. Triplicate NUDT15 HiFi sequencing of two reference material samples had nonreference genotype concordances of >99.9%, indicating that the assay is robust. Notably, short-read genome sequencing of a subset of samples was unable to determine the phase of star (*) allele-defining NUDT15 variants, resulting in ambiguous diplotype results. In contrast, long-read HiFi sequencing phased all variants across the NUDT15 amplicons, including a *2/*9 diplotype that previously was characterized as *1/*2 in the 1000 Genomes Project v3 data set. Assay throughput was also tested using 8.5 kb amplicons from 100 Ashkenazi Jewish individuals, which identified a novel NUDT15 *1 suballele (c.-121G>A) and a rare likely deleterious coding variant (p.Pro129Arg). Both novel alleles were Sanger confirmed and assigned as *1.007 and *20, respectively, by the PharmVar Consortium. Taken together, NUDT15 HiFi amplicon sequencing is an innovative method for phased full-gene characterization and novel allele discovery, which could improve NUDT15 pharmacogenomic testing and subsequent phenotype prediction.
Topics: Alleles; Genotype; Haplotypes; Humans; Pharmacogenetics; Sequence Analysis, DNA
PubMed: 36057977
DOI: 10.1002/humu.24457 -
Trends in Ecology & Evolution Mar 2020The particular combinations of alleles that define haplotypes along individual chromosomes can be determined with increasing ease and accuracy by using current... (Review)
Review
The particular combinations of alleles that define haplotypes along individual chromosomes can be determined with increasing ease and accuracy by using current sequencing technologies. Beyond allele frequencies, haplotype data collected in population samples contain information about the history of allelic associations in gene genealogies, and this is of tremendous potential for conservation genomics. We provide an overview of how haplotype information can be used to assess historical demography, gene flow, selection, and the evolutionary outcomes of hybridization across different timescales relevant to conservation issues. We address technical aspects of applying such approaches to nonmodel species. We conclude that there is much to be gained by integrating haplotype-based analyses in future conservation genomics studies.
Topics: Alleles; Gene Flow; Gene Frequency; Genomics; Haplotypes
PubMed: 31810774
DOI: 10.1016/j.tree.2019.10.012