Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
1.
Cell ; 176(6): 1310-1324.e10, 2019 03 07.
Artigo em Inglês | MEDLINE | ID: mdl-30827684

RESUMO

DNA rearrangements resulting in human genome structural variants (SVs) are caused by diverse mutational mechanisms. We used long- and short-read sequencing technologies to investigate end products of de novo chromosome 17p11.2 rearrangements and query the molecular mechanisms underlying both recurrent and non-recurrent events. Evidence for an increased rate of clustered single-nucleotide variant (SNV) mutation in cis with non-recurrent rearrangements was found. Indel and SNV formation are associated with both copy-number gains and losses of 17p11.2, occur up to ∼1 Mb away from the breakpoint junctions, and favor C > G transversion substitutions; results suggest that single-stranded DNA is formed during the genesis of the SV and provide compelling support for a microhomology-mediated break-induced replication (MMBIR) mechanism for SV formation. Our data show an additional mutational burden of MMBIR consisting of hypermutation confined to the locus and manifesting as SNVs and indels predominantly within genes.


Assuntos
Cromossomos Humanos Par 17 , Mutação , Anormalidades Múltiplas/genética , Pontos de Quebra do Cromossomo , Transtornos Cromossômicos/genética , Duplicação Cromossômica/genética , Variações do Número de Cópias de DNA , Reparo do DNA/genética , Replicação do DNA , Rearranjo Gênico , Genoma Humano , Variação Estrutural do Genoma , Humanos , Mutação INDEL , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Recombinação Genética , Análise de Sequência de DNA/métodos , Síndrome de Smith-Magenis/genética
2.
Genet Med ; 24(5): 1062-1072, 2022 05.
Artigo em Inglês | MEDLINE | ID: mdl-35331649

RESUMO

PURPOSE: The Mayo-Baylor RIGHT 10K Study enabled preemptive, sequence-based pharmacogenomics (PGx)-driven drug prescribing practices in routine clinical care within a large cohort. We also generated the tools and resources necessary for clinical PGx implementation and identified challenges that need to be overcome. Furthermore, we measured the frequency of both common genetic variation for which clinical guidelines already exist and rare variation that could be detected by DNA sequencing, rather than genotyping. METHODS: Targeted oligonucleotide-capture sequencing of 77 pharmacogenes was performed using DNA from 10,077 consented Mayo Clinic Biobank volunteers. The resulting predicted drug response-related phenotypes for 13 genes, including CYP2D6 and HLA, affecting 21 drug-gene pairs, were deposited preemptively in the Mayo electronic health record. RESULTS: For the 13 pharmacogenes of interest, the genomes of 79% of participants carried clinically actionable variants in 3 or more genes, and DNA sequencing identified an average of 3.3 additional conservatively predicted deleterious variants that would not have been evident using genotyping. CONCLUSION: Implementation of preemptive rather than reactive and sequence-based rather than genotype-based PGx prescribing revealed nearly universal patient applicability and required integrated institution-wide resources to fully realize individualized drug therapy and to show more efficient use of health care resources.


Assuntos
Citocromo P-450 CYP2D6 , Farmacogenética , Centros Médicos Acadêmicos , Sequência de Bases , Citocromo P-450 CYP2D6/genética , Genótipo , Humanos , Farmacogenética/métodos
3.
Genome Res ; 24(7): 1209-23, 2014 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-24985915

RESUMO

Accurate gene model annotation of reference genomes is critical for making them useful. The modENCODE project has improved the D. melanogaster genome annotation by using deep and diverse high-throughput data. Since transcriptional activity that has been evolutionarily conserved is likely to have an advantageous function, we have performed large-scale interspecific comparisons to increase confidence in predicted annotations. To support comparative genomics, we filled in divergence gaps in the Drosophila phylogeny by generating draft genomes for eight new species. For comparative transcriptome analysis, we generated mRNA expression profiles on 81 samples from multiple tissues and developmental stages of 15 Drosophila species, and we performed cap analysis of gene expression in D. melanogaster and D. pseudoobscura. We also describe conservation of four distinct core promoter structures composed of combinations of elements at three positions. Overall, each type of genomic feature shows a characteristic divergence rate relative to neutral models, highlighting the value of multispecies alignment in annotating a target genome that should prove useful in the annotation of other high priority genomes, especially human and other mammalian genomes that are rich in noncoding sequences. We report that the vast majority of elements in the annotation are evolutionarily conserved, indicating that the annotation will be an important springboard for functional genetic testing by the Drosophila community.


Assuntos
Biologia Computacional/métodos , Drosophila melanogaster/genética , Perfilação da Expressão Gênica , Anotação de Sequência Molecular , Transcriptoma , Animais , Análise por Conglomerados , Drosophila melanogaster/classificação , Evolução Molecular , Éxons , Feminino , Genoma de Inseto , Humanos , Masculino , Motivos de Nucleotídeos , Filogenia , Matrizes de Pontuação de Posição Específica , Regiões Promotoras Genéticas , Edição de RNA , Sítios de Splice de RNA , Splicing de RNA , Reprodutibilidade dos Testes , Sítio de Iniciação de Transcrição
4.
Genome Res ; 24(7): 1193-208, 2014 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-24714809

RESUMO

The Drosophila melanogaster Genetic Reference Panel (DGRP) is a community resource of 205 sequenced inbred lines, derived to improve our understanding of the effects of naturally occurring genetic variation on molecular and organismal phenotypes. We used an integrated genotyping strategy to identify 4,853,802 single nucleotide polymorphisms (SNPs) and 1,296,080 non-SNP variants. Our molecular population genomic analyses show higher deletion than insertion mutation rates and stronger purifying selection on deletions. Weaker selection on insertions than deletions is consistent with our observed distribution of genome size determined by flow cytometry, which is skewed toward larger genomes. Insertion/deletion and single nucleotide polymorphisms are positively correlated with each other and with local recombination, suggesting that their nonrandom distributions are due to hitchhiking and background selection. Our cytogenetic analysis identified 16 polymorphic inversions in the DGRP. Common inverted and standard karyotypes are genetically divergent and account for most of the variation in relatedness among the DGRP lines. Intriguingly, variation in genome size and many quantitative traits are significantly associated with inversions. Approximately 50% of the DGRP lines are infected with Wolbachia, and four lines have germline insertions of Wolbachia sequences, but effects of Wolbachia infection on quantitative traits are rarely significant. The DGRP complements ongoing efforts to functionally annotate the Drosophila genome. Indeed, 15% of all D. melanogaster genes segregate for potentially damaged proteins in the DGRP, and genome-wide analyses of quantitative traits identify novel candidate genes. The DGRP lines, sequence data, genotypes, quality scores, phenotypes, and analysis and visualization tools are publicly available.


Assuntos
Drosophila melanogaster/genética , Variação Genética , Genoma de Inseto , Fenótipo , Animais , Cromatina/genética , Cromatina/metabolismo , Drosophila melanogaster/microbiologia , Feminino , Ligação Genética , Tamanho do Genoma , Estudo de Associação Genômica Ampla , Genótipo , Técnicas de Genotipagem , Sequenciamento de Nucleotídeos em Larga Escala , Mutação INDEL , Desequilíbrio de Ligação , Masculino , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único , Característica Quantitativa Herdável , Reprodutibilidade dos Testes
5.
Commun Biol ; 7(1): 174, 2024 Feb 19.
Artigo em Inglês | MEDLINE | ID: mdl-38374434

RESUMO

Disparities in data underlying clinical genomic interpretation is an acknowledged problem, but there is a paucity of data demonstrating it. The All of Us Research Program is collecting data including whole-genome sequences, health records, and surveys for at least a million participants with diverse ancestry and access to healthcare, representing one of the largest biomedical research repositories of its kind. Here, we examine pathogenic and likely pathogenic variants that were identified in the All of Us cohort. The European ancestry subgroup showed the highest overall rate of pathogenic variation, with 2.26% of participants having a pathogenic variant. Other ancestry groups had lower rates of pathogenic variation, including 1.62% for the African ancestry group and 1.32% in the Latino/Admixed American ancestry group. Pathogenic variants were most frequently observed in genes related to Breast/Ovarian Cancer or Hypercholesterolemia. Variant frequencies in many genes were consistent with the data from the public gnomAD database, with some notable exceptions resolved using gnomAD subsets. Differences in pathogenic variant frequency observed between ancestral groups generally indicate biases of ascertainment of knowledge about those variants, but some deviations may be indicative of differences in disease prevalence. This work will allow targeted precision medicine efforts at revealed disparities.


Assuntos
Predisposição Genética para Doença , Saúde da População , Humanos , População Negra , Genômica , Hispânico ou Latino/genética , Estados Unidos/epidemiologia , População Europeia , População Africana
6.
medRxiv ; 2024 Apr 12.
Artigo em Inglês | MEDLINE | ID: mdl-38645101

RESUMO

Background: Multiplexed Assays of Variant Effects (MAVEs) can test all possible single variants in a gene of interest. The resulting saturation-style data may help resolve variant classification disparities between populations, especially for variants of uncertain significance (VUS). Methods: We analyzed clinical significance classifications in 213,663 individuals of European-like genetic ancestry versus 206,975 individuals of non-European-like genetic ancestry from All of Us and the Genome Aggregation Database. Then, we incorporated clinically calibrated MAVE data into the Clinical Genome Resource's Variant Curation Expert Panel rules to automate VUS reclassification for BRCA1, TP53, and PTEN . Results: Using two orthogonal statistical approaches, we show a higher prevalence ( p ≤5.95e-06) of VUS in individuals of non-European-like genetic ancestry across all medical specialties assessed in all three databases. Further, in the non-European-like genetic ancestry group, higher rates of Benign or Likely Benign and variants with no clinical designation ( p ≤2.5e-05) were found across many medical specialties, whereas Pathogenic or Likely Pathogenic assignments were higher in individuals of European-like genetic ancestry ( p ≤2.5e-05). Using MAVE data, we reclassified VUS in individuals of non-European-like genetic ancestry at a significantly higher rate in comparison to reclassified VUS from European-like genetic ancestry ( p =9.1e-03) effectively compensating for the VUS disparity. Further, essential code analysis showed equitable impact of MAVE evidence codes but inequitable impact of allele frequency ( p =7.47e-06) and computational predictor ( p =6.92e-05) evidence codes for individuals of non-European-like genetic ancestry. Conclusions: Generation of saturation-style MAVE data should be a priority to reduce VUS disparities and produce equitable training data for future computational predictors.

7.
F1000Res ; 11: 530, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36262335

RESUMO

In October 2021, 59 scientists from 14 countries and 13 U.S. states collaborated virtually in the Third Annual Baylor College of Medicine & DNANexus Structural Variation hackathon. The goal of the hackathon was to advance research on structural variants (SVs) by prototyping and iterating on open-source software. This led to nine hackathon projects focused on diverse genomics research interests, including various SV discovery and genotyping methods, SV sequence reconstruction, and clinically relevant structural variation, including SARS-CoV-2 variants. Repositories for the projects that participated in the hackathon are available at https://github.com/collaborativebioinformatics.


Assuntos
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , Genômica , Software
8.
F1000Res ; 10: 246, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34621504

RESUMO

In October 2020, 62 scientists from nine nations worked together remotely in the Second Baylor College of Medicine & DNAnexus hackathon, focusing on different related topics on Structural Variation, Pan-genomes, and SARS-CoV-2 related research.   The overarching focus was to assess the current status of the field and identify the remaining challenges. Furthermore, how to combine the strengths of the different interests to drive research and method development forward. Over the four days, eight groups each designed and developed new open-source methods to improve the identification and analysis of variations among species, including humans and SARS-CoV-2. These included improvements in SV calling, genotyping, annotations and filtering. Together with advancements in benchmarking existing methods. Furthermore, groups focused on the diversity of SARS-CoV-2. Daily discussion summary and methods are available publicly at  https://github.com/collaborativebioinformatics provides valuable insights for both participants and the research community.


Assuntos
COVID-19 , SARS-CoV-2 , Animais , Genoma Viral , Humanos , Vertebrados
9.
Genome Biol ; 21(1): 15, 2020 01 23.
Artigo em Inglês | MEDLINE | ID: mdl-31969194

RESUMO

BACKGROUND: Arthropods comprise the largest and most diverse phylum on Earth and play vital roles in nearly every ecosystem. Their diversity stems in part from variations on a conserved body plan, resulting from and recorded in adaptive changes in the genome. Dissection of the genomic record of sequence change enables broad questions regarding genome evolution to be addressed, even across hyper-diverse taxa within arthropods. RESULTS: Using 76 whole genome sequences representing 21 orders spanning more than 500 million years of arthropod evolution, we document changes in gene and protein domain content and provide temporal and phylogenetic context for interpreting these innovations. We identify many novel gene families that arose early in the evolution of arthropods and during the diversification of insects into modern orders. We reveal unexpected variation in patterns of DNA methylation across arthropods and examples of gene family and protein domain evolution coincident with the appearance of notable phenotypic and physiological adaptations such as flight, metamorphosis, sociality, and chemoperception. CONCLUSIONS: These analyses demonstrate how large-scale comparative genomics can provide broad new insights into the genotype to phenotype map and generate testable hypotheses about the evolution of animal diversity.


Assuntos
Artrópodes/genética , Evolução Molecular , Animais , Artrópodes/classificação , Metilação de DNA , Especiação Genética , Variação Genética , Filogenia
11.
Obesity (Silver Spring) ; 25(7): 1270-1276, 2017 07.
Artigo em Inglês | MEDLINE | ID: mdl-28508493

RESUMO

OBJECTIVE: To perform whole exome sequencing in 928 Hispanic children and identify variants and genes associated with childhood obesity. METHODS: Single-nucleotide variants (SNVs) were identified from Illumina whole exome sequencing data using integrated read mapping, variant calling, and an annotation pipeline (Mercury). Association analyses of 74 obesity-related traits and exonic variants were performed using SeqMeta software. Rare autosomal variants were analyzed using gene-based association analyses, and common autosomal variants were analyzed at the SNV level. RESULTS: (1) Rare exonic variants in 10 genes and 16 common SNVs in 11 genes that were associated with obesity traits in a cohort of Hispanic children were identified, (2) novel rare variants in peroxisome biogenesis factor 1 (PEX1) associated with several obesity traits (weight, weight z score, BMI, BMI z score, waist circumference, fat mass, trunk fat mass) were discovered, and (3) previously reported SNVs associated with childhood obesity were replicated. CONCLUSIONS: Convergence of whole exome sequencing, a family-based design, and extensive phenotyping discovered novel rare and common variants associated with childhood obesity. Linking PEX1 to obesity phenotypes poses a novel mechanism of peroxisomal biogenesis and metabolism underlying the development of childhood obesity.


Assuntos
Exoma , Loci Gênicos , Hispânico ou Latino/genética , Obesidade Infantil/genética , Análise de Sequência de DNA , ATPases Associadas a Diversas Atividades Celulares/genética , ATPases Associadas a Diversas Atividades Celulares/metabolismo , Adolescente , Índice de Massa Corporal , Peso Corporal , Criança , Pré-Escolar , Estudos de Coortes , Estudo de Associação Genômica Ampla , Humanos , Proteínas de Membrana/genética , Proteínas de Membrana/metabolismo , Obesidade Infantil/etnologia , Polimorfismo de Nucleotídeo Único , Fatores de Risco , Software , Circunferência da Cintura , Adulto Jovem
12.
Insect Biochem Mol Biol ; 76: 118-147, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27522922

RESUMO

Manduca sexta, known as the tobacco hornworm or Carolina sphinx moth, is a lepidopteran insect that is used extensively as a model system for research in insect biochemistry, physiology, neurobiology, development, and immunity. One important benefit of this species as an experimental model is its extremely large size, reaching more than 10 g in the larval stage. M. sexta larvae feed on solanaceous plants and thus must tolerate a substantial challenge from plant allelochemicals, including nicotine. We report the sequence and annotation of the M. sexta genome, and a survey of gene expression in various tissues and developmental stages. The Msex_1.0 genome assembly resulted in a total genome size of 419.4 Mbp. Repetitive sequences accounted for 25.8% of the assembled genome. The official gene set is comprised of 15,451 protein-coding genes, of which 2498 were manually curated. Extensive RNA-seq data from many tissues and developmental stages were used to improve gene models and for insights into gene expression patterns. Genome wide synteny analysis indicated a high level of macrosynteny in the Lepidoptera. Annotation and analyses were carried out for gene families involved in a wide spectrum of biological processes, including apoptosis, vacuole sorting, growth and development, structures of exoskeleton, egg shells, and muscle, vision, chemosensation, ion channels, signal transduction, neuropeptide signaling, neurotransmitter synthesis and transport, nicotine tolerance, lipid metabolism, and immunity. This genome sequence, annotation, and analysis provide an important new resource from a well-studied model insect species and will facilitate further biochemical and mechanistic experimental studies of many biological systems in insects.


Assuntos
Expressão Gênica , Genoma de Inseto , Manduca/genética , Animais , Perfilação da Expressão Gênica , Larva/genética , Larva/crescimento & desenvolvimento , Manduca/crescimento & desenvolvimento , Pupa/genética , Pupa/crescimento & desenvolvimento , Análise de Sequência de DNA , Sintenia
13.
Curr Biol ; 25(5): 613-20, 2015 Mar 02.
Artigo em Inglês | MEDLINE | ID: mdl-25660540

RESUMO

Gall-forming arthropods are highly specialized herbivores that, in combination with their hosts, produce extended phenotypes with unique morphologies [1]. Many are economically important, and others have improved our understanding of ecology and adaptive radiation [2]. However, the mechanisms that these arthropods use to induce plant galls are poorly understood. We sequenced the genome of the Hessian fly (Mayetiola destructor; Diptera: Cecidomyiidae), a plant parasitic gall midge and a pest of wheat (Triticum spp.), with the aim of identifying genic modifications that contribute to its plant-parasitic lifestyle. Among several adaptive modifications, we discovered an expansive reservoir of potential effector proteins. Nearly 5% of the 20,163 predicted gene models matched putative effector gene transcripts present in the M. destructor larval salivary gland. Another 466 putative effectors were discovered among the genes that have no sequence similarities in other organisms. The largest known arthropod gene family (family SSGP-71) was also discovered within the effector reservoir. SSGP-71 proteins lack sequence homologies to other proteins, but their structures resemble both ubiquitin E3 ligases in plants and E3-ligase-mimicking effectors in plant pathogenic bacteria. SSGP-71 proteins and wheat Skp proteins interact in vivo. Mutations in different SSGP-71 genes avoid the effector-triggered immunity that is directed by the wheat resistance genes H6 and H9. Results point to effectors as the agents responsible for arthropod-induced plant gall formation.


Assuntos
Cromossomos/genética , Dípteros/genética , Família Multigênica/genética , Filogenia , Tumores de Planta/genética , Triticum/parasitologia , Adaptação Biológica/genética , Sequência de Aminoácidos , Animais , Sequência de Bases , Dípteros/metabolismo , Larva/metabolismo , Modelos Genéticos , Dados de Sequência Molecular , Análise de Sequência de DNA , Homologia de Sequência , Comportamento Sexual Animal/fisiologia , Técnicas do Sistema de Duplo-Híbrido , Ubiquitina-Proteína Ligases/genética
14.
Science ; 344(6188): 1168-1173, 2014 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-24904168

RESUMO

Sheep (Ovis aries) are a major source of meat, milk, and fiber in the form of wool and represent a distinct class of animals that have a specialized digestive organ, the rumen, that carries out the initial digestion of plant material. We have developed and analyzed a high-quality reference sheep genome and transcriptomes from 40 different tissues. We identified highly expressed genes encoding keratin cross-linking proteins associated with rumen evolution. We also identified genes involved in lipid metabolism that had been amplified and/or had altered tissue expression patterns. This may be in response to changes in the barrier lipids of the skin, an interaction between lipid metabolism and wool synthesis, and an increased role of volatile fatty acids in ruminants compared with nonruminant animals.


Assuntos
Metabolismo dos Lipídeos/fisiologia , Rúmen/fisiologia , Carneiro Doméstico/genética , Carneiro Doméstico/metabolismo , Sequência de Aminoácidos , Animais , Ácidos Graxos Voláteis/metabolismo , Ácidos Graxos Voláteis/fisiologia , Regulação da Expressão Gênica , Genoma , Queratinas Específicas do Cabelo/genética , Metabolismo dos Lipídeos/genética , Dados de Sequência Molecular , Filogenia , Rúmen/metabolismo , Carneiro Doméstico/classificação , Transcriptoma , Lã/crescimento & desenvolvimento
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa