Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
1.
PLoS Genet ; 16(7): e1008903, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32678846

RESUMO

Genome wide association studies (GWAS) of human diseases have generally identified many loci associated with risk with relatively small effect sizes. The omnigenic model attempts to explain this observation by suggesting that diseases can be thought of as networks, where genes with direct involvement in disease-relevant biological pathways are named 'core genes', while peripheral genes influence disease risk via their interactions or regulatory effects on core genes. Here, we demonstrate a method for identifying candidate core genes solely from genes in or near disease-associated SNPs (GWAS hits) in conjunction with protein-protein interaction network data. Applied to 1,381 GWAS studies from 5 ancestries, we identify a total of 1,865 candidate core genes in 343 GWAS studies. Our analysis identifies several well-known disease-related genes that are not identified by GWAS, including BRCA1 in Breast Cancer, Amyloid Precursor Protein (APP) in Alzheimer's Disease, INS in A1C measurement and Type 2 Diabetes, and PCSK9 in LDL cholesterol, amongst others. Notably candidate core genes are preferentially enriched for disease relevance over GWAS hits and are enriched for both Clinvar pathogenic variants and known drug targets-consistent with the predictions of the omnigenic model. We subsequently use parent term annotations provided by the GWAS catalog, to merge related GWAS studies and identify candidate core genes in over-arching disease processes such as cancer-where we identify 109 candidate core genes.


Assuntos
Doença de Alzheimer/genética , Neoplasias da Mama/genética , Diabetes Mellitus Tipo 2/genética , Estudo de Associação Genômica Ampla , Mapas de Interação de Proteínas/genética , Doença de Alzheimer/patologia , Precursor de Proteína beta-Amiloide/genética , Proteína BRCA1/genética , Neoplasias da Mama/patologia , Diabetes Mellitus Tipo 2/patologia , Feminino , Humanos , Insulina/genética , Polimorfismo de Nucleotídeo Único/genética , Pró-Proteína Convertase 9/genética , Fatores de Risco
2.
Nature ; 495(7441): 360-4, 2013 Mar 21.
Artigo em Inglês | MEDLINE | ID: mdl-23354050

RESUMO

The domestication of dogs was an important episode in the development of human civilization. The precise timing and location of this event is debated and little is known about the genetic changes that accompanied the transformation of ancient wolves into domestic dogs. Here we conduct whole-genome resequencing of dogs and wolves to identify 3.8 million genetic variants used to identify 36 genomic regions that probably represent targets for selection during dog domestication. Nineteen of these regions contain genes important in brain function, eight of which belong to nervous system development pathways and potentially underlie behavioural changes central to dog domestication. Ten genes with key roles in starch digestion and fat metabolism also show signals of selection. We identify candidate mutations in key genes and provide functional support for an increased starch digestion in dogs relative to wolves. Our results indicate that novel adaptations allowing the early ancestors of modern dogs to thrive on a diet rich in starch, relative to the carnivorous diet of wolves, constituted a crucial step in the early domestication of dogs.


Assuntos
Animais Domésticos/genética , Dieta/veterinária , Cães/genética , Genoma/genética , Amido , Amilases/genética , Animais , Doença de Depósito de Glicogênio Tipo II , Mutação , Lobos/genética , alfa-Glucosidases/genética
3.
Genome Res ; 22(1): 51-63, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22006216

RESUMO

Analysis of diverse eukaryotes has revealed that recombination events cluster in discrete genomic locations known as hotspots. In humans, a zinc-finger protein, PRDM9, is believed to initiate recombination in >40% of hotspots by binding to a specific DNA sequence motif. However, the PRDM9 coding sequence is disrupted in the dog genome assembly, raising questions regarding the nature and control of recombination in dogs. By analyzing the sequences of PRDM9 orthologs in a number of dog breeds and several carnivores, we show here that this gene was inactivated early in canid evolution. We next use patterns of linkage disequilibrium using more than 170,000 SNP markers typed in almost 500 dogs to estimate the recombination rates in the dog genome using a coalescent-based approach. Broad-scale recombination rates show good correspondence with an existing linkage-based map. Significant variation in recombination rate is observed on the fine scale, and we are able to detect over 4000 recombination hotspots with high confidence. In contrast to human hotspots, 40% of canine hotspots are characterized by a distinct peak in GC content. A comparative genomic analysis indicates that these peaks are present also as weaker peaks in the panda, suggesting that the hotspots have been continually reinforced by accelerated and strongly GC biased nucleotide substitutions, consistent with the long-term action of biased gene conversion on the dog lineage. These results are consistent with the loss of PRDM9 in canids, resulting in a greater evolutionary stability of recombination hotspots. The genetic determinants of recombination hotspots in the dog genome may thus reflect a fundamental process of relevance to diverse animal species.


Assuntos
Evolução Molecular , Genoma/fisiologia , Instabilidade Genômica/fisiologia , Histona-Lisina N-Metiltransferase/genética , Polimorfismo de Nucleotídeo Único , Recombinação Genética/fisiologia , Animais , Cães , Sequência Rica em GC/fisiologia , Humanos
4.
PLoS Genet ; 7(10): e1002316, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-22022279

RESUMO

The extraordinary phenotypic diversity of dog breeds has been sculpted by a unique population history accompanied by selection for novel and desirable traits. Here we perform a comprehensive analysis using multiple test statistics to identify regions under selection in 509 dogs from 46 diverse breeds using a newly developed high-density genotyping array consisting of >170,000 evenly spaced SNPs. We first identify 44 genomic regions exhibiting extreme differentiation across multiple breeds. Genetic variation in these regions correlates with variation in several phenotypic traits that vary between breeds, and we identify novel associations with both morphological and behavioral traits. We next scan the genome for signatures of selective sweeps in single breeds, characterized by long regions of reduced heterozygosity and fixation of extended haplotypes. These scans identify hundreds of regions, including 22 blocks of homozygosity longer than one megabase in certain breeds. Candidate selection loci are strongly enriched for developmental genes. We chose one highly differentiated region, associated with body size and ear morphology, and characterized it using high-throughput sequencing to provide a list of variants that may directly affect these traits. This study provides a catalogue of genomic regions showing extreme reduction in genetic variation or population differentiation in dogs, including many linked to phenotypic variation. The many blocks of reduced haplotype diversity observed across the genome in dog breeds are the result of both selection and genetic drift, but extended blocks of homozygosity on a megabase scale appear to be best explained by selection. Further elucidation of the variants under selection will help to uncover the genetic basis of complex traits and disease.


Assuntos
Comportamento Animal , Cruzamento , Cães/genética , Variação Genética/genética , Seleção Genética , Animais , Tamanho Corporal/genética , Cães/anatomia & histologia , Orelha/anatomia & histologia , Estudo de Associação Genômica Ampla , Técnicas de Genotipagem , Haplótipos , Heterozigoto , Homozigoto , Fenótipo , Filogenia , Polimorfismo de Nucleotídeo Único
5.
Nat Commun ; 15(1): 7776, 2024 Sep 05.
Artigo em Inglês | MEDLINE | ID: mdl-39237529

RESUMO

Collisions of the transcription and replication machineries on the same DNA strand can pose a significant threat to genomic stability. These collisions occur in part due to the formation of RNA-DNA hybrids termed R-loops, in which a newly transcribed RNA molecule hybridizes with the DNA template strand. This study investigated the role of RAD52, a known DNA repair factor, in preventing collisions by directing R-loop formation and resolution. We show that RAD52 deficiency increases R-loop accumulation, exacerbating collisions and resulting in elevated DNA damage. Furthermore, RAD52's ability to interact with the transcription machinery, coupled with its capacity to facilitate R-loop dissolution, highlights its role in preventing collisions. Lastly, we provide evidence of an increased mutational burden from double-strand breaks at conserved R-loop sites in human tumor samples, which is increased in tumors with low RAD52 expression. In summary, this study underscores the importance of RAD52 in orchestrating the balance between replication and transcription processes to prevent collisions and maintain genome stability.


Assuntos
Replicação do DNA , Instabilidade Genômica , Estruturas R-Loop , Proteína Rad52 de Recombinação e Reparo de DNA , Transcrição Gênica , Proteína Rad52 de Recombinação e Reparo de DNA/metabolismo , Proteína Rad52 de Recombinação e Reparo de DNA/genética , Replicação do DNA/genética , Estruturas R-Loop/genética , Humanos , Dano ao DNA , Quebras de DNA de Cadeia Dupla , DNA/metabolismo , DNA/genética , Reparo do DNA , Mutação , Neoplasias/genética , Neoplasias/metabolismo
6.
J Natl Cancer Inst ; 113(6): 742-751, 2021 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-33429428

RESUMO

BACKGROUND: Patients with human papillomavirus-related oropharyngeal cancers have excellent outcomes but experience clinically significant toxicities when treated with standard chemoradiotherapy (70 Gy). We hypothesized that functional imaging could identify patients who could be safely deescalated to 30 Gy of radiotherapy. METHODS: In 19 patients, pre- and intratreatment dynamic fluorine-18-labeled fluoromisonidazole positron emission tomography (PET) was used to assess tumor hypoxia. Patients without hypoxia at baseline or intratreatment received 30 Gy; patients with persistent hypoxia received 70 Gy. Neck dissection was performed at 4 months in deescalated patients to assess pathologic response. Magnetic resonance imaging (weekly), circulating plasma cell-free DNA, RNA-sequencing, and whole-genome sequencing (WGS) were performed to identify potential molecular determinants of response. Samples from an independent prospective study were obtained to reproduce molecular findings. All statistical tests were 2-sided. RESULTS: Fifteen of 19 patients had no hypoxia on baseline PET or resolution on intratreatment PET and were deescalated to 30 Gy. Of these 15 patients, 11 had a pathologic complete response. Two-year locoregional control and overall survival were 94.4% (95% confidence interval = 84.4% to 100%) and 94.7% (95% confidence interval = 85.2% to 100%), respectively. No acute grade 3 radiation-related toxicities were observed. Microenvironmental features on serial imaging correlated better with pathologic response than tumor burden metrics or circulating plasma cell-free DNA. A WGS-based DNA repair defect was associated with response (P = .02) and was reproduced in an independent cohort (P = .03). CONCLUSIONS: Deescalation of radiotherapy to 30 Gy on the basis of intratreatment hypoxia imaging was feasible, safe, and associated with minimal toxicity. A DNA repair defect identified by WGS was predictive of response. Intratherapy personalization of chemoradiotherapy may facilitate marked deescalation of radiotherapy.


Assuntos
Neoplasias Orofaríngeas , Quimiorradioterapia/métodos , Humanos , Neoplasias Orofaríngeas/radioterapia , Tomografia por Emissão de Pósitrons , Estudos Prospectivos , Dosagem Radioterapêutica , Hipóxia Tumoral
7.
BMC Genomics ; 11: 458, 2010 Aug 03.
Artigo em Inglês | MEDLINE | ID: mdl-20678236

RESUMO

BACKGROUND: The advent of cheap high through-put sequencing methods has facilitated low coverage skims of a large number of organisms. To maximise the utility of the sequences, assembly into contigs and then ordering of those contigs is required. Whilst sequences can be assembled into contigs de novo, using assembled genomes of closely related organisms as a framework can considerably aid the process. However, the preferred search programs and parameters that will optimise the sensitivity and specificity of the alignments between the sequence reads and the framework genome(s) are not necessarily obvious. Here we demonstrate a process that uses paired-end sequence reads to choose an optimal program and alignment parameters. RESULTS: Unlike two single fragment reads, in paired-end sequence reads, such as BAC-end sequences, the two sequences in the pair have a known positional relationship in the original genome. This provides an additional level of confidence over match scores and e-values in the accuracy of the positional assignment of the reads in the comparative genome. Three commonly used sequence alignment programs: MegaBLAST, Blastz and PatternHunter were used to align a set of ovine BAC-end sequences against the equine genome assembly. A range of different search parameters, with a particular focus on contiguous and discontiguous seeds, were used for each program. The number of reads with a hit and the number of read pairs with hits for the two end sequences in the tail-to-tail paired-end configuration were plotted relative to the theoretical maximum expected curve. Of the programs tested, MegaBLAST with short contiguous seed lengths (word size 8-11) performed best in this particular task. In addition the data also provides estimates of the false positive and false negative rates, which can be used to determine the appropriate values of additional parameters, such as score cut-off, to balance sensitivity and specificity. To determine whether the approach also worked for the alignment of shorter reads, the first 240 bases of each BAC end sequence were also aligned to the equine genome. Again, contiguous MegaBLAST performed the best in optimising the sensitivity and specificity with which sheep BAC end reads map to the equine and bovine genomes. CONCLUSIONS: Paired-end reads, such as BAC-end sequences, provide an efficient mechanism to optimise sequence alignment parameters, for example for comparative genome assemblies, by providing an objective standard to evaluate performance.


Assuntos
Genoma , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Animais , Sequência de Bases , Bovinos , Ovinos , Software
8.
BMC Genomics ; 10: 46, 2009 Jan 23.
Artigo em Inglês | MEDLINE | ID: mdl-19166603

RESUMO

BACKGROUND: In large genomics projects involving many different types of analyses of bacterial artificial chromosomes (BACs), such as fingerprinting, end sequencing (BES) and full BAC sequencing there are many opportunities for the identities of BACs to become confused. However, by comparing the results from the different analyses, inconsistencies can be identified and a set of high integrity BACs preferred for future research can be defined. RESULTS: The location of each bovine BAC in the BAC fingerprint-based genome map and in the genome assembly were compared based on the reported BESs, and for a smaller number of BACs the full sequence. BACs with consistent positions in all three datasets, or if the full sequence was not available, for both the fingerprint map and BES-based alignments, were deemed to be correctly positioned. BACs with consistent BES-based and fingerprint-based locations, but with conflicting locations based on the fully sequenced BAC, appeared to have been misidentified during sequencing, and included a number of apparently swapped BACs. Inconsistencies between BES-based and fingerprint map positions identified thirty one plates from the CHORI-240 library that appear to have suffered substantial systematic problems during the end-sequencing of the BACs. No systematic problems were identified in the fingerprinting of the BACs. Analysis of BACs overlapping in the assembly identified a small overrepresentation of clones with substantial overlap in the library and a substantial enrichment of highly overlapping BACs on the same plate in the CHORI-240 library. More than half of these BACs appear to have been present as duplicates on the original BAC-library plates and thus should be avoided in subsequent projects. CONCLUSION: Our analysis shows that approximately 95% of the bovine CHORI-240 library clones with both a BAC fingerprint and two BESs mapping to the genome in the expected orientations (approximately 27% of all BACs) have consistent locations in the BAC fingerprint map and the genome assembly. We have developed a broadly applicable methodology for checking the integrity of BAC-based datasets even where only incomplete and partially assembled genomic sequence is available.


Assuntos
Bovinos/genética , Cromossomos Artificiais Bacterianos/genética , Genoma , Genômica/métodos , Animais , Mapeamento Cromossômico , Impressões Digitais de DNA , Biblioteca Gênica , Marcadores Genéticos , Análise de Sequência de DNA
9.
Alzheimers Dement (N Y) ; 5: 906-917, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31890855

RESUMO

INTRODUCTION: Women are at increased risk for Alzheimer's disease (AD), but the reason why remains unknown. One hypothesis is that low estrogen levels at menopause increases vulnerability to AD, but this remains unproven. METHODS: We compared neuronal genes upregulated by estrogen in ovariectomized female rhesus macaques with a database of >17,000 diverse gene sets and applied a rare variant burden test to exome sequencing data from 1208 female AD patients with the age of onset < 75 years and 2162 female AD controls. RESULTS: We found a striking overlap between genes upregulated by estrogen in macaques and genes downregulated in the human postmortem AD brain, and we found that estrogen upregulates the APOE gene and that progesterone acts antagonistically to estrogen genome-wide. We also found that female patients with AD have excess rare mutations in the early menopause gene MCM8. DISCUSSION: We show with genomic data that the menopausal loss of estrogen could underlie the increased risk for AD in women.

10.
Bioinformatics ; 22(19): 2396-404, 2006 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-16864591

RESUMO

MOTIVATION: Biological differences between classes are reflected in transcriptional changes which in turn affect the levels by which essential genes are individually expressed and collectively connected. The purpose of this communication is to introduce an analytical procedure to simultaneously identify genes that are differentially expressed (DE) as well as differentially connected (DC) in two or more classes of interest. RESULTS: Our procedure is based on a two-step approach: First, mixed-model equations are applied to obtain the normalized expression levels of each gene in each class treatment. These normalized expressions form the basis to compute a measure of (possible) DE as well as the correlation structure existing among genes. Second, a two-component mixture of bi-variate distributions is fitted to identify the component that encapsulates those genes that are DE and/or DC. We demonstrate our approach using three distinct datasets including a human systemic inflammation oligonucleotide data; a spotted cDNA data dealing with bovine in vitro adipogenesis and SAGE database on cancerous and normal tissue samples.


Assuntos
Adipogenia/fisiologia , Perfilação da Expressão Gênica/métodos , Inflamação/metabolismo , Família Multigênica/fisiologia , Neoplasias/metabolismo , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Proteínas/metabolismo , Animais , Biomarcadores/análise , Biomarcadores/metabolismo , Bovinos , Simulação por Computador , Humanos , Modelos Biológicos , Modelos Estatísticos , Proteínas/análise
11.
BMC Res Notes ; 3: 334, 2010 Dec 13.
Artigo em Inglês | MEDLINE | ID: mdl-21144002

RESUMO

BACKGROUND: The sheep CHORI-243 bacterial artificial chromosome (BAC) library is being used in the construction of the virtual sheep genome, the sequencing and construction of the actual sheep genome assembly and as a source of DNA for regions of the genome of biological interest. The objective of our study is to assess the integrity of the clones and plates which make up the CHORI-243 library using the virtual sheep genome. FINDINGS: A series of analyses were undertaken based on the mapping the sheep BAC-end sequences (BESs) to the virtual sheep genome. Overall, very few plate specific biases were identified, with only three of the 528 plates in the library significantly affected. The analysis of the number of tail-to-tail (concordant) BACs on the plates identified a number of plates with lower than average numbers of such BACs. For plates 198 and 213 a partial swap of the BESs determined with one of the two primers appear to have occurred. A third plate, 341, also with a significant deficit in tail-to-tail BACs, appeared to contain a substantial number of sequences determined from contaminating eubacterial 16 S rRNA DNA. Additionally a small number of eubacterial 16 S rRNA DNA sequences were present on two other plates, 111 and 338, in the library. CONCLUSIONS: The comparative genomic approach can be used to assess BAC library integrity in the absence of fingerprinting. The sequences of the sheep CHORI-243 library BACs have high integrity, especially with the corrections detailed above. The library represents a high quality resource for use by the sheep genomics community.

12.
Philos Trans R Soc Lond B Biol Sci ; 365(1552): 2571-80, 2010 Aug 27.
Artigo em Inglês | MEDLINE | ID: mdl-20643747

RESUMO

The identification of loci influenced by positive selection is a major goal of evolutionary genetics. A popular approach is to perform scans of alignments on a genome-wide scale in order to find regions evolving at accelerated rates on a particular branch of a phylogenetic tree. However, positive selection is not the only process that can lead to accelerated evolution. Notably, GC-biased gene conversion (gBGC) is a recombination-associated process that results in the biased fixation of G and C nucleotides. This process can potentially generate bursts of nucleotide substitutions within hotspots of meiotic recombination. Here, we analyse the results of a scan for positive selection on genes on branches across the primate phylogeny. We show that genes identified as targets of positive selection have a significant tendency to exhibit the genomic signature of gBGC. Using a maximum-likelihood framework, we estimate that more than 20 per cent of cases of significantly elevated non-synonymous to synonymous substitution rates ratio (d(N)/d(S)), particularly in shorter branches, could be due to gBGC. We demonstrate that in some cases, gBGC can lead to very high d(N)/d(S) (more than 2). Our results indicate that gBGC significantly affects the evolution of coding sequences in primates, often leading to patterns of evolution that can be mistaken for positive selection.


Assuntos
Evolução Molecular , Conversão Gênica/genética , Genoma/genética , Filogenia , Primatas/genética , Seleção Genética , Animais , Sequência de Bases , Humanos , Funções Verossimilhança , Modelos Genéticos , Recombinação Genética/genética , Alinhamento de Sequência
13.
Genome Biol ; 8(7): R152, 2007.
Artigo em Inglês | MEDLINE | ID: mdl-17663790

RESUMO

BACKGROUND: Is it possible to construct an accurate and detailed subgene-level map of a genome using bacterial artificial chromosome (BAC) end sequences, a sparse marker map, and the sequences of other genomes? RESULTS: A sheep BAC library, CHORI-243, was constructed and the BAC end sequences were determined and mapped with high sensitivity and low specificity onto the frameworks of the human, dog, and cow genomes. To maximize genome coverage, the coordinates of all BAC end sequence hits to the cow and dog genomes were also converted to the equivalent human genome coordinates. The 84,624 sheep BACs (about 5.4-fold genome coverage) with paired ends in the correct orientation (tail-to-tail) and spacing, combined with information from sheep BAC comparative genome contigs (CGCs) built separately on the dog and cow genomes, were used to construct 1,172 sheep BAC-CGCs, covering 91.2% of the human genome. Clustered non-tail-to-tail and outsize BACs located close to the ends of many BAC-CGCs linked BAC-CGCs covering about 70% of the genome to at least one other BAC-CGC on the same chromosome. Using the BAC-CGCs, the intrachromosomal and interchromosomal BAC-CGC linkage information, human/cow and vertebrate synteny, and the sheep marker map, a virtual sheep genome was constructed. To identify BACs potentially located in gaps between BAC-CGCs, an additional set of 55,668 sheep BACs were positioned on the sheep genome with lower confidence. A coordinate conversion process allowed us to transfer human genes and other genome features to the virtual sheep genome to display on a sheep genome browser. CONCLUSION: We demonstrate that limited sequencing of BACs combined with positioning on a well assembled genome and integrating locations from other less well assembled genomes can yield extensive, detailed subgene-level maps of mammalian genomes, for which genomic resources are currently limited.


Assuntos
Genoma , Genômica , Mapeamento Físico do Cromossomo , Carneiro Doméstico/genética , Animais , Sequência de Bases , Bovinos , Cromossomos Artificiais Bacterianos , Cães , Biblioteca Gênica , Genoma Humano , Humanos , Dados de Sequência Molecular , Análise de Sequência de DNA
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA