RESUMO
BACKGROUND: Indigenous Chinese cattle have abundant genetic diversity and a long history of artificial selection, giving local breeds advantages in adaptability, forage tolerance and resistance. The detection of selective sweeps and comparative genome analysis of selected breeds and ancestral populations provide a basis for understanding differences among breeds and for the identification and utilization of candidate genes. We investigated genetic diversity, population structure, and signatures of selection using genome-wide sequencing data for a new breed of Qinchuan cattle (QNC, n = 21), ancestral Qinchuan cattle (QCC, n = 20), and Zaosheng cattle (ZSC, n = 19). RESULTS: A population structure analysis showed that the ancestry components of QNC and ZSC were similar. In addition, the QNC and ZSC groups showed higher proportions of European taurine ancestry than that of QCC, and this may explain the larger body size of QNC, approaching that of European cattle under long-term domestication and selection. A neighbor-joining tree revealed that QCC individuals were closely related, whereas QNC formed a distinct group. To search for signatures of selection in the QNC genome, we evaluated nucleotide diversity (θπ), the fixation index (FST) and Tajima's D. Overlapping selective sweeps were enriched for one KEGG pathway, the apelin signaling pathway, and included five candidate genes (MEF2A, SMAD2, CAMK4, RPS6, and PIK3CG). We performed a comprehensive review of genomic variants in QNC, QCC, and ZSC using whole-genome sequencing data. QCC was rich in novel genetic diversity, while diversity in QNC and ZSC cattle was reduced due to strong artificial selection, with divergence from the original cattle. CONCLUSIONS: We identified candidate genes associated with production traits. These results support the success of selective breeding and can guide further breeding and resource conservation of Qinchuan cattle.
Assuntos
Variação Genética , Seleção Genética , Animais , Bovinos/genética , Genômica/métodos , Polimorfismo de Nucleotídeo Único , Genética Populacional , Estudo de Associação Genômica Ampla , Genoma , CruzamentoRESUMO
Ribonucleotides are the most abundant non-canonical nucleotides in the genome. Their vast presence and influence over genome biology is becoming increasingly appreciated. Here we review the recent progress made in understanding their genomic presence, incorporation characteristics and usefulness as biomarkers for polymerase enzymology. We also discuss ribonucleotide processing, the genetic consequences of unrepaired ribonucleotides in DNA and evidence supporting the significance of their transient presence in the nuclear genome.
Assuntos
Replicação do DNA/genética , DNA/genética , DNA/metabolismo , Ribonucleotídeos/genética , Ribonucleotídeos/metabolismo , Animais , Biomarcadores/metabolismo , Núcleo Celular/metabolismo , Reparo do DNA/genética , DNA Polimerase Dirigida por DNA/metabolismo , Genoma Mitocondrial , Instabilidade Genômica , Humanos , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismoRESUMO
Increasing use of unbiased genomic sequencing in critically ill infants can expand understanding of rare diseases such as Kabuki syndrome (KS). Infants diagnosed with KS through genome-wide sequencing performed during the initial hospitalization underwent retrospective review of medical records. Human phenotype ontology terms used in genomic analysis were aggregated and analyzed. Clinicians were surveyed regarding changes in management and other care changes. Fifteen infants met inclusion criteria. KS was not suspected prior to genomic sequencing. Variants were classified as Pathogenic (n = 10) or Likely Pathogenic (n = 5) by American College of Medical Genetics and Genomics Guidelines. Fourteen variants were de novo (KMT2D, n = 12, KDM6A, n = 2). One infant inherited a likely pathogenic variant in KMT2D from an affected father. Frequent findings involved cardiovascular (14/15) and renal (7/15) systems, with palatal defects also identified (6/15). Three infants had non-immune hydrops. No minor anomalies were universally documented; ear anomalies, micrognathia, redundant nuchal skin, and hypoplastic nails were common. Changes in management were reported in 14 infants. Early use of unbiased genome-wide sequencing enabled a molecular diagnosis prior to clinical recognition including infants with atypical or rarely reported features of KS while also expanding the phenotypic spectrum of this rare disorder.
Assuntos
Anormalidades Múltiplas , Doenças Hematológicas , Doenças Vestibulares , Gravidez , Feminino , Humanos , Lactente , Anormalidades Múltiplas/genética , Face/anormalidades , Doenças Hematológicas/genética , Doenças Vestibulares/genética , Fenótipo , Histona Desmetilases/genéticaRESUMO
OBJECTIVES: To explore the molecular characteristics of Staphylococcus aureus (S. aureus) in children, and to compare the molecular characteristics of different types of strains (infection and colonization strains) so as to reveal pathogenic molecular markers of S. aureus. METHODS: A cross-sectional study design was used to conduct nasopharyngeal swab sampling from healthy children in the community and clinical samples from infected children in the hospital. Whole genome sequencing was used to detect antibiotic resistance genes and virulence genes. A random forest method to used to screen pathogenic markers. RESULTS: A total of 512 S. aureus strains were detected, including 272 infection strains and 240 colonization strains. For virulence genes, the carrying rates of enterotoxin genes (seb and sep), extracellular enzyme coding genes (splA, splB, splE and edinC), leukocytotoxin genes (lukD, lukE, lukF-PV and lukS-PV) and epidermal exfoliating genes (eta and etb) in infection strains were higher than those in colonization strains. But the carrying rates of enterotoxin genes (sec, sec3, seg, seh, sei, sel, sem, sen, seo and seu) were lower in infection strains than in colonization strains (P<0.05). For antibiotic resistance genes, the carrying rates of lnuA, lnuG, aadD, tetK and dfrG were significantly higher in infection strains than in colonization strains (P<0.05). The accuracy of cross-validation of the random forest model for screening pathogenic markers of S. aureus before and after screening was 69% and 68%, respectively, and the area under the curve was 0.75 and 0.70, respectively. The random forest model finally screened out 16 pathogenic markers (sem, etb, splE, sep, ser, mecA, lnuA, sea, blaZ, cat(pC233), blaTEm-1A, aph(3')-III, ermB, ermA, ant(9)-Ia and ant(6)-Ia). The top five variables in the variable importance ranking were sem (OR=0.40), etb (OR=3.95), splE (OR=1.68), sep (OR=3.97), and ser (OR=1.68). CONCLUSIONS: The random forest model can screen out pathogenic markers of S. aureus and exhibits a superior predictive performance, providing genetic evidence for tracing highly pathogenic S. aureus and conducting precise targeted interventions.
Assuntos
Infecções Estafilocócicas , Staphylococcus aureus , Criança , Humanos , Staphylococcus aureus/genética , Estudos Transversais , Enterotoxinas/genética , Sequenciamento Completo do GenomaRESUMO
Genome-wide sequencing (exome and whole genome) has transformed our ability to diagnose patients with suspected genetic disorders. Cerebral palsy (CP), although historically thought to be due to birth injury (perinatal hypoxia), represents a clinical spectrum of disorders, many of which have been attributed to a genetic cause. GWS has elucidated the underlying single gene cause for many patients with CP and has important implications for the customization of treatment, management, and genetic counseling. International guidelines recommend genetic counseling for all families considering genome-wide sequencing. Genetic counselors educate and support families and help them to make testing decisions based on their values. They can help families adapt to, and understand the implications of a genomic diagnosis. Here, we review advances in sequencing for CP, clinical features suggestive of a genetic etiology of CP, practice guidelines for GWS, and a practical approach to the genetic counseling of these families. This includes: the content to be addressed in pre-test and post-test genetic counseling sessions, the benefits of a establishing a genetic cause and importantly, the need for ongoing support.
Assuntos
Paralisia Cerebral , Aconselhamento Genético , Gravidez , Feminino , Humanos , Paralisia Cerebral/diagnóstico , Paralisia Cerebral/genética , Exoma , Genômica , Testes GenéticosRESUMO
PURPOSE: Genetic testing is routine practice for individuals with unexplained developmental disabilities and multiple congenital anomalies. However, current testing pathways can be costly and time consuming, and the diagnostic yield low. Genome-wide sequencing, including exome sequencing (ES) and genome sequencing (GS), can improve diagnosis, but at a higher cost. This study aimed to assess the cost-effectiveness of genome-wide sequencing in Ontario, Canada. METHODS: A cost-effectiveness analysis was conducted using a discrete event simulation from a public payer perspective. Six strategies involving ES or GS were compared. Outcomes reported were direct medical costs, number of molecular diagnoses, number of positive findings, and number of active treatment changes. RESULTS: If ES was used as a second-tier test (after the current first-tier, chromosomal microarray, fails to provide a diagnosis), it would be less costly and more effective than standard testing (CAN$6357 [95% CI: 6179-6520] vs. CAN$8783 per patient [95% CI: 2309-31,123]). If ES was used after standard testing, it would cost an additional CAN$15,228 to identify the genetic diagnosis for one additional patient compared with standard testing. The results remained robust when parameters and assumptions were varied. CONCLUSION: ES would likely be cost-saving if used earlier in the diagnostic pathway.
Assuntos
Anormalidades Múltiplas , Deficiências do Desenvolvimento , Criança , Análise Custo-Benefício , Deficiências do Desenvolvimento/diagnóstico , Deficiências do Desenvolvimento/genética , Humanos , Ontário , Sequenciamento do ExomaRESUMO
BACKGROUND: The use of genome-wide sequencing in pediatric medicine and research is growing exponentially. While this has many potential benefits, the normative and empirical literature has highlighted various ethical issues. There have not been, however, any systematic reviews of these issues. The aim of this systematic review is to determine systematically the spectrum of ethical issues that is raised for stakeholders in in pediatric genome-wide sequencing. METHODS: A systematic review in PubMed and Google Books (publications in English or German between 2004 and 2021) was conducted. Further references were identified via reference screening. Data were analyzed and synthesized using qualitative content analysis. Ethical issues were defined as arising when a relevant normative principle is not adequately considered or when two principles come into conflict. RESULTS: Our literature search retrieved 3175 publications of which 143 were included in the analysis. Together these mentioned 106 ethical issues in pediatric genome-wide sequencing, categorized into five themes along the pediatric genome-wide sequencing lifecycle. Most ethical issues identified in relation to genome-wide sequencing typically reflect ethical issues that arise in general genetic testing, but they are often amplified by the increased quantity of data obtained, and associated uncertainties. The most frequently discussed ethical aspects concern the issue of unsolicited findings. CONCLUSION: Concentration of the debate on unsolicited findings risks overlooking other ethical challenges. An overarching difficulty presents the terminological confusion: both with regard to both the test procedure/ the scope of analysis, as well as with the topic of unsolicited findings. It is important that the genetics and ethics communities together with other medical professions involved work jointly on specific case related guidelines to grant the maximum benefit for the care of the children, while preventing patient harm and disproportionate overload of clinicians and the healthcare system by the wealth of available options and economic incentives to increase testing.
Assuntos
Atenção à Saúde , Testes Genéticos , Criança , HumanosRESUMO
GenCOUNSEL is the largest genetic counseling research grant awarded to date and brings together experts in genetic counseling, genomics, law and policy, health services implementation, and health economics research. It is the first project of its kind to examine the genetic counseling issues associated with the clinical implementation of genome-wide sequencing (exome and genome sequencing). GenCOUNSEL is a Canadian-based, multi-method research study that takes place over a variety of sites, including non-clinical, clinical, and laboratory research sites and includes the training of undergraduate and graduate students. The COVID-19 pandemic will likely have a lasting impact on genetic counseling service delivery, research, and training. Almost every aspect of the GenCOUNSEL research project has been impacted by the COVID-19 pandemic. Here we describe how our research recruitment strategies, methods, resource allocation, and training capacity have been affected. We discuss ways that we have adapted to the pandemic including revision of our research methods and work to understand the barriers in order to optimize opportunities. We finish with take-home messages to fellow researchers highlighting the importance of resiliency in genetic counseling research.
Assuntos
COVID-19 , Aconselhamento Genético , Canadá , Humanos , Pandemias , SARS-CoV-2RESUMO
Genome-wide sequencing (GWS) is increasingly being used in neonatal intensive care units. While studies have explored its clinical utility, little is known about parental experiences with this testing post-return of results. We conducted a qualitative study, using an interpretive description framework and thematic analysis, to gain further insight into parents' perceptions of the value and utility of GWS for their infant. We sought to explore whether parents' perceptions differ if their child received a diagnosis or not, and whether their child is living or deceased. Semi-structured, telephone interviews were conducted with parents of infants who had rapid exome sequencing while in the neonatal intensive care unit at BC Women's Hospital in Vancouver, Canada. Interviews addressed perceived benefits and harms of GWS and included an evaluation of decisional regret. Parents of 27 probands were approached and 14 (52%; 13 mothers and 1 father) participated in interviews. On average, 26 months had elapsed from the time of results to the interview. Six themes were identified. Firstly, parents had a positive regard for GWS. The results of GWS helped provide context for their child's admission to the NICU, and all parents experienced relief following receiving the results. A diagnosis by GWS enabled parents to picture the future, form connections with other parents, and coordinate their child's care. Lastly, some parents experienced discomfort with the concept of a genetic diagnosis, and interestingly felt lack of a genomic diagnosis indicated a reduced severity of their infant's condition. Decisional regret post-results was found to be low. Our results highlight how parents cope with the results of GWS and suggest that a genetic counselor can have an important role in helping families understand and adjust to these results in the neonatal intensive care unit.
Assuntos
Unidades de Terapia Intensiva Neonatal , Pais , Criança , Feminino , Humanos , Lactente , Recém-Nascido , Pesquisa Qualitativa , Estudos Retrospectivos , Sequenciamento do ExomaRESUMO
OBJECTIVES: Systemic lupus erythematosus (SLE) is a kind of autoimmune inflammatory connective tissue disease which seriously endangers human health. Genetic factors play a key role in the pathogenesis of SLE. This study aims to investigate a novel phospholipase D2 (PLD2) mutation associated with familial SLE, and further explore the underlying mechanism of the mutation in SLE. METHODS: The blood samples from a SLE patient, the patient's parents, and 147 normal controls were collected and DNA was extracted. Whole genome high-throughput sequencing was performed in the patient and her parents and the results were further analyzed by various bioinformatics methods. The wild type (wt), mutant type (mu), and negative control PLD2 plasmids were further constructed and transfected into 293 cells. The expression level of HRAS protein in 293 cells was detected by Western blotting. RESULTS: In this SLE family, the female SLE patient and her mother, 1 in generation II and 1 in generation III had typical clinical manifestations of SLE, and all of them had lupus nephritis at early stage. The genetic characteristics are consistent with autosomal dominant inheritance. A novel PLD2 heterozygous mutation (c.2722C>T) was found in the patient and her mother, but not in her father and other normal controls. Compared with wtPLD2 plasmid and negative control PLD2 plasmid, the expression of HRAS in 293 cells transfected with muPLD2 plasmid was significantly up-regulated (both P<0.05). CONCLUSIONS: PLD2 c.2722C>T mutation may be one of the pathogeny of SLE in this family.
Assuntos
Lúpus Eritematoso Sistêmico , Nefrite Lúpica , Estudos de Casos e Controles , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Lúpus Eritematoso Sistêmico/genética , Mutação , Fosfolipase DRESUMO
PURPOSE: This study describes the cost trajectory of the standard diagnostic care pathway for children with suspected genetic disorders in British Columbia, Canada. METHODS: Average annual per-patient costs were estimated using medical records review and a caregiver survey for a cohort of 498 children referred to BC Children's and Women's Hospitals (C&W) with unexplained intellectual disability (the TIDE-BC study) and families enrolled in the CAUSES study, which offered diagnostic genome-wide sequencing (GWS; exome and genome sequencing) to 500 families of children with suspected genetic disorders. RESULTS: Direct costs peaked in the first year of patients' diagnostic odyssey, with an average of C$2257 per patient (95% confidence interval [CI] C$2074, C$2441) for diagnostic testing and C$631 (95% CI C$543, C$727) for specialist consultations at C&W. In subsequent years, direct costs accrued at a constant rate, with an estimated annual per-patient cost of C$511 (95% CI C$473, C$551) for diagnostic testing and C$334 (95% CI C$295, C$369) for consultations at C&W. Travel costs and caregiver productivity loss associated with attending diagnosis-related physician appointments averaged C$1907/family/year. CONCLUSIONS: The continuing long-term accrual of costs by undiagnosed patients suggests that economic evaluations of diagnostic GWS services should use longer time horizons than have typically been used.
Assuntos
Doenças Genéticas Inatas/economia , Testes Genéticos/economia , Custos de Cuidados de Saúde/tendências , Adulto , Colúmbia Britânica/epidemiologia , Cuidadores/economia , Cuidadores/psicologia , Estudos de Coortes , Análise Custo-Benefício , Exoma/genética , Feminino , Custos de Cuidados de Saúde/ética , Humanos , Deficiência Intelectual/genética , Masculino , Análise de Sequência de DNA/economia , Sequenciamento do Exoma/economia , Sequenciamento do Exoma/métodosRESUMO
Elucidating demographic history during the settlement of ecological communities is crucial for properly inferring the mechanisms that shape patterns of species diversity and their persistence through time. Here, we used genomic data and coalescent-based approaches to elucidate for the first time the demographic dynamics associated with the settlement by endemic reef fish fauna of one of the most remote peripheral islands of the Pacific Ocean, Rapa Nui (Easter Island). We compared the demographic history of nine endemic species in order to explore their demographic responses to Pleistocene climatic fluctuations. We found that species endemic to Rapa Nui share a common demographic history, as signatures of population expansions were retrieved for almost all of the species studied here, and synchronous demographic expansions initiated during the last glacial period were recovered for more than half of the studied species. These results suggest that eustatic fluctuations associated with Milankovitch cycles have played a central role in species demographic histories and in the final stage of the community assembly of many Rapa Nui reef fishes. Specifically, sea level lowstands resulted in the maximum reef habitat extension for Rapa Nui endemic species; we discuss the potential role of seamounts in allowing endemic species to cope with Pleistocene climatic fluctuations, and we highlight the importance of local historical processes over regional ones. Overall, our results shed light on the mechanisms by which endemism arises and is maintained in peripheral reef fish fauna.
Assuntos
Adaptação Biológica/genética , Evolução Biológica , Mudança Climática , Peixes , Animais , Recifes de Corais , Peixes/classificação , Peixes/genética , Ilhas , Oceano Pacífico , PolinésiaRESUMO
Objective: To analyze the molecular characteristics of Listeria monocytogenes strains from ready-to eat food in China. Methods: A total of 239 Listeria monocytogenes strains isolated from ready-to-eat food in 2017, all strains underwent whole-genome sequencing (WGS) , and comparisons uncovered population structure derived from lineages, clonal complex, serogroups, antimicrobial susceptibility and virulence, which were inferred in silico from the WGS data. Core genome multilocus sequence typing was used to subtype isolates. Results: All strains were categorized into three different lineages, lineage â ¡ was the predominant types in food, and IIa was the main serogroups. CC8, CC101 and CC87 were the first three prevalent CCs among 23 detected CCs, accounting for 49.4%. Only 4.6% (11 isolates) of tested strains harbored antibiotic resistance genes, which were mostly trimethoprim genes (7 isolates, 2.9%). All strains were positive for LIPI-1, and only a part of strains harbored LIPI-3 and LIPI-4, accounting for 13.8% (33 isolates) and 14.2% (34 isolates), respectively. ST619 carried both LIPI-3 and LIPI-4. 51.5% (123 isolates) of strains carried SSI-1, and all CC121 strains harbored SSI-2. Different lineages, serogroups and CCs can be separated obviously through cgMLST analysis, and 24 sublineages were highly concordant with CCs. Conclusion: â ¡a was the main serogroups in ready-to-eat food isolates in China; CC8, CC101 and CC87 were the prevalent CCs, and CC87 isolates was hypervirulent isolates, cgMLST method can be adopted for prospective foodborne disease surveillance and outbreaks detection.
Assuntos
Microbiologia de Alimentos , Listeria monocytogenes/isolamento & purificação , Listeriose/microbiologia , China/epidemiologia , Humanos , Listeriose/epidemiologiaRESUMO
The relevance of the chosen topic is due to the need to resolve ethical problems that arise in the framework of legal regulation of genome-wide sequencing in Russia and foreign countries. The purpose of this research is to form ethical principles that should become a reference point for law - making in this area. In order to achieve this goal, we have solved the tasks of studying the normative legal acts of Russia and a number of foreign countries from an ethical point of view. General scientific, private scientific and special methods of scientific knowledge (system-structural, formal-legal) are used. In order to comply with the ethical boundaries of legal regulation, to store access and protect full-genome sequencing data in Russia and foreign countries, it is proposed to develop a set of restrictions that prevent possible discrimination on genetic grounds, to create the necessary conditions for the inadmissibility of disclosure of personalized data, disclosure of information about a genetic disease to the subject and his relatives, as well as the boundaries of editing the genome of a human embryo. For the first time, the authors substantiate the need to establish clear ethical boundaries in the implementation of genome-wide sequencing in Russia based on foreign experience.
Assuntos
Internacionalidade , Sequenciamento Completo do Genoma , Humanos , Federação Russa , Sequenciamento Completo do Genoma/éticaRESUMO
The relevance of the study of the general principles and patterns of legal regulation of access storage processes and data protection of genome sequencing in foreign countries is determined by the need to develop a general concept of legal regulation of this type of activity in Russia. The purpose of this study is to develop the system-forming principles and patterns of access storage and data protection of genome sequencing in Russia. To achieve this goal, tasks were set and solved to identify and study the general principles and patterns of legal regulation of access storage processes and data protection of genome sequencing in foreign countries. The international documents regulating the features of regulation of access storage processes and data protection of genome-wide sequencing, the doctrinal sources of Great Britain, the USA, France, Israel, and Japan are studied. Methods used: general philosophical, general scientific, private scientific, special (structural-legal, comparative-legal, formal-legal). The general principles for the formation of the concept of legal regulation of genome sequencing in Russia are proposed. It was revealed that the creation of a universal regulatory regulator aimed at protecting the subject of personal data in view of the prevalence of public interests over private ones and the constant expansion of the scope of application of genetic data obtained as a result of genome-wide sequencing is the main problem in developing a legal regulation mechanism in the studied area. For the first time, the authors determine the basic principles for developing the concept of genome-wide sequencing in Russia, including: recognition of human rights and human dignity as the highest value, the necessity of researchers' responsibility for the well-being of participants in view of the obtained research results, the mandatory informed consent of which should be voluntary, permanent, their right to get acquainted with the results obtained if it concerns their health, access to such information, ensuring the right to non-knowledge of research results and others.
Assuntos
Segurança Computacional , Direitos Humanos , Humanos , Internacionalidade , Japão , Federação RussaRESUMO
Power estimations are important for optimizing genotype-phenotype association study designs. However, existing frameworks are designed for common disorders, and thus ill-suited for the inherent challenges of studies for low-prevalence conditions such as rare diseases and infrequent adverse drug reactions. These challenges include small sample sizes and the need to leverage genetic annotation resources in association analyses for the purpose of ranking potential causal genes. We present SimPEL, a simulation-based program providing power estimations for the design of low-prevalence condition studies. SimPEL integrates the usage of gene annotation resources for association analyses. Customizable parameters, including the penetrance of the putative causal allele and the employed pathogenic scoring system, allow SimPEL to realistically model a large range of study designs. To demonstrate the effects of various parameters on power, we estimated the power of several simulated designs using SimPEL and captured power trends in agreement with observations from current literature on low-frequency condition studies. SimPEL, as a tool, provides researchers studying low-frequency conditions with an intuitive and highly flexible avenue for statistical power estimation. The platform-independent "batteries included" executable and default input files are available at https://github.com/precisionomics/SimPEL.
Assuntos
Simulação por Computador , Modelos Genéticos , Análise de Sequência de DNA , Alelos , Estudos de Associação Genética , Estudo de Associação Genômica Ampla , Humanos , Penetrância , Prevalência , Tamanho da AmostraRESUMO
BACKGROUND: The next generation sequencing technology allows us to obtain a large amount of short DNA sequence (DNA-seq) reads at a genome-wide level. DNA-seq data have been increasingly collected during the recent years. Count-type data analysis is a widely used approach for DNA-seq data. However, the related data pre-processing is based on the moving window method, in which a window size need to be defined in order to obtain count-type data. Furthermore, useful information can be reduced after data pre-processing for count-type data. RESULTS: In this study, we propose to analyze DNA-seq data based on the related distance-type measure. Distances are measured in base pairs (bps) between two adjacent alignments of short reads mapped to a reference genome. Our experimental data based simulation study confirms the advantages of distance-type measure approach in both detection power and detection accuracy. Furthermore, we propose artificial censoring for the distance data so that distances larger than a given value are considered potential outliers. Our purpose is to simplify the pre-processing of DNA-seq data. Statistically, we consider a mixture of right censored geometric distributions to model the distance data. Additionally, to reduce the GC-content bias, we extend the mixture model to a mixture of generalized linear models (GLMs). The estimation of model can be achieved by the Newton-Raphson algorithm as well as the Expectation-Maximization (E-M) algorithm. We have conducted simulations to evaluate the performance of our approach. Based on the rank based inverse normal transformation of distance data, we can obtain the related z-values for a follow-up analysis. For an illustration, an application to the DNA-seq data from a pair of normal and tumor cell lines is presented with a change-point analysis of z-values to detect DNA copy number alterations. CONCLUSION: Our distance-type measure approach is novel. It does not require either a fixed or a sliding window procedure for generating count-type data. Its advantages have been demonstrated by our simulation studies and its practical usefulness has been illustrated by an experimental data application.
Assuntos
Algoritmos , Variações do Número de Cópias de DNA , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Neoplasias/genética , Análise de Sequência de DNA/métodos , Estudos de Casos e Controles , HumanosRESUMO
Low temperature is a vital effector of rice at different growth stages. MicroRNAs (miRNAs) play important roles in responding to abiotic and biotic stresses. Here, we confirmed the cold tolerance of Dongxiang common wild rice and explored the miRNAs differentially expressed under cold stress using genome-wide small RNA sequencing. In total, 16 miRNAs, nine upregulated and seven downregulated by cold stress, were characterized in Dongxiang common wild rice, and their target genes were predicted. Additionally, an AgriGO analysis of the target genes revealed that they were enriched in several terms related to cold-stress tolerance, suggesting a complex response mechanism, involving miRNAs, to cold stress in Dongxiang common wild rice.
Assuntos
MicroRNAs/metabolismo , Oryza/genética , RNA de Plantas/metabolismo , Aclimatação/genética , Resposta ao Choque Frio/genética , Oryza/fisiologia , Análise de Sequência de RNARESUMO
For ionising radiation (IR)-induced cellular toxicity, DNA cleavage is thought to be a crucial step. In this paper, the genome-wide DNA sequence preference of gamma radiation-induced cleavage was investigated in purified human DNA. We utilised Illumina short read technology and over 80 million double-strand breaks (DSBs) were analysed in this study. The frequency of occurrence of individual nucleotides at the 50,000 most frequently cleaved sites was calculated and C nucleotides were found to be most prevalent at the cleavage site, followed by G and T, with A being the least prevalent. 5'-C*C and 5'-CC* dinucleotides (where * is the cleavage site) were found to be the present at the highest frequency at the cleavage site; while it was 5'-CC*C for trinucleotides and 5'-GCC*C and 5'-CC*CC for tetranucleotides. The frequency of occurrence of individual nucleotides at the most frequently cleaved sites was determined and the nucleotides in the sequence 5'-GGC*MH (where M is A or C, H is any nucleotide except G) were found to occur most frequently for DNA that was treated with endonuclease IV (to remove blocking 3'-phosphoglycolate termini); and 5'-GSC*MH (where S is G or C) for non-endonuclease IV-treated DNA. It was concluded that GC-rich sequences were preferentially targeted for cleavage by gamma irradiation. This was the first occasion that an extensive examination of the genome-wide DNA sequence preference of IR-induced DSBs has been performed.
Assuntos
Sequência de Bases/genética , Ilhas de CpG/genética , Quebras de DNA de Cadeia Dupla/efeitos da radiação , Clivagem do DNA/efeitos da radiação , Dano ao DNA/efeitos da radiação , DNA/efeitos da radiação , Sequência de Bases/efeitos da radiação , Ilhas de CpG/efeitos da radiação , DNA/genética , Raios gama , Estudo de Associação Genômica Ampla , Células HeLa , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Radiação IonizanteRESUMO
Genetic disorders are one of the leading causes of infant mortality and are frequent in neonatal intensive care units (NICUs). Rapid genome-wide sequencing (GWS; whole genome or exome sequencing (ES)), due to its diagnostic capabilities and immediate impacts on medical management, is becoming an appealing testing option in the NICU setting. RAPIDOMICS was a trio-based rapid ES pilot study of 25 babies with suspected genetic disorders in the BC Women's Hospital NICU. ES and bioinformatic analysis were performed after careful patient ascertainment. Trio analysis was performed using an in-house pipeline reporting variants in known disease-causing genes. Variants interpreted by the research team as definitely or possibly causal of the infant's phenotype were Sanger validated in a clinical laboratory. The average time to preliminary diagnosis was 7.2 days. Sanger validation was pursued in 15 patients for 13 autosomal dominant and 2 autosomal recessive disorders, with an overall diagnostic rate (partial or complete) of 60%.Conclusion: In total, 72% of patients enrolled had a genomic diagnosis achieved through ES, multi-gene panel testing or chromosomal microarray analysis. Among these, there was an 83% rate of significant and immediate impact on medical decision-making directly related to new knowledge of the diagnosis. Health service implementation challenges and successes are discussed. What is Known: ⢠Rapid genome-wide sequencing in the neonatal intensive care setting has a greater diagnostic hit rate and impact on medical management than conventional genetic testing. However, the impact of consultation with genetics and patient ascertainment requires further investigation. What is New: ⢠This study demonstrates the importance of genetic consultation and careful patient selection prior to pursuing exome sequencing (ES). ⢠In total, 15/25 (60%) patients achieved a diagnosis through ES and 18/25 (72%) through ES, multi-gene panel testing or chromosomal microarray analysis with 83% of those having immediate effects on medical management.