Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 37
Filtrar
1.
GigaByte ; 2024: gigabyte118, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38746537

RESUMO

Marsupials exhibit distinctive modes of reproduction and early development that set them apart from their eutherian counterparts and render them invaluable for comparative studies. However, marsupial genomic resources still lag far behind those of eutherian mammals. We present a series of novel genomic resources for the fat-tailed dunnart (Sminthopsis crassicaudata), a mouse-like marsupial that, due to its ease of husbandry and ex-utero development, is emerging as a laboratory model. We constructed a highly representative multi-tissue de novo transcriptome assembly of dunnart RNA-seq reads spanning 12 tissues. The transcriptome includes 2,093,982 assembled transcripts and has a mammalian transcriptome BUSCO completeness score of 93.3%, the highest amongst currently published marsupial transcriptomes. This global transcriptome, along with ab initio predictions, supported annotation of the existing dunnart genome, revealing 21,622 protein-coding genes. Altogether, these resources will enable wider use of the dunnart as a model marsupial and deepen our understanding of mammalian genome evolution.

2.
Genome Biol ; 25(1): 94, 2024 Apr 15.
Artigo em Inglês | MEDLINE | ID: mdl-38622708

RESUMO

Recent innovations in single-cell RNA-sequencing (scRNA-seq) provide the technology to investigate biological questions at cellular resolution. Pooling cells from multiple individuals has become a common strategy, and droplets can subsequently be assigned to a specific individual by leveraging their inherent genetic differences. An implicit challenge with scRNA-seq is the occurrence of doublets-droplets containing two or more cells. We develop Demuxafy, a framework to enhance donor assignment and doublet removal through the consensus intersection of multiple demultiplexing and doublet detecting methods. Demuxafy significantly improves droplet assignment by separating singlets from doublets and classifying the correct individual.


Assuntos
Análise de Célula Única , Humanos , Análise de Célula Única/métodos , Análise de Sequência de RNA/métodos
4.
Nat Genet ; 56(4): 595-604, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38548990

RESUMO

Common genetic variants confer substantial risk for chronic lung diseases, including pulmonary fibrosis. Defining the genetic control of gene expression in a cell-type-specific and context-dependent manner is critical for understanding the mechanisms through which genetic variation influences complex traits and disease pathobiology. To this end, we performed single-cell RNA sequencing of lung tissue from 66 individuals with pulmonary fibrosis and 48 unaffected donors. Using a pseudobulk approach, we mapped expression quantitative trait loci (eQTLs) across 38 cell types, observing both shared and cell-type-specific regulatory effects. Furthermore, we identified disease interaction eQTLs and demonstrated that this class of associations is more likely to be cell-type-specific and linked to cellular dysregulation in pulmonary fibrosis. Finally, we connected lung disease risk variants to their regulatory targets in disease-relevant cell types. These results indicate that cellular context determines the impact of genetic variation on gene expression and implicates context-specific eQTLs as key regulators of lung homeostasis and disease.


Assuntos
Fibrose Pulmonar , Locos de Características Quantitativas , Humanos , Locos de Características Quantitativas/genética , Fibrose Pulmonar/genética , Regulação da Expressão Gênica/genética , Pulmão , Herança Multifatorial , Estudo de Associação Genômica Ampla/métodos , Polimorfismo de Nucleotídeo Único
5.
Genome Biol ; 25(1): 56, 2024 Feb 26.
Artigo em Inglês | MEDLINE | ID: mdl-38409056

RESUMO

BACKGROUND: The development of single-cell RNA sequencing (scRNA-seq) has enabled scientists to catalog and probe the transcriptional heterogeneity of individual cells in unprecedented detail. A common step in the analysis of scRNA-seq data is the selection of so-called marker genes, most commonly to enable annotation of the biological cell types present in the sample. In this paper, we benchmark 59 computational methods for selecting marker genes in scRNA-seq data. RESULTS: We compare the performance of the methods using 14 real scRNA-seq datasets and over 170 additional simulated datasets. Methods are compared on their ability to recover simulated and expert-annotated marker genes, the predictive performance and characteristics of the gene sets they select, their memory usage and speed, and their implementation quality. In addition, various case studies are used to scrutinize the most commonly used methods, highlighting issues and inconsistencies. CONCLUSIONS: Overall, we present a comprehensive evaluation of methods for selecting marker genes in scRNA-seq data. Our results highlight the efficacy of simple methods, especially the Wilcoxon rank-sum test, Student's t-test, and logistic regression.


Assuntos
Benchmarking , Análise de Célula Única , Análise de Célula Única/métodos , Sequenciamento do Exoma , Análise de Sequência de RNA , Perfilação da Expressão Gênica , Software
6.
Cell Genom ; 3(8): 100349, 2023 Aug 09.
Artigo em Inglês | MEDLINE | ID: mdl-37601968

RESUMO

Meiotic crossovers are required for accurate chromosome segregation and producing new allelic combinations. Meiotic crossover numbers are tightly regulated within a narrow range, despite an excess of initiating DNA double-strand breaks. Here, we reveal the tumor suppressor FANCM as a meiotic anti-crossover factor in mammals. We use unique large-scale crossover analyses with both single-gamete sequencing and pedigree-based bulk-sequencing datasets to identify a genome-wide increase in crossover frequencies in Fancm-deficient mice. Gametogenesis is heavily perturbed in Fancm loss-of-function mice, which is consistent with the reproductive defects reported in humans with biallelic FANCM mutations. A portion of the gametogenesis defects can be attributed to the cGAS-STING pathway after birth. Despite the gametogenesis phenotypes in Fancm mutants, both sexes are capable of producing offspring. We propose that the anti-crossover function and role in gametogenesis of Fancm are separable and will inform diagnostic pathways for human genomic instability disorders.

7.
Radiol Artif Intell ; 5(2): e220072, 2023 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-37035431

RESUMO

Supplemental material is available for this article. Keywords: Mammography, Screening, Convolutional Neural Network (CNN) Published under a CC BY 4.0 license. See also the commentary by Cadrin-Chênevert in this issue.

8.
bioRxiv ; 2023 Jun 29.
Artigo em Inglês | MEDLINE | ID: mdl-36993211

RESUMO

Common genetic variants confer substantial risk for chronic lung diseases, including pulmonary fibrosis (PF). Defining the genetic control of gene expression in a cell-type-specific and context-dependent manner is critical for understanding the mechanisms through which genetic variation influences complex traits and disease pathobiology. To this end, we performed single-cell RNA-sequencing of lung tissue from 67 PF and 49 unaffected donors. Employing a pseudo-bulk approach, we mapped expression quantitative trait loci (eQTL) across 38 cell types, observing both shared and cell type-specific regulatory effects. Further, we identified disease-interaction eQTL and demonstrated that this class of associations is more likely to be cell-type specific and linked to cellular dysregulation in PF. Finally, we connected PF risk variants to their regulatory targets in disease-relevant cell types. These results indicate that cellular context determines the impact of genetic variation on gene expression, and implicates context-specific eQTL as key regulators of lung homeostasis and disease.

9.
bioRxiv ; 2023 Dec 17.
Artigo em Inglês | MEDLINE | ID: mdl-38168317

RESUMO

The human lung is structurally complex, with a diversity of specialized epithelial, stromal and immune cells playing specific functional roles in anatomically distinct locations, and large-scale changes in the structure and cellular makeup of this distal lung is a hallmark of pulmonary fibrosis (PF) and other progressive chronic lung diseases. Single-cell transcriptomic studies have revealed numerous disease-emergent/enriched cell types/states in PF lungs, but the spatial contexts wherein these cells contribute to disease pathogenesis has remained uncertain. Using sub-cellular resolution image-based spatial transcriptomics, we analyzed the gene expression of more than 1 million cells from 19 unique lungs. Through complementary cell-based and innovative cell-agnostic analyses, we characterized the localization of PF-emergent cell-types, established the cellular and molecular basis of classical PF histopathologic disease features, and identified a diversity of distinct molecularly-defined spatial niches in control and PF lungs. Using machine-learning and trajectory analysis methods to segment and rank airspaces on a gradient from normal to most severely remodeled, we identified a sequence of compositional and molecular changes that associate with progressive distal lung pathology, beginning with alveolar epithelial dysregulation and culminating with changes in macrophage polarization. Together, these results provide a unique, spatially-resolved characterization of the cellular and molecular programs of PF and control lungs, provide new insights into the heterogeneous pathobiology of PF, and establish analytical approaches which should be broadly applicable to other imaging-based spatial transcriptomic studies.

10.
BMC Bioinformatics ; 23(1): 460, 2022 Nov 03.
Artigo em Inglês | MEDLINE | ID: mdl-36329399

RESUMO

BACKGROUND: Single-cell RNA sequencing (scRNA-seq) technology has contributed significantly to diverse research areas in biology, from cancer to development. Since scRNA-seq data is high-dimensional, a common strategy is to learn low-dimensional latent representations better to understand overall structure in the data. In this work, we build upon scVI, a powerful deep generative model which can learn biologically meaningful latent representations, but which has limited explicit control of batch effects. Rather than prioritizing batch effect removal over conservation of biological variation, or vice versa, our goal is to provide a bird's eye view of the trade-offs between these two conflicting objectives. Specifically, using the well established concept of Pareto front from economics and engineering, we seek to learn the entire trade-off curve between conservation of biological variation and removal of batch effects. RESULTS: A multi-objective optimisation technique known as Pareto multi-task learning (Pareto MTL) is used to obtain the Pareto front between conservation of biological variation and batch effect removal. Our results indicate Pareto MTL can obtain a better Pareto front than the naive scalarization approach typically encountered in the literature. In addition, we propose to measure batch effect by applying a neural-network based estimator called Mutual Information Neural Estimation (MINE) and show benefits over the more standard maximum mean discrepancy measure. CONCLUSION: The Pareto front between conservation of biological variation and batch effect removal is a valuable tool for researchers in computational biology. Our results demonstrate the efficacy of applying Pareto MTL to estimate the Pareto front in conjunction with applying MINE to measure the batch effect.


Assuntos
Algoritmos , Transcriptoma , Biologia Computacional/métodos , Análise de Célula Única
11.
Nucleic Acids Res ; 50(20): e118, 2022 11 11.
Artigo em Inglês | MEDLINE | ID: mdl-36107768

RESUMO

Profiling gametes of an individual enables the construction of personalised haplotypes and meiotic crossover landscapes, now achievable at larger scale than ever through the availability of high-throughput single-cell sequencing technologies. However, high-throughput single-gamete data commonly have low depth of coverage per gamete, which challenges existing gamete-based haplotype phasing methods. In addition, haplotyping a large number of single gametes from high-throughput single-cell DNA sequencing data and constructing meiotic crossover profiles using existing methods requires intensive processing. Here, we introduce efficient software tools for the essential tasks of generating personalised haplotypes and calling crossovers in gametes from single-gamete DNA sequencing data (sgcocaller), and constructing, visualising, and comparing individualised crossover landscapes from single gametes (comapr). With additional data pre-possessing, the tools can also be applied to bulk-sequenced samples. We demonstrate that sgcocaller is able to generate impeccable phasing results for high-coverage datasets, on which it is more accurate and stable than existing methods, and also performs well on low-coverage single-gamete sequencing datasets for which current methods fail. Our tools achieve highly accurate results with user-friendly installation, comprehensive documentation, efficient computation times and minimal memory usage.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA , Algoritmos , Células Germinativas , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Análise da Expressão Gênica de Célula Única , Software , Troca Genética
12.
PLoS One ; 17(9): e0275168, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36173986

RESUMO

We developed a simple and reliable method for the isolation of haploid nuclei from fresh and frozen testes. The described protocol uses readily available reagents in combination with flow cytometry to separate haploid and diploid nuclei. The protocol can be completed within 1 hour and the resulting individual haploid nuclei have intact morphology. The isolated nuclei are suitable for library preparation for high-throughput DNA and RNA sequencing using bulk or single nuclei. The protocol was optimised with mouse testes and we anticipate that it can be applied for the isolation of mature sperm from other mammals including humans.


Assuntos
Ácidos Nucleicos , Animais , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Mamíferos , Camundongos , Sêmen , Espermatozoides
13.
Genome Biol ; 22(1): 341, 2021 12 15.
Artigo em Inglês | MEDLINE | ID: mdl-34911537

RESUMO

Population-scale single-cell RNA sequencing (scRNA-seq) is now viable, enabling finer resolution functional genomics studies and leading to a rush to adapt bulk methods and develop new single-cell-specific methods to perform these studies. Simulations are useful for developing, testing, and benchmarking methods but current scRNA-seq simulation frameworks do not simulate population-scale data with genetic effects. Here, we present splatPop, a model for flexible, reproducible, and well-documented simulation of population-scale scRNA-seq data with known expression quantitative trait loci. splatPop can also simulate complex batch, cell group, and conditional effects between individuals from different cohorts as well as genetically-driven co-expression.


Assuntos
Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Benchmarking , Análise por Conglomerados , Simulação por Computador , Perfilação da Expressão Gênica/métodos , Genômica , Humanos , Locos de Características Quantitativas , Software
14.
Genome Biol ; 22(1): 188, 2021 06 24.
Artigo em Inglês | MEDLINE | ID: mdl-34167583

RESUMO

BACKGROUND: Single-cell RNA sequencing (scRNA-seq) has enabled the unbiased, high-throughput quantification of gene expression specific to cell types and states. With the cost of scRNA-seq decreasing and techniques for sample multiplexing improving, population-scale scRNA-seq, and thus single-cell expression quantitative trait locus (sc-eQTL) mapping, is increasingly feasible. Mapping of sc-eQTL provides additional resolution to study the regulatory role of common genetic variants on gene expression across a plethora of cell types and states and promises to improve our understanding of genetic regulation across tissues in both health and disease. RESULTS: While previously established methods for bulk eQTL mapping can, in principle, be applied to sc-eQTL mapping, there are a number of open questions about how best to process scRNA-seq data and adapt bulk methods to optimize sc-eQTL mapping. Here, we evaluate the role of different normalization and aggregation strategies, covariate adjustment techniques, and multiple testing correction methods to establish best practice guidelines. We use both real and simulated datasets across single-cell technologies to systematically assess the impact of these different statistical approaches. CONCLUSION: We provide recommendations for future single-cell eQTL studies that can yield up to twice as many eQTL discoveries as default approaches ported from bulk studies.


Assuntos
Mapeamento Cromossômico/estatística & dados numéricos , Genoma Humano , Células-Tronco Pluripotentes Induzidas/metabolismo , Locos de Características Quantitativas , Análise de Célula Única/métodos , Alelos , Linhagem Celular , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Humanos , Células-Tronco Pluripotentes Induzidas/citologia , Análise de Sequência de RNA , Software , Sequenciamento do Exoma
15.
Genome Biol ; 22(1): 112, 2021 04 19.
Artigo em Inglês | MEDLINE | ID: mdl-33874978

RESUMO

Genetic maps have been fundamental to building our understanding of disease genetics and evolutionary processes. The gametes of an individual contain all of the information required to perform a de novo chromosome-scale assembly of an individual's genome, which historically has been performed with populations and pedigrees. Here, we discuss how single-cell gamete sequencing offers the potential to merge the advantages of short-read sequencing with the ability to build personalized genetic maps and open up an entirely new space in personalized genetics.


Assuntos
Genoma , Genômica/métodos , Células Germinativas/metabolismo , Sequenciamento de Nucleotídeos em Larga Escala , Medicina de Precisão/métodos , Análise de Célula Única/métodos , Animais , Mapeamento Cromossômico , Biologia Computacional/métodos , Biologia Computacional/normas , Interpretação Estatística de Dados , Heterogeneidade Genética , Genômica/normas , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Medicina de Precisão/normas , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Análise de Célula Única/normas , Sequenciamento Completo do Genoma
16.
Nat Biotechnol ; 38(6): 747-755, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-32518403

RESUMO

Single-cell RNA sequencing (scRNA-seq) is the leading technique for characterizing the transcriptomes of individual cells in a sample. The latest protocols are scalable to thousands of cells and are being used to compile cell atlases of tissues, organs and organisms. However, the protocols differ substantially with respect to their RNA capture efficiency, bias, scale and costs, and their relative advantages for different applications are unclear. In the present study, we generated benchmark datasets to systematically evaluate protocols in terms of their power to comprehensively describe cell types and states. We performed a multicenter study comparing 13 commonly used scRNA-seq and single-nucleus RNA-seq protocols applied to a heterogeneous reference sample resource. Comparative analysis revealed marked differences in protocol performance. The protocols differed in library complexity and their ability to detect cell-type markers, impacting their predictive value and suitability for integration into reference cell atlases. These results provide guidance both for individual researchers and for consortium projects such as the Human Cell Atlas.


Assuntos
Análise de Sequência de RNA , Análise de Célula Única , Animais , Benchmarking , Linhagem Celular , Bases de Dados Genéticas , Genômica/métodos , Genômica/normas , Humanos , Camundongos , Análise de Sequência de RNA/métodos , Análise de Sequência de RNA/normas , Análise de Célula Única/métodos , Análise de Célula Única/normas
18.
Nat Methods ; 17(4): 414-421, 2020 04.
Artigo em Inglês | MEDLINE | ID: mdl-32203388

RESUMO

Bulk and single-cell DNA sequencing has enabled reconstructing clonal substructures of somatic tissues from frequency and cooccurrence patterns of somatic variants. However, approaches to characterize phenotypic variations between clones are not established. Here we present cardelino (https://github.com/single-cell-genetics/cardelino), a computational method for inferring the clonal tree configuration and the clone of origin of individual cells assayed using single-cell RNA-seq (scRNA-seq). Cardelino flexibly integrates information from imperfect clonal trees inferred based on bulk exome-seq data, and sparse variant alleles expressed in scRNA-seq data. We apply cardelino to a published cancer dataset and to newly generated matched scRNA-seq and exome-seq data from 32 human dermal fibroblast lines, identifying hundreds of differentially expressed genes between cells from different somatic clones. These genes are frequently enriched for cell cycle and proliferation pathways, indicating a role for cell division genes in somatic evolution in healthy skin.


Assuntos
Fibroblastos/metabolismo , Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Software , Algoritmos , Ciclo Celular , Proliferação de Células , Humanos , Melanoma , Mutação , Transcriptoma
19.
Genome Biol ; 21(1): 31, 2020 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-32033589

RESUMO

The recent boom in microfluidics and combinatorial indexing strategies, combined with low sequencing costs, has empowered single-cell sequencing technology. Thousands-or even millions-of cells analyzed in a single experiment amount to a data revolution in single-cell biology and pose unique data science problems. Here, we outline eleven challenges that will be central to bringing this emerging field of single-cell data science forward. For each challenge, we highlight motivating research questions, review prior work, and formulate open problems. This compendium is for established researchers, newcomers, and students alike, highlighting interesting and rewarding problems for the coming years.


Assuntos
Ciência de Dados/métodos , Genômica/métodos , RNA-Seq/métodos , Análise de Célula Única/métodos , Animais , Humanos
20.
Nat Commun ; 11(1): 810, 2020 02 10.
Artigo em Inglês | MEDLINE | ID: mdl-32041960

RESUMO

Recent developments in stem cell biology have enabled the study of cell fate decisions in early human development that are impossible to study in vivo. However, understanding how development varies across individuals and, in particular, the influence of common genetic variants during this process has not been characterised. Here, we exploit human iPS cell lines from 125 donors, a pooled experimental design, and single-cell RNA-sequencing to study population variation of endoderm differentiation. We identify molecular markers that are predictive of differentiation efficiency of individual lines, and utilise heterogeneity in the genetic background across individuals to map hundreds of expression quantitative trait loci that influence expression dynamically during differentiation and across cellular contexts.


Assuntos
Diferenciação Celular/genética , Expressão Gênica/genética , Células-Tronco Pluripotentes Induzidas/citologia , Linhagem Celular , Endoderma/citologia , Feminino , Perfilação da Expressão Gênica , Interação Gene-Ambiente , Estudos de Associação Genética , Heterogeneidade Genética , Humanos , Masculino , Locos de Características Quantitativas , Análise de Célula Única
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...