RESUMEN
The liver is the largest solid organ in the body, yet it remains incompletely characterized. Here we present a spatial proteogenomic atlas of the healthy and obese human and murine liver combining single-cell CITE-seq, single-nuclei sequencing, spatial transcriptomics, and spatial proteomics. By integrating these multi-omic datasets, we provide validated strategies to reliably discriminate and localize all hepatic cells, including a population of lipid-associated macrophages (LAMs) at the bile ducts. We then align this atlas across seven species, revealing the conserved program of bona fide Kupffer cells and LAMs. We also uncover the respective spatially resolved cellular niches of these macrophages and the microenvironmental circuits driving their unique transcriptomic identities. We demonstrate that LAMs are induced by local lipid exposure, leading to their induction in steatotic regions of the murine and human liver, while Kupffer cell development crucially depends on their cross-talk with hepatic stellate cells via the evolutionarily conserved ALK1-BMP9/10 axis.
Asunto(s)
Evolución Biológica , Hepatocitos/metabolismo , Macrófagos/metabolismo , Proteogenómica , Animales , Núcleo Celular/metabolismo , Hígado Graso/genética , Hígado Graso/patología , Homeostasis , Humanos , Macrófagos del Hígado/metabolismo , Antígenos Comunes de Leucocito/metabolismo , Lípidos/química , Hígado/metabolismo , Linfocitos/metabolismo , Ratones Endogámicos C57BL , Modelos Biológicos , Células Mieloides/metabolismo , Obesidad/patología , Proteoma/metabolismo , Transducción de Señal , Transcriptoma/genéticaRESUMEN
Single-cell transcriptomics (scRNA-seq) has greatly advanced our ability to characterize cellular heterogeneity1. However, scRNA-seq requires lysing cells, which impedes further molecular or functional analyses on the same cells. Here, we established Live-seq, a single-cell transcriptome profiling approach that preserves cell viability during RNA extraction using fluidic force microscopy2,3, thus allowing to couple a cell's ground-state transcriptome to its downstream molecular or phenotypic behaviour. To benchmark Live-seq, we used cell growth, functional responses and whole-cell transcriptome read-outs to demonstrate that Live-seq can accurately stratify diverse cell types and states without inducing major cellular perturbations. As a proof of concept, we show that Live-seq can be used to directly map a cell's trajectory by sequentially profiling the transcriptomes of individual macrophages before and after lipopolysaccharide (LPS) stimulation, and of adipose stromal cells pre- and post-differentiation. In addition, we demonstrate that Live-seq can function as a transcriptomic recorder by preregistering the transcriptomes of individual macrophages that were subsequently monitored by time-lapse imaging after LPS exposure. This enabled the unsupervised, genome-wide ranking of genes on the basis of their ability to affect macrophage LPS response heterogeneity, revealing basal Nfkbia expression level and cell cycle state as important phenotypic determinants, which we experimentally validated. Thus, Live-seq can address a broad range of biological questions by transforming scRNA-seq from an end-point to a temporal analysis approach.
Asunto(s)
Supervivencia Celular , Perfilación de la Expresión Génica , Macrófagos , RNA-Seq , Análisis de la Célula Individual , Transcriptoma , Tejido Adiposo/citología , Ciclo Celular/efectos de los fármacos , Ciclo Celular/genética , Diferenciación Celular , Perfilación de la Expresión Génica/métodos , Perfilación de la Expresión Génica/normas , Genoma/efectos de los fármacos , Genoma/genética , Lipopolisacáridos/inmunología , Lipopolisacáridos/farmacología , Macrófagos/citología , Macrófagos/efectos de los fármacos , Macrófagos/inmunología , Macrófagos/metabolismo , Inhibidor NF-kappaB alfa/genética , Especificidad de Órganos , Fenotipo , ARN/genética , ARN/aislamiento & purificación , RNA-Seq/métodos , RNA-Seq/normas , Reproducibilidad de los Resultados , Análisis de Secuencia de ARN/métodos , Análisis de Secuencia de ARN/normas , Análisis de la Célula Individual/métodos , Células del Estroma/citología , Células del Estroma/metabolismo , Factores de Tiempo , Transcriptoma/genéticaRESUMEN
Heterogeneity between different macrophage populations has become a defining feature of this lineage. However, the conserved factors defining macrophages remain largely unknown. The transcription factor ZEB2 is best described for its role in epithelial to mesenchymal transition; however, its role within the immune system is only now being elucidated. We show here that Zeb2 expression is a conserved feature of macrophages. Using Clec4f-cre, Itgax-cre, and Fcgr1-cre mice to target five different macrophage populations, we found that loss of ZEB2 resulted in macrophage disappearance from the tissues, coupled with their subsequent replenishment from bone-marrow precursors in open niches. Mechanistically, we found that ZEB2 functioned to maintain the tissue-specific identities of macrophages. In Kupffer cells, ZEB2 achieved this by regulating expression of the transcription factor LXRα, removal of which recapitulated the loss of Kupffer cell identity and disappearance. Thus, ZEB2 expression is required in macrophages to preserve their tissue-specific identities.
Asunto(s)
Macrófagos del Hígado/citología , Receptores X del Hígado/genética , Caja Homeótica 2 de Unión a E-Box con Dedos de Zinc/genética , Animales , Linaje de la Célula/inmunología , Transición Epitelial-Mesenquimal , Femenino , Regulación Neoplásica de la Expresión Génica , Macrófagos del Hígado/inmunología , Hígado/citología , Receptores X del Hígado/metabolismo , Pulmón/citología , Masculino , Ratones , Ratones Endogámicos C57BL , Ratones TransgénicosRESUMEN
Single-cell RNA sequencing (scRNA-seq) approaches have transformed our ability to resolve cellular properties across systems, but are currently tailored toward large cell inputs (>1,000 cells). This renders them inefficient and costly when processing small, individual tissue samples, a problem that tends to be resolved by loading bulk samples, yielding confounded mosaic cell population read-outs. Here, we developed a deterministic, mRNA-capture bead and cell co-encapsulation dropleting system, DisCo, aimed at processing low-input samples (<500 cells). We demonstrate that DisCo enables precise particle and cell positioning and droplet sorting control through combined machine-vision and multilayer microfluidics, enabling continuous processing of low-input single-cell suspensions at high capture efficiency (>70%) and at speeds up to 350 cells per hour. To underscore DisCo's unique capabilities, we analyzed 31 individual intestinal organoids at varying developmental stages. This revealed extensive organoid heterogeneity, identifying distinct subtypes including a regenerative fetal-like Ly6a+ stem cell population that persists as symmetrical cysts, or spheroids, even under differentiation conditions, and an uncharacterized 'gobloid' subtype consisting predominantly of precursor and mature (Muc2+) goblet cells. To complement this dataset and to demonstrate DisCo's capacity to process low-input, in vivo-derived tissues, we also analyzed individual mouse intestinal crypts. This revealed the existence of crypts with a compositional similarity to spheroids, which consisted predominantly of regenerative stem cells, suggesting the existence of regenerating crypts in the homeostatic intestine. These findings demonstrate the unique power of DisCo in providing high-resolution snapshots of cellular heterogeneity in small, individual tissues.
Asunto(s)
Organoides , Análisis de la Célula Individual , Animales , Diferenciación Celular , Mucosa Intestinal , Ratones , Células MadreRESUMEN
Tissue-resident macrophages can derive from yolk sac macrophages (YS-Macs), fetal liver monocytes (FL-MOs), or adult bone-marrow monocytes (BM-MOs). The relative capacity of these precursors to colonize a niche, self-maintain, and perform tissue-specific functions is unknown. We simultaneously transferred traceable YS-Macs, FL-MOs, and BM-MOs into the empty alveolar macrophage (AM) niche of neonatal Csf2rb(-/-) mice. All subsets produced AMs, but in competition preferential outgrowth of FL-MOs was observed, correlating with their superior granulocyte macrophage-colony stimulating factor (GM-CSF) reactivity and proliferation capacity. When transferred separately, however, all precursors efficiently colonized the alveolar niche and generated AMs that were transcriptionally almost identical, self-maintained, and durably prevented alveolar proteinosis. Mature liver, peritoneal, or colon macrophages could not efficiently colonize the empty AM niche, whereas mature AMs could. Thus, precursor origin does not affect the development of functional self-maintaining tissue-resident macrophages and the plasticity of the mononuclear phagocyte system is largest at the precursor stage.
Asunto(s)
Células de la Médula Ósea/citología , Diferenciación Celular/inmunología , Factor Estimulante de Colonias de Granulocitos y Macrófagos/inmunología , Hígado/citología , Macrófagos Alveolares/citología , Saco Vitelino/citología , Animales , Proliferación Celular , Subunidad beta Común de los Receptores de Citocinas/genética , Hígado/embriología , Hígado/inmunología , Macrófagos Alveolares/inmunología , Ratones , Ratones Endogámicos BALB C , Ratones Endogámicos C57BL , Ratones Noqueados , Transcriptoma/inmunología , Saco Vitelino/inmunologíaRESUMEN
Computational methods that model how gene expression of a cell is influenced by interacting cells are lacking. We present NicheNet (https://github.com/saeyslab/nichenetr), a method that predicts ligand-target links between interacting cells by combining their expression data with prior knowledge on signaling and gene regulatory networks. We applied NicheNet to tumor and immune cell microenvironment data and demonstrate that NicheNet can infer active ligands and their gene regulatory effects on interacting cells.
Asunto(s)
Comunicación Celular , Modelos Teóricos , Animales , Redes Reguladoras de Genes , Humanos , Ligandos , Ratones , Receptores de Superficie Celular/metabolismo , Análisis de Secuencia de ARN/métodos , Transducción de Señal , Análisis de la Célula Individual/métodos , Transcriptoma , Microambiente TumoralRESUMEN
MOTIVATION: During the last decade, trajectory inference (TI) methods have emerged as a novel framework to model cell developmental dynamics, most notably in the area of single-cell transcriptomics. At present, more than 70 TI methods have been published, and recent benchmarks showed that even state-of-the-art methods only perform well for certain trajectory types but not others. RESULTS: In this work, we present TinGa, a new TI model that is fast and flexible, and that is based on Growing Neural Graphs. We performed an extensive comparison of TinGa to five state-of-the-art methods for TI on a set of 250 datasets, including both synthetic as well as real datasets. Overall, TinGa improves the state-of-the-art by producing accurate models (comparable to or an improvement on the state-of-the-art) on the whole spectrum of data complexity, from the simplest linear datasets to the most complex disconnected graphs. In addition, TinGa obtained the fastest execution times, showing that our method is thus one of the most versatile methods up to date. AVAILABILITY AND IMPLEMENTATION: R scripts for running TinGa, comparing it to top existing methods and generating the figures of this article are available at https://github.com/Helena-todd/TinGa.
Asunto(s)
Agentes Nerviosos , Biología ComputacionalRESUMEN
Recent developments in single-cell transcriptomics have opened new opportunities for studying dynamic processes in immunology in a high throughput and unbiased manner. Starting from a mixture of cells in different stages of a developmental process, unsupervised trajectory inference algorithms aim to automatically reconstruct the underlying developmental path that cells are following. In this review, we break down the strategies used by this novel class of methods, and organize their components into a common framework, highlighting several practical advantages and disadvantages of the individual methods. We also give an overview of new insights these methods have already provided regarding the wiring and gene regulation of cell differentiation. As the trajectory inference field is still in its infancy, we propose several future developments that will ultimately lead to a global and data-driven way of studying immune cell differentiation.
Asunto(s)
Diferenciación Celular , Perfilación de la Expresión Génica/métodos , Análisis de la Célula Individual/métodos , Algoritmos , Diferenciación Celular/genética , Biología Computacional , HumanosRESUMEN
BACKGROUND: Interactions among cis-regulatory elements (CREs) play a crucial role in gene regulation. Various approaches have been developed to map these interactions genome-wide, including those relying on interindividual epigenomic variation to identify groups of covariable regulatory elements, referred to as chromatin modules (CMs). While CM mapping allows to investigate the relationship between chromatin modularity and gene expression, the computational principles used for CM identification vary in their application and outcomes. RESULTS: We comprehensively evaluate and streamline existing CM mapping tools and present guidelines for optimal utilization of epigenome data from a diverse population of individuals to assess regulatory coordination across the human genome. We showcase the effectiveness of our recommended practices by analyzing distinct cell types and demonstrate cell type specificity of CRE interactions in CMs and their relevance for gene expression. Integration of genotype information revealed that many non-coding disease-associated variants affect the activity of CMs in a cell type-specific manner by affecting the binding of cell type-specific transcription factors. We provide example cases that illustrate in detail how CMs can be used to deconstruct GWAS loci, assess variable expression of cell surface receptors in immune cells, and reveal how genetic variation can impact the expression of prognostic markers in chronic lymphocytic leukemia. CONCLUSIONS: Our study presents an optimal strategy for CM mapping and reveals how CMs capture the coordination of CREs and its impact on gene expression. Non-coding genetic variants can disrupt this coordination, and we highlight how this may lead to disease predisposition in a cell type-specific manner.
Asunto(s)
Cromatina , Humanos , Cromatina/genética , Cromatina/metabolismo , Genoma Humano , Estudio de Asociación del Genoma Completo , Secuencias Reguladoras de Ácidos Nucleicos , Regulación de la Expresión Génica , Variación GenéticaRESUMEN
Adipose tissue plasticity is orchestrated by molecularly and functionally diverse cells within the stromal vascular fraction (SVF). Although several mouse and human adipose SVF cellular subpopulations have by now been identified, we still lack an understanding of the cellular and functional variability of adipose stem and progenitor cell (ASPC) populations across human fat depots. To address this, we performed single-cell and bulk RNA sequencing (RNA-seq) analyses of >30 SVF/Lin- samples across four human adipose depots, revealing two ubiquitous human ASPC (hASPC) subpopulations with distinct proliferative and adipogenic properties but also depot- and BMI-dependent proportions. Furthermore, we identified an omental-specific, high IGFBP2-expressing stromal population that transitions between mesothelial and mesenchymal cell states and inhibits hASPC adipogenesis through IGFBP2 secretion. Our analyses highlight the molecular and cellular uniqueness of different adipose niches, while our discovery of an anti-adipogenic IGFBP2+ omental-specific population provides a new rationale for the biomedically relevant, limited adipogenic capacity of omental hASPCs.
Asunto(s)
Adipogénesis , Proteína 2 de Unión a Factor de Crecimiento Similar a la Insulina , Epiplón , Células del Estroma , Humanos , Epiplón/metabolismo , Epiplón/citología , Proteína 2 de Unión a Factor de Crecimiento Similar a la Insulina/metabolismo , Proteína 2 de Unión a Factor de Crecimiento Similar a la Insulina/genética , Células del Estroma/metabolismo , Células del Estroma/citología , Femenino , Masculino , Persona de Mediana Edad , Tejido Adiposo/metabolismo , Tejido Adiposo/citología , Adulto , Epitelio/metabolismo , Células Madre/metabolismo , Células Madre/citología , Células Madre Mesenquimatosas/metabolismo , Células Madre Mesenquimatosas/citología , Anciano , AnimalesRESUMEN
Proper differentiation of sperm from germline stem cells, essential for production of the next generation, requires dramatic changes in gene expression that drive remodeling of almost all cellular components, from chromatin to organelles to cell shape itself. Here, we provide a single nucleus and single cell RNA-seq resource covering all of spermatogenesis in Drosophila starting from in-depth analysis of adult testis single nucleus RNA-seq (snRNA-seq) data from the Fly Cell Atlas (FCA) study. With over 44,000 nuclei and 6000 cells analyzed, the data provide identification of rare cell types, mapping of intermediate steps in differentiation, and the potential to identify new factors impacting fertility or controlling differentiation of germline and supporting somatic cells. We justify assignment of key germline and somatic cell types using combinations of known markers, in situ hybridization, and analysis of extant protein traps. Comparison of single cell and single nucleus datasets proved particularly revealing of dynamic developmental transitions in germline differentiation. To complement the web-based portals for data analysis hosted by the FCA, we provide datasets compatible with commonly used software such as Seurat and Monocle. The foundation provided here will enable communities studying spermatogenesis to interrogate the datasets to identify candidate genes to test for function in vivo.
Asunto(s)
Células Madre Adultas , Testículo , Animales , Masculino , Testículo/metabolismo , Drosophila , RNA-Seq , SemenRESUMEN
In this work, we studied the generation of memory precursor cells following an acute infection by analyzing single-cell RNA-seq data that contained CD8 T cells collected during the postinfection expansion phase. We used different tools to reconstruct the developmental trajectory that CD8 T cells followed after activation. Cells that exhibited a memory precursor signature were identified and positioned on this trajectory. We found that these memory precursors are generated continuously with increasing numbers arising over time. Similarly, expression of genes associated with effector functions was also found to be raised in memory precursors at later time points. The ability of cells to enter quiescence and differentiate into memory cells was confirmed by BrdU pulse-chase experiment in vivo. Analysis of cell counts indicates that the vast majority of memory cells are generated at later time points from cells that have extensively divided.
RESUMEN
For more than 100 years, the fruit fly Drosophila melanogaster has been one of the most studied model organisms. Here, we present a single-cell atlas of the adult fly, Tabula Drosophilae, that includes 580,000 nuclei from 15 individually dissected sexed tissues as well as the entire head and body, annotated to >250 distinct cell types. We provide an in-depth analysis of cell type-related gene signatures and transcription factor markers, as well as sexual dimorphism, across the whole animal. Analysis of common cell types between tissues, such as blood and muscle cells, reveals rare cell types and tissue-specific subtypes. This atlas provides a valuable resource for the Drosophila community and serves as a reference to study genetic perturbations and disease models at single-cell resolution.
Asunto(s)
Drosophila melanogaster/citología , Drosophila melanogaster/genética , Transcriptoma , Animales , Núcleo Celular/metabolismo , Bases de Datos Genéticas , Proteínas de Drosophila/genética , Drosophila melanogaster/fisiología , Femenino , Regulación de la Expresión Génica , Redes Reguladoras de Genes , Genes de Insecto , Masculino , RNA-Seq , Caracteres Sexuales , Análisis de la Célula Individual , Factores de Transcripción/genéticaRESUMEN
We present dyngen, a multi-modal simulation engine for studying dynamic cellular processes at single-cell resolution. dyngen is more flexible than current single-cell simulation engines, and allows better method development and benchmarking, thereby stimulating development and testing of computational methods. We demonstrate its potential for spearheading computational methods on three applications: aligning cell developmental trajectories, cell-specific regulatory network inference and estimation of RNA velocity.
Asunto(s)
Simulación por Computador , Redes Reguladoras de Genes , Análisis de la Célula Individual/métodos , Algoritmos , Benchmarking , Biología Computacional/métodos , Perfilación de la Expresión GénicaRESUMEN
Trajectory inference has radically enhanced single-cell RNA-seq research by enabling the study of dynamic changes in gene expression. Downstream of trajectory inference, it is vital to discover genes that are (i) associated with the lineages in the trajectory, or (ii) differentially expressed between lineages, to illuminate the underlying biological processes. Current data analysis procedures, however, either fail to exploit the continuous resolution provided by trajectory inference, or fail to pinpoint the exact types of differential expression. We introduce tradeSeq, a powerful generalized additive model framework based on the negative binomial distribution that allows flexible inference of both within-lineage and between-lineage differential expression. By incorporating observation-level weights, the model additionally allows to account for zero inflation. We evaluate the method on simulated datasets and on real datasets from droplet-based and full-length protocols, and show that it yields biological insights through a clear interpretation of the data.
Asunto(s)
Perfilación de la Expresión Génica , Análisis de Secuencia de ARN , Análisis de la Célula Individual , Animales , Médula Ósea/metabolismo , Simulación por Computador , Bases de Datos Genéticas , Regulación de la Expresión Génica , Ratones , Modelos Estadísticos , Mucosa Olfatoria/metabolismo , Análisis de Componente PrincipalRESUMEN
This protocol explains how to perform a fast SCENIC analysis alongside standard best practices steps on single-cell RNA-sequencing data using software containers and Nextflow pipelines. SCENIC reconstructs regulons (i.e., transcription factors and their target genes) assesses the activity of these discovered regulons in individual cells and uses these cellular activity patterns to find meaningful clusters of cells. Here we present an improved version of SCENIC with several advances. SCENIC has been refactored and reimplemented in Python (pySCENIC), resulting in a tenfold increase in speed, and has been packaged into containers for ease of use. It is now also possible to use epigenomic track databases, as well as motifs, to refine regulons. In this protocol, we explain the different steps of SCENIC: the workflow starts from the count matrix depicting the gene abundances for all cells and consists of three stages. First, coexpression modules are inferred using a regression per-target approach (GRNBoost2). Next, the indirect targets are pruned from these modules using cis-regulatory motif discovery (cisTarget). Lastly, the activity of these regulons is quantified via an enrichment score for the regulon's target genes (AUCell). Nonlinear projection methods can be used to display visual groupings of cells based on the cellular activity patterns of these regulons. The results can be exported as a loom file and visualized in the SCope web application. This protocol is illustrated on two use cases: a peripheral blood mononuclear cell data set and a panel of single-cell RNA-sequencing cancer experiments. For a data set of 10,000 genes and 50,000 cells, the pipeline runs in <2 h.
Asunto(s)
Redes Reguladoras de Genes , Análisis de la Célula Individual/métodos , Flujo de Trabajo , Animales , Línea Celular Tumoral , Humanos , RatonesRESUMEN
Trajectory inference approaches analyze genome-wide omics data from thousands of single cells and computationally infer the order of these cells along developmental trajectories. Although more than 70 trajectory inference tools have already been developed, it is challenging to compare their performance because the input they require and output models they produce vary substantially. Here, we benchmark 45 of these methods on 110 real and 229 synthetic datasets for cellular ordering, topology, scalability and usability. Our results highlight the complementarity of existing tools, and that the choice of method should depend mostly on the dataset dimensions and trajectory topology. Based on these results, we develop a set of guidelines to help users select the best method for their dataset. Our freely available data and evaluation pipeline ( https://benchmark.dynverse.org ) will aid in the development of improved tools designed to analyze increasingly large and complex single-cell datasets.
Asunto(s)
Biología Computacional/métodos , Genoma/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de la Célula Individual/métodos , Benchmarking , Secuenciación de Nucleótidos de Alto Rendimiento/tendencias , Análisis de la Célula Individual/tendenciasRESUMEN
Recent technological breakthroughs in single-cell RNA sequencing are revolutionizing modern experimental design in biology. The increasing size of the single-cell expression data from which networks can be inferred allows identifying more complex, non-linear dependencies between genes. Moreover, the inter-cellular variability that is observed in single-cell expression data can be used to infer not only one global network representing all the cells, but also numerous regulatory networks that are more specific to certain conditions. By experimentally perturbing certain genes, the deconvolution of the true contribution of these genes can also be greatly facilitated. In this chapter, we will therefore tackle the advantages of single-cell transcriptomic data and show how new methods exploit this novel data type to enhance the inference of gene regulatory networks.
Asunto(s)
Perfilación de la Expresión Génica/métodos , Redes Reguladoras de Genes , Modelos Genéticos , Análisis de la Célula Individual/métodos , Biología de Sistemas/métodos , Algoritmos , Perfilación de la Expresión Génica/instrumentación , Secuenciación de Nucleótidos de Alto Rendimiento/instrumentación , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Análisis de Secuencia de ARN , Análisis de la Célula Individual/instrumentación , Biología de Sistemas/instrumentaciónRESUMEN
In computational biology and other sciences, researchers are frequently faced with a choice between several computational methods for performing data analyses. Benchmarking studies aim to rigorously compare the performance of different methods using well-characterized benchmark datasets, to determine the strengths of each method or to provide recommendations regarding suitable choices of methods for an analysis. However, benchmarking studies must be carefully designed and implemented to provide accurate, unbiased, and informative results. Here, we summarize key practical guidelines and recommendations for performing high-quality benchmarking analyses, based on our experiences in computational biology.
Asunto(s)
Biología Computacional/normas , Guías como Asunto , Benchmarking , Conjuntos de Datos como Asunto , Edición , Proyectos de Investigación , Programas InformáticosRESUMEN
A critical step in the analysis of large genome-wide gene expression datasets is the use of module detection methods to group genes into co-expression modules. Because of limitations of classical clustering methods, numerous alternative module detection methods have been proposed, which improve upon clustering by handling co-expression in only a subset of samples, modelling the regulatory network, and/or allowing overlap between modules. In this study we use known regulatory networks to do a comprehensive and robust evaluation of these different methods. Overall, decomposition methods outperform all other strategies, while we do not find a clear advantage of biclustering and network inference-based approaches on large gene expression datasets. Using our evaluation workflow, we also investigate several practical aspects of module detection, such as parameter estimation and the use of alternative similarity measures, and conclude with recommendations for the further development of these methods.