Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
1.
Nat Methods ; 19(4): 445-448, 2022 04.
Article in English | MEDLINE | ID: mdl-35396485

ABSTRACT

Structural variants are associated with cancers and developmental disorders, but challenges with estimating population frequency remain a barrier to prioritizing mutations over inherited variants. In particular, variability in variant calling heuristics and filtering limits the use of current structural variant catalogs. We present STIX, a method that, instead of relying on variant calls, indexes and searches the raw alignments from thousands of samples to enable more comprehensive allele frequency estimation.


Subject(s)
Genome , Genomic Structural Variation , Neoplasms , Algorithms , Genomic Structural Variation/genetics , High-Throughput Nucleotide Sequencing , Humans , Neoplasms/genetics , Software
3.
Sci Rep ; 14(1): 3432, 2024 02 10.
Article in English | MEDLINE | ID: mdl-38341450

ABSTRACT

Many nocturnally active fireflies use precisely timed bioluminescent patterns to identify mates, making them especially vulnerable to light pollution. As urbanization continues to brighten the night sky, firefly populations are under constant stress, and close to half of the species are now threatened. Ensuring the survival of firefly biodiversity depends on a large-scale conservation effort to monitor and protect thousands of populations. While species can be identified by their flash patterns, current methods require expert measurement and manual classification and are infeasible given the number and geographic distribution of fireflies. Here we present the application of a recurrent neural network (RNN) for accurate automated firefly flash pattern classification. Using recordings from commodity cameras, we can extract flash trajectories of individuals within a swarm and classify their species with an accuracy of approximately seventy percent. In addition to its potential in population monitoring, automated classification provides the means to study firefly behavior at the population level. We employ the classifier to measure and characterize the variability within and between swarms, unlocking a new dimension of their behavior. Our method is open source, and deployment in community science applications could revolutionize our ability to monitor and understand firefly populations.


Subject(s)
Fireflies , Sexual Behavior, Animal , Humans , Animals
4.
Cell Genom ; : 100604, 2024 Jul 02.
Article in English | MEDLINE | ID: mdl-38959898

ABSTRACT

Insulinomas are rare neuroendocrine tumors arising from pancreatic ß cells, characterized by aberrant proliferation and altered insulin secretion, leading to glucose homeostasis failure. With the aim of uncovering the role of noncoding regulatory regions and their aberrations in the development of these tumors, we coupled epigenetic and transcriptome profiling with whole-genome sequencing. As a result, we unraveled somatic mutations associated with changes in regulatory functions. Critically, these regions impact insulin secretion, tumor development, and epigenetic modifying genes, including polycomb complex components. Chromatin remodeling is apparent in insulinoma-selective domains shared across patients, containing a specific set of regulatory sequences dominated by the SOX17 binding motif. Moreover, many of these regions are H3K27me3 repressed in ß cells, suggesting that tumoral transition involves derepression of polycomb-targeted domains. Our work provides a compendium of aberrant cis-regulatory elements affecting the function and fate of ß cells in their progression to insulinomas and a framework to identify coding and noncoding driver mutations.

5.
Genome Biol ; 22(1): 161, 2021 05 25.
Article in English | MEDLINE | ID: mdl-34034781

ABSTRACT

Visual validation is an important step to minimize false-positive predictions from structural variant (SV) detection. We present Samplot, a tool for creating images that display the read depth and sequence alignments necessary to adjudicate purported SVs across samples and sequencing technologies. These images can be rapidly reviewed to curate large SV call sets. Samplot is applicable to many biological problems such as SV prioritization in disease studies, analysis of inherited variation, or de novo SV review. Samplot includes a machine learning package that dramatically decreases the number of false positives without human review. Samplot is available at https://github.com/ryanlayer/samplot .


Subject(s)
Genomic Structural Variation , Software , Automation , Chromosome Inversion , Gene Duplication , Reproducibility of Results , Translocation, Genetic
6.
PLoS One ; 15(4): e0232332, 2020.
Article in English | MEDLINE | ID: mdl-32353042

ABSTRACT

The assay for transposase-accessible chromatin followed by sequencing (ATAC-seq) is an inexpensive protocol for measuring open chromatin regions. ATAC-seq is also relatively simple and requires fewer cells than many other high-throughput sequencing protocols. Therefore, it is tractable in numerous settings where other high throughput assays are challenging to impossible. Hence it is important to understand the limits of what can be inferred from ATAC-seq data. In this work, we leverage ATAC-seq to predict the presence of nascent transcription. Nascent transcription assays are the current gold standard for identifying regions of active transcription, including markers for functional transcription factor (TF) binding. We combine mapped short reads from ATAC-seq with the underlying peak sequence, to determine regions of active transcription genome-wide. We show that a hybrid signal/sequence representation classified using recurrent neural networks (RNNs) can identify these regions across different cell types.


Subject(s)
DNA-Directed RNA Polymerases/metabolism , Sequence Analysis, DNA/methods , Transcription Initiation Site , A549 Cells , HCT116 Cells , Humans , MCF-7 Cells , Neural Networks, Computer , Nucleotide Motifs , Protein Binding , Transcription Factors/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL