Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 20
Filter
Add more filters










Publication year range
1.
J Exp Bot ; 2024 May 29.
Article in English | MEDLINE | ID: mdl-38808657

ABSTRACT

Chilling stress threatens plant growth and development, particularly affecting membrane fluidity and cellular integrity. Understanding plant membrane responses to chilling stress is important for unraveling the molecular mechanisms of stress tolerance. Whereas core transcriptional responses to chilling stress and stress tolerance are conserved across species, the associated changes in membrane lipids appear to be less conserved, as which lipids are affected by chilling stress varies by species. Here, we investigated changes in gene expression and membrane lipids in response to chilling stress during one 24 hour cycle in chilling-tolerant foxtail millet (Setaria italica), and chilling-sensitive sorghum (Sorghum bicolor), and Urochloa (browntop signal grass, Urochloa fusca, lipids only), leveraging their evolutionary relatedness and differing levels of chilling-stress tolerance. We show that most chilling-induced lipid changes are conserved across the three species, while we observed distinct, time-specific responses in chilling-tolerant foxtail millet, indicating the presence of a finely orchestrated adaptive mechanism. We detected rhythmicity in lipid responses to chilling stress in the three grasses, which were also present in Arabidopsis (Arabidopsis thaliana), suggesting the conservation of rhythmic patterns across species and highlighting the importance of accounting for time of day. When integrating lipid datasets with gene expression profiles, we identified potential candidate genes that showed corresponding transcriptional changes in response to chilling stress, providing insights into the differences in regulatory mechanisms between chilling-sensitive sorghum and chilling-tolerant foxtail millet.

2.
Bioinformatics ; 40(3)2024 Mar 04.
Article in English | MEDLINE | ID: mdl-38430463

ABSTRACT

MOTIVATION: Large-scale gene expression studies allow gene network construction to uncover associations among genes. To study direct associations among genes, partial correlation-based networks are preferred over marginal correlations. However, FDR control for partial correlation-based network construction is not well-studied. In addition, currently available partial correlation-based methods cannot take existing biological knowledge to help network construction while controlling FDR. RESULTS: In this paper, we propose a method called Partial Correlation Graph with Information Incorporation (PCGII). PCGII estimates partial correlations between each pair of genes by regularized node-wise regression that can incorporate prior knowledge while controlling the effects of all other genes. It handles high-dimensional data where the number of genes can be much larger than the sample size and controls FDR at the same time. We compare PCGII with several existing approaches through extensive simulation studies and demonstrate that PCGII has better FDR control and higher power. We apply PCGII to a plant gene expression dataset where it recovers confirmed regulatory relationships and a hub node, as well as several direct associations that shed light on potential functional relationships in the system. We also introduce a method to supplement observed data with a pseudogene to apply PCGII when no prior information is available, which also allows checking FDR control and power for real data analysis. AVAILABILITY AND IMPLEMENTATION: R package is freely available for download at https://cran.r-project.org/package=PCGII.


Subject(s)
Algorithms , Gene Regulatory Networks , Computer Simulation , Genes, Plant , Sample Size
3.
Research (Wash D C) ; 6: 0099, 2023.
Article in English | MEDLINE | ID: mdl-37223465

ABSTRACT

The real-world vaccine protection rates (VPRs) against the severe acute respiratory syndrome coronavirus 2 (SARS­CoV­2) infection are critical in formulating future vaccination strategies against the virus. Based on a varying coefficient stochastic epidemic model, we obtain 7 countries' real-world VPRs using daily epidemiological and vaccination data, and find that the VPRs improved with more vaccine doses. The average VPR of the full vaccination was 82% (SE: 4%) and 61% (SE: 3%) in the pre-Delta and Delta-dominated periods, respectively. The Omicron variant reduced the average VPR of the full vaccination to 39% (SE: 2%). However, the booster dose restored the VPR to 63% (SE: 1%) which was significantly above the 50% threshold in the Omicron-dominated period. Scenario analyses show that the existing vaccination strategies have significantly delayed and reduced the timing and the magnitude of the infection peaks, respectively, and doubling the existing booster coverage would lead to 29% fewer confirmed cases and 17% fewer deaths in the 7 countries compared to the outcomes at the existing booster taking rates. These call for higher full vaccine and booster coverage for all countries.

4.
Plant Phenomics ; 5: 0052, 2023.
Article in English | MEDLINE | ID: mdl-37213545

ABSTRACT

High-throughput plant phenotyping-the use of imaging and remote sensing to record plant growth dynamics-is becoming more widely used. The first step in this process is typically plant segmentation, which requires a well-labeled training dataset to enable accurate segmentation of overlapping plants. However, preparing such training data is both time and labor intensive. To solve this problem, we propose a plant image processing pipeline using a self-supervised sequential convolutional neural network method for in-field phenotyping systems. This first step uses plant pixels from greenhouse images to segment nonoverlapping in-field plants in an early growth stage and then applies the segmentation results from those early-stage images as training data for the separation of plants at later growth stages. The proposed pipeline is efficient and self-supervising in the sense that no human-labeled data are needed. We then combine this approach with functional principal components analysis to reveal the relationship between the growth dynamics of plants and genotypes. We show that the proposed pipeline can accurately separate the pixels of foreground plants and estimate their heights when foreground and background plants overlap and can thus be used to efficiently assess the impact of treatments and genotypes on plant growth in a field environment by computer vision techniques. This approach should be useful for answering important scientific questions in the area of high-throughput phenotyping.

5.
Biometrics ; 79(2): 1173-1186, 2023 06.
Article in English | MEDLINE | ID: mdl-35044681

ABSTRACT

Partial correlation is a common tool in studying conditional dependence for Gaussian distributed data. However, partial correlation being zero may not be equivalent to conditional independence under non-Gaussian distributions. In this paper, we propose a statistical inference procedure for partial correlations under the high-dimensional nonparanormal (NPN) model where the observed data are normally distributed after certain monotone transformations. The NPN partial correlation is the partial correlation of the normal transformed data under the NPN model, which is a more general measure of conditional dependence. We estimate the NPN partial correlations by regularized nodewise regression based on the empirical ranks of the original data. A multiple testing procedure is proposed to identify the nonzero NPN partial correlations. The proposed method can be carried out by a simple coordinate descent algorithm for lasso optimization. It is easy-to-implement and computationally more efficient compared to the existing methods for estimating NPN graphical models. Theoretical results are developed to show the asymptotic normality of the proposed estimator and to justify the proposed multiple testing procedure. Numerical simulations and a case study on brain imaging data demonstrate the utility of the proposed procedure and evaluate its performance compared to the existing methods. Data used in preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database.


Subject(s)
Brain , Neuroimaging
6.
Biostatistics ; 24(1): 140-160, 2022 12 12.
Article in English | MEDLINE | ID: mdl-36514939

ABSTRACT

The process of identifying and quantifying metabolites in complex mixtures plays a critical role in metabolomics studies to obtain an informative interpretation of underlying biological processes. Manual approaches are time-consuming and heavily reliant on the knowledge and assessment of nuclear magnetic resonance (NMR) experts. We propose a shifting-corrected regularized regression method, which identifies and quantifies metabolites in a mixture automatically. A detailed algorithm is also proposed to implement the proposed method. Using a novel weight function, the proposed method is able to detect and correct peak shifting errors caused by fluctuations in experimental procedures. Simulation studies show that the proposed method performs better with regard to the identification and quantification of metabolites in a complex mixture. We also demonstrate real data applications of our method using experimental and biological NMR mixtures.


Subject(s)
Magnetic Resonance Imaging , Metabolomics , Humans , Proton Magnetic Resonance Spectroscopy , Magnetic Resonance Spectroscopy/methods , Metabolomics/methods , Algorithms
7.
Methods Mol Biol ; 2539: 57-68, 2022.
Article in English | MEDLINE | ID: mdl-35895196

ABSTRACT

It is essential that the scientific community develop and deploy accurate and high-throughput techniques to capture factors that influence plant phenotypes if we are to meet the projected demands for food and energy. In recognition of this fact, multiple research institutions have invested in automated high-throughput plant phenotyping (HTPP) systems designed for use in controlled environments. These systems can generate large amounts of data in relatively short periods of time, potentially allowing researchers to gain insights about phenotypic responses to environmental, biological, and management factors. Reliable inferences about these factors depends on the use of proper experimental design when planning phenotypic studies in order to avoid issues such as lack of power and confounding. In this chapter, the topic of experimental design will be discussed, from basic principles to examples specific to controlled environment plant phenotyping. Examples will be provided based on the package agricolae in the R statistical language.


Subject(s)
Plants , Research Design , Environment, Controlled , Phenotype , Plants/genetics
8.
Plant Phenomics ; 2021: 9805489, 2021.
Article in English | MEDLINE | ID: mdl-34405144

ABSTRACT

High-throughput phenotyping enables the efficient collection of plant trait data at scale. One example involves using imaging systems over key phases of a crop growing season. Although the resulting images provide rich data for statistical analyses of plant phenotypes, image processing for trait extraction is required as a prerequisite. Current methods for trait extraction are mainly based on supervised learning with human labeled data or semisupervised learning with a mixture of human labeled data and unsupervised data. Unfortunately, preparing a sufficiently large training data is both time and labor-intensive. We describe a self-supervised pipeline (KAT4IA) that uses K-means clustering on greenhouse images to construct training data for extracting and analyzing plant traits from an image-based field phenotyping system. The KAT4IA pipeline includes these main steps: self-supervised training set construction, plant segmentation from images of field-grown plants, automatic separation of target plants, calculation of plant traits, and functional curve fitting of the extracted traits. To deal with the challenge of separating target plants from noisy backgrounds in field images, we describe a novel approach using row-cuts and column-cuts on images segmented by transform domain neural network learning, which utilizes plant pixels identified from greenhouse images to train a segmentation model for field images. This approach is efficient and does not require human intervention. Our results show that KAT4IA is able to accurately extract plant pixels and estimate plant heights.

9.
Biom J ; 63(6): 1325-1341, 2021 08.
Article in English | MEDLINE | ID: mdl-33830499

ABSTRACT

In modern high-throughput plant phenotyping, images of plants of different genotypes are repeatedly taken throughout the growing season, and phenotypic traits of plants (e.g., plant height) are extracted through image processing. It is of interest to recover whole trait trajectories and their derivatives at both genotype and plant levels based on observations made at irregular discrete time points. We propose to model trait trajectories using hierarchical functional principal component analysis (HFPCA) and show that the problem of recovering derivatives of the trajectories is reduced to estimating derivatives of eigenfunctions, which is solved by differentiating eigenequations. Based on HFPCA, we also propose a new measure for the broad-sense heritability by allowing it to vary over time during plant growth. Simulation studies show that the proposed procedure performs better than its competitors in terms of recovering both trait trajectories and their derivatives. Interesting characteristics of plant growth and heritability dynamics are revealed in the application to a modern plant phenotyping study.


Subject(s)
Data Analysis , Plants , Genotype , Image Processing, Computer-Assisted , Phenotype , Plants/genetics
10.
Proc Math Phys Eng Sci ; 477(2248): 20200440, 2021 Apr.
Article in English | MEDLINE | ID: mdl-35153551

ABSTRACT

We study epidemiological characteristics of 25 early COVID-19 outbreak countries, which emphasizes on the reproduction of infection and effects of government control measures. The study is based on a vSIADR model which allows asymptomatic and pre-diagnosis infections to reflect COVID-19 clinical realities, and a linear mixed-effect model to analyse the association between each country's control measures and the effective reproduction number R t . It finds significant effects of higher stringency measures in lowering the reproduction, and a significant shortening effect on the time to the epidemic turning point by applying stronger early counter measures. Epidemic projections under scenarios of the counter measures (China and Korea, the USA and the UK) show substantial reduction in the epidemic size and death by taking earlier and forceful actions. The governments' response before and after the start of the second wave epidemics were alarmingly weak, which made the average duration of the second wave more than doubled that of the first wave. We identify countries which urgently need to restore to at least the maximum stringency measures implemented so far in the pandemic in order to avoid even higher infection size and death.

11.
Plant Phenomics ; 2020: 7481687, 2020.
Article in English | MEDLINE | ID: mdl-33313562

ABSTRACT

High-throughput phenotyping system has become more and more popular in plant science research. The data analysis for such a system typically involves two steps: plant feature extraction through image processing and statistical analysis for the extracted features. The current approach is to perform those two steps on different platforms. We develop the package "implant" in R for both robust feature extraction and functional data analysis. For image processing, the "implant" package provides methods including thresholding, hidden Markov random field model, and morphological operations. For statistical analysis, this package can produce nonparametric curve fitting with its confidence region for plant growth. A functional ANOVA model to test for the treatment and genotype effects on the plant growth dynamics is also provided.

12.
Mol Plant ; 13(6): 907-922, 2020 06 01.
Article in English | MEDLINE | ID: mdl-32171733

ABSTRACT

Linking natural genetic variation to trait variation can help determine the functional roles ofdifferent genes. Variations of one or several traits are often assessed separately. High-throughput phenotyping and data mining can capture dozens or hundreds of traits from the same individuals. Here, we test the association between markers within a gene and many traits simultaneously. This genome-phenome wide association study (GPWAS) is both a multi-marker and multi-trait test. Genes identified using GPWAS with 260 phenotypic traits in maize were enriched for genes independently linked to phenotypic variation. Traits associated with classical mutants were consistent with reported phenotypes for mutant alleles. Genes linked to phenomic variation in maize using GPWAS shared molecular, population genetic, and evolutionary features with classical mutants in maize. Genes linked to phenomic variation in Arabidopsis using GPWAS are significantly enriched in genes with known loss-of-function phenotypes. GPWAS may be an effective strategy to identify genes in which loss-of-function alleles produce mutant phenotypes. The shared signatures present in classical mutants and genes identified using GPWAS may be markers for genes with a role in specifying plant phenotypes generally or pleiotropy specifically.


Subject(s)
Arabidopsis/genetics , Evolution, Molecular , Genome, Plant , Genome-Wide Association Study , Phenomics , Zea mays/genetics , Algorithms , Gene Knockout Techniques , Genes, Plant , Genetic Pleiotropy , Genetic Variation , Models, Genetic , Mutation/genetics , Phenotype , Reproducibility of Results
14.
Biostatistics ; 21(4): 641-658, 2020 10 01.
Article in English | MEDLINE | ID: mdl-30596883

ABSTRACT

Alzheimer's disease (AD) is a chronic neurodegenerative disease that changes the functional connectivity of the brain. The alteration of the strong connections between different brain regions is of particular interest to researchers. In this article, we use partial correlations to model the brain connectivity network and propose a data-driven procedure to recover a $c$-level partial correlation graph based on PET data, which is the graph of the absolute partial correlations larger than a pre-specified constant $c$. The proposed procedure is adaptive to the "large p, small n" scenario commonly seen in whole brain studies, and it incorporates the variation of the estimated partial correlations, which results in higher power compared to the existing methods. A case study on the FDG-PET images from AD and normal control (NC) subjects discovers new brain regions, Sup Frontal and Mid Frontal in the frontal lobe, which have different brain functional connectivity between AD and NC.


Subject(s)
Alzheimer Disease , Neurodegenerative Diseases , Alzheimer Disease/diagnostic imaging , Brain/diagnostic imaging , Humans , Magnetic Resonance Imaging , Neuroimaging , Positron-Emission Tomography
15.
Biometrics ; 75(3): 895-905, 2019 09.
Article in English | MEDLINE | ID: mdl-30820943

ABSTRACT

Thresholding is a regularization method commonly used for covariance estimation, which provides consistent estimators if the population covariance satisfies certain sparsity condition (Bickel and Levina, 2008a; Cai and Liu, 2011). However, the performance of the thresholding estimators heavily depends on the threshold level. By minimizing the Frobenius risk of the adaptive thresholding estimator for covariances, we conduct a theoretical study for the optimal threshold level, and obtain its analytical expression. A consistent estimator based on this expression is proposed for the optimal threshold level, which is easy to implement in practice and efficient in computation. Numerical simulations and a case study on gene expression data are conducted to illustrate the proposed method.


Subject(s)
Biometry/methods , Gene Expression , Models, Statistical , Algorithms , Data Interpretation, Statistical , Humans
16.
BMC Proc ; 12(Suppl 9): 46, 2018.
Article in English | MEDLINE | ID: mdl-30275894

ABSTRACT

Obesity is a risk factor for heart disease, stroke, diabetes, high blood pressure, and other chronic diseases. Some drugs, including fenofibrate, are used to treat obesity or excessive weight by lowering the level of specific triglycerides. However, different groups have different drug sensitivities and, consequently, there are differences in drug effects. In this study, we assessed both genetic and nongenetic factors that influence drug responses and stratified patients into groups based on differential drug effect and sensitivity. Our methodology of investigating genetic factors and nongenetic factors is applicable to studying differential effects of other drugs, such as statins, and provides an approach to the development of personalized medicine.

17.
Plant Methods ; 14: 35, 2018.
Article in English | MEDLINE | ID: mdl-29760766

ABSTRACT

BACKGROUND: Image-based plant phenotyping facilitates the extraction of traits noninvasively by analyzing large number of plants in a relatively short period of time. It has the potential to compute advanced phenotypes by considering the whole plant as a single object (holistic phenotypes) or as individual components, i.e., leaves and the stem (component phenotypes), to investigate the biophysical characteristics of the plants. The emergence timing, total number of leaves present at any point of time and the growth of individual leaves during vegetative stage life cycle of the maize plants are significant phenotypic expressions that best contribute to assess the plant vigor. However, image-based automated solution to this novel problem is yet to be explored. RESULTS: A set of new holistic and component phenotypes are introduced in this paper. To compute the component phenotypes, it is essential to detect the individual leaves and the stem. Thus, the paper introduces a novel method to reliably detect the leaves and the stem of the maize plants by analyzing 2-dimensional visible light image sequences captured from the side using a graph based approach. The total number of leaves are counted and the length of each leaf is measured for all images in the sequence to monitor leaf growth. To evaluate the performance of the proposed algorithm, we introduce University of Nebraska-Lincoln Component Plant Phenotyping Dataset (UNL-CPPD) and provide ground truth to facilitate new algorithm development and uniform comparison. The temporal variation of the component phenotypes regulated by genotypes and environment (i.e., greenhouse) are experimentally demonstrated for the maize plants on UNL-CPPD. Statistical models are applied to analyze the greenhouse environment impact and demonstrate the genetic regulation of the temporal variation of the holistic phenotypes on the public dataset called Panicoid Phenomap-1. CONCLUSION: The central contribution of the paper is a novel computer vision based algorithm for automated detection of individual leaves and the stem to compute new component phenotypes along with a public release of a benchmark dataset, i.e., UNL-CPPD. Detailed experimental analyses are performed to demonstrate the temporal variation of the holistic and component phenotypes in maize regulated by environment and genetic variation with a discussion on their significance in the context of plant science.

18.
Gigascience ; 7(2): 1-11, 2018 02 01.
Article in English | MEDLINE | ID: mdl-29186425

ABSTRACT

Background: Maize (Zea mays ssp. mays) is 1 of 3 crops, along with rice and wheat, responsible for more than one-half of all calories consumed around the world. Increasing the yield and stress tolerance of these crops is essential to meet the growing need for food. The cost and speed of plant phenotyping are currently the largest constraints on plant breeding efforts. Datasets linking new types of high-throughput phenotyping data collected from plants to the performance of the same genotypes under agronomic conditions across a wide range of environments are essential for developing new statistical approaches and computer vision-based tools. Findings: A set of maize inbreds-primarily recently off patent lines-were phenotyped using a high-throughput platform at University of Nebraska-Lincoln. These lines have been previously subjected to high-density genotyping and scored for a core set of 13 phenotypes in field trials across 13 North American states in 2 years by the Genomes 2 Fields Consortium. A total of 485 GB of image data including RGB, hyperspectral, fluorescence, and thermal infrared photos has been released. Conclusions: Correlations between image-based measurements and manual measurements demonstrated the feasibility of quantifying variation in plant architecture using image data. However, naive approaches to measuring traits such as biomass can introduce nonrandom measurement errors confounded with genotype variation. Analysis of hyperspectral image data demonstrated unique signatures from stem tissue. Integrating heritable phenotypes from high-throughput phenotyping data with field data from different environments can reveal previously unknown factors that influence yield plasticity.


Subject(s)
Image Processing, Computer-Assisted/statistics & numerical data , Imaging, Three-Dimensional/methods , Quantitative Trait, Heritable , Zea mays/anatomy & histology , Genotype , Inbreeding , Phenotype , Plant Breeding , Zea mays/classification , Zea mays/genetics
19.
Metabolomics ; 14(8): 108, 2018 08 10.
Article in English | MEDLINE | ID: mdl-30830388

ABSTRACT

INTRODUCTION: Failure to properly account for normal systematic variations in OMICS datasets may result in misleading biological conclusions. Accordingly, normalization is a necessary step in the proper preprocessing of OMICS datasets. In this regards, an optimal normalization method will effectively reduce unwanted biases and increase the accuracy of downstream quantitative analyses. But, it is currently unclear which normalization method is best since each algorithm addresses systematic noise in different ways. OBJECTIVE: Determine an optimal choice of a normalization method for the preprocessing of metabolomics datasets. METHODS: Nine MVAPACK normalization algorithms were compared with simulated and experimental NMR spectra modified with added Gaussian noise and random dilution factors. Methods were evaluated based on an ability to recover the intensities of the true spectral peaks and the reproducibility of true classifying features from orthogonal projections to latent structures-discriminant analysis model (OPLS-DA). RESULTS: Most normalization methods (except histogram matching) performed equally well at modest levels of signal variance. Only probabilistic quotient (PQ) and constant sum (CS) maintained the highest level of peak recovery (> 67%) and correlation with true loadings (> 0.6) at maximal noise. CONCLUSION: PQ and CS performed the best at recovering peak intensities and reproducing the true classifying features for an OPLS-DA model regardless of spectral noise level. Our findings suggest that performance is largely determined by the level of noise in the dataset, while the effect of dilution factors was negligible. A minimal allowable noise level of 20% was also identified for a valid NMR metabolomics dataset.


Subject(s)
Algorithms , Data Interpretation, Statistical , Magnetic Resonance Imaging/methods , Magnetic Resonance Spectroscopy/methods , Metabolomics/methods , Discriminant Analysis , Gene Expression Profiling , Humans , Proteome/analysis , Reproducibility of Results , Signal-To-Noise Ratio
20.
Plant Cell ; 29(8): 1938-1951, 2017 Aug.
Article in English | MEDLINE | ID: mdl-28733421

ABSTRACT

Identifying interspecies changes in gene regulation, one of the two primary sources of phenotypic variation, is challenging on a genome-wide scale. The use of paired time-course data on cold-responsive gene expression in maize (Zea mays) and sorghum (Sorghum bicolor) allowed us to identify differentially regulated orthologs. While the majority of cold-responsive transcriptional regulation of conserved gene pairs is species specific, the initial transcriptional responses to cold appear to be more conserved than later responses. In maize, the promoters of genes with conserved transcriptional responses to cold tend to contain more micrococcal nuclease hypersensitive sites in their promoters, a proxy for open chromatin. Genes with conserved patterns of transcriptional regulation between the two species show lower ratios of nonsynonymous to synonymous substitutions. Genes involved in lipid metabolism, known to be involved in cold acclimation, tended to show consistent regulation in both species. Genes with species-specific cold responses did not cluster in particular pathways nor were they enriched in particular functional categories. We propose that cold-responsive transcriptional regulation in individual species may not be a reliable marker for function, while a core set of genes involved in perceiving and responding to cold stress are subject to functionally constrained cold-responsive regulation across the grass tribe Andropogoneae.


Subject(s)
Gene Expression Profiling , Gene Expression Regulation, Plant , Genome, Plant , Plant Proteins/genetics , Sequence Homology, Amino Acid , Sorghum/genetics , Zea mays/genetics , Chromatin/metabolism , Genes, Plant , Phylogeny , Plant Proteins/metabolism , Stress, Physiological/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...