Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 42
Filter
1.
Sci Rep ; 11(1): 22997, 2021 11 26.
Article in English | MEDLINE | ID: mdl-34837000

ABSTRACT

We present a simple and efficient hypothesis-free machine learning pipeline for risk factor discovery that accounts for non-linearity and interaction in large biomedical databases with minimal variable pre-processing. In this study, mortality models were built using gradient boosting decision trees (GBDT) and important predictors were identified using a Shapley values-based feature attribution method, SHAP values. Cox models controlled for false discovery rate were used for confounder adjustment, interpretability, and further validation. The pipeline was tested using information from 502,506 UK Biobank participants, aged 37-73 years at recruitment and followed over seven years for mortality registrations. From the 11,639 predictors included in GBDT, 193 potential risk factors had SHAP values ≥ 0.05, passed the correlation test, and were selected for further modelling. Of the total variable importance summed up, 60% was directly health related, and baseline characteristics, sociodemographics, and lifestyle factors each contributed about 10%. Cox models adjusted for baseline characteristics, showed evidence for an association with mortality for 166 out of the 193 predictors. These included mostly well-known risk factors (e.g., age, sex, ethnicity, education, material deprivation, smoking, physical activity, self-rated health, BMI, and many disease outcomes). For 19 predictors we saw evidence for an association in the unadjusted but not adjusted analyses, suggesting bias by confounding. Our GBDT-SHAP pipeline was able to identify relevant predictors 'hidden' within thousands of variables, providing an efficient and pragmatic solution for the first stage of hypothesis free risk factor identification.


Subject(s)
Cognition Disorders/mortality , Databases, Factual , Life Style , Machine Learning , Mortality/trends , Smoking/mortality , Aged , Cognition Disorders/epidemiology , Cohort Studies , Female , Humans , Male , Middle Aged , Risk Factors , Smoking/epidemiology , United Kingdom/epidemiology
2.
Cancers (Basel) ; 13(21)2021 Oct 27.
Article in English | MEDLINE | ID: mdl-34771551

ABSTRACT

Matrix assisted laser desorption/ionization mass spectrometry imaging (MALDI MSI) can determine the spatial distribution of analytes such as protein distributions in a tissue section according to their mass-to-charge ratio. Here, we explored the clinical potential of machine learning (ML) applied to MALDI MSI data for cancer diagnostic classification using tissue microarrays (TMAs) on 302 colorectal (CRC) and 257 endometrial cancer (EC)) patients. ML based on deep neural networks discriminated colorectal tumour from normal tissue with an overall accuracy of 98% in balanced cross-validation (98.2% sensitivity and 98.6% specificity). Moreover, our machine learning approach predicted the presence of lymph node metastasis (LNM) for primary tumours of EC with an accuracy of 80% (90% sensitivity and 69% specificity). Our results demonstrate the capability of MALDI MSI for complementing classic histopathological examination for cancer diagnostic applications.

3.
Comput Biol Med ; 134: 104433, 2021 07.
Article in English | MEDLINE | ID: mdl-34004575

ABSTRACT

BACKGROUND: Word vectors or word embeddings are n-dimensional representations of words and form the backbone of Natural Language Processing of textual data. This research experiments with algorithms that augment word vectors with lexical constraints that are popular in NLP research and clinical domain constraints derived from the Unified Medical Language System (UMLS). It also compares the performance of the augmented vectors with Bio + Clinical BERT vectors which have been trained and fine-tuned on clinical datasets. METHODS: Word2vec vectors are generated for words in a publicly available de-identified Electronic Health Records (EHR) dataset and augmented by ontologies using three algorithms that have fundamentally different approaches to vector augmentation. The augmented vectors are then evaluated alongside publicly available Bio + Clinical BERT on their correlation with human-annotated lists using Spearman's correlation coefficient. They are also evaluated on the downstream task of Named Entity Recognition (NER). Quantitative and empirical evaluations are used to highlight the strengths and weaknesses of the different approaches. RESULTS: The counter-fitted word2vec vectors augmented with information from the UMLS ontology produced the best correlation overall with human-annotated evaluation lists (Spearman's correlation of 0.733 with mini mayo-doctors' annotation) while Bio + Clinical BERT produces the best results in the NER task (F1 of 0.87 and 0.811 on the i2b2 2010 and i2b2 2012 datasets respectively) in our experiments. CONCLUSION: Clinically adapted word2vec vectors successfully encapsulate concepts of lexical and clinical synonymy and antonymy and to a smaller extent, hyponymy and hypernymy. Bio + Clinical BERT vectors perform better at NER and avoid out-of-vocabulary words.


Subject(s)
Natural Language Processing , Unified Medical Language System , Algorithms , Electronic Health Records , Humans
4.
Diagnostics (Basel) ; 11(3)2021 Mar 19.
Article in English | MEDLINE | ID: mdl-33808677

ABSTRACT

Research into machine learning (ML) for clinical vascular analysis, such as those useful for stroke and coronary artery disease, varies greatly between imaging modalities and vascular regions. Limited accessibility to large diverse patient imaging datasets, as well as a lack of transparency in specific methods, are obstacles to further development. This paper reviews the current status of quantitative vascular ML, identifying advantages and disadvantages common to all imaging modalities. Literature from the past 8 years was systematically collected from MEDLINE® and Scopus database searches in January 2021. Papers satisfying all search criteria, including a minimum of 50 patients, were further analysed and extracted of relevant data, for a total of 47 publications. Current ML image segmentation, disease risk prediction, and pathology quantitation methods have shown sensitivities and specificities over 70%, compared to expert manual analysis or invasive quantitation. Despite this, inconsistencies in methodology and the reporting of results have prevented inter-model comparison, impeding the identification of approaches with the greatest potential. The clinical potential of this technology has been well demonstrated in Computed Tomography of coronary artery disease, but remains practically limited in other modalities and body regions, particularly due to a lack of routine invasive reference measurements and patient datasets.

5.
Br J Cancer ; 125(3): 337-350, 2021 08.
Article in English | MEDLINE | ID: mdl-33927352

ABSTRACT

BACKGROUND: Glioblastoma is the most aggressive type of brain cancer with high-levels of intra- and inter-tumour heterogeneity that contribute to its rapid growth and invasion within the brain. However, a spatial characterisation of gene signatures and the cell types expressing these in different tumour locations is still lacking. METHODS: We have used a deep convolutional neural network (DCNN) as a semantic segmentation model to segment seven different tumour regions including leading edge (LE), infiltrating tumour (IT), cellular tumour (CT), cellular tumour microvascular proliferation (CTmvp), cellular tumour pseudopalisading region around necrosis (CTpan), cellular tumour perinecrotic zones (CTpnz) and cellular tumour necrosis (CTne) in digitised glioblastoma histopathological slides from The Cancer Genome Atlas (TCGA). Correlation analysis between segmentation results from tumour images together with matched RNA expression data was performed to identify genetic signatures that are specific to different tumour regions. RESULTS: We found that spatially resolved gene signatures were strongly correlated with survival in patients with defined genetic mutations. Further in silico cell ontology analysis along with single-cell RNA sequencing data from resected glioblastoma tissue samples showed that these tumour regions had different gene signatures, whose expression was driven by different cell types in the regional tumour microenvironment. Our results further pointed to a key role for interactions between microglia/pericytes/monocytes and tumour cells that occur in the IT and CTmvp regions, which may contribute to poor patient survival. CONCLUSIONS: This work identified key histopathological features that correlate with patient survival and detected spatially associated genetic signatures that contribute to tumour-stroma interactions and which should be investigated as new targets in glioblastoma. The source codes and datasets used are available in GitHub: https://github.com/amin20/GBM_WSSM .


Subject(s)
Brain Neoplasms/diagnostic imaging , Gene Expression Profiling/methods , Gene Regulatory Networks , Glioblastoma/diagnostic imaging , Radiographic Image Interpretation, Computer-Assisted/methods , Brain Neoplasms/genetics , Deep Learning , Gene Expression Regulation, Neoplastic , Glioblastoma/genetics , Humans , Neural Networks, Computer , Single-Cell Analysis , Stem Cell Niche , Survival Analysis , Tumor Microenvironment
6.
J Pers Med ; 10(4)2020 Nov 12.
Article in English | MEDLINE | ID: mdl-33198332

ABSTRACT

In recent years, improved deep learning techniques have been applied to biomedical image processing for the classification and segmentation of different tumors based on magnetic resonance imaging (MRI) and histopathological imaging (H&E) clinical information. Deep Convolutional Neural Networks (DCNNs) architectures include tens to hundreds of processing layers that can extract multiple levels of features in image-based data, which would be otherwise very difficult and time-consuming to be recognized and extracted by experts for classification of tumors into different tumor types, as well as segmentation of tumor images. This article summarizes the latest studies of deep learning techniques applied to three different kinds of brain cancer medical images (histology, magnetic resonance, and computed tomography) and highlights current challenges in the field for the broader applicability of DCNN in personalized brain cancer care by focusing on two main applications of DCNNs: classification and segmentation of brain cancer tumors images.

7.
Neuroscience ; 422: 230-239, 2019 12 01.
Article in English | MEDLINE | ID: mdl-31806080

ABSTRACT

Brain connectivity studies have reported that functional networks change with older age. We aim to (1) investigate whether electroencephalography (EEG) data can be used to distinguish between individual functional networks of young and old adults; and (2) identify the functional connections that contribute to this classification. Two eyes-open resting-state EEG recording sessions with 64 electrodes for each of 22 younger adults (19-37 years) and 22 older adults (63-85 years) were conducted. For each session, imaginary coherence matrices in delta, theta, alpha, beta and gamma bands were computed. A range of machine learning classification methods were utilized to distinguish younger and older adult brains. A support vector machine (SVM) classifier was 93% accurate in classifying the brains by age group. We report decreased functional connectivity with older age in delta, theta, alpha and gamma bands, and increased connectivity with older age in beta band. Most connections involving frontal, temporal, and parietal electrodes, and more than half of connections involving occipital electrodes, showed decreased connectivity with older age. Slightly less than half of the connections involving central electrodes showed increased connectivity with older age. Functional connections showing decreased strength with older age were not significantly different in electrode-to-electrode distance than those that increased with older age. Most of the connections used by the classifier to distinguish participants by age group belonged to the alpha band. Findings suggest a decrease in connectivity in key networks and frequency bands associated with attention and awareness, and an increase in connectivity of the sensorimotor functional networks with aging during a resting state.


Subject(s)
Aging/physiology , Brain Waves/physiology , Neural Pathways/physiology , Adult , Aged , Aged, 80 and over , Electroencephalography , Female , Humans , Machine Learning , Male , Middle Aged , Support Vector Machine , Young Adult
8.
J Neurophysiol ; 120(5): 2532-2541, 2018 11 01.
Article in English | MEDLINE | ID: mdl-29975165

ABSTRACT

Transcranial magnetic stimulation (TMS) is a technique that enables noninvasive manipulation of neural activity and holds promise in both clinical and basic research settings. The effect of TMS on the motor cortex is often measured by electromyography (EMG) recordings from a small hand muscle. However, the details of how TMS generates responses measured with EMG are not completely understood. We aim to develop a biophysically detailed computational model to study the potential mechanisms underlying the generation of EMG signals following TMS. Our model comprises a feed-forward network of cortical layer 2/3 cells, which drive morphologically detailed layer 5 corticomotoneuronal cells, which in turn project to a pool of motoneurons. EMG signals are modeled as the sum of motor unit action potentials. EMG recordings from the first dorsal interosseous muscle were performed in four subjects and compared with simulated EMG signals. Our model successfully reproduces several characteristics of the experimental data. The simulated EMG signals match experimental EMG recordings in shape and size, and change with stimulus intensity and contraction level as in experimental recordings. They exhibit cortical silent periods that are close to the biological values and reveal an interesting dependence on inhibitory synaptic transmission properties. Our model predicts several characteristics of the firing patterns of neurons along the entire pathway from cortical layer 2/3 cells down to spinal motoneurons and should be considered as a viable tool for explaining and analyzing EMG signals following TMS. NEW & NOTEWORTHY A biophysically detailed model of EMG signal generation following transcranial magnetic stimulation (TMS) is proposed. Simulated EMG signals match experimental EMG recordings in shape and amplitude. Motor-evoked potential and cortical silent period properties match experimental data. The model is a viable tool to analyze, explain, and predict EMG signals following TMS.


Subject(s)
Evoked Potentials, Motor , Models, Neurological , Muscle, Skeletal/physiology , Adult , Computer Simulation , Electromyography , Female , Humans , Male , Motor Cortex/cytology , Motor Cortex/physiology , Motor Neurons/physiology , Muscle Contraction , Muscle, Skeletal/innervation , Transcranial Magnetic Stimulation
9.
R Soc Open Sci ; 4(9): 160889, 2017 Sep.
Article in English | MEDLINE | ID: mdl-28989729

ABSTRACT

Suprathreshold stochastic resonance (SSR) is a distinct form of stochastic resonance, which occurs in multilevel parallel threshold arrays with no requirements on signal strength. In the generic SSR model, an optimal weighted decoding scheme shows its superiority in minimizing the mean square error (MSE). In this study, we extend the proposed optimal weighted decoding scheme to more general input characteristics by combining a Kalman filter and a least mean square (LMS) recursive algorithm, wherein the weighted coefficients can be adaptively adjusted so as to minimize the MSE without complete knowledge of input statistics. We demonstrate that the optimal weighted decoding scheme based on the Kalman-LMS recursive algorithm is able to robustly decode the outputs from the system in which SSR is observed, even for complex situations where the signal and noise vary over time.

10.
PLoS Comput Biol ; 13(9): e1005634, 2017 Sep.
Article in English | MEDLINE | ID: mdl-28937977

ABSTRACT

In the brain, the postsynaptic response of a neuron to time-varying inputs is determined by the interaction of presynaptic spike times with the short-term dynamics of each synapse. For a neuron driven by stochastic synapses, synaptic depression results in a quite different postsynaptic response to a large population input depending on how correlated in time the spikes across individual synapses are. Here we show using both simulations and mathematical analysis that not only the rate but the phase of the postsynaptic response to a rhythmic population input varies as a function of synaptic dynamics and synaptic configuration. Resultant phase leads may compensate for transmission delays and be predictive of rhythmic changes. This could be particularly important for sensory processing and motor rhythm generation in the nervous system.


Subject(s)
Action Potentials/physiology , Models, Neurological , Neuronal Plasticity/physiology , Animals , Computational Biology
11.
IEEE Trans Image Process ; 26(10): 4669-4683, 2017 Oct.
Article in English | MEDLINE | ID: mdl-28436874

ABSTRACT

This paper addresses the problem of online tracking and classification of multiple objects in an image sequence. Our proposed solution is to first track all objects in the scene without relying on object-specific prior knowledge, which in other systems can take the form of hand-crafted features or user-based track initialization. We then classify the tracked objects with a fast-learning image classifier, that is based on a shallow convolutional neural network architecture and demonstrate that object recognition improves when this is combined with object state information from the tracking algorithm. We argue that by transferring the use of prior knowledge from the detection and tracking stages to the classification stage, we can design a robust, general purpose object recognition system with the ability to detect and track a variety of object types. We describe our biologically inspired implementation, which adaptively learns the shape and motion of tracked objects, and apply it to the Neovision2 Tower benchmark data set, which contains multiple object types. An experimental evaluation demonstrates that our approach is competitive with the state-of-the-art video object recognition systems that do make use of object-specific prior knowledge in detection and tracking, while providing additional practical advantages by virtue of its generality.

12.
IEEE Trans Biomed Eng ; 64(9): 2219-2229, 2017 09.
Article in English | MEDLINE | ID: mdl-27925583

ABSTRACT

OBJECTIVE: By modeling the cochlear implant (CI) electrode-to-nerve interface and quantifying electrode discriminability in the model, we address the questions of how many individual channels can be distinguished by CI recipients and the extent to which performance might be improved by inserting electrodes deeper into the cochlea. METHOD: We adapt an artificial neural network to model electrode discrimination as well as a commonly used psychophysical measure (four-interval forced-choice) in CI stimulation and predict how well the locations of the stimulating electrodes can be inferred from simulated auditory nerve spiking patterns. RESULTS: We show that a longer electrode leads to better electrode place discrimination in our model. For a simulated four-interval forced-choice procedure, correct classification rates significantly reduce with decreasing distance between the test electrodes and the reference electrodes, and higher correct classification rates may be achieved by the basal electrodes than apical electrodes. CONCLUSION: Our results suggest that enhanced electrode discriminability results from a longer CI electrode array, and the locations where the errors occur along the electrode array are not only affected by the distance between electrodes but also the twirling angle between electrodes. SIGNIFICANCE: Our models and simulations provide theoretical insights into several important clinically relevant problems that will inform future designs of CI electrode arrays and stimulation strategies.


Subject(s)
Auditory Threshold/physiology , Cochlea/physiology , Cochlear Implantation/methods , Cochlear Implants , Electric Stimulation Therapy/instrumentation , Models, Neurological , Computer Simulation , Electric Stimulation Therapy/methods , Equipment Design , Equipment Failure Analysis , Humans
13.
J Comput Neurosci ; 41(2): 193-206, 2016 Oct.
Article in English | MEDLINE | ID: mdl-27480847

ABSTRACT

Neural spike trains are commonly characterized as a Poisson point process. However, the Poisson assumption is a poor model for spiking in auditory nerve fibres because it is known that interspike intervals display positive correlation over long time scales and negative correlation over shorter time scales. We have therefore developed a biophysical model based on the well-known Meddis model of the peripheral auditory system, to produce simulated auditory nerve fibre spiking statistics that more closely match the firing correlations observed in empirical data. We achieve this by introducing biophysically realistic ion channel noise to an inner hair cell membrane potential model that includes fractal fast potassium channels and deterministic slow potassium channels. We succeed in producing simulated spike train statistics that match empirically observed firing correlations. Our model thus replicates macro-scale stochastic spiking statistics in the auditory nerve fibres due to modeling stochasticity at the micro-scale of potassium channels.


Subject(s)
Action Potentials , Cochlear Nerve , Ion Channels/physiology , Models, Neurological , Neurons , Potassium Channels
15.
Proc Math Phys Eng Sci ; 472(2187): 20150748, 2016 Mar.
Article in English | MEDLINE | ID: mdl-27118917

ABSTRACT

Is it possible for a large sequence of measurements or observations, which support a hypothesis, to counterintuitively decrease our confidence? Can unanimous support be too good to be true? The assumption of independence is often made in good faith; however, rarely is consideration given to whether a systemic failure has occurred. Taking this into account can cause certainty in a hypothesis to decrease as the evidence for it becomes apparently stronger. We perform a probabilistic Bayesian analysis of this effect with examples based on (i) archaeological evidence, (ii) weighing of legal evidence and (iii) cryptographic primality testing. In this paper, we investigate the effects of small error rates in a set of measurements or observations. We find that even with very low systemic failure rates, high confidence is surprisingly difficult to achieve; in particular, we find that certain analyses of cryptographically important numerical tests are highly optimistic, underestimating their false-negative rate by as much as a factor of 280.

16.
PLoS One ; 10(8): e0134254, 2015.
Article in English | MEDLINE | ID: mdl-26262687

ABSTRACT

Recent advances in training deep (multi-layer) architectures have inspired a renaissance in neural network use. For example, deep convolutional networks are becoming the default option for difficult tasks on large datasets, such as image and speech recognition. However, here we show that error rates below 1% on the MNIST handwritten digit benchmark can be replicated with shallow non-convolutional neural networks. This is achieved by training such networks using the 'Extreme Learning Machine' (ELM) approach, which also enables a very rapid training time (∼ 10 minutes). Adding distortions, as is common practise for MNIST, reduces error rates even further. Our methods are also shown to be capable of achieving less than 5.5% error rates on the NORB image database. To achieve these results, we introduce several enhancements to the standard ELM algorithm, which individually and in combination can significantly improve performance. The main innovation is to ensure each hidden-unit operates only on a randomly sized and positioned patch of each image. This form of random 'receptive field' sampling of the input ensures the input weight matrix is sparse, with about 90% of weights equal to zero. Furthermore, combining our methods with a small number of iterations of a single-batch backpropagation method can significantly reduce the number of hidden-units required to achieve a particular performance. Our close to state-of-the-art results for MNIST and NORB suggest that the ease of use and accuracy of the ELM algorithm for designing a single-hidden-layer neural network classifier should cause it to be given greater consideration either as a standalone method for simpler problems, or as the final classification stage in deep neural networks applied to more difficult problems.


Subject(s)
Algorithms , Models, Theoretical
17.
Network ; 26(2): 35-71, 2015.
Article in English | MEDLINE | ID: mdl-25760433

ABSTRACT

Stochastic resonance (SR) is said to be observed when the presence of noise in a nonlinear system enables an output signal from the system to better represent some feature of an input signal than it does in the absence of noise. The effect has been observed in models of individual neurons, and in experiments performed on real neural systems. Despite the ubiquity of biophysical sources of stochastic noise in the nervous system, however, it has not yet been established whether neuronal computation mechanisms involved in performance of specific functions such as perception or learning might exploit such noise as an integral component, such that removal of the noise would diminish performance of these functions. In this paper we revisit the methods used to demonstrate stochastic resonance in models of single neurons. This includes a previously unreported observation in a multicompartmental model of a CA1-pyramidal cell. We also discuss, as a contrast to these classical studies, a form of 'stochastic facilitation', known as inverse stochastic resonance. We draw on the reviewed examples to argue why new approaches to studying 'stochastic facilitation' in neural systems need to be developed.


Subject(s)
Computer Simulation , Models, Neurological , Neurons/physiology , Stochastic Processes , Animals , Humans
18.
Neural Comput ; 27(1): 74-103, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25380331

ABSTRACT

In this letter, we provide a stochastic analysis of, and supporting simulation data for, a stochastic model of the generation of gamma bursts in local field potential (LFP) recordings by interacting populations of excitatory and inhibitory neurons. Our interest is in behavior near a fixed point of the stochastic dynamics of the model. We apply a recent limit theorem of stochastic dynamics to probe into details of this local behavior, obtaining several new results. We show that the stochastic model can be written in terms of a rotation multiplied by a two-dimensional standard Ornstein-Uhlenbeck (OU) process. Viewing the rewritten process in terms of phase and amplitude processes, we are able to proceed further in analysis. We demonstrate that gamma bursts arise in the model as excursions of the modulus of the OU process. The associated pair of stochastic phase and amplitude processes satisfies their own pair of stochastic differential equations, which indicates that large phase slips occur between gamma bursts. This behavior is mirrored in LFP data simulated from the original model. These results suggest that the rewritten model is a valid representation of the behavior near the fixed point for a wide class of models of oscillatory neural processes.


Subject(s)
Evoked Potentials/physiology , Gamma Rhythm/physiology , Models, Neurological , Nonlinear Dynamics , Electroencephalography , Humans , Neurons/physiology , Spectrum Analysis , Stochastic Processes
19.
PLoS One ; 9(12): e114503, 2014.
Article in English | MEDLINE | ID: mdl-25486535

ABSTRACT

Complex networks are frequently characterized by metrics for which particular subgraphs are counted. One statistic from this category, which we refer to as motif-role fingerprints, differs from global subgraph counts in that the number of subgraphs in which each node participates is counted. As with global subgraph counts, it can be important to distinguish between motif-role fingerprints that are 'structural' (induced subgraphs) and 'functional' (partial subgraphs). Here we show mathematically that a vector of all functional motif-role fingerprints can readily be obtained from an arbitrary directed adjacency matrix, and then converted to structural motif-role fingerprints by multiplying that vector by a specific invertible conversion matrix. This result demonstrates that a unique structural motif-role fingerprint exists for any given functional motif-role fingerprint. We demonstrate a similar result for the cases of functional and structural motif-fingerprints without node roles, and global subgraph counts that form the basis of standard motif analysis. We also explicitly highlight that motif-role fingerprints are elemental to several popular metrics for quantifying the subgraph structure of directed complex networks, including motif distributions, directed clustering coefficient, and transitivity. The relationships between each of these metrics and motif-role fingerprints also suggest new subtypes of directed clustering coefficients and transitivities. Our results have potential utility in analyzing directed synaptic networks constructed from neuronal connectome data, such as in terms of centrality. Other potential applications include anomaly detection in networks, identification of similar networks and identification of similar nodes within networks. Matlab code for calculating all stated metrics following calculation of functional motif-role fingerprints is provided as S1 Matlab File.


Subject(s)
Algorithms , Caenorhabditis elegans/genetics , Gene Regulatory Networks , Models, Biological , Neural Pathways , Neurons/metabolism , Protein Interaction Mapping , Animals , Cluster Analysis , Computational Biology , Computer Simulation
20.
PLoS One ; 9(11): e113159, 2014.
Article in English | MEDLINE | ID: mdl-25402466

ABSTRACT

When we look at the world--or a graphical depiction of the world--we perceive surface materials (e.g. a ceramic black and white checkerboard) independently of variations in illumination (e.g. shading or shadow) and atmospheric media (e.g. clouds or smoke). Such percepts are partly based on the way physical surfaces and media reflect and transmit light and partly on the way the human visual system processes the complex patterns of light reaching the eye. One way to understand how these percepts arise is to assume that the visual system parses patterns of light into layered perceptual representations of surfaces, illumination and atmospheric media, one seen through another. Despite a great deal of previous experimental and modelling work on layered representation, however, a unified computational model of key perceptual demonstrations is still lacking. Here we present the first general computational model of perceptual layering and surface appearance--based on a boarder theoretical framework called gamut relativity--that is consistent with these demonstrations. The model (a) qualitatively explains striking effects of perceptual transparency, figure-ground separation and lightness, (b) quantitatively accounts for the role of stimulus- and task-driven constraints on perceptual matching performance, and (c) unifies two prominent theoretical frameworks for understanding surface appearance. The model thereby provides novel insights into the remarkable capacity of the human visual system to represent and identify surface materials, illumination and atmospheric media, which can be exploited in computer graphics applications.


Subject(s)
Color Perception/physiology , Contrast Sensitivity/physiology , Models, Theoretical , Humans , Lighting , Perception , Psychophysics
SELECTION OF CITATIONS
SEARCH DETAIL