Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 21
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Waste Manag ; 190: 63-73, 2024 Sep 14.
Artigo em Inglês | MEDLINE | ID: mdl-39277917

RESUMO

In recent years, the rapid accumulation of marine waste not only endangers the ecological environment but also causes seawater pollution. Traditional manual salvage methods often have low efficiency and pose safety risks to human operators, making automatic underwater waste recycling a mainstream approach. In this paper, we propose a lightweight multi-scale cross-level network for underwater waste segmentation based on sonar images that provides pixel-level location information and waste categories for autonomous underwater robots. In particular, we introduce hybrid perception and multi-scale attention modules to capture multi-scale contextual features and enhance high-level critical information, respectively. At the same time, we use sampling attention modules and cross-level interaction modules to achieve feature down-sampling and fuse detailed features and semantic features, respectively. Relevant experimental results indicate that our method outperforms other semantic segmentation models and achieves 74.66 % mIoU with only 0.68 M parameters. In particular, compared with the representative PIDNet Small model based on the convolutional neural network architecture, our method can improve the mIoU metric by 1.15 percentage points and can reduce model parameters by approximately 91 %. Compared with the representative SeaFormer T model based on the transformer architecture, our approach can improve the mIoU metric by 2.07 percentage points and can reduce model parameters by approximately 59 %. Our approach maintains a satisfactory balance between model parameters and segmentation performance. Our solution provides new insights into intelligent underwater waste recycling, which helps in promoting sustainable marine development.

2.
Sensors (Basel) ; 24(13)2024 Jul 05.
Artigo em Inglês | MEDLINE | ID: mdl-39001153

RESUMO

Smoke is an obvious sign of pre-fire. However, due to its variable morphology, the existing schemes are difficult to extract precise smoke characteristics, which seriously affects the practical applications. Therefore, we propose a lightweight cross-layer smoke-aware network (CLSANet) of only 2.38 M. To enhance the information exchange and ensure accurate feature extraction, three cross-layer connection strategies with bias are applied to the CLSANet. First, a spatial perception module (SPM) is designed to transfer spatial information from the shallow layer to the high layer, so that the valuable texture details can be complemented in the deeper levels. Furthermore, we propose a texture federation module (TFM) in the final encoding phase based on fully connected attention (FCA) and spatial texture attention (STA). Both FCA and STA structures implement cross-layer connections to further repair the missing spatial information of smoke. Finally, a feature self-collaboration head (FSCHead) is devised. The localization and classification tasks are decoupled and explicitly deployed on different layers. As a result, CLSANet effectively removes redundancy and preserves meaningful smoke features in a concise way. It obtains the precision of 94.4% and 73.3% on USTC-RF and XJTU-RS databases, respectively. Extensive experiments are conducted and the results demonstrate that CLSANet has a competitive performance.

3.
Waste Manag ; 177: 125-134, 2024 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-38325013

RESUMO

To create a clean living environment, governments around the world have hired a large number of workers to clean up waste on pavements, which is inefficient for waste management. To better alleviate this problem, relevant scholars have proposed several deep learning methods based on RGB images to achieve waste detection and recognition. Considering the limitations of color images, we propose an efficient multi-modal learning solution for pavement waste detection and recognition. Specifically, we construct a high-quality outdoor pavement waste dataset called OPWaste, which is more in line with real needs. Compared to other waste datasets, OPWaste dataset not only has the advantages of rich background and high diversity, but also provides color and depth images. Meanwhile, we explore six different multi-modal fusion methods and propose a novel multi-modal multi-scale network (MM-Net) for RGB-D waste detection and recognition. MM-Net introduces a novel multi-scale refinement module (MRM) and multi-scale interaction module (MIM). MRM can effectively refine critical features using attention mechanisms. MIM can gradually realize information interaction between hierarchical features. In addition, we select several representative methods and perform comparative experiments. Experimental results show that MM-Net based on the image addition fusion method outperforms other deep learning models and reaches 97.3% and 84.4% on mAP0.5 and AR metrics. In fact, multi-modal learning plays an important role in intelligent waste recycling. As a promising auxiliary tool, our solution can be applied to intelligent cleaning robots for automatic outdoor waste management.


Assuntos
Aprendizado Profundo , Gerenciamento de Resíduos , Humanos , Reciclagem
4.
Cancer Causes Control ; 35(4): 605-609, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-37975972

RESUMO

BACKGROUND: Head and neck cancer (HNC) has low 5-year survival, and evidence-based recommendations for tertiary prevention are lacking. Aspirin improves outcomes for cancers at other sites, but its role in HNC tertiary prevention remains understudied. METHODS: HNC patients were recruited in the University of Michigan Head and Neck Cancer Specialized Program of Research Excellence (SPORE) from 2003 to 2014. Aspirin data were collected through medical record review; outcomes (overall mortality, HNC-specific mortality, and recurrence) were collected through medical record review, Social Security Death Index, or LexisNexis. Cox proportional hazards models were used to evaluate the associations between aspirin use at diagnosis (yes/no) and HNC outcomes. RESULTS: We observed no statistically significant associations between aspirin and cancer outcome in our HNC patient cohort (n = 1161) (HNC-specific mortality: HR = 0.91, 95% CI = 0.68-1.21; recurrence: HR = 0.94, 95% CI = 0.73-1.19). In analyses stratified by anatomic site, HPV status, and disease stage, we observed no association in any strata examined with the possible exception of a lower risk of recurrence in oropharynx patients (HR = 0.60, 95% CI 0.35-1.04). CONCLUSIONS: Our findings do not support a protective association between aspirin use and cancer-specific death or recurrence in HNC patients, with the possible exception of a lower risk of recurrence in oropharynx patients.


Assuntos
Aspirina , Neoplasias de Cabeça e Pescoço , Humanos , Aspirina/uso terapêutico , Neoplasias de Cabeça e Pescoço/tratamento farmacológico , Modelos de Riscos Proporcionais
5.
Sensors (Basel) ; 22(24)2022 Dec 16.
Artigo em Inglês | MEDLINE | ID: mdl-36560280

RESUMO

Iris localization in non-cooperative environments is challenging and essential for accurate iris recognition. Motivated by the traditional iris-localization algorithm and the robustness of the YOLO model, we propose a novel iris-localization algorithm. First, we design a novel iris detector with a modified you only look once v4 (YOLO v4) model. We can approximate the position of the pupil center. Then, we use a modified integro-differential operator to precisely locate the iris inner and outer boundaries. Experiment results show that iris-detection accuracy can reach 99.83% with this modified YOLO v4 model, which is higher than that of a traditional YOLO v4 model. The accuracy in locating the inner and outer boundary of the iris without glasses can reach 97.72% at a short distance and 98.32% at a long distance. The locating accuracy with glasses can obtained at 93.91% and 84%, respectively. It is much higher than the traditional Daugman's algorithm. Extensive experiments conducted on multiple datasets demonstrate the effectiveness and robustness of our method for iris localization in non-cooperative environments.


Assuntos
Algoritmos , Iris , Pupila
6.
Genes (Basel) ; 13(11)2022 10 27.
Artigo em Inglês | MEDLINE | ID: mdl-36360196

RESUMO

The epigenome likely interacts with traditional and genetic risk factors to influence blood pressure. We evaluated whether 13 previously reported DNA methylation sites (CpGs) are associated with systolic (SBP) or diastolic (DBP) blood pressure, both individually and aggregated into methylation risk scores (MRS), in 3070 participants (including 437 African ancestry (AA) and 2021 European ancestry (EA), mean age = 70.5 years) from the Health and Retirement Study. Nine CpGs were at least nominally associated with SBP and/or DBP after adjusting for traditional hypertension risk factors (p < 0.05). MRSSBP was positively associated with SBP in the full sample (ß = 1.7 mmHg per 1 standard deviation in MRSSBP; p = 2.7 × 10-5) and in EA (ß = 1.6; p = 0.001), and MRSDBP with DBP in the full sample (ß = 1.1; p = 1.8 × 10-6), EA (ß = 1.1; p = 7.2 × 10-5), and AA (ß = 1.4; p = 0.03). The MRS and BP-genetic risk scores were independently associated with blood pressure in EA. The effects of both MRSs were weaker with increased age (pinteraction < 0.01), and the effect of MRSDBP was higher among individuals with at least some college education (pinteraction = 0.02). In AA, increasing MRSSBP was associated with higher SBP in females only (pinteraction = 0.01). Our work shows that MRS is a potential biomarker of blood pressure that may be modified by traditional hypertension risk factors.


Assuntos
Hipertensão , Aposentadoria , Feminino , Humanos , Idoso , Pressão Sanguínea/genética , Hipertensão/genética , Fatores de Risco , Epigênese Genética
7.
Phys Chem Chem Phys ; 24(5): 2997-3006, 2022 Feb 02.
Artigo em Inglês | MEDLINE | ID: mdl-35037923

RESUMO

The self-assembly processes of Pd6L3 coordination prisms consisting of cis-protected Pd(II) complexes and porphyrin-based tetratopic ligands with four 3-pyridyl or 4-pyridyl groups (L) were investigated by experimental and numerical methods, QASAP (quantitative analysis of self-assembly process) and NASAP (numerical analysis of self-assembly process), respectively. It was found that contrary to common intuition macrocyclization takes place faster than the bridging reaction in the prism assembly and that the bridging reaction occurring before the macrocyclization tends to produce kinetically trapped species. A numerical simulation demonstrates that the relative magnitude of the rate constants between the macrocyclization and the bridging reaction is the key factor that determines whether the self-assembly leads to the thermodynamically most stable prism or to kinetically trapped species. Finding the key elementary reactions that largely affect the selection of the major assembly pathway is helpful to rationally control the products under kinetic control via modulation of the energy landscape.

8.
Sensors (Basel) ; 21(13)2021 Jun 29.
Artigo em Inglês | MEDLINE | ID: mdl-34209907

RESUMO

Due to carbon deposits, lean flames, or damaged metal parts, sparks can occur in aero engine chambers. At present, the detection of such sparks deeply depends on laborious manual work. Considering that interference has the same features as sparks, almost all existing object detectors cannot replace humans in carrying out high-precision spark detection. In this paper, we propose a scene-aware spark detection network, consisting of an information fusion-based cascading video codec-image object detector structure, which we name SAVSDN. Unlike video object detectors utilizing candidate boxes from adjacent frames to assist in the current prediction, we find that efforts should be made to extract the spatio-temporal features of adjacent frames to reduce over-detection. Visualization experiments show that SAVSDN can learn the difference in spatio-temporal features between sparks and interference. To solve the problem of a lack of aero engine anomalous spark data, we introduce a method to generate simulated spark images based on the Gaussian function. In addition, we publish the first simulated aero engine spark data set, which we name SAES. In our experiments, SAVSDN far outperformed state-of-the-art detection models for spark detection in terms of five metrics.


Assuntos
Cálcio , Humanos
9.
Comput Math Methods Med ; 2020: 9812019, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32774445

RESUMO

In this paper, we present a parallel framework based on MPI for a large dataset to extract power spectrum features of EEG signals so as to improve the speed of brain signal processing. At present, the Welch method has been wildly used to estimate the power spectrum. However, the traditional Welch method takes a lot of time especially for the large dataset. In view of this, we added the MPI into the traditional Welch method and developed it into a reusable master-slave parallel framework. As long as the EEG data of any format are converted into the text file of a specified format, the power spectrum features can be extracted quickly by this parallel framework. In the proposed parallel framework, the EEG signals recorded by a channel are divided into N overlapping data segments. Then, the PSD of N segments are computed by some nodes in parallel. The results are collected and summarized by the master node. The final PSD results of each channel are saved in the text file, which can be read and analyzed by Microsoft Excel. This framework can be implemented not only on the clusters but also on the desktop computer. In the experiment, we deploy this framework on a desktop computer with a 4-core Intel CPU. It took only a few minutes to extract the power spectrum features from the 2.85 GB EEG dataset, seven times faster than using Python. This framework makes it easy for users, who do not have any parallel programming experience in constructing the parallel algorithms to extract the EEG power spectrum.


Assuntos
Algoritmos , Eletroencefalografia/estatística & dados numéricos , Big Data , Encéfalo/fisiologia , Interfaces Cérebro-Computador/estatística & dados numéricos , Bases de Dados Factuais/estatística & dados numéricos , Eletroencefalografia/instrumentação , Análise de Fourier , Humanos , Reconhecimento Automatizado de Padrão/estatística & dados numéricos , Linguagens de Programação , Processamento de Sinais Assistido por Computador
10.
Sensors (Basel) ; 20(15)2020 Jul 30.
Artigo em Inglês | MEDLINE | ID: mdl-32751620

RESUMO

Palmprint recognition has been widely studied for security applications. However, there is a lack of in-depth investigations on robust palmprint recognition. Regression analysis being intuitively interpretable on robustness design inspires us to propose a correntropy-induced discriminative nonnegative sparse coding method for robust palmprint recognition. Specifically, we combine the correntropy metric and l1-norm to present a powerful error estimator that gains flexibility and robustness to various contaminations by cooperatively detecting and correcting errors. Furthermore, we equip the error estimator with a tailored discriminative nonnegative sparse regularizer to extract significant nonnegative features. We manage to explore an analytical optimization approach regarding this unified scheme and figure out a novel efficient method to address the challenging non-negative constraint. Finally, the proposed coding method is extended for robust multispectral palmprint recognition. Namely, we develop a constrained particle swarm optimizer to search for the feasible parameters to fuse the extracted robust features of different spectrums. Extensive experimental results on both contactless and contact-based multispectral palmprint databases verify the flexibility and robustness of our methods.


Assuntos
Identificação Biométrica , Mãos , Algoritmos , Bases de Dados Factuais , Entropia , Humanos , Análise de Regressão
11.
Sensors (Basel) ; 20(2)2020 Jan 10.
Artigo em Inglês | MEDLINE | ID: mdl-32284514

RESUMO

Magnetic rings are the most widely used magnetic material product in industry. The existing manual defect detection method for magnetic rings has high cost, low efficiency and low precision. To address this issue, a magnetic ring multi-defect stereo detection system based on multi-camera vision technology is developed to complete the automatic inspection of magnetic rings. The system can detect surface defects and measure ring height simultaneously. Two image processing algorithms are proposed, namely, the image edge removal algorithm (IERA) and magnetic ring location algorithm (MRLA), separately. On the basis of these two algorithms, connected domain filtering methods for crack, fiber and large-area defects are established to complete defect inspection. This system achieves a recognition rate of 100% for defects such as crack, adhesion, hanger adhesion and pitting. Furthermore, the recognition rate for fiber and foreign matter defects attains 92.5% and 91.5%, respectively. The detection speed exceeds 120 magnetic rings per minutes, and the precision is within 0.05 mm. Both precision and speed meet the requirements of real-time quality inspection in actual production.

12.
PLoS One ; 14(4): e0209083, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-30986209

RESUMO

An extreme learning machine (ELM) is a novel training method for single-hidden layer feedforward neural networks (SLFNs) in which the hidden nodes are randomly assigned and fixed without iterative tuning. ELMs have earned widespread global interest due to their fast learning speed, satisfactory generalization ability and ease of implementation. In this paper, we extend this theory to hypercomplex space and attempt to simultaneously consider multisource information using a hypercomplex representation. To illustrate the performance of the proposed hypercomplex extreme learning machine (HELM), we have applied this scheme to the task of multispectral palmprint recognition. Images from different spectral bands are utilized to construct the hypercomplex space. Extensive experiments conducted on the PolyU and CASIA multispectral databases demonstrate that the HELM scheme can achieve competitive results. The source code together with datasets involved in this paper can be available for free download at https://figshare.com/s/01aef7d48840afab9d6d.


Assuntos
Dermatoglifia , Aprendizado de Máquina , Adulto , Algoritmos , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Redes Neurais de Computação , Reconhecimento Automatizado de Padrão/métodos , Adulto Jovem
13.
Sensors (Basel) ; 19(2)2019 Jan 09.
Artigo em Inglês | MEDLINE | ID: mdl-30634530

RESUMO

For the past decades, recognition technologies of multispectral palmprint have attracted more and more attention due to their abundant spatial and spectral characteristics compared with the single spectral case. Enlightened by this, an innovative robust L2 sparse representation with tensor-based extreme learning machine (RL2SR-TELM) algorithm is put forward by using an adaptive image level fusion strategy to accomplish the multispectral palmprint recognition. Firstly, we construct a robust L2 sparse representation (RL2SR) optimization model to calculate the linear representation coefficients. To suppress the affection caused by noise contamination, we introduce a logistic function into RL2SR model to evaluate the representation residual. Secondly, we propose a novel weighted sparse and collaborative concentration index (WSCCI) to calculate the fusion weight adaptively. Finally, we put forward a TELM approach to carry out the classification task. It can deal with the high dimension data directly and reserve the image spatial information well. Extensive experiments are implemented on the benchmark multispectral palmprint database provided by PolyU. The experiment results validate that our RL2SR-TELM algorithm overmatches a number of state-of-the-art multispectral palmprint recognition algorithms both when the images are noise-free and contaminated by different noises.

14.
Rev Sci Instrum ; 89(7): 074302, 2018 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-30068128

RESUMO

Brain-computer interface (BCI) systems establish a direct communication channel from the brain to an output device. As the basis of BCIs, recognizing motor imagery activities poses a considerable challenge to signal processing due to the complex and non-stationary characteristics. This paper introduces an optimal and intelligent method for motor imagery BCIs. Because of the robustness to noise, wavelet packet decomposition and common spatial pattern (CSP) methods were implemented to reduce the dimensions of preprocessed signals. And a novel and efficient classifier projection extreme learning machine (PELM) was employed to recognize the labels of electroencephalogram signals. Experiments have been performed on the BCI Competition Dataset to demonstrate the superiority of wavelet-CSP in BCI and the outperformance of the PELM-based method. Results show that the average recognition rate of PELM approaches approximately 70%, while the optimal rate of other methods is 72%, whose training time and classification time are relatively longer as 11.00 ms and 11.66 ms, respectively, compared with 4.75 ms and 4.87 ms obtained by using the proposed BCI system.


Assuntos
Interfaces Cérebro-Computador , Eletroencefalografia , Imaginação/fisiologia , Aprendizado de Máquina , Atividade Motora/fisiologia , Análise de Ondaletas , Encéfalo/fisiologia , Calibragem , Eletroencefalografia/métodos , Humanos , Reconhecimento Automatizado de Padrão/métodos , Análise de Componente Principal , Fatores de Tempo
15.
Sensors (Basel) ; 18(8)2018 Aug 20.
Artigo em Inglês | MEDLINE | ID: mdl-30127275

RESUMO

Steel bars play an important role in modern construction projects and their quality enormously affects the safety of buildings. It is urgent to detect whether steel bars meet the specifications or not. However, the existing manual detection methods are costly, slow and offer poor precision. In order to solve these problems, a high precision quality inspection system for steel bars based on machine vision is developed. We propose two algorithms: the sub-pixel boundary location method (SPBLM) and fast stitch method (FSM). A total of five sensors, including a CMOS, a level sensor, a proximity switch, a voltage sensor, and a current sensor have been used to detect the device conditions and capture image or video. The device could capture abundant and high-definition images and video taken by a uniform and stable smartphone at the construction site. Then data could be processed in real-time on a smartphone. Furthermore, the detection results, including steel bar diameter, spacing, and quantity would be given by a practical APP. The system has a rather high accuracy (as low as 0.04 mm (absolute error) and 0.002% (relative error) of calculating diameter and spacing; zero error in counting numbers of steel bars) when doing inspection tasks, and three parameters can be detected at the same time. None of these features are available in existing systems and the device and method can be widely used to steel bar quality inspection at the construction site.

16.
PLoS One ; 12(5): e0178432, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28558064

RESUMO

Multispectral palmprint recognition has shown broad prospects for personal identification due to its high accuracy and great stability. In this paper, we develop a novel illumination-invariant multispectral palmprint recognition method. To combine the information from multiple spectral bands, an image-level fusion framework is completed based on a fast and adaptive bidimensional empirical mode decomposition (FABEMD) and a weighted Fisher criterion. The FABEMD technique decomposes the multispectral images into their bidimensional intrinsic mode functions (BIMFs), on which an illumination compensation operation is performed. The weighted Fisher criterion is to construct the fusion coefficients at the decomposition level, making the images be separated correctly in the fusion space. The image fusion framework has shown strong robustness against illumination variation. In addition, a tensor-based extreme learning machine (TELM) mechanism is presented for feature extraction and classification of two-dimensional (2D) images. In general, this method has fast learning speed and satisfying recognition accuracy. Comprehensive experiments conducted on the PolyU multispectral palmprint database illustrate that the proposed method can achieve favorable results. For the testing under ideal illumination, the recognition accuracy is as high as 99.93%, and the result is 99.50% when the lighting condition is unsatisfied.


Assuntos
Identificação Biométrica/métodos , Iluminação , Algoritmos , Humanos , Aprendizagem
17.
PLoS One ; 12(5): e0176909, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28472185

RESUMO

Human endogenous retroviruses (HERVs) encode active retroviral proteins, which may be involved in the progression of cancer and other diseases. Matrix protein (MA), in group-specific antigen genes (gag) of retroviruses, is associated with the virus envelope glycoproteins in most mammalian retroviruses and may be involved in virus particle assembly, transport and budding. However, the amount of annotated MAs in ERVs is still at a low level so far. No computational method to predict the exact start and end coordinates of MAs in gags has been proposed yet. In this paper, a computational method to identify MAs in ERVs is proposed. A divide and conquer technique was designed and applied to the conventional prediction model to acquire better results when dealing with gene sequences with various lengths. Initiation sites and termination sites were predicted separately and then combined according to their intervals. Three different algorithms were applied and compared: weighted support vector machine (WSVM), weighted extreme learning machine (WELM) and random forest (RF). G - mean (geometric mean of sensitivity and specificity) values of initiation sites and termination sites under 5-fold cross validation generated by random forest models are 0.9869 and 0.9755 respectively, highest among the algorithms applied. Our prediction models combine RF & WSVM algorithms to achieve the best prediction results. 98.4% of all the collected ERV sequences with complete MAs (125 in total) could be predicted exactly correct by the models. 94,671 HERV sequences from 118 families were scanned by the model, 104 new putative MAs were predicted in human chromosomes. Distributions of the putative MAs and optimizations of model parameters were also analyzed. The usage of our predicting method was also expanded to other retroviruses and satisfying results were acquired.


Assuntos
Biologia Computacional , Retrovirus Endógenos/metabolismo , Proteínas da Matriz Viral/metabolismo , Animais , Humanos
18.
J Theor Biol ; 423: 63-70, 2017 06 21.
Artigo em Inglês | MEDLINE | ID: mdl-28454901

RESUMO

Integrase catalytic domain (ICD) is an essential part in the retrovirus for integration reaction, which enables its newly synthesized DNA to be incorporated into the DNA of infected cells. Owing to the crucial role of ICD for the retroviral replication and the absence of an equivalent of integrase in host cells, it is comprehensible that ICD is a promising drug target for therapeutic intervention. However, annotated ICDs in UniProtKB database have still been insufficient for a good understanding of their statistical characteristics so far. Accordingly, it is of great importance to put forward a computational ICD model in this work to annotate these domains in the retroviruses. The proposed model then discovered 11,660 new putative ICDs after scanning sequences without ICD annotations. Subsequently in order to provide much confidence in ICD prediction, it was tested under different cross-validation methods, compared with other database search tools, and verified on independent datasets. Furthermore, an evolutionary analysis performed on the annotated ICDs of retroviruses revealed a tight connection between ICD and retroviral classification. All the datasets involved in this paper and the application software tool of this model can be available for free download at https://sourceforge.net/projects/icdtool/files/?source=navbar.


Assuntos
Domínio Catalítico , Biologia Computacional , Evolução Molecular , Integrases/química , Retroviridae/classificação , Análise de Sequência de Proteína , Simulação por Computador , Bases de Dados de Proteínas , Anotação de Sequência Molecular , Software
19.
J Theor Biol ; 415: 84-89, 2017 02 21.
Artigo em Inglês | MEDLINE | ID: mdl-27908705

RESUMO

Regulatory single nucleotide polymorphisms (rSNPs), kind of functional noncoding genetic variants, can affect gene expression in a regulatory way, and they are thought to be associated with increased susceptibilities to complex diseases. Here a novel computational approach to identify potential rSNPs is presented. Different from most other rSNPs finding methods which based on hypothesis that SNPs causing large allele-specific changes in transcription factor binding affinities are more likely to play regulatory functions, we use a set of documented experimentally verified rSNPs and nonfunctional background SNPs to train classifiers, so the discriminating features are found. To characterize variants, an extensive range of characteristics, such as sequence context, DNA structure and evolutionary conservation etc. are analyzed. Support vector machine is adopted to build the classifier model together with an ensemble method to deal with unbalanced data. 10-fold cross-validation result shows that our method can achieve accuracy with sensitivity of ~78% and specificity of ~82%. Furthermore, our method performances better than some other algorithms based on aforementioned hypothesis in handling false positives. The original data and the source matlab codes involved are available at https://sourceforge.net/projects/rsnppredict/.


Assuntos
Simulação por Computador , Regulação da Expressão Gênica , Genoma Humano , Polimorfismo de Nucleotídeo Único/genética , Algoritmos , Biologia Computacional/métodos , Humanos , Métodos , Sensibilidade e Especificidade , Aprendizado de Máquina Supervisionado
20.
PLoS One ; 11(10): e0165216, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27755604

RESUMO

[This corrects the article DOI: 10.1371/journal.pone.0161913.].

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA