Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Mais filtros

Bases de dados
Ano de publicação
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Sensors (Basel) ; 21(24)2021 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-34960407

RESUMO

Models for keyword spotting in continuous recordings can significantly improve the experience of navigating vast libraries of audio recordings. In this paper, we describe the development of such a keyword spotting system detecting regions of interest in Polish call centre conversations. Unfortunately, in spite of recent advancements in automatic speech recognition systems, human-level transcription accuracy reported on English benchmarks does not reflect the performance achievable in low-resource languages, such as Polish. Therefore, in this work, we shift our focus from complete speech-to-text conversion to acoustic similarity matching in the hope of reducing the demand for data annotation. As our primary approach, we evaluate Siamese and prototypical neural networks trained on several datasets of English and Polish recordings. While we obtain usable results in English, our models' performance remains unsatisfactory when applied to Polish speech, both after mono- and cross-lingual training. This performance gap shows that generalisation with limited training resources is a significant obstacle for actual deployments in low-resource languages. As a potential countermeasure, we implement a detector using audio embeddings generated with a generic pre-trained model provided by Google. It has a much more favourable profile when applied in a cross-lingual setup to detect Polish audio patterns. Nevertheless, despite these promising results, its performance on out-of-distribution data are still far from stellar. It would indicate that, in spite of the richness of internal representations created by more generic models, such speech embeddings are not entirely malleable to cross-language transfer.


Assuntos
Redes Neurais de Computação , Fala , Acústica , Curadoria de Dados , Humanos , Idioma
2.
Sensors (Basel) ; 21(16)2021 Aug 05.
Artigo em Inglês | MEDLINE | ID: mdl-34450735

RESUMO

Despite technological progress, we lack a consensus on the method of conducting automated bowel sound (BS) analysis and, consequently, BS tools have not become available to doctors. We aimed to briefly review the literature on BS recording and analysis, with an emphasis on the broad range of analytical approaches. Scientific journals and conference materials were researched with a specific set of terms (Scopus, MEDLINE, IEEE) to find reports on BS. The research articles identified were analyzed in the context of main research directions at a number of centers globally. Automated BS analysis methods were already well developed by the early 2000s. Accuracy of 90% and higher had been achieved with various analytical approaches, including wavelet transformations, multi-layer perceptrons, independent component analysis and autoregressive-moving-average models. Clinical research on BS has exposed their important potential in the non-invasive diagnosis of irritable bowel syndrome, in surgery, and for the investigation of gastrointestinal motility. The most recent advances are linked to the application of artificial intelligence and the development of dedicated BS devices. BS research is technologically mature, but lacks uniform methodology, an international forum for discussion and an open platform for data exchange. A common ground is needed as a starting point. The next key development will be the release of freely available benchmark datasets with labels confirmed by human experts.


Assuntos
Inteligência Artificial , Gastroenteropatias , Redes Neurais de Computação , Automação , Gastroenteropatias/diagnóstico , Humanos , Som
3.
Sensors (Basel) ; 21(22)2021 Nov 16.
Artigo em Inglês | MEDLINE | ID: mdl-34833679

RESUMO

Automated bowel sound (BS) analysis methods were already well developed by the early 2000s. Accuracy of ~90% had been achieved by several teams using various analytical approaches. Clinical research on BS had revealed their high potential in the non-invasive investigation of irritable bowel syndrome to study gastrointestinal motility and in a surgical setting. This article proposes a novel methodology for the analysis of BS using hybrid convolutional and recursive neural networks. It is one of the first methods of using deep learning to be widely explored. We have developed an experimental pipeline and evaluated our results with a new dataset collected using a device with a dedicated contact microphone. Data have been collected at night-time, which is the most interesting period from a neurogastroenterological point of view. Previous works had ignored this period and instead kept brief records only during the day. Our algorithm can detect bowel sounds with an accuracy >93%. Moreover, we have achieved a very high specificity (>97%), crucial in diagnosis. The results have been checked with a medical professional, and they successfully support clinical diagnosis. We have developed a client-server system allowing medical practitioners to upload the recordings from their patients and have them analyzed online. This system is available online. Although BS research is technologically mature, it still lacks a uniform methodology, an international forum for discussion, and an open platform for data exchange, and therefore it is not commonly used. Our server could provide a starting point for establishing a common framework in BS research.


Assuntos
Algoritmos , Redes Neurais de Computação , Acústica , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA