RESUMEN
BACKGROUND/OBJECTIVES: Obesity is a pressing health concern within the United States (US). Obesity medicine "diplomates" receive specialized training, yet it is unclear if their accessibility and availability adequately serves the need. The purpose of this research was to understand how accessibility has evolved over time and assess the practicality of serving an estimated patient population with the current distribution and quantity of diplomates. METHODS: Population-weighted Census tracts in US counties were mapped to the nearest facility on a road network with at least one diplomate who specialized in adult (including geriatric) care between 2011 and 2019. The median travel time for all Census tracts within a county represented the primary geographic access measure. Availability was assessed by estimating the number of diplomates per 100 000 patients with obesity and the number of facilities able to serve assigned patients under three clinical guidelines. RESULTS: Of the 3371 diplomates certified since 2019, 3036 were included. The median travel time (weighted for county population) fell from 28.5 min [IQR: 13.7, 68.1] in 2011 to 9.95 min [IQR: 7.49, 18.1] in 2019. There were distinct intra- and inter-year travel time variations by race, ethnicity, education, median household income, rurality, and Census region (all P < 0.001). The median number of diplomates per 100 000 with obesity grew from 1 [IQR: 0.39, 1.59] in 2011 to 5 [IQR: 2.74, 11.4] in 2019. In 2019, an estimated 1.7% of facilities could meet the recommended number of visits for all mapped patients with obesity, up from 0% in 2011. CONCLUSIONS: Diplomate geographic access and availability have improved over time, yet there is still not a high enough supply to serve the potential patient demand. Future studies should quantify patient-level associations between travel time and health outcomes, including whether the number of available diplomates impacts utilization.
Asunto(s)
Población Rural , Viaje , Adulto , Anciano , Escolaridad , Etnicidad , Humanos , Obesidad/epidemiología , Estados Unidos/epidemiologíaRESUMEN
BACKGROUND: Reddit is a popular social media platform that has faced scrutiny for inflammatory language against those with obesity, yet there has been no comprehensive analysis of its obesity-related content. OBJECTIVE: We aimed to quantify the presence of 4 types of obesity-related content on Reddit (misinformation, facts, stigma, and positivity) and identify psycholinguistic features that may be enriched within each one. METHODS: All sentences (N=764,179) containing "obese" or "obesity" from top-level comments (n=689,447) made on non-age-restricted subreddits (ie, smaller communities within Reddit) between 2011 and 2019 that contained one of a series of keywords were evaluated. Four types of common natural language processing features were extracted: bigram term frequency-inverse document frequency, word embeddings derived from Bidirectional Encoder Representations from Transformers, sentiment from the Valence Aware Dictionary for Sentiment Reasoning, and psycholinguistic features from the Linguistic Inquiry and Word Count Program. These features were used to train an Extreme Gradient Boosting machine learning classifier to label each sentence as 1 of the 4 content categories or other. Two-part hurdle models for semicontinuous data (which use logistic regression to assess the odds of a 0 result and linear regression for continuous data) were used to evaluate whether select psycholinguistic features presented differently in misinformation (compared with facts) or stigma (compared with positivity). RESULTS: After removing ambiguous sentences, 0.47% (3610/764,179) of the sentences were labeled as misinformation, 1.88% (14,366/764,179) were labeled as stigma, 1.94% (14,799/764,179) were labeled as positivity, and 8.93% (68,276/764,179) were labeled as facts. Each category had markers that distinguished it from other categories within the data as well as an external corpus. For example, misinformation had a higher average percent of negations (ß=3.71, 95% CI 3.53-3.90; P<.001) but a lower average number of words >6 letters (ß=-1.47, 95% CI -1.85 to -1.10; P<.001) relative to facts. Stigma had a higher proportion of swear words (ß=1.83, 95% CI 1.62-2.04; P<.001) but a lower proportion of first-person singular pronouns (ß=-5.30, 95% CI -5.44 to -5.16; P<.001) relative to positivity. CONCLUSIONS: There are distinct psycholinguistic properties between types of obesity-related content on Reddit that can be leveraged to rapidly identify deleterious content with minimal human intervention and provide insights into how the Reddit population perceives patients with obesity. Future work should assess whether these properties are shared across languages and other social media platforms.
Asunto(s)
Medios de Comunicación Sociales , Humanos , Prevalencia , Infodemiología , Psicolingüística , ComunicaciónRESUMEN
Studies investigating the development of tense/aspect in children with developmental disorders have focused on production frequency and/or relied on short spontaneous speech samples. How children with developmental disorders use future forms/constructions is also unknown. The current study expands this literature by examining frequency, consistency, and productivity of past, present, and future usage, using the Speechome Recorder, which enables collection of dense, longitudinal audio-video recordings of children's speech. Samples were collected longitudinally in a child who was previously diagnosed with autism spectrum disorder, but at the time of the study exhibited only language delay [Audrey], and a typically developing child [Cleo]. While Audrey was comparable to Cleo in frequency and productivity of tense/aspect use, she was atypical in her consistency and production of an unattested future form. Examining additional measures of densely collected speech samples may reveal subtle atypicalities that are missed when relying on only few typical measures of acquisition.
Asunto(s)
Trastornos del Desarrollo del Lenguaje/diagnóstico , Semántica , Medio Social , Grabación en Video , Trastorno del Espectro Autista/diagnóstico , Preescolar , Comorbilidad , Discapacidades del Desarrollo/diagnóstico , Femenino , Humanos , Lingüística , Estudios Longitudinales , Valores de Referencia , Medición de la Producción del Habla , VocabularioRESUMEN
RNA-protein complexes underlie numerous cellular processes, including basic translation and gene regulation. The high-resolution structure determination of the RNA-protein complexes is essential for elucidating their functions. Therefore, computational methods capable of identifying the native-like RNA-protein structures are needed. To address this challenge, we thus develop DRPScore, a deep-learning-based approach for identifying native-like RNA-protein structures. DRPScore is tested on representative sets of RNA-protein complexes with various degrees of binding-induced conformation change ranging from fully rigid docking (bound-bound) to fully flexible docking (unbound-unbound). Out of the top 20 predictions, DRPScore selects native-like structures with a success rate of 91.67% on the testing set of bound RNA-protein complexes and 56.14% on the unbound complexes. DRPScore consistently outperforms existing methods with a roughly 10.53-15.79% improvement, even for the most difficult unbound cases. Furthermore, DRPScore significantly improves the accuracy of the native interface interaction predictions. DRPScore should be broadly useful for modeling and designing RNA-protein complexes.
Asunto(s)
Aprendizaje Profundo , Unión Proteica , Modelos Moleculares , Proteínas/metabolismo , ARN/metabolismo , Conformación Proteica , Simulación del Acoplamiento Molecular , AlgoritmosRESUMEN
Recent advances in whole-slide image (WSI) scanners and computational capabilities have significantly propelled the application of artificial intelligence in histopathology slide analysis. While these strides are promising, current supervised learning approaches for WSI analysis come with the challenge of exhaustively labeling high-resolution slides-a process that is both labor-intensive and timeconsuming. In contrast, self-supervised learning (SSL) pretraining strategies are emerging as a viable alternative, given that they don't rely on explicit data annotations. These SSL strategies are quickly bridging the performance disparity with their supervised counterparts. In this context, we introduce an SSL framework. This framework aims for transferable representation learning and semantically meaningful clustering by synergizing invariance loss and clustering loss in WSI analysis. Notably, our approach outperforms common SSL methods in downstream classification and clustering tasks, as evidenced by tests on the Camelyon16 and a pancreatic cancer dataset. The code and additional details are accessible at https://github.com/wwyi1828/CluSiam.
RESUMEN
Human cancers are heterogenous by their cell composition and origination site. Cancer metastasis generates the conundrum of the unknown origin of migrated tumor cells. Tracing tissue of origin and tumor type in primary and metastasized cancer is vital for clinical significance. DNA methylation alterations play a crucial role in carcinogenesis and mark cell fate differentiation, thus can be used to trace tumor tissue of origin. In this study, we employed a novel tumor-type-specific hierarchical model using genome-scale DNA methylation data to develop a multilayer perceptron model, HiTAIC, to trace tissue of origin and tumor type in 27 cancers from 23 tissue sites in data from 7735 tumors with high resolution, accuracy, and specificity. In tracing primary cancer origin, HiTAIC accuracy was 99% in the test set and 93% in the external validation data set. Metastatic cancers were identified with a 96% accuracy in the external data set. HiTAIC is a user-friendly web-based application through https://sites.dartmouth.edu/salaslabhitaic/. In conclusion, we developed HiTAIC, a DNA methylation-based algorithm, to trace tumor tissue of origin in primary and metastasized cancers. The high accuracy and resolution of tumor tracing using HiTAIC holds promise for clinical assistance in identifying cancer of unknown origin.
RESUMEN
BACKGROUND: COVID-19 severity is amplified among individuals with obesity, which may have influenced mainstream media coverage of the disease by both improving understanding of the condition and increasing weight-related stigma. OBJECTIVE: We aimed to measure obesity-related conversations on Facebook and Instagram around key dates during the first year of the COVID-19 pandemic. METHODS: Public Facebook and Instagram posts were extracted for 29-day windows in 2020 around January 28 (the first US COVID-19 case), March 11 (when COVID-19 was declared a global pandemic), May 19 (when obesity and COVID-19 were linked in mainstream media), and October 2 (when former US president Trump contracted COVID-19 and obesity was mentioned most frequently in the mainstream media). Trends in daily posts and corresponding interactions were evaluated using interrupted time series. The 10 most frequent obesity-related topics on each platform were also examined. RESULTS: On Facebook, there was a temporary increase in 2020 in obesity-related posts and interactions on May 19 (posts +405, 95% CI 166 to 645; interactions +294,930, 95% CI 125,986 to 463,874) and October 2 (posts +639, 95% CI 359 to 883; interactions +182,814, 95% CI 160,524 to 205,105). On Instagram, there were temporary increases in 2020 only in interactions on May 19 (+226,017, 95% CI 107,323 to 344,708) and October 2 (+156,974, 95% CI 89,757 to 224,192). Similar trends were not observed in controls. Five of the most frequent topics overlapped (COVID-19, bariatric surgery, weight loss stories, pediatric obesity, and sleep); additional topics specific to each platform included diet fads, food groups, and clickbait. CONCLUSIONS: Social media conversations surged in response to obesity-related public health news. Conversations contained both clinical and commercial content of possibly dubious accuracy. Our findings support the idea that major public health announcements may coincide with the spread of health-related content (truthful or otherwise) on social media.
RESUMEN
Intercensal estimates of access to electricity and clean cooking fuels at policy planning microregions in a country are essential for understanding their evolution and tracking progress towards Sustainable Development Goals (SDG) 7. Surveys are prohibitively expensive to get such intercensal microestimates. Existing works, mainly, focus on electrification rates, make predictions at the coarse spatial granularity, and generalize poorly to intercensal periods. Limited works focus on estimating clean cooking fuel access, which is one of the crucial indicators for measuring progress towards SDG 7. We propose a novel spatio-temporal multi-target Bayesian regression model that provides accurate intercensal microestimates for household electrification and clean cooking fuel access by combining multiple types of earth-observation data, census, and surveys. Our model's estimates are produced for Senegal for 2020 at policy planning microregions, and they explain 77% and 86% of variation in regional aggregates for electrification and clean fuels, respectively, when validated against the most recent survey. The diagnostic nature of our microestimates reveals a slow evolution and significant lack of clean cooking fuel access in both urban and rural areas in Senegal. It underscores the challenge of expanding energy access even in urban areas owing to their rapid population growth. Owing to the timeliness and accuracy of our microestimates, they can help plan interventions by local governments or track the attainment of SDGs when no ground-truth data are available. Supplementary Information: The online version contains supplementary material available at 10.1140/epjds/s13688-022-00371-5.
RESUMEN
Influence, the ability to change the beliefs and behaviors of others, is the main currency on social media. Extant studies of influence on social media, however, are limited by publicly available data that record expressions (active engagement of users with content, such as likes and comments), but neglect impressions (exposure to content, such as views) and lack "ground truth" measures of influence. To overcome these limitations, we implemented a social media simulation using an original, web-based micro-blogging platform. We propose three influence models, leveraging expressions and impressions to create a more complete picture of social influence. We demonstrate that impressions are much more important drivers of influence than expressions, and our models accurately identify the most influential accounts in our simulation. Impressions data also allow us to better understand important social media dynamics, including the emergence of small numbers of influential accounts and the formation of opinion echo chambers.
RESUMEN
We investigated the differential diffusion of all of the verified true and false news stories distributed on Twitter from 2006 to 2017. The data comprise ~126,000 stories tweeted by ~3 million people more than 4.5 million times. We classified news as true or false using information from six independent fact-checking organizations that exhibited 95 to 98% agreement on the classifications. Falsehood diffused significantly farther, faster, deeper, and more broadly than the truth in all categories of information, and the effects were more pronounced for false political news than for false news about terrorism, natural disasters, science, urban legends, or financial information. We found that false news was more novel than true news, which suggests that people were more likely to share novel information. Whereas false stories inspired fear, disgust, and surprise in replies, true stories inspired anticipation, sadness, joy, and trust. Contrary to conventional wisdom, robots accelerated the spread of true and false news at the same rate, implying that false news spreads more than the truth because humans, not robots, are more likely to spread it.