Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 56
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Nature ; 610(7930): 120-127, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-36131023

RESUMO

Faculty hiring and retention determine the composition of the US academic workforce and directly shape educational outcomes1, careers2, the development and spread of ideas3 and research priorities4,5. However, hiring and retention are dynamic, reflecting societal and academic priorities, generational turnover and efforts to diversify the professoriate along gender6-8, racial9 and socioeconomic10 lines. A comprehensive study of the structure and dynamics of the US professoriate would elucidate the effects of these efforts and the processes that shape scholarship more broadly. Here we analyse the academic employment and doctoral education of tenure-track faculty at all PhD-granting US universities over the decade 2011-2020, quantifying stark inequalities in faculty production, prestige, retention and gender. Our analyses show universal inequalities in which a small minority of universities supply a large majority of faculty across fields, exacerbated by patterns of attrition and reflecting steep hierarchies of prestige. We identify markedly higher attrition rates among faculty trained outside the United States or employed by their doctoral university. Our results indicate that gains in women's representation over this decade result from demographic turnover and earlier changes made to hiring, and are unlikely to lead to long-term gender parity in most fields. These analyses quantify the dynamics of US faculty hiring and retention, and will support efforts to improve the organization, composition and scholarship of the US academic workforce.


Assuntos
Docentes , Seleção de Pessoal , Universidades , Recursos Humanos , Educação de Pós-Graduação/estatística & dados numéricos , Emprego/estatística & dados numéricos , Docentes/estatística & dados numéricos , Feminino , Humanos , Masculino , Seleção de Pessoal/estatística & dados numéricos , Grupos Raciais/estatística & dados numéricos , Fatores Socioeconômicos , Estados Unidos , Universidades/estatística & dados numéricos , Mulheres , Recursos Humanos/estatística & dados numéricos
4.
Proc Natl Acad Sci U S A ; 118(32)2021 08 10.
Artigo em Inglês | MEDLINE | ID: mdl-34341121

RESUMO

Although it is under-studied relative to other social media platforms, YouTube is arguably the largest and most engaging online media consumption platform in the world. Recently, YouTube's scale has fueled concerns that YouTube users are being radicalized via a combination of biased recommendations and ostensibly apolitical "anti-woke" channels, both of which have been claimed to direct attention to radical political content. Here we test this hypothesis using a representative panel of more than 300,000 Americans and their individual-level browsing behavior, on and off YouTube, from January 2016 through December 2019. Using a labeled set of political news channels, we find that news consumption on YouTube is dominated by mainstream and largely centrist sources. Consumers of far-right content, while more engaged than average, represent a small and stable percentage of news consumers. However, consumption of "anti-woke" content, defined in terms of its opposition to progressive intellectual and political agendas, grew steadily in popularity and is correlated with consumption of far-right content off-platform. We find no evidence that engagement with far-right content is caused by YouTube recommendations systematically, nor do we find clear evidence that anti-woke channels serve as a gateway to the far right. Rather, consumption of political content on YouTube appears to reflect individual preferences that extend across the web as a whole.


Assuntos
Política , Mídias Sociais , Humanos , Mídias Sociais/estatística & dados numéricos , Gravação em Vídeo
5.
Nature ; 551(7681): 457-463, 2017 11 23.
Artigo em Inglês | MEDLINE | ID: mdl-29088705

RESUMO

Our growing awareness of the microbial world's importance and diversity contrasts starkly with our limited understanding of its fundamental structure. Despite recent advances in DNA sequencing, a lack of standardized protocols and common analytical frameworks impedes comparisons among studies, hindering the development of global inferences about microbial life on Earth. Here we present a meta-analysis of microbial community samples collected by hundreds of researchers for the Earth Microbiome Project. Coordinated protocols and new analytical methods, particularly the use of exact sequences instead of clustered operational taxonomic units, enable bacterial and archaeal ribosomal RNA gene sequences to be followed across multiple studies and allow us to explore patterns of diversity at an unprecedented scale. The result is both a reference database giving global context to DNA sequence data and a framework for incorporating data from future studies, fostering increasingly complete characterization of Earth's microbial diversity.


Assuntos
Biodiversidade , Planeta Terra , Microbiota/genética , Animais , Archaea/genética , Archaea/isolamento & purificação , Bactérias/genética , Bactérias/isolamento & purificação , Ecologia/métodos , Dosagem de Genes , Mapeamento Geográfico , Humanos , Plantas/microbiologia , RNA Ribossômico 16S/análise , RNA Ribossômico 16S/genética
6.
Proc Natl Acad Sci U S A ; 117(38): 23393-23400, 2020 09 22.
Artigo em Inglês | MEDLINE | ID: mdl-32887799

RESUMO

Most real-world networks are incompletely observed. Algorithms that can accurately predict which links are missing can dramatically speed up network data collection and improve network model validation. Many algorithms now exist for predicting missing links, given a partially observed network, but it has remained unknown whether a single best predictor exists, how link predictability varies across methods and networks from different domains, and how close to optimality current methods are. We answer these questions by systematically evaluating 203 individual link predictor algorithms, representing three popular families of methods, applied to a large corpus of 550 structurally diverse networks from six scientific domains. We first show that individual algorithms exhibit a broad diversity of prediction errors, such that no one predictor or family is best, or worst, across all realistic inputs. We then exploit this diversity using network-based metalearning to construct a series of "stacked" models that combine predictors into a single algorithm. Applied to a broad range of synthetic networks, for which we may analytically calculate optimal performance, these stacked models achieve optimal or nearly optimal levels of accuracy. Applied to real-world networks, stacked models are superior, but their accuracy varies strongly by domain, suggesting that link prediction may be fundamentally easier in social networks than in biological or technological networks. These results indicate that the state of the art for link prediction comes from combining individual algorithms, which can achieve nearly optimal predictions. We close with a brief discussion of limitations and opportunities for further improvements.


Assuntos
Aprendizado de Máquina , Redes Neurais de Computação , Humanos , Aprendizado de Máquina/normas , Modelos Estatísticos , Valor Preditivo dos Testes , Rede Social
7.
Proc Natl Acad Sci U S A ; 116(22): 10729-10733, 2019 05 28.
Artigo em Inglês | MEDLINE | ID: mdl-31036658

RESUMO

Faculty at prestigious institutions produce more scientific papers, receive more citations and scholarly awards, and are typically trained at more-prestigious institutions than faculty with less prestigious appointments. This imbalance is often attributed to a meritocratic system that sorts individuals into more-prestigious positions according to their reputation, past achievements, and potential for future scholarly impact. Here, we investigate the determinants of scholarly productivity and measure their dependence on past training and current work environments. To distinguish the effects of these environments, we apply a matched-pairs experimental design to career and productivity trajectories of 2,453 early-career faculty at all 205 PhD-granting computer science departments in the United States and Canada, who together account for over 200,000 publications and 7.4 million citations. Our results show that the prestige of faculty's current work environment, not their training environment, drives their future scientific productivity, while current and past locations drive prominence. Furthermore, the characteristics of a work environment are more predictive of faculty productivity and impact than mechanisms representing preferential selection or retention of more-productive scholars by more-prestigious departments. These results identify an environmental mechanism for cumulative advantage, in which an individual's past successes are "locked in" via placement into a more prestigious environment, which directly facilitates future success. The scientific productivity of early-career faculty is thus driven by where they work, rather than where they trained for their doctorate, indicating a limited role for doctoral prestige in predicting scientific contributions.

8.
BMC Bioinformatics ; 22(1): 157, 2021 Mar 25.
Artigo em Inglês | MEDLINE | ID: mdl-33765911

RESUMO

BACKGROUND: Large-scale biological data sets are often contaminated by noise, which can impede accurate inferences about underlying processes. Such measurement noise can arise from endogenous biological factors like cell cycle and life history variation, and from exogenous technical factors like sample preparation and instrument variation. RESULTS: We describe a general method for automatically reducing noise in large-scale biological data sets. This method uses an interaction network to identify groups of correlated or anti-correlated measurements that can be combined or "filtered" to better recover an underlying biological signal. Similar to the process of denoising an image, a single network filter may be applied to an entire system, or the system may be first decomposed into distinct modules and a different filter applied to each. Applied to synthetic data with known network structure and signal, network filters accurately reduce noise across a wide range of noise levels and structures. Applied to a machine learning task of predicting changes in human protein expression in healthy and cancerous tissues, network filtering prior to training increases accuracy up to 43% compared to using unfiltered data. CONCLUSIONS: Network filters are a general way to denoise biological data and can account for both correlation and anti-correlation between different measurements. Furthermore, we find that partitioning a network prior to filtering can significantly reduce errors in networks with heterogenous data and correlation patterns, and this approach outperforms existing diffusion based methods. Our results on proteomics data indicate the broad potential utility of network filters to applications in systems biology.


Assuntos
Processamento de Imagem Assistida por Computador , Neoplasias , Algoritmos , Difusão , Humanos , Razão Sinal-Ruído
9.
Proc Natl Acad Sci U S A ; 114(44): E9216-E9223, 2017 10 31.
Artigo em Inglês | MEDLINE | ID: mdl-29042510

RESUMO

A scientist may publish tens or hundreds of papers over a career, but these contributions are not evenly spaced in time. Sixty years of studies on career productivity patterns in a variety of fields suggest an intuitive and universal pattern: Productivity tends to rise rapidly to an early peak and then gradually declines. Here, we test the universality of this conventional narrative by analyzing the structures of individual faculty productivity time series, constructed from over 200,000 publications and matched with hiring data for 2,453 tenure-track faculty in all 205 PhD-granting computer science departments in the United States and Canada. Unlike prior studies, which considered only some faculty or some institutions, or lacked common career reference points, here we combine a large bibliographic dataset with comprehensive information on career transitions that covers an entire field of study. We show that the conventional narrative confidently describes only one-fifth of faculty, regardless of department prestige or researcher gender, and the remaining four-fifths of faculty exhibit a rich diversity of productivity patterns. To explain this diversity, we introduce a simple model of productivity trajectories and explore correlations between its parameters and researcher covariates, showing that departmental prestige predicts overall individual productivity and the timing of the transition from first- to last-author publications. These results demonstrate the unpredictability of productivity over time and open the door for new efforts to understand how environmental and individual factors shape scientific productivity.


Assuntos
Docentes/estatística & dados numéricos , Publicações/estatística & dados numéricos , Editoração/estatística & dados numéricos , Pesquisadores/estatística & dados numéricos , Mobilidade Ocupacional , Eficiência , Humanos , Narração
10.
Multiscale Model Simul ; 15(1): 537-574, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-29046619

RESUMO

Numerous centrality measures have been developed to quantify the importances of nodes in time-independent networks, and many of them can be expressed as the leading eigenvector of some matrix. With the increasing availability of network data that changes in time, it is important to extend such eigenvector-based centrality measures to time-dependent networks. In this paper, we introduce a principled generalization of network centrality measures that is valid for any eigenvector-based centrality. We consider a temporal network with N nodes as a sequence of T layers that describe the network during different time windows, and we couple centrality matrices for the layers into a supra-centrality matrix of size NT × NT whose dominant eigenvector gives the centrality of each node i at each time t. We refer to this eigenvector and its components as a joint centrality, as it reflects the importances of both the node i and the time layer t. We also introduce the concepts of marginal and conditional centralities, which facilitate the study of centrality trajectories over time. We find that the strength of coupling between layers is important for determining multiscale properties of centrality, such as localization phenomena and the time scale of centrality changes. In the strong-coupling regime, we derive expressions for time-averaged centralities, which are given by the zeroth-order terms of a singular perturbation expansion. We also study first-order terms to obtain first-order-mover scores, which concisely describe the magnitude of nodes' centrality changes over time. As examples, we apply our method to three empirical temporal networks: the United States Ph.D. exchange in mathematics, costarring relationships among top-billed actors during the Golden Age of Hollywood, and citations of decisions from the United States Supreme Court.

11.
Proc Natl Acad Sci U S A ; 110(15): 5785-90, 2013 Apr 09.
Artigo em Inglês | MEDLINE | ID: mdl-23530201

RESUMO

The late pre-Hispanic period in the US Southwest (A.D. 1200-1450) was characterized by large-scale demographic changes, including long-distance migration and population aggregation. To reconstruct how these processes reshaped social networks, we compiled a comprehensive artifact database from major sites dating to this interval in the western Southwest. We combine social network analysis with geographic information systems approaches to reconstruct network dynamics over 250 y. We show how social networks were transformed across the region at previously undocumented spatial, temporal, and social scales. Using well-dated decorated ceramics, we track changes in network topology at 50-y intervals to show a dramatic shift in network density and settlement centrality from the northern to the southern Southwest after A.D. 1300. Both obsidian sourcing and ceramic data demonstrate that long-distance network relationships also shifted from north to south after migration. Surprisingly, social distance does not always correlate with spatial distance because of the presence of network relationships spanning long geographic distances. Our research shows how a large network in the southern Southwest grew and then collapsed, whereas networks became more fragmented in the northern Southwest but persisted. The study also illustrates how formal social network analysis may be applied to large-scale databases of material culture to illustrate multigenerational changes in network structure.


Assuntos
Apoio Social , Arqueologia/métodos , Cerâmica , Bases de Dados Factuais , Sistemas de Informação Geográfica , Geografia , História Medieval , Migração Humana , Humanos , Sudoeste dos Estados Unidos , Fatores de Tempo , Estados Unidos
12.
BMC Bioinformatics ; 15: 220, 2014 Jun 25.
Artigo em Inglês | MEDLINE | ID: mdl-24965130

RESUMO

BACKGROUND: Community structure is ubiquitous in biological networks. There has been an increased interest in unraveling the community structure of biological systems as it may provide important insights into a system's functional components and the impact of local structures on dynamics at a global scale. Choosing an appropriate community detection algorithm to identify the community structure in an empirical network can be difficult, however, as the many algorithms available are based on a variety of cost functions and are difficult to validate. Even when community structure is identified in an empirical system, disentangling the effect of community structure from other network properties such as clustering coefficient and assortativity can be a challenge. RESULTS: Here, we develop a generative model to produce undirected, simple, connected graphs with a specified degrees and pattern of communities, while maintaining a graph structure that is as random as possible. Additionally, we demonstrate two important applications of our model: (a) to generate networks that can be used to benchmark existing and new algorithms for detecting communities in biological networks; and (b) to generate null models to serve as random controls when investigating the impact of complex network features beyond the byproduct of degree and modularity in empirical biological networks. CONCLUSION: Our model allows for the systematic study of the presence of community structure and its impact on network function and dynamics. This process is a crucial step in unraveling the functional consequences of the structural properties of biological systems and uncovering the mechanisms that drive these systems.


Assuntos
Algoritmos , Biologia Computacional/métodos , Gráficos por Computador , Análise por Conglomerados , Humanos , Modelos Biológicos
13.
Ecol Lett ; 17(2): 211-20, 2014 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-24304872

RESUMO

Horses (family Equidae) are a classic example of adaptive radiation, exhibiting a nearly 60-fold increase in maximum body mass and a peak taxonomic diversity of nearly 100 species across four continents. Such patterns are commonly attributed to niche competition, in which increased taxonomic diversity drives increased size disparity. However, neutral processes, such as macroevolutionary 'diffusion', can produce similar increases in disparity without increased diversity. Using a comprehensive database of Equidae species size estimates and a common mathematical framework, we measure the contributions of diversity-driven and diffusion-driven mechanisms for increased disparity during the Equidae radiation. We find that more than 90% of changes in size disparity are attributable to diffusion alone. These results clarify the role of species competition in body size evolution, indicate that morphological disparity and species diversity may be only weakly coupled in general, and demonstrate that large species may evolve from neutral macroevolutionary diffusion processes alone.


Assuntos
Evolução Biológica , Peso Corporal/genética , Cavalos/genética , Modelos Genéticos , Animais
14.
PLoS Comput Biol ; 9(10): e1003268, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24130474

RESUMO

The var genes of the human malaria parasite Plasmodium falciparum present a challenge to population geneticists due to their extreme diversity, which is generated by high rates of recombination. These genes encode a primary antigen protein called PfEMP1, which is expressed on the surface of infected red blood cells and elicits protective immune responses. Var gene sequences are characterized by pronounced mosaicism, precluding the use of traditional phylogenetic tools that require bifurcating tree-like evolutionary relationships. We present a new method that identifies highly variable regions (HVRs), and then maps each HVR to a complex network in which each sequence is a node and two nodes are linked if they share an exact match of significant length. Here, networks of var genes that recombine freely are expected to have a uniformly random structure, but constraints on recombination will produce network communities that we identify using a stochastic block model. We validate this method on synthetic data, showing that it correctly recovers populations of constrained recombination, before applying it to the Duffy Binding Like-α (DBLα) domain of var genes. We find nine HVRs whose network communities map in distinctive ways to known DBLα classifications and clinical phenotypes. We show that the recombinational constraints of some HVRs are correlated, while others are independent. These findings suggest that this micromodular structuring facilitates independent evolutionary trajectories of neighboring mosaic regions, allowing the parasite to retain protein function while generating enormous sequence diversity. Our approach therefore offers a rigorous method for analyzing evolutionary constraints in var genes, and is also flexible enough to be easily applied more generally to any highly recombinant sequences.


Assuntos
Biologia Computacional/métodos , Plasmodium falciparum/genética , Proteínas de Protozoários/genética , Análise de Sequência de DNA/métodos , Sequência de Aminoácidos , Simulação por Computador , Variação Genética/genética , Humanos , Malária Falciparum/parasitologia , Dados de Sequência Molecular , Alinhamento de Sequência , Processos Estocásticos
15.
Nature ; 453(7191): 98-101, 2008 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-18451861

RESUMO

Networks have in recent years emerged as an invaluable tool for describing and quantifying complex systems in many branches of science. Recent studies suggest that networks often exhibit hierarchical organization, in which vertices divide into groups that further subdivide into groups of groups, and so forth over multiple scales. In many cases the groups are found to correspond to known functional units, such as ecological niches in food webs, modules in biochemical networks (protein interaction networks, metabolic networks or genetic regulatory networks) or communities in social networks. Here we present a general technique for inferring hierarchical structure from network data and show that the existence of hierarchy can simultaneously explain and quantitatively reproduce many commonly observed topological properties of networks, such as right-skewed degree distributions, high clustering coefficients and short path lengths. We further show that knowledge of hierarchical structure can be used to predict missing connections in partly known networks with high accuracy, and for more general network structures than competing techniques. Taken together, our results suggest that hierarchy is a central organizing principle of complex networks, capable of offering insight into many network phenomena.


Assuntos
Algoritmos , Modelos Biológicos , Probabilidade , Vias Biossintéticas , Cadeia Alimentar , Redes Reguladoras de Genes , Redes e Vias Metabólicas , Ligação Proteica , Sensibilidade e Especificidade , Comportamento Social
16.
Nat Commun ; 15(1): 1364, 2024 Feb 14.
Artigo em Inglês | MEDLINE | ID: mdl-38355612

RESUMO

Link prediction algorithms are indispensable tools in many scientific applications by speeding up network data collection and imputing missing connections. However, in many systems, links change over time and it remains unclear how to optimally exploit such temporal information for link predictions in such networks. Here, we show that many temporal topological features, in addition to having high computational cost, are less accurate in temporal link prediction than sequentially stacked static network features. This sequential stacking link prediction method uses 41 static network features that avoid detailed feature engineering choices and is capable of learning a highly accurate predictive distribution of future connections from historical data. We demonstrate that this algorithm works well for both partially observed and completely unobserved target layers, and on two temporal stochastic block models achieves near-oracle-level performance when combined with other single predictor methods as an ensemble learning method. Finally, we empirically illustrate that stacking multiple predictive methods together further improves performance on 19 real-world temporal networks from different domains.

17.
Obstet Gynecol ; 143(3): e63-e77, 2024 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-38176019

RESUMO

OBJECTIVE: To determine biomarkers other than CA 125 that could be used in identifying early-stage ovarian cancer. DATA SOURCES: Ovid MEDLINE ALL, EMBASE, Web of Science Core Collection, ScienceDirect, Clinicaltrials.gov , and CAB Direct were searched for English-language studies between January 2008 and April 2023 for the concepts of high-grade serous ovarian cancer, testing, and prevention or early diagnosis. METHODS OF STUDY SELECTION: The 5,523 related articles were uploaded to Covidence. Screening by two independent reviewers of the article abstracts led to the identification of 245 peer-reviewed primary research articles for full-text review. Full-text review by those reviewers led to the identification of 131 peer-reviewed primary research articles used for this review. TABULATION, INTEGRATION, AND RESULTS: Of 131 studies, only 55 reported sensitivity, specificity, or area under the curve (AUC), with 36 of the studies reporting at least one biomarker with a specificity of 80% or greater specificity or 0.9 or greater AUC. CONCLUSION: These findings suggest that although many types of biomarkers are being tested in ovarian cancer, most have similar or worse detection rates compared with CA 125 and have the same limitations of poor detection rates in early-stage disease. However, 27.5% of articles (36/131) reported biomarkers with better sensitivity and an AUC greater than 0.9 compared with CA 125 alone and deserve further exploration.


Assuntos
Tubas Uterinas , Neoplasias Ovarianas , Feminino , Humanos , Neoplasias Ovarianas/diagnóstico , Biomarcadores
18.
bioRxiv ; 2024 Jan 29.
Artigo em Inglês | MEDLINE | ID: mdl-38352574

RESUMO

Despite ovarian cancer being the deadliest gynecological malignancy, there has been little change to therapeutic options and mortality rates over the last three decades. Recent studies indicate that the composition of the tumor immune microenvironment (TIME) influences patient outcomes but are limited by a lack of spatial understanding. We performed multiplexed ion beam imaging (MIBI) on 83 human high-grade serous carcinoma tumors - one of the largest protein-based, spatially-intact, single-cell resolution tumor datasets assembled - and used statistical and machine learning approaches to connect features of the TIME spatial organization to patient outcomes. Along with traditional clinical/immunohistochemical attributes and indicators of TIME composition, we found that several features of TIME spatial organization had significant univariate correlations and/or high relative importance in high-dimensional predictive models. The top performing predictive model for patient progression-free survival (PFS) used a combination of TIME composition and spatial features. Results demonstrate the importance of spatial structure in understanding how the TIME contributes to treatment outcomes. Furthermore, the present study provides a generalizable roadmap for spatial analyses of the TIME in ovarian cancer research.

19.
Sci Adv ; 9(42): eadi2205, 2023 Oct 20.
Artigo em Inglês | MEDLINE | ID: mdl-37862417

RESUMO

Women remain underrepresented among faculty in nearly all academic fields. Using a census of 245,270 tenure-track and tenured professors at United States-based PhD-granting departments, we show that women leave academia overall at higher rates than men at every career age, in large part because of strongly gendered attrition at lower-prestige institutions, in non-STEM fields, and among tenured faculty. A large-scale survey of the same faculty indicates that the reasons faculty leave are gendered, even for institutions, fields, and career ages in which retention rates are not. Women are more likely than men to feel pushed from their jobs and less likely to feel pulled toward better opportunities, and women leave or consider leaving because of workplace climate more often than work-life balance. These results quantify the systemic nature of gendered faculty retention; contextualize its relationship with career age, institutional prestige, and field; and highlight the importance of understanding the gendered reasons for attrition rather than focusing on rates alone.

20.
Science ; 382(6672): 781-783, 2023 Nov 17.
Artigo em Inglês | MEDLINE | ID: mdl-37972192

RESUMO

Highlights from the Science family of journals.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA