Computer Science / Digital Libraries Computer Science / Social Networks Physics / physics.soc-ph Statistics / Machine Learning

Science and Technology Advance through Surprise

February 23, 2026

Reading time: 5 minute

...

#Statistics #Machine Learning #Computer Science #Social Networks #Physics #Digital Libraries

📝 Original Info

Title: Science and Technology Advance through Surprise
ArXiv ID: 1910.09370
Date: 2020-01-17
Authors: Researchers from original ArXiv paper

📝 Abstract

Breakthrough discoveries and inventions involve unexpected combinations of contents including problems, methods, and natural entities, and also diverse contexts such as journals, subfields, and conferences. Drawing on data from tens of millions of research papers, patents, and researchers, we construct models that predict next year's content and context combinations with an AUC of 95% based on embeddings constructed from high-dimensional stochastic block models, where the improbability of new combinations itself predicts up to 50% of the likelihood that they will gain outsized citations and major awards. Most of these breakthroughs occur when problems in one field are unexpectedly solved by researchers from a distant other. These findings demonstrate the critical role of surprise in advance, and enable evaluation of scientific institutions ranging from education and peer review to awards in supporting it.

💡 Deep Analysis

Deep Dive into Science and Technology Advance through Surprise.

Breakthrough discoveries and inventions involve unexpected combinations of contents including problems, methods, and natural entities, and also diverse contexts such as journals, subfields, and conferences. Drawing on data from tens of millions of research papers, patents, and researchers, we construct models that predict next year’s content and context combinations with an AUC of 95% based on embeddings constructed from high-dimensional stochastic block models, where the improbability of new combinations itself predicts up to 50% of the likelihood that they will gain outsized citations and major awards. Most of these breakthroughs occur when problems in one field are unexpectedly solved by researchers from a distant other. These findings demonstrate the critical role of surprise in advance, and enable evaluation of scientific institutions ranging from education and peer review to awards in supporting it.

📄 Full Content

Science and Technology Advance through Surprise

Feng Shi, University of North Carolina at Chapel Hill, Knowledge Lab fbillshi@gmail.com

James Evans, Knowledge Lab, University of Chicago, Santa Fe Institute jevans@uchicago.edu

Breakthrough discoveries and inventions involve unexpected combinations of contents including problems, methods, and natural entities, and also diverse contexts such as journals, subfields, and conferences. Drawing on data from tens of millions of research papers, patents, and researchers, we construct models that predict next year’s content and context combinations with an AUC of 95% based on embeddings constructed from high-dimensional stochastic block models, where the improbability of new combinations itself predicts up to 50% of the likelihood that they will gain outsized citations and major awards. Most of these breakthroughs occur when problems in one field are unexpectedly solved by researchers from a distant other. These findings demonstrate the critical role of surprise in advance, and enable evaluation of scientific institutions ranging from education and peer review to awards in supporting it.

19th Century philosopher and scientist Charles Sanders Peirce argued that neither the logics of deduction nor induction alone could characterize the reasoning behind path-breaking new hypotheses in science, but rather their collision through a process he termed abduction. Abduction begins as expectations born of theory or tradition become disrupted by unexpected observations or findings (1). Surprise stimulates scientists to forge new claims that make the surprising unsurprising. Here we empirically demonstrate across the life sciences, physical sciences and patented inventions that, following Peirce, surprising hypotheses, findings and insights are the best available predictor of outsized success. But neither Peirce nor anyone since has specified where the stuff of new hypotheses came from. One account is serendipity or making the most of surprising encounters (2, 3), encapsulated in Pasteur’s oft-quoted maxim “chance favors only the prepared mind” (4), but this poses a paradox. The successful scientific mind must simultaneously know enough within a scientific or technological context to be surprised, and enough outside to imagine why it should not be surprised. Here we show how surprising successes systematically emerge across, rather than within researchers; most commonly when those in one field surprisingly publish problem-solving results to audiences in a distant other. This contrasts with research that focuses on inter- and multi-disciplinarity as primary sources of advance (5–7). We suggest how predictability and surprise in science and technology provide us with new tools to evaluate how scientific institutions ranging from awards and graduate education to peer review facilitate advance.

In order to identify the sources of scientific and technological surprise, we must first identify what is expected with precision. Here we follow others in modeling discovery and invention as combinatorial processes linking previous ideas, phenomena and technologies (8–12). We separate combinations of scientific contents and contexts in order to refine our expectations about normal scientific and technological developments in the future (13). A new scientific or technological configuration of contents—phenomena, concepts, and methods—may surprise because it has never succeeded before, despite having been considered and attempted. A new configuration of contents that cuts across divergent contexts—journals and conferences—may surprise because it has never been imagined. The separate consideration of contents and contexts allows us to contrast scientific discovery with technological search: Fields and their boundaries are clear and ever-present for scientists at all phases of scientific production, publishing and promotion, but largely invisible for technological invention and its certification in legally protected patents.

Virtually all empirical research examining combinatorial discovery and invention has deconstructed new products into collections of pairwise combinations (11), resting on mature analysis tools for simple graphs that define links between entity pairs. Recent research, however, has demonstrated the critical importance of higher-order structure in understanding complex networks, ranging from the hub structure of global transportation networks to clustering in neuronal networks (14) to stabilizing interaction between species (15, 16). Here we develop a method to model the frontiers of science and technology as a complex hypergraph drawn from an embedding of contents and contexts (17) using mixed-membership, high-dimensional stochastic block models, where each discovery or invention can be rendered as a complete set of scientif

…(Full text truncated)…

📄 Read Full PDF on ArXiv

📸 Image Gallery

Reference

This content is AI-processed based on ArXiv data.

Science and Technology Advance through Surprise

📝 Original Info

📝 Abstract

💡 Deep Analysis

📄 Full Content

📸 Image Gallery

Reference

Table of Contents

Table of Contents

📝 Original Info

📝 Abstract

💡 Deep Analysis

📄 Full Content

📸 Image Gallery

Reference

Related Posts

Kronecker Graphs: An Approach to Modeling Networks

A New Clustering Algorithm Based Upon Flocking On Complex Network

Dynamic Policy Programming

Start searching

No results found