Tracing the origin of a scientific legend by Reference Publication Year Spectroscopy (RPYS): the legend of the Darwin finches
In a previews paper we introduced the quantitative method named Reference Publication Year Spectroscopy (RPYS). With this method one can determine the historical roots of research fields and quantify their impact on current research. RPYS is based on the analysis of the frequency with which references are cited in the publications of a specific research field in terms of the publication years of these cited references. In this study, we illustrate that RPYS can also be used to reveal the origin of scientific legends. We selected Darwin finches as an example for illustration. Charles Darwin, the originator of evolutionary theory, was given credit for finches he did not see and for observations and insights about the finches he never made. We have shown that a book published in 1947 is the most-highly cited early reference cited within the relevant literature. This book had already been revealed as the origin of the term Darwin finches by Sulloway through careful historical analysis.
💡 Research Summary
The paper demonstrates how Reference Publication Year Spectroscopy (RPYS), a quantitative bibliometric technique, can be used to uncover the historical origin of a scientific legend—in this case, the “Darwin finches” myth. RPYS works by extracting all references cited in a defined set of publications, counting how often each reference year appears, and visualizing the distribution as a “spectra” where peaks indicate years with unusually high citation frequencies.
To test the method, the authors retrieved from the Science Citation Index (SCI) all 689 papers published since 1974 that contain the term “Darwin’s finch(es)”. From the reference lists of these papers they extracted 4 1961 citations whose publication years are ≤ 1960. The yearly citation counts were then normalized by calculating the absolute deviation from a five‑year moving median, which sharpens the visibility of peaks.
The resulting spectra show several expected peaks: 1859 (Darwin’s On the Origin of Species) and 1871 (Darwin’s The Descent of Man) each account for a large share of citations, reflecting the centrality of Darwin’s classic works to any discussion of finches. A 1937 peak corresponds to Theodosius Dobzhansky’s Genetics and the Origin of Species, an influential synthesis text. However, the most pronounced peak occurs in 1947, where 144 of the 161 citations in that year refer to D. L. Lack’s book Darwin’s Finches.
This bibliometric finding aligns perfectly with the historical analysis of F. J. Sulloway (1982, 1983), who argued that the term “Darwin finches” was not coined by Darwin himself but was popularized by Lack’s 1947 monograph. Sulloway noted that although the phrase had appeared earlier (e.g., Percy Lowe in 1936), Lack’s book gave it widespread acceptance, leading to the persistent myth that Darwin’s observations of Galápagos finches were pivotal to his theory of natural selection.
The authors discuss the strengths and limitations of RPYS. On the positive side, RPYS can rapidly identify seminal works that serve as “historical roots” of a research field without requiring manual, time‑consuming historiography. It is especially useful for detecting the emergence of legends, misconceptions, or eponymous terms that have become entrenched in scientific discourse. On the downside, RPYS is dependent on citation counts; the true originator of an idea may be a low‑cited source (e.g., a conference abstract, a letter, or a non‑indexed publication) that will not generate a visible peak. Consequently, RPYS results must be interpreted by domain experts and complemented with traditional historical scholarship to confirm the actual source.
In conclusion, the case study validates RPYS as an effective tool for making the origins of scientific legends visible. The 1947 Lack book emerges as the most highly cited pre‑1960 reference within the “Darwin finches” literature, confirming the legend’s modern provenance. The authors suggest that future work should apply RPYS to other scientific myths and misattributions, thereby broadening our understanding of how scientific narratives are constructed and propagated.
Comments & Academic Discussion
Loading comments...
Leave a Comment