Sleeping Beauties of Science
July 13, 2015
Most people associate the
Sleeping Beauty tale with the
1959 Disney film, but this
folk tale goes back at least as far as the early
14th century. It was
published as the far less
romantic,
Histoire de Troïlus et de Zellandine, in the 1528 book,
Perceforest. In that version, which is certainly not a
Disney Princess story, the beauty is
raped while in her
deep coma, and she
delivers a child without awakening.
Music lovers might enjoy
Tchaikovsky's score of the
The Sleeping Beauty ballet,[1] but I prefer
The Nutcracker.[2]
Although "sleeping beauties of science" might be a reference to the older
professors who
nod-off during
seminars, a
research team from the
Center for Complex Networks and Systems Research of the
School of Informatics and Computing,
Indiana University (Bloomington, Indiana), have a different definition. The sleeping beauties that they write about in a recent
paper in the
Proceedings of the National Academy of Sciences are published research papers that remain dormant for years, then later are found to be of great importance.[3-4]
Generally, a scientific paper will generate the most interest when it's first published, and then its
citation rate will decline over a few years, finally reaching zero for most papers. As I wrote in an
earlier article (Modeling Scientific Citation, December 16, 2011), this was demonstrated by
physicists from the
University of Fribourg (Fribourg, Switzerland) in an analysis of 450,000 papers published from 1893 to 2009 in American Physical Society journals, as shown below.[5]
Since
searching and retrieving older articles is now easier in our
Internet age, older articles are being cited more often. As I wrote in an
earlier article (Citation in the Electronic Age, December 15, 2014), a 2008 study published in the
Journal of the American Society for Information Science and Technology found that the age of cited articles has increased continuously since the mid-
1960s.[6]
A study by a
Google, Inc. research team confirmed this finding.[7-8] It found, also, that 21% of citations were to articles 15 years old and older in 2013, an increase of 30% over the 1990 level. Thirteen percent of citations were to articles 20 years old or older, an increase of 36% over the 1990 level.[7] In 2013, 36% of citations were for articles at least 10 years old, up from 28% in 1990.[7]
In contrast to citation decay of most papers, a sleeping beauty paper has a long period in which it is rarely cited, followed by a sudden surge in its popularity.[3] The term, "sleeping beauty," was coined by the
Dutch statistician,
Anthony F.J. van Raan, in 2004, and
anecdotal evidence suggested that there were just a few of these. In
physics, the
Einstein, Podolsky, and Rosen (EPR) "paradox" paper is one of these.[3] EPR was published in 1935, but its citations bloomed in 1994. In
statistics,
Karl Pearson's 1901 paper in
Philosophical Magazine wasn't much cited until 2002.[4]
Aside from a few examples such as these, sleeping beauties were thought to be rare. The Indiana University study, however, examined a very large
dataset - 380,000 publications from the
American Physical Society and 22.4 million from the
Web of Science.[3-4] The dataset covered multiple
disciplines in the span of more than a
century.[4]
A "beauty coefficient" was calculated for each paper in this dataset. The value of this
coefficient depended on a comparison of a paper's citation history to others based on publication year, the maximum number of citations received in a year, and the year when maximum citation was achieved.
Analysis found that sleeping beauties are not that rare, as shown in the figure.[3-4]
There's a
continuous spectrum of sleeping beauty behavior with a
power-law behavior, thereby suggesting a common
mechanism working at all
scales.[3] Among the interesting statistics of the
phenomenon, four of the top fifteen sleeping beauties were published more than a century ago.[4] Physics,
chemistry,
multidisciplinary science,
mathematics, and general and
internal medicine were the disciplines with the highest frequency of sleeping beauty behavior, and several papers in these fields weren't noticed for up to 70 years.[4] The Proceedings of the National Academy of Sciences, where these results were published,
Nature, and
Science were the top
journals publishing sleeping beauties.[4]
Future developments in science determine whether or not a paper is destined for sleeping beauty
stardom.
Alessandro Flammini, an
associate professor of
informatics at Indiana University and the corresponding
author of the study, notes the following:[4]
"The potential application of some studies are simply unforeseen at the time'... The second-ranked sleeping beauty in our study, published in 1958, concerns the preparation of graphite oxide, which much later became a compound used to produce graphene, a material hundreds of time more resistant than steel and therefore of great interest to industry."
"This study provides empirical evidence that a paper can truly be 'ahead of its time... A 'premature' topic may fail to attract attention even when it is introduced by authors who have already established a strong scientific reputation."
Statistics, a discipline that's become more relevant with the availability of large
computerized datasets, has had a surge in its number of sleeping beauties. Along with Pearson's hundred year paper, there's one by
Edin Bidwell Wilson that was rediscovered after seventy years.[4] Many sleeping beauties have attained prominence when their results have been found to be applicable to other fields.[4]
The authors write that their study provides evidence against the use of short-term citation
metrics for deciding a paper's
scientific impact.[3] This study was supported by the
National Science Foundation.[4]
References:
- The Sleeping Beauty Ballet, London Symphony Orchestra, André Previn, Conductor, YouTube Video.
- Tchaikovsky, The Nutcracker, Mariinsky Theatre Gergiev, YouTube Video.
- Qing Ke, Emilio Ferrara, Filippo Radicchi, and Alessandro Flammini, "Defining and identifying Sleeping Beauties in science," Proc. Natl. Acad. Sci., Early Edition, doi: 10.1073/pnas.1424329112.
- Like Sleeping Beauty, some research lies dormant for decades, IU study finds, Indiana University Press Release, May 25, 2015.
- Matúê Medo, Giulio Cimini, and Stanislao Gualdi, "Temporal Effects in the Growth of Networks,"Phys. Rev. Lett. vol. 107, no. 23 (December 2, 2011), Document No. 238701. This appears, also as Matúê Medo, Giulio Cimini, and Stanislao Gualdi, "Temporal Effects in the Growth of Networks," arXiv Preprint Server, September 26, 2011.
- Vincent Larivière, Éric Archambault, and Yves Gingras, "Long-term variations in the aging of scientific literature: From exponential growth to steady-state science (1900–2004)," Journal of the American Society for Information Science and Technology, vol. 59, no. 2 (January 15, 2008), pp. 288-296.
- Alex Verstak, Anurag Acharya, Helder Suzuki, Sean Henderson, Mikhail Iakhiaev, Cliff Chiung Yu Lin, and Namit Shetty, "On the Shoulders of Giants: The Growing Impact of Older Articles," arXiv, November 2, 2014.
- John Bohannon, "Older papers are increasingly remembered—and cited," Science, November 4, 2014.