Alisa Surkis, Stuart Spore
doi: http://dx.doi.org/10.5195/jmla.2018.499
Received 01 May 2018: Accepted 01 May 2018
ABSTRACT
Bibliometrics is becoming increasingly prominent in the world of medical libraries. The number of presentations related to research impact at the Medical Library Association (MLA) annual meeting has been increasing in past years. Medical centers have been using institutional dashboards to track clinical performance for over a decade, and more recently, these institutional dashboards have included measures of academic performance. This commentary reviews current practices and considers the role for a newer metric, the relative citation ratio.
To judge by the program for MLA ’17, the 2017 Medical Library Association (MLA) annual meeting, bibliometrics is becoming increasingly prominent in the world of medical libraries. The number of presentations related to research impact at the MLA meeting has been increasing in past years, with a more than threefold increase between 2013 and 2017 (supplemental Appendix A). Medical centers have been using institutional dashboards to track clinical performance for over a decade [1], and more recently these institutional dashboards have included measures of academic performance [2]. Clearly, demand for metrics of all types is on the rise. Given this growing interest in and use of metrics, it is a good time to review current practices and consider the role for a newer metric.
Most medical librarians are familiar with the journal impact factor (JIF). While this metric provides value in specific use cases, it is often used inappropriately. The JIF was devised to measure the influence of journals but has been widely used to judge individual articles. This use is based on the specious assumption that all articles in a journal are equally influential, as reflected by their number of citations [3, 4]. This misuse can lead to weak articles being overvalued and important work being undervalued.
The h-index [5] is another familiar metric. It can be problematic as well, being in essence the product of a simple, somewhat arbitrary, formula, which derives a single number from a list of citation counts. While often used to compare researchers, the h-index can be misleading when comparing researchers in different fields because it does not take into account the differing citation practices of different fields [6]. It also disadvantages newer investigators, as the value of the h-index is driven by the number of articles authored, regardless of how many citations those articles may attract [7].
The relative citation ratio (RCR) is a prominent new metric that is more appropriate for many common-use cases. The RCR was developed by the National Institutes of Health (NIH) Office of Portfolio Analysis as part of an effort to improve evaluation of grant outcomes and was presented as a preprint in 2015 [8] and then as a peer-reviewed article in PLOS Biology in 2016 [9]. In contrast to journal-level metrics like the JIF and author-level metrics like the h-index, the RCR is an article-level metric. While there are a number of article-level citation metrics, the RCR has several properties that make it a particularly appealing choice.
The RCR uses a novel approach to normalize the metric for the “discipline” of the article, with the idea that the metric can then be compared between articles in different fields. This technique is known as field normalization. While there are other citation-based, article-level metrics that employ some type of field normalization, previous metrics have mainly done so by assigning an article to a field or set of fields chosen from a fixed set of categories [10–13]. The RCR uses an approach that dynamically determines the field of the article based on the co-citation network of that article, that is, all articles that have been cited by articles citing the target article [9]. As science becomes increasingly interdisciplinary, this approach takes into account that the field of an article may not be well captured by a predefined discipline or set of disciplines and that, as disciplines evolve, the discipline that best captures an article may also evolve.
While somewhat complicated, the RCR is transparent (particularly when compared with proprietary metrics). The details of its calculation are clearly spelled out in the Hutchins article [9]. NIH provides access to the RCR of articles through a web interface [14] and an application programming interface (API) [15]. Other popular field-normalized, article-level metrics are proprietary, with less transparency in their calculation, and are available only through licensing agreements.
Finally, a key aspect of the derivation of the RCR is that it is benchmarked to NIH R01-funded articles. The calculation of this metric is done such that an article with an RCR value equal to 1.0 is at the median for NIH R01-funded articles for that year. This leads to a clear interpretation of RCR values, in that an RCR value of 1.0 is the demarcation between articles that have a lower citation rate than the median NIH R01-funded publication and those with a higher citation rate. Because the RCR makes it straightforward to communicate how a particular article compares to NIH R01-funded works, it has the potential to be a particularly compelling metric among users focused on NIH funding as a primary measure of impact.
The following examples illustrate some of the pitfalls in assessing research impact that the RCR can help avoid. The first example compares two articles published in 2005:
Mallucci L, Wells V. Potential role of the antiproliferative cytokine beta-galactoside binding protein in cancer therapy. Curr Opin Investig Drugs. 2005 Dec;6(12):1228–33. PMID: 16370387.
Ogedegbe G, Cassells AN, Robinson CM, DuHamel K, Tobin JN, Sox CH, Dietrich AJ. Perceptions of barriers and facilitators of cancer early detection among low-income minority women in community health centers. J Natl Med Assoc. 2005 Feb;97(2):162–70. PMID: 15712779.
The first article appears in a journal with a high JIF (15.333), whereas the second article appears in a journal with a very low JIF (0.182). Conversely, the first article has an RCR of 0.2, significantly below the NIH R01-funded median, whereas the second article has an RCR of 4.0, far above that median. This example illustrates how misleading reliance on the JIF for article-level comparisons can be.
Next, consider two articles published in 2013:
Djiane A, Krejci A, Bernard F, Fexova S, Millen K, Bray SJ. Dissecting the mechanisms of Notch induced hyperplasia. EMBO J. 2013 Jan 9;32(1):60–71. DOI: http://dx.doi.org/10.1038/emboj.2012.326. Epub 2012 Dec 11. PubMed PMID: 23232763.
Browning DJ. Impact of the revised American Academy of Ophthalmology guidelines regarding hydroxychloroquine screening on actual practice. Am J Ophthalmol. 2013 Mar;155(3):418–28.e1. DOI: http://dx.doi.org/10.1016/j.ajo.2012.09.025. Epub 2012 Dec 4. PubMed PMID: 23218706.
Both articles have 27 citations; however, the first article is in the field of genomics where articles tend to be much more heavily cited than in ophthalmology, the field of the second article. The RCR reflects this, as the genomics article has an RCR of 1.3, while the ophthalmology article with the same number of citations has an RCR of 3.7. A comparison of raw citations—or any metric based solely on those raw citation numbers—would have led to a very misleading comparison of the impact of these articles in their respective fields.
A recent example from our institution provides another striking illustration of the difference between assessing an article with the JIF versus the RCR. The New York University (NYU) Office of Science and Research requested a report on 2016 articles by NYU School of Medicine faculty from the database of faculty articles maintained by the library [16], including a section on articles appearing in journals with a JIF greater than 10. Upon discussion, it became clear that the office intended to identify works of particular merit. In response, the authors provided the requested report, along with an additional report listing works with an RCR in the 90th percentile or above relative to NIH R01-funded works. The reports listed 475 articles in journals with a JIF greater than 10 and 666 with RCR values at the 90th percentile or above (supplemental Appendix B), with only 186 of those articles in appearing in both lists.
Comparing the 100 highest-ranking items from both reports, we found only 36 articles common to the RCR list and JIF list. While the lowest impact factor in the top 100 list for JIF was over 30, the RCR top 100 list contained 36 articles from journals with a JIF less than 10. The JIF top 100 list contained 12 items with an RCR of less than 1.0 or no calculated RCR at all (due to inadequate numbers of recorded citations). To summarize, the 2 lists agreed on just over a third of their contents. Of the articles that the JIF flagged as most important, 12% showed no evidence of importance according to the RCR. Almost 40% of the articles that the RCR flagged as most impactful were published in journals with a JIF of less than 10.
To further explore the relationship between the 2 metrics, we calculated the correlation between the RCR and the JIF for research articles published by NYU School of Medicine faculty in 2015. We found 4,470 research articles (supplement Appendix C) and filtered out articles with either no JIF or no RCR. We performed linear regression of the RCR on the JIF for the resulting 3,772 articles, and the value of R-squared as 0.13 (this metric ranges between 0 and 1.0 and reflects the percent of variance in the RCR that is explained by the JIF). So, while the JIF and RCR are correlated with a high degree of certainty (p<2×10–16), the strength of that correlation is not particularly strong (Figure 1). Because only 13% of the variance of the RCR is explained by the JIF, the data points are not tightly clustered around the regression line, suggesting that the JIF serves as a very poor proxy for the impact of an article.
|
||
Figure 1 Relative citation ratio (RCR) versus journal impact factor (JIF) for research articles published in 2015 by NYU School of Medicine faculty, for which both an RCR and a JIF are available |
The above examples provide evidence that using the JIF as a proxy for the impact of an article can be very misleading. This has implications for the culture of faculty publishing. Over the last few decades, faculty who wanted their works to be recognized as important by hiring committees and promotion and tenure committees typically needed to ensure that they appeared in journals with high JIFs. By changing the focus from the JIF to the RCR, the focus shifts from “where” to “what”; thus, making journal choice alone less of a priority.
The RCR is just beginning to acquire “mind share” amongst practitioners, but bibliometricians have been discussing it intensely [4, 17–21], and there are active projects using the RCR for real-world data analysis [22–27]. Predictably, the RCR has not escaped criticism. As with all metrics, it has limitations and complications. The RCR is currently only calculated for MEDLINE citations from 1995 on, so it is not a suitable choice for older or for non-biomedical literature. Like other bibliometric measures, the RCR is subject to a good deal of latency (it takes time for citation counts to become meaningful), and therefore, like all citation-based metrics, it is of limited utility for newly published articles.
Another limitation is that, because the RCR uses NIH funding as its benchmark, its usefulness might be diminished outside of the United States. It has also been suggested that the RCR’s strength in field normalization might result in it discounting the value of interdisciplinary works [21]. Finally, it is possible for the addition of new citations to cause a decrease in the RCR [18], either because the field of the article changes over time or because the article accrues citations at a slower pace than other articles published in that year. However, analyses show that even a small drop in the RCR occurs very rarely, and this decrease may provide an accurate picture of the diminishing influence of that article relative to its peers [28]. Both the time of year when an article is published and the length of time that it is an online publication before its print publication date affect the RCR. For newer articles, the difference in time for citations to accrue between an article published in December versus one published in January can be quite significant.
Despite these issues, the RCR is arguably a significant improvement over the most commonly used metrics. There are still many open questions about the practical application of the RCR. For instance, there is no current agreement on how best to extrapolate from a set of RCRs (e.g., for all articles by an author) to a single number suitable for comparative purposes, like the h-index.
We have found that the administration at our institution has been receptive to the RCR as a supplement to or replacement for existing metrics. Our institutional bibliometrics dashboards [2] have been updated to replace displays of raw citation counts with a number of different visualizations incorporating the RCR. We have updated our library’s publication metrics tool to incorporate the RCR. The promotion and tenure reports currently provide the mean and median RCR values for a faculty member’s articles alongside the h-index.
As medical librarians devote more time and attention to bibliometrics, we are fortunate to have better tools at our disposal. We strongly urge medical librarians to add the RCR to the other arrows in their bibliometrics quiver.
The authors thank Fred Willie Zametkin LaPolla and Richard McGowan for discussion and comment on the manuscript.
1 Buttigieg SC, Pace A, Rathert C. Hospital performance dashboards: a literature review. J Health Organ Manag. 2017 May 15;31(3):385–406.
2 Tachibana C. New tools for measuring academic performance. Science. 2017 Feb 10 2017;355(6325):651–4.
3 Deciphering impact factors. Nature Neurosci. 2003;6:783. DOI: http://dx.doi.org/10.1038/nn0803-783.
4 Santangelo GM. Article-level assessment of influence and translation in biomedical research. Mol Biol Cell. 2017 Jun 1;28(11):1401–8.
5 Hirsch JE. An index to quantify an individual’s scientific research output. Proc Natl Acad Sci U S A. 2005 Nov 15;102(46):16569–72.
6 Harzing AW, Alakangas S. Google Scholar, Scopus and the Web of Science: a longitudinal and cross-disciplinary comparison. Scientometrics. 2016 Feb 1;106(2):787–804. http://dx.doi.org/10.1007/s11192-015-1798-9.
7 Ball P. The h-index, or the academic equivalent of the stag’s antlers. Guardian. 2012 6 Jan 2012.
8 Hutchins BI, Yuan X, Anderson JM, Santangelo GM. Relative citation ratio (RCR): a new metric that uses citation rates to measure influence at the article level [Internet]. bioRxiv. 2015 [cited 23 Jul 2018]. <https://www.biorxiv.org/content/biorxiv/early/2015/10/22/029629.full.pdf>.
9 Hutchins BI, Yuan X, Anderson JM, Santangelo GM. Relative citation ratio (RCR): a new metric that uses citation rates to measure influence at the article level. PLoS Biol. 2016 Sep 6;14(9):e1002541.
10 Crespo JA, Herranz N, Li Y, Ruiz-Castillo J. The effect on citation inequality of differences in citation practices at the web of science subject category level. J Assoc Inf Sci Technol. 2014 Jun;65(6):1244–56. DOI: http://dx.doi.org/10.1002/asi.23006.
11 Abramo G, Cicero T, D’Angelo CA. Revisiting the scaling of citations for research assessment. J Informetr. 2012 Oct;6(4):470–-9. DOI: http://dx.doi.org/10.1016/j.joi.2012.03.005.
12 Radicchi F, Fortunato S, Castellano C. Universality of citation distributions: toward an objective measure of scientific impact. Proc Natl Acad Sci U S A. 2008 Nov 11;105(45):17268–72. DOI: http://dx.doi.org/10.1073/pnas.0806977105.
13 Li Y, Radicchi F, Castellano C, Ruiz-Castillo J. Quantitative evaluation of alternative field normalization procedures. J Informetr. 2013 Jul;7(3):746–55. DOI: http://dx.doi.org/10.1016/j.joi.2013.06.001.
14 National Institute of Health, Office of Portfolio Analysis. iCite new analysis [Internet]. The Institutes [cited 23 Jul 2018]. <https://icite.od.nih.gov/analysis>.
15 National Institute of Health, Office of Portfolio Analysis. iCite API [Internet]. The Institutes [cited 23 Jul 2018]. <https://icite.od.nih.gov/api>.
16 Vieira D, McGowan R, McCrillis A, Lamb I, Larson C, Bakker T, Spore S. The Faculty Bibliography Project at the NYU School of Medicine. J Librariansh Schol Commun. 2014;2(3):eP1161.
17 Bornmann L, Haunschild R. Relative citation ratio (RCR): an empirical attempt to study a new field - normalized bibliometric indicator. J Assoc Inf Sci Technol. 2017;68(4):1064–7. DOI: http://dx.doi.org/10.1002/asi.23729.
18 Janssens ACJW, Goodman M, Powell KR, Gwinn M. A critical evaluation of the algorithm behind the relative citation ratio (RCR). PLoS Biol. 2017 Oct 2;15(10):e2002536. http://dx.doi.org/10.1371/journal.pbio.2002536.
19 Waltman L. NIH’s new citation metric: a step forward in quantifying scientific impact? Centre for Science and Technology Studies Blog [Internet]. 2 Nov 2015 [cited 23 Jul 2018]. <https://www.cwts.nl/blog?article=n-q2u294>.
20 Ioannidis JPA, Boyack K, Wouters PF. Citation metrics: a primer on how (not) to normalize. PLoS Biol. 2016 Sep 6;14(9):e1002542. DOI: http://dx.doi.org/10.1371/journal.pbio.1002542.
21 Basken P. Better than impact factor? NIH team claims key advance in ranking journal articles. Chron Higher Educ. 2016 Sep 7.
22 Schneider M, Kane CM, Rainwater J, Guerrero L, Tong G, Desai SR, Trochim W. Feasibility of common bibliometrics in evaluating translational science. J Clin Transl Sci. 2017 Feb;1(1):45–52. DOI: http://dx.doi.org/10.1017/cts.2016.8.
23 Gao Y, Stanley DV, Robinson VC, Reid DM, Galis ZS. Angiogenesis research: extramural portfolio supported by the National Heart, Lung, and Blood Institute, 2008–2015. Circ Res. 2017 May 26;120(11):1713–7. DOI: http://dx.doi.org/10.1161/CIRCRESAHA.117.310735.
24 Bornmann L, Williams R. Can the journal impact factor be used as a criterion for the selection of junior researchers? a large-scale empirical study based on ResearcherID data. J Informetr. 2017 Aug;11(3):788–99. DOI: http://dx.doi.org/10.1016/j.joi.2017.06.001.
25 Rock CB, Prabhu AV, Fuller CD, Thomas CR, Holliday EB. Evaluation of the relative citation ratio, a new National Institutes of Health–supported bibliometric measure of research productivity, among academic radiation oncologists. J Am Coll Radiol. 2018 Mar;15(3, pt A):469–74. DOI: http://dx.doi.org/10.1016/j.jacr.2017.11.006.
26 Yu F, Hayes B. Applying data analytics and visualization to assessing the research impact of the Cancer Cell Biology (CCB) Program at the University of North Carolina at Chapel Hill. J eSci Librariansh. 2018 7(1):e1123.
27 Ündar A. The relative citation ratio: measuring impact of publications from an international conference with a new NIH metric. Artifi Organs. 2017 Dec;41(12):1085–91. http://dx.doi.org/10.1111/aor.13079.
28 Hutchins BI, Hoppe TA, Meseroll RA, Anderson JM, Santangelo GM. Additional support for RCR: a validated article-level measure of scientific influence. PLoS Biol. 2017 Oct 2;15(10):e2003552. http://dx.doi.org/10.1371/journal.pbio.2003552.
Alisa Surkis, PhD, MLS, alisa.surkis@med.nyu.edu, http://orcid.org/0000-0002-9777-2693, Assistant Director for Research Data and Metrics, Health Sciences Library, NYU, 577 1st Avenue, New York, NY 10016,
Stuart Spore, MS, MLIS, stuart.spore@med.nyu.edu, https://orcid.org/0000-0001-5390-4394, Lead, Scholarly Output Assessment, Health Sciences Library, NYU, 577 1st Avenue, New York, NY 10016.
Articles in this journal are licensed under a Creative Commons Attribution 4.0 International License.
This journal is published by theUniversity Library System of the University of Pittsburgh as part of its D-Scribe Digital Publishing Program and is cosponsored by the University of Pittsburgh Press.
Journal of the Medical Library Association, VOLUME 106, NUMBER 4, October 2018