Coverage of open citation data approaches parity with Web of Science and Scopus

Authors

DOI:

https://doi.org/10.3145/thinkepi.2021.e15e04

Keywords:

Infraestructura académica, Transformación digital, Comunicación cientí­fica, Ciencia abierta, Metadatos académicos, Licencias abiertas, índices de Citas, Web of Science, Scopus, Google Scholar, Microsoft Academic, Crossref, 14OC, OpenCitations, COCI, NIH-OCC, Internet Archive, Refcat

Abstract

The information sources that are often used to monitor and to obtain a better understanding of the system of scholarly communication (such as Web of Science, Scopus, and Google Scholar) have historically been distributed under restrictive use licenses. However, in a scenario where science and scientific communication are undergoing a process of digital transformation, these models do not facilitate the development of new infrastructure that is better adapted to current and future needs. At the same time, these models hamper reproducibility. In recent years, a variety of open data sources, such as Microsoft Academic, Crossref, and others, have become available, providing easy access to large collections of metadata that were previously only available from closed sources. Citation data are one type of metadata provided by these open data sources. This study documents the significant growth in coverage of open citation data that has taken place between 2019 and 2021, and the events that have led to this point. These collections of open scholarly metadata have kick-started the development of a new ecosystem of scholarly information services. However, their fragility still poses a risk for downstream applications. Academic libraries could become important allies of open scholarly metadata initiatives.

References

Bilder, Geoffrey; Lin, Jennifer; Neylon, Cameron (2020). The principles of open scholarly infrastructure. https://doi.org/10.24343/C34W2H

Czygan, Martin; Holzmann, Helge; Newbold, Bryan (2021). Refcat: The internet archive scholar Citation graph. arXiv: 2110.06595 [cs]. http://arxiv.org/abs/2110.06595

Heibi, Ivan; Peroni, Silvio; Shotton, David (2019). "Software review: COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations". Scientometrics, v. 121, pp. 1213-1228. https://doi.org/10.1007/s11192-019-03217-6

Hendricks, Ginny; Kramer, Bianca; Maccallum, Catriona J.; Manghi, Paolo; Neylon, Cameron; Peroni, Silvio; Shotton, David; Tay, Aaron; Waltman, Ludo (2021). "Now is the time to work together toward open infrastructures for scholarly metadata". Impact of social sciences blog, octubre 27. https://blogs.lse.ac.uk/impactofsocialsciences/2021/10/27/now-is-the-time-to-work-together-toward-open-infrastructures-for-scholarly-metadata

Hutchins, B. Ian; Baker, Kirk L.; Davis, Matthew T.; Diwersy, Mario A.; Haque, Ehsanul; Harriman, Robert M.; Hoppe, Travis A.; Leicht, Stephen A.; Meyer, Payam; Santangelo, George M. (2019). "The NIH open citation collection: A public access, broad coverage resource". PLOS biology, v. 17, n. 10, e3000385. https://doi.org/10.1371/journal.pbio.3000385

Martí­n-Martí­n, Alberto; Thelwall, Mike; Orduna-Malea, Enrique; Delgado-López-Cózar, Emilio (2021). "Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations´ COCI: A multidisciplinary comparison of coverage via citations". Scientometrics, v. 126, n. 1, pp. 871-906. https://doi.org/10.1007/s11192-020-03690-4

Peroni, Silvio; Shotton, David (2020). "OpenCitations, an infrastructure organization for open scholarship". Quantitative science studies, v. 1, n. 1, pp. 428-444. https://doi.org/10.1162/qss_a_00023

Tay, Aaron; Martí­n-Martí­n, Alberto; Hug, Sven E. (2021). "Goodbye, Microsoft Academic - hello, open research infrastructure?". Impact of social sciences blog, mayo 27. https://blogs.lse.ac.uk/impactofsocialsciences/2021/05/27/goodbye-microsoft-academic-hello-open-research-infrastructure/

Published

2021-11-29

How to Cite

Martí­n-Martí­n, A. (2021). Coverage of open citation data approaches parity with Web of Science and Scopus. Anuario ThinkEPI, 15. https://doi.org/10.3145/thinkepi.2021.e15e04

Issue

Section

E. Comunicación cientí­fica, edición y fuentes de información