Coverage of open citation data approaches parity with Web of Science and Scopus




Infraestructura académica, Transformación digital, Comunicación cientí­fica, Ciencia abierta, Metadatos académicos, Licencias abiertas, índices de Citas, Web of Science, Scopus, Google Scholar, Microsoft Academic, Crossref, 14OC, OpenCitations, COCI, NIH-OCC, Internet Archive, Refcat


The information sources that are often used to monitor and to obtain a better understanding of the system of scholarly communication (such as Web of Science, Scopus, and Google Scholar) have historically been distributed under restrictive use licenses. However, in a scenario where science and scientific communication are undergoing a process of digital transformation, these models do not facilitate the development of new infrastructure that is better adapted to current and future needs. At the same time, these models hamper reproducibility. In recent years, a variety of open data sources, such as Microsoft Academic, Crossref, and others, have become available, providing easy access to large collections of metadata that were previously only available from closed sources. Citation data are one type of metadata provided by these open data sources. This study documents the significant growth in coverage of open citation data that has taken place between 2019 and 2021, and the events that have led to this point. These collections of open scholarly metadata have kick-started the development of a new ecosystem of scholarly information services. However, their fragility still poses a risk for downstream applications. Academic libraries could become important allies of open scholarly metadata initiatives.


Bilder, Geoffrey; Lin, Jennifer; Neylon, Cameron (2020). The principles of open scholarly infrastructure.

Czygan, Martin; Holzmann, Helge; Newbold, Bryan (2021). Refcat: The internet archive scholar Citation graph. arXiv: 2110.06595 [cs].

Heibi, Ivan; Peroni, Silvio; Shotton, David (2019). "Software review: COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations". Scientometrics, v. 121, pp. 1213-1228.

Hendricks, Ginny; Kramer, Bianca; Maccallum, Catriona J.; Manghi, Paolo; Neylon, Cameron; Peroni, Silvio; Shotton, David; Tay, Aaron; Waltman, Ludo (2021). "Now is the time to work together toward open infrastructures for scholarly metadata". Impact of social sciences blog, octubre 27.

Hutchins, B. Ian; Baker, Kirk L.; Davis, Matthew T.; Diwersy, Mario A.; Haque, Ehsanul; Harriman, Robert M.; Hoppe, Travis A.; Leicht, Stephen A.; Meyer, Payam; Santangelo, George M. (2019). "The NIH open citation collection: A public access, broad coverage resource". PLOS biology, v. 17, n. 10, e3000385.

Martí­n-Martí­n, Alberto; Thelwall, Mike; Orduna-Malea, Enrique; Delgado-López-Cózar, Emilio (2021). "Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations´ COCI: A multidisciplinary comparison of coverage via citations". Scientometrics, v. 126, n. 1, pp. 871-906.

Peroni, Silvio; Shotton, David (2020). "OpenCitations, an infrastructure organization for open scholarship". Quantitative science studies, v. 1, n. 1, pp. 428-444.

Tay, Aaron; Martí­n-Martí­n, Alberto; Hug, Sven E. (2021). "Goodbye, Microsoft Academic - hello, open research infrastructure?". Impact of social sciences blog, mayo 27.



How to Cite

Martí­n-Martí­n, A. (2021). Coverage of open citation data approaches parity with Web of Science and Scopus. Anuario ThinkEPI, 15.



E. Comunicación cientí­fica, edición y fuentes de información