Coverage of open citation data approaches parity with Web of Science and Scopus
DOI:
https://doi.org/10.3145/thinkepi.2021.e15e04Keywords:
Infraestructura académica, Transformación digital, Comunicación científica, Ciencia abierta, Metadatos académicos, Licencias abiertas, índices de Citas, Web of Science, Scopus, Google Scholar, Microsoft Academic, Crossref, 14OC, OpenCitations, COCI, NIH-OCC, Internet Archive, RefcatAbstract
The information sources that are often used to monitor and to obtain a better understanding of the system of scholarly communication (such as Web of Science, Scopus, and Google Scholar) have historically been distributed under restrictive use licenses. However, in a scenario where science and scientific communication are undergoing a process of digital transformation, these models do not facilitate the development of new infrastructure that is better adapted to current and future needs. At the same time, these models hamper reproducibility. In recent years, a variety of open data sources, such as Microsoft Academic, Crossref, and others, have become available, providing easy access to large collections of metadata that were previously only available from closed sources. Citation data are one type of metadata provided by these open data sources. This study documents the significant growth in coverage of open citation data that has taken place between 2019 and 2021, and the events that have led to this point. These collections of open scholarly metadata have kick-started the development of a new ecosystem of scholarly information services. However, their fragility still poses a risk for downstream applications. Academic libraries could become important allies of open scholarly metadata initiatives.
References
Bilder, Geoffrey; Lin, Jennifer; Neylon, Cameron (2020). The principles of open scholarly infrastructure. https://doi.org/10.24343/C34W2H
Czygan, Martin; Holzmann, Helge; Newbold, Bryan (2021). Refcat: The internet archive scholar Citation graph. arXiv: 2110.06595 [cs]. http://arxiv.org/abs/2110.06595
Heibi, Ivan; Peroni, Silvio; Shotton, David (2019). "Software review: COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations". Scientometrics, v. 121, pp. 1213-1228. https://doi.org/10.1007/s11192-019-03217-6
Hendricks, Ginny; Kramer, Bianca; Maccallum, Catriona J.; Manghi, Paolo; Neylon, Cameron; Peroni, Silvio; Shotton, David; Tay, Aaron; Waltman, Ludo (2021). "Now is the time to work together toward open infrastructures for scholarly metadata". Impact of social sciences blog, octubre 27. https://blogs.lse.ac.uk/impactofsocialsciences/2021/10/27/now-is-the-time-to-work-together-toward-open-infrastructures-for-scholarly-metadata
Hutchins, B. Ian; Baker, Kirk L.; Davis, Matthew T.; Diwersy, Mario A.; Haque, Ehsanul; Harriman, Robert M.; Hoppe, Travis A.; Leicht, Stephen A.; Meyer, Payam; Santangelo, George M. (2019). "The NIH open citation collection: A public access, broad coverage resource". PLOS biology, v. 17, n. 10, e3000385. https://doi.org/10.1371/journal.pbio.3000385
Martín-Martín, Alberto; Thelwall, Mike; Orduna-Malea, Enrique; Delgado-López-Cózar, Emilio (2021). "Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations´ COCI: A multidisciplinary comparison of coverage via citations". Scientometrics, v. 126, n. 1, pp. 871-906. https://doi.org/10.1007/s11192-020-03690-4
Peroni, Silvio; Shotton, David (2020). "OpenCitations, an infrastructure organization for open scholarship". Quantitative science studies, v. 1, n. 1, pp. 428-444. https://doi.org/10.1162/qss_a_00023
Tay, Aaron; Martín-Martín, Alberto; Hug, Sven E. (2021). "Goodbye, Microsoft Academic - hello, open research infrastructure?". Impact of social sciences blog, mayo 27. https://blogs.lse.ac.uk/impactofsocialsciences/2021/05/27/goodbye-microsoft-academic-hello-open-research-infrastructure/