Skip to content

List of CrossRef DOI prefixes augmented with semantic information about their holders from Wikidata

License

Notifications You must be signed in to change notification settings

csisc/DOIPrefixAnalysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 

Repository files navigation

DOI Prefix Analysis

List of CrossRef DOI prefixes augmented with semantic information about their holders from Wikidata

Data Collection

This project collects the list of Digital Object Identifier prefixes corresponding to journal publishers as revealed by the CrossRef bibliographic database as of 17 January 2022 (98,420,414 DOIs). This dataset includes the number of journals and publications corresponding to every DOI prefix. Then, we use OpenRefine, a data cleaning and reconciliation software, for aligning the publisher names of the best 200 DOI prefixes to their corresponding items in Wikidata, an open and collaborative multilingual and multidisciplinary knowledge graph. The alignment of publishers to Wikidata items will be later used to enrich the list of DOI prefixes with detailed information about the publishers, mainly the type, country, year of creation, and year of end of work of every institution. As well, this matching will be used to identify the publishers indexed in the Beall's List of predatory publishing institutions. The output is finally curated by the two first authors to fill the gaps in data and verify the information automatically added from Wikidata to our dataset.

To cite the work

Turki, H., Fraumann, G., Hadj Taieb, M. A., & Ben Aouicha, M. (2022). Digital Object Identifiers (DOIs) for whom? The Digital Divide in Scientometrics and the Publishing Industry. Frontiers in Research Metrics and Analytics (Forthcoming).

About

List of CrossRef DOI prefixes augmented with semantic information about their holders from Wikidata

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published