Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cited document by URL with wrong DOI #30

Open
essepuntato opened this issue Oct 9, 2017 · 0 comments
Open

Cited document by URL with wrong DOI #30

essepuntato opened this issue Oct 9, 2017 · 0 comments
Labels

Comments

@essepuntato
Copy link
Owner

Resource gbr:12642 comes has 2289 incoming citation links. However, looking at the actual manuscript identified by the DOI associated, I can't really see that many citations.

There is, indeed, some issue there, and I should have identified the problem. I’ve just noticed that gbr:12642 has three identifiers associated, that are:

Now, I remember this specific resource: the one that is usually used when the R Statistical Framework is cited in a document – that’s why there are no metadata associated with it (Crossref did not return anything). However the latter DOI associated to this resource (gid:9489845) sounded a bit odd. Thus, I’ve investigated when it has been added and from which source by accessing to its provenance information (https://w3id.org/oc/corpus/id/9489845/prov/se/1). By looking at the actual source XML file (http://www.ebi.ac.uk/europepmc/webservices/rest/PMC5039327/fullTextXML), I’ve found the following reference from which the basic data have been extracted:

<mixed-citation publication-type="book" id="eva12394-cit-0034">
	<collab collab-type="authors">R Development Core Team</collab>
	<year>2014</year>
	<source>R: A Language and Environment for Statistical Computing</source>.
	<publisher-name>R Foundation for Statistical Computing</publisher-name>.
	doi: <ext-link ext-link-type="doi" xlink:href="10.1007/978-3-540-74686-7”>
		10.1007/978&#x02010;3&#x02010;540&#x02010;74686&#x02010;7
	</ext-link>.
	<ext-link ext-link-type="uri" xlink:href="http://www.r-project.org”>
		http://www.r-project.org 
	</ext-link>.
</mixed-citation>

As you can see in this reference there is a DOI and a URL specified, and since the OpenCitations process extract both, is where the mistake happened.

I’ve just added a new issue in the repository so as to try to avoid somehow these kinds of issues in the future, but the main point is that the error was propagated starting from the original source.

This issue has been identified by Daniel Ecer (eLife).

@essepuntato essepuntato added the bug label Oct 9, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant