Duplicate detection during external source import should escape metadata #3712
Labels
bug
component: configurable entities
related to configurable entities
help wanted
Needs a volunteer to claim to move forward
integration: OpenAIRE
Related to integration with OpenAIRE
tools:import-sources
Related to "Live Import" Sources feature, allowing import of content via external APIs.
Describe the bug
When importing an item from an external source (e.g. OpenAIRE), DSpace now checks if the item already exists locally and alerts the user. However, the strategy for doing this is simply a solr query using the name metadata. This works for most cases, but breaks when certain characters are not escaped in the title. e.g.
:
This means any titles with colons will necessarily create duplicates every time they are imported.
I suspect these other characters may present issues as well:
+
,--
,-
,&&
,||
,!
,(
,)
,"
,~
,*
,?
,:
To Reproduce
Publication
submission to attach a relatedProject
entity.655609
. You should see an item with the title "Adriatic Perspectives: Memory and Identity on a Transnational European Periphery".Expected behavior
During step 5, the original Project should be detected and displayed for the user to select
Related work
TBD
The text was updated successfully, but these errors were encountered: