Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Encoding issues when interacting with Blazegraph via py4jps #667

Open
markushofmeister opened this issue Mar 16, 2023 · 0 comments
Open
Labels
bug Something isn't working python-wrapper Issues relating to py4jps and pyderivationagent

Comments

@markushofmeister
Copy link
Collaborator

There are encoding issues with non-ascii characters when interacting with Blazegraph from Python agents via the RemoteStoreClient of the JPS_BASE_LIB. For example, when uploading unit symbols like £, °, °C, or µg the Blazegraph GUI will display odd characters. The same odd characters will be returned when querying for those symbols.
Likely this encoding issue happens somewhere at the Python - Java interface, as uploading the entire ontology of units of measure rdf file using the RemoteStoreClient natively in Java does not face any of these issues.

The current workaround consists of

  1. uploading all units with special characters at the start of relevant agents (to minimise instantiation of non-ascii Literals, as the instantiation seems to be inconsistent and previously produced different odd character sets), e.g. here
  2. decode unit symbols upon retrieval, e.g. here

Fore the visualisation this has been addressed by PR #466 (Issue #513)

@markushofmeister markushofmeister added bug Something isn't working python-wrapper Issues relating to py4jps and pyderivationagent labels Mar 16, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working python-wrapper Issues relating to py4jps and pyderivationagent
Projects
None yet
Development

No branches or pull requests

1 participant