-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Need help with whitespace, indent, new lines in XML/Oxygen #53
Comments
@davidamichelson @wlpotter is this a result of oXygen or a result of eXist export? It looks to me like at some point the file was 'pretty-printed' in oXygen, this shouldn't change meaningful whitespace. |
@wsalesky thanks, we would like to use regex to find and correct all cases where there is a missing "." at the end of either //desc() or //desc/quote() but we can't seem to get around the new lines in Oxygen. Any ideas? |
The problem was caused by Oxygen we think |
What is the regex you are trying? |
@wsalesky Let's save this for later. When we do look at it, record 2294 is a good example
|
@wsalesky I think the default settings in Oxygen is doing something odd to the spacing/indent in our files.
See for example: https://github.com/srophe/bethqatraye-data/blob/master/data/places/tei/143.xml#L138-L139
Why is this desc text node splitting into a new line like that?
Or why is the closing /desc on a new line here: https://github.com/srophe/bethqatraye-data/blob/master/data/places/tei/143.xml#L142-L143
Anyway, we would like to write some find and replace scripts using regex to clean this data up, but are having trouble because of the spacing. Any ideas on what is going on?
The text was updated successfully, but these errors were encountered: