You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Indeed, this is a pretty exotic feature request but I happen to have some CSVs where the last column contains mixed text including XML and the text within the XML tags can potentially have newline characters which shouldn't be interpreted as newlines when parsing the file. Two such examples
<PAGE_AUTHORS>
\n
\n
\n
\n
\nHACKETT;Ark. — A sheriff;admin;About the Author</PAGE_AUTHORS>
and
<PAGE_AUTHORS>K G Rana;\nMax Planck Institute of Microstructure Physics;Weinberg 2;D-06120 Halle;Germany;\nMax Planck Institute for Chemical Physics of Solids;Nöthnitzer Str. 40;D-01187 Dresden;O Meshcheriakova;J Kübler;\nInstitut für Festkörperphysik;Technische Universität Darmstadt;D-64289 Darmstadt;B Ernst;J Karel;R Hillebrand;E Pippel;P Werner;A K Nayak;C Felser;S S P Parkin</PAGE_AUTHORS>
The first example is taken from the file 20160810171500.gkg.csv from the GDELT2 dataset
The text was updated successfully, but these errors were encountered:
Are these columns surrounded by quotation marks? If not, we would have to add support for XML to handle this? That seems not like a good idea :) Or am I misunderstanding something here?
Indeed, this is a pretty exotic feature request but I happen to have some CSVs where the last column contains mixed text including XML and the text within the XML tags can potentially have newline characters which shouldn't be interpreted as newlines when parsing the file. Two such examples
and
The first example is taken from the file
20160810171500.gkg.csv
from the GDELT2 datasetThe text was updated successfully, but these errors were encountered: