-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support all parquet types? #9
Comments
The question is what to do with data types that Stata doesn't support natively. These include:
The options are
|
I think that making them doubles is the way to go. |
Brownie points if it parses dates into a stata data format. |
I think the dates are from January 1, 1970, whereas in Stata they're compared to January 1, 1960, so a recomputation might be needed... |
It seems that unix time is 1970, but for whatever reason Stata does 1960 (SAS?) |
I suppose so. I didn't know SAS also had 1960 as epoch. |
It seems that parquet has 8 data primitives; the above are built on those primitives, so the plugin should already be able to read all of these. What ought to happen is that we should keep the formats somehow... |
What do you mean? |
I mean that it would be ideal to keep the display format. |
When writing or reading? |
Both. Not sure if it's automagic when writing if the date tyoe is declared, but for sure that is not the case when reading. Atm it's treated as long or double. |
I don't know how you would keep the display format in either. |
I think when reading it would be more relevant than when writing. For the latter if the target is a date type I suppose that's enough. When reading, Stata could format the variable after the data is in memory. Maybe not super crucial the more I think about it, but possibly convenient. |
Well dates should be possible because there's a date type in Parquet... But you won't be able to retrieve like |
See
arrow/cpp/src/arrow/builder.cc
. For instance,The text was updated successfully, but these errors were encountered: