Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Alternatives for IO #38

Open
klmr opened this issue Mar 11, 2015 · 0 comments
Open

Alternatives for IO #38

klmr opened this issue Mar 11, 2015 · 0 comments
Labels

Comments

@klmr
Copy link
Contributor

klmr commented Mar 11, 2015

At the moment the io module exclusively uses R base IO plus xlsx.

There are two or three broad alternatives:

  • data.table::fread. This is a thoroughly terrible idea and I strongly oppose it, because their code base and their API is a mess, and it’s been in development for years without a stable version. Performance is phenomenal, but we need correctness first, performance last.
  • readr. Also still in development. Uses sane defaults (somewhat similar to what we’re already doing with stringsAsFactors), produces dplyr-compatible tbl_df without row names. Not as fast as fread but apparently still an order of magnitude faster than base R.
  • rio. I honestly don’t know exactly what this is; it seems to be higher level, i.e. read data from different format using the same interface (same as io, but way more formats supported). Uses (or rather, will use in the future) readr under the hood.
@klmr klmr added the question label Mar 11, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant