You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It turns out that even in version 2.2.0 of pandas the new string dtype is not up to the same speed for some tasks, and unfortunately one of them is indexing:
object
9.78 µs ± 18.9 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)
string
15.7 µs ± 36.5 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)
string[pyarrow]
17.6 µs ± 66.6 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)
So we might consider switching to store the file index again as object dtype as we do now for the dependencies in audb (audeering/audb#371). The only problem is, that in audb the change is hidden for the user, whereas here it would be a breaking change.
The text was updated successfully, but these errors were encountered:
It turns out that even in version
2.2.0
ofpandas
the newstring
dtype is not up to the same speed for some tasks, and unfortunately one of them is indexing:which returns
So we might consider switching to store the
file
index again asobject
dtype as we do now for the dependencies in audb (audeering/audb#371). The only problem is, that inaudb
the change is hidden for the user, whereas here it would be a breaking change.The text was updated successfully, but these errors were encountered: