Given a Tree and a Branch name, the script provided here will extract the data from that branch and store the content as a .npz archive. This tool can read and decompress ROOT Tree files in parallel, making it a fast option for Machine Learning Feature Engineering.
After making sure you have Python
and pip
installed; you just need to install a few dependencies:
$ pip install --user -r requirements.txt
For basic usage instructions try:
$ ./extract.py --help
- add options for exporting .npx and .hdf
⚠️ This utility is NOT compatible with irregular data structures ⛔️- Although this tool was designed to be used in conjuncture with nsw_data_wrangler's output. It may be adapted to any other use-case requiring parallel extraction of highly compressed ROOT data.