-
Notifications
You must be signed in to change notification settings - Fork 1
scottblanc/vowpal-wabbit-data
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Simple utility class for parsing and vectorizing vowpal wabbit data in python. Vowpal wabbit format has a few very nice properties: * handles text and non-text very naturally * groups features by namespace * stores meta data, like importance weight and tags This class tries to maintain fidelity for these properties and in addition: * allows for easy slicing and dicing of the data by leveraging pandas * supports direct extraction of large, sparse feature matrices grouped by namespace * support simple feature pruning * pass-through processing of text using scipy text analyzers To see it in action, type "python example.py".
About
Python code for working with vowpal wabbit data
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published