-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Support WARC extension fields from Browsertrix (#76)
* feat: support Browsertrix extension fields in WARC header To preserve the case of field-names produced by Browsertrix, this commit adds the fields WARC-Page-ID, WARC-JSON-Metadata and WARC-Resource-Type to the list of known header fields. gowarc can handle unknown fields but normalizes their field-names to title-case. The WARC specification allows for extension fields in the WARC header. * test: refactor validateHeader test Adds some more test cases. * test: add test for normalizeName
- Loading branch information
Showing
2 changed files
with
143 additions
and
46 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters