-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PICA: filter out records #137
Comments
Some more rules which records to filter out can be found here:
With this issue resolved in pica-rs, the filters can be read by pica-rs from a file with comments for documentation. |
Reduction of records to level 0 can also be done in this step with filter argument
and use file
|
|
Yes
Yes, that's confusing, it's ignored anyway because filter expression is read from file. A next version of pica-rs will get rid of this, so the colling syntax would become
|
For the time being I will not implement the exact syntax, but something which is parsable more easyly, and later I will get back to this feature. There is an existng feature |
@nichtich I am working on this, and seems that the
I see two solutions:
Which one do you prefer? |
Filtering out records should be left to pica-rs as far as possible. It's more reliable to have specialized tools for specific tasks, put together as modules. |
Some PICA records should be filtered out because they are internal, deleted... To ensure stable filter rules, these filters should be put/applied in the import script and executed with pica-rs.
Example: filter out "mailbox" records as they are only used internally (example must be run in script as bash expands
!
if directly run at command line):The text was updated successfully, but these errors were encountered: