Skip to content

Automated scheduled cleanup of Paratext projects folder #725

@mmartin9684-sil

Description

@mmartin9684-sil

Could we run a script over the Paratext > projects folder on a regular basis to remove PII from the projects, as well as other project data that is not needed by the silnlp tools (graphics, .hg folder, etc)?
Automatically removing PII would help us ensure we more consistently follow good security practices for all projects. Automatically removing unnecessary project artifacts, such as graphics and revision history, will reduce storage demands.
David has a script that he runs on occasion that could be a starting point.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or requestpipeline 2: extractIssue related to extracting parallel corpora

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions