Skip to content

Latest commit

 

History

History
83 lines (58 loc) · 2.67 KB

tanagra-clean-entity.adoc

File metadata and controls

83 lines (58 loc) · 2.67 KB

tanagra-clean-entity(1) Manual Page

Name

tanagra-clean-entity - Clean the outputs of all jobs for a single entity, or all entities.

Synopsis

tanagra clean entity [--all] [--dry-run] [--github-dir=<githubDir>] --indexer-config=<name> [--job-executor=<jobExecutor>] [--output-dir=<outputDir>] [--job-filter=<classNames>[, <classNames>…​]]…​ [--names=<names>[,<names>…​]]…​

Description

Clean the outputs of all jobs for a single entity, or all entities.

Options

--all

Include all entities.

Default: false
--dry-run

Do a dry run. No indexing tables will be written and no Dataflow jobs will be kicked off.

Default: false
--github-dir=<githubDir>

Absolute path of the top-level directory for the local clone of the tanagra GitHub repo. Defaults to the current directory.

--indexer-config=<name>

Indexer config name

--job-executor=<jobExecutor>

Executor to use when running jobs: PARALLEL, SERIAL. Recommend serial for debugging, parallel otherwise.

Default: PARALLEL
--job-filter=<classNames>[,<classNames>…​]

Only run jobs with these class names. Specify the class names relative to the IndexingJob class (e.g. bigquery.ValidateDataTypes, not bio.terra.tanagra.indexing.job.bigquery.ValidateDataTypes). Useful for debugging a particular indexing job.

--names=<names>[,<names>…​]

Entity name(s). Comma-separated list if more than one.

--output-dir=<outputDir>

Absolute path to the directory where the html report will be written. Defaults to the current directory.