Skip to content

Commit

Permalink
Adds spark-submit-parallel to docs
Browse files Browse the repository at this point in the history
  • Loading branch information
ecurtin committed Oct 17, 2017
1 parent 880777b commit b644e9b
Showing 1 changed file with 34 additions and 6 deletions.
40 changes: 34 additions & 6 deletions docs/_users-guide/spark-submit-config.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@ those spark-submits. The `class` and `jar` parameters are set by the spark-bench


- [Parameters](#parameters)
- [spark-submit-parallel](#spark-submit-parallel)
- [spark-home](#spark-home)
- [spark-args](#spark-args)
- [conf](#conf)
Expand All @@ -21,12 +22,31 @@ those spark-submits. The `class` and `jar` parameters are set by the spark-bench

## Parameters

| Name | Required | Description |
| ---------- | ----- | --- |
| spark-home | no | Path to the top level of your Spark installation |
| spark-args | no | Includes master, executor-memory, and other spark-submit arguments |
| conf | no | A series of configuration options for Spark |
| suites-parallel | no | Whether the workload-suites within this spark-submit should run serially or in parallel. Defaults to `false`. |
| Name | Required | Default | Description |
| ------- | -------- | ------- | ----------- |
| spark-submit-parallel | no | false | Controls whether spark-submits are launched in parallel. Defaults to `false` |
| spark-home | no | $SPARK_HOME | Path to the top level of your Spark installation |
| spark-args | no | master = $SPARK_MASTER_HOST | Includes master, executor-memory, and other spark-submit arguments |
| conf | no | -- | A series of configuration options for Spark |
| suites-parallel | no | false | Whether the workload-suites within this spark-submit should run serially or in parallel. Defaults to `false`. |

## spark-submit-parallel

`spark-submit-parallel` is the only parameter listed here which is set outside of the `spark-submit-config` structure.
If there are multiple spark-submits created by the config file, this boolean option determines whether they are launched
serially or in parallel.
This option defaults to `false` meaning the suites will run serially.

```hocon
spark-bench = {
spark-submit-parallel = true
spark-submit-config = {
spark-home = //...
}
}
```

## spark-home

Expand Down Expand Up @@ -65,6 +85,14 @@ spark-args = {
master = "mesos://207.184.161.138:7077"
}
```
```hocon
/*
* Since no master is specified here, spark-bench will look for the master
* from the environment variable $SPARK_MASTER_HOST
*/
spark-args = {
}
```

`master` is the only spark-arg that can also be set in an environment variable. If `SPARK_MASTER_HOST` and `spark-args = { master = ...`
are both set, the configuration file option will win.
Expand Down

0 comments on commit b644e9b

Please sign in to comment.