Skip to content

Commit

Permalink
Fix typo "queryStr" to "query" for sql
Browse files Browse the repository at this point in the history
  • Loading branch information
ecurtin committed Oct 4, 2017
1 parent 3b0cfac commit 378773e
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 7 deletions.
6 changes: 3 additions & 3 deletions docs/_workloads/sql.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,7 @@ select `0` from input where `0` < -0.9
| name | yes | -- | "sql" |
| input | yes | -- | the input dataset |
| output | no | -- | If users wish to capture the actual results of the SQL query, they can specify an output file here. |
| queryStr | yes | -- | the sql query to perform. The table name must be "input" as shown in the examples above. |
| query | yes | -- | the sql query to perform. The table name must be "input" as shown in the examples above. |
| cache | no | false | whether the dataset should be cached after being read from disk |

#### Examples
Expand All @@ -33,15 +33,15 @@ select `0` from input where `0` < -0.9
name = "sql"
input = "/tmp/generated-kmeans-data.parquet"
output = "/tmp/sql-query-results.parquet"
queryStr = "select `0` from input where `0` < -0.9"
query = "select `0` from input where `0` < -0.9"
}
```

```hocon
{
name = "sql"
input = "/tmp/generated-kmeans-data.parquet"
queryStr = "select `0` from input where `0` < -0.9"
query = "select `0` from input where `0` < -0.9"
cache = true
}
```
8 changes: 4 additions & 4 deletions spark-launch/src/test/resources/etc/specific-spark-home.conf
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ spark-bench = {
workload-suites = [
{
descr = "Generate a dataset, then take that same dataset and write it out to Parquet format"
benchmark-output = "/home/dev-user/emily/results-data-gen.csv"
benchmark-output = "hdfs:///tmp/results-data-gen.csv"
// We need to generate the dataset first through the data generator, then we take that dataset and convert it to Parquet.
parallel = false
workloads = [
Expand All @@ -20,22 +20,22 @@ spark-bench = {
},
{
name = "sql"
queryStr = "select * from input"
query = "select * from input"
input = "hdfs:///tmp/kmeans-data.csv"
output = "hdfs:///tmp/kmeans-data.parquet"
}
]
},
{
descr = "Run two different SQL queries over the dataset in two different formats"
benchmark-output = "/home/dev-user/emily/results-sql.csv"
benchmark-output = "hdfs:///tmp/results-sql.csv"
parallel = false
repeat = 10
workloads = [
{
name = "sql"
input = ["hdfs:///tmp/kmeans-data.csv", "hdfs:///tmp/kmeans-data.parquet"]
queryStr = ["select * from input", "select `0`, `22` from input where `0` < -0.9"]
query = ["select * from input", "select `0`, `22` from input where `0` < -0.9"]
cache = false
}
]
Expand Down

0 comments on commit 378773e

Please sign in to comment.