Aggregated data into index #506

ashokblend · 2021-10-13T05:20:32Z

ashokblend
Oct 13, 2021

As i see hyperspace supports index creation only if query scans hdfs file. Is there plan to support for below kind of query.

val df = spark.sql("select dim1,dim2,sum(measure1) as measure1 from table group by dim1,dim2")

hs.createIndex(df, IndexConfig("dim1dim2", indexedColumns = Seq("dim1","dim2"), includedColumns = Seq("measure1")))

Answered by sezruby

Oct 13, 2021

There is an issue related to the feature: #186

Since the spark plan of the query can be different depending on spark versions, it's somewhat difficult to support the feature.
Maybe we could try to implement it with a SQL string as a source, like
hs.createIndex(IndexConfig("name", "sql"))

Though it's not a simple item/implementation and not planned yet.

View full answer

sezruby · 2021-10-13T19:08:09Z

sezruby
Oct 13, 2021

There is an issue related to the feature: #186

Since the spark plan of the query can be different depending on spark versions, it's somewhat difficult to support the feature.
Maybe we could try to implement it with a SQL string as a source, like
hs.createIndex(IndexConfig("name", "sql"))

Though it's not a simple item/implementation and not planned yet.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aggregated data into index #506

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Aggregated data into index #506

ashokblend Oct 13, 2021

Replies: 1 comment

sezruby Oct 13, 2021

ashokblend
Oct 13, 2021

sezruby
Oct 13, 2021