Verdict TODO

Jump to bottom Edit New page

Yongjoo Park edited this page Jul 8, 2017 · 4 revisions

TODO List

In Progress

Supporting Spark SQL, Druid
Make Verdict's meta data location configurable
Let Verdict read configuration options from a file

Future

Suggesting samples after analyzing query logs
Supporting SSL (for non-Kerberos)
Making confidence interval estimates analytic
Supporting IN (subquery) correctly

Finished

Adding confidence intervals by Bootstrapping: I have pushed updates into dev repo; I will merge this into the main repository along with count-distinct feature.
Supporting COUNT(distinct *)
1. Add universal samplers: By default, universe samples are built for top-ten high-cardinality columns.
2. Currently working on using stratified samples for low-cardinality columns.
3. If no appropriate samples are available, Verdict will use native HLL provided by Impala, Spark SQL, etc.
Supporting Kerberos
1. Kerberos JDBC connection confirmed; will add this shortly.