This is a collaberate project between InitialDLab and Huawei, proposed to integrate functionalities from both Simba and CarbonData so that they can work together as a unfied spatial-temporal analysis system. It is supposed to support efficient spatial analytics query on carbon formatted data and peristing Simba's index structure through CarbonData onto distributed file systems.
- Support native connection from Simba to CarbonData.
- CarbonData formated persistent spatial index structure built by Simba.
- Spatial predicates pushdown onto CarbonData formatted file.
Notes: For a detailed integration plan, please refer to the wiki page of this project.
In particular, we are now using Simba standalone-2.1
version and CarbonData 1.1.1, which are both compatible with Apache Spark 2.1.x.
- Dong Xie: dongx [at] cs [dot] utah [dot] edu
- Yanqing Peng: cosineufo [at] gmail [dot] com
- Feifei Li: lifeifei [at] cs [dot] utah [dot] edu
- Kun Li: jacky [dot] likun [at] huawei [dot] com
- Jihong Ma: Jihong [dot] Ma [at] huawei [dot] com
- Qingqing Zhou: Qingqing [dot] Zhou [at] huawei [dot] com