This repository has been archived by the owner on Feb 16, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 1
GDPR right to be forgotten #153
Labels
Comments
The configuration and connection between hbase - phoenix - trino has been estabilished.
The
The hbase discovery configmap needs to be extented the following way
However, with dbeaver we can't see actual data.
|
For release we need the following issues: stackabletech/trino-operator#331 |
We have row level deletes in the trino iceberg demo which demonstrates what we wanted to achieve here, closing |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Scenario:
Event data from your E-Commerce Webshop is stored in hbase.
Details of your data and master data is stored somewhere else.
Questions around this are usual:
Therefore, analyzing the event data can result in an additional ETL job and using up more space. Since event data can get very large you may need to aggregate the data and thus lose details.
Another challenge in context of event data from a webshop results from the GDPR context
A customer enforces his right to be forgotten and wants all his data deleted.
With hbase, you have access to a single row. Thus, a company is able to quickly find the customer, the events connect with the customer and master data.
Customers won't store all there data in hbase. Thus they need to query different sources to get a big picture of the situation. This demo will try to connect event data stored in hbase with data stored in s3 to enrich the events (data federation).
As of now, we need to introduce ACL on the hbase side. Therefore, we can not execute the deletion query. This will be possible with a future enhancement of the demo.
Another future development could be modeling the data as a data vault and applying data warehouse concepts.
Tasks:
The text was updated successfully, but these errors were encountered: