Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Anonymizer] Add ability to anonymize data in reports #8

Open
roma-glushko opened this issue Sep 11, 2019 · 0 comments
Open

[Anonymizer] Add ability to anonymize data in reports #8

roma-glushko opened this issue Sep 11, 2019 · 0 comments

Comments

@roma-glushko
Copy link
Owner

roma-glushko commented Sep 11, 2019

Description

Access log anonymizing can be a super helpful feature to have in Tango. Usually, there are needs to pass/show access logs to 3rd-parties, but not always it's possible to show customer/visitor data

Expected behavior

A new CLI command that would process the data according to passed anonymizer params and would save the access log file in the same raw format (not CSV as Tango normally does).

Benefits

  • zero effort to make access log data anonymized
  • possibility to share access log data with 3rd-parties without fear to have legal issues

Additional information

The following data seems to be useful to anonymize:

  • visitor IPs
  • access URIs
  • user agents
  • timezone

It may be useful to add the ability to replace some specific lines in the access logs (like googlable vendor names).

Also, anonymized of data should show visitor actions as close as possible to the original file. This means that would be great to keep for IP and URI, User-Agent of the visitor the same along with the whole access log.

Example1. If we anonymize IP 1.1.1.1 to 250.2.3.50, IP 250.2.3.50 should be used everywhere where we need to close information about IP 1.1.1.1.

Example2. If we want to hide information about URI "/automobile-windows" and anonymize it to "/category100", then we need to use "/category100" everywhere where it comes to the original URI.

Such a strategy helps to keep relations in the access log and pass it to other parties.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant