-
Notifications
You must be signed in to change notification settings - Fork 114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Detect tracking parameters in URLs #1998
Conversation
I'm unsure about that. As OSM is not cc @jocelynj |
Are you sure about the license? I see on https://github.com/duckduckgo/privacy-configuration/blob/main/LICENSE that it is using Apache license v2, which is an open-source license. CC BY-NC-SA 4.0 seems to be only about DuckDuckGo logos and marks. |
It is not the same as here https://github.com/duckduckgo/privacy-configuration/tree/main#licensing |
So, we have two different information about license :( It would be better to ask which license is applied, and if CC BY-NC-SA is confirmed, to ask for explicit authorization. In the meantime, we could limit to utm_* variables, which can be found in a free license here: https://en.wikipedia.org/wiki/UTM_parameters |
Drafted an email on Matrix. If that's fine, we could leave it for a few weeks on draft until we get a confirmation? |
b3cfcef
to
be2a799
Compare
be2a799
to
eb842c5
Compare
New implementation using the tracking parameter list from https://github.com/mpchadwick/tracking-query-params-registry is ready :) |
Look ok for me. Thank you. |
Implements #1950
This PR detects tracking parameters in URLs, such as fbclid or utm_campaign, and strips them from the URL.
It uses the list of https://github.com/duckduckgo/privacy-configuration/blob/main/features/tracking-parameters.json, which is CC BY-NC-SA 4.0 licensed. Based upon my understanding, we can use this for the detection as long as we refer to its origin (viaresource=*
) as we're not commercial (but mind, I'm not a legally educated person).As the easiest way to parse the URL was to use urlsplit, I also added a warning (class 30933) for broken URLs that give a ValueError in that function. These are however pretty rare, so I suspect it'll barely detect something. So if you prefer, I can also skip over those.