-
Notifications
You must be signed in to change notification settings - Fork 250
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Include URL and page content in the Topics classifier features #118
Comments
This probably needs a separate permission. In many cases the page content or path can be sensitive when the domain is not. (It's possible that a user would be perfectly fine with having the |
Exposing content beyond the origin to the API is tricky from a privacy perspective. Cross-origin iframes today do not have access to the page title or URL or page contents, just the origin. Giving Topics access to that data means it's possible for these third-parties to then learn topics inferred from content they couldn't otherwise access. It's quite limited data, but enough that it gives pause and would likely need some mitigations to protect against abuse. As such, closing for now as we don't intend to do this in the near future. |
@jkarlin it seems the concern you express only applies in a on by default state. By making access to the extra information opt-in via permissions, you wouldn't allow the cross-origin iframe access to information it couldn't gather today through, for example, an opt-in post message orchestrated by the publisher. |
#224 has continued discussion on a possible opt-in mechanism. |
The Topics classifier currently uses only the page hostname to define the corresponding topics.
This leads to large sites, with diverse content, getting a very generic topic with low advertising value.
Taking into account the URL and the content of the website as features in the classifier will allow for much more accurate classification of websites’ pages and shall improve the signaling within Topics.
The text was updated successfully, but these errors were encountered: