Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Analyze traffic to see if Google ever sends Google-Extended #13

Open
davidmytton opened this issue Sep 13, 2024 · 3 comments
Open

Analyze traffic to see if Google ever sends Google-Extended #13

davidmytton opened this issue Sep 13, 2024 · 3 comments

Comments

@davidmytton
Copy link

Google's AI crawler is Google-Extended

Google-Extended is a standalone product token that web publishers can use to manage whether their sites help improve Gemini Apps and Vertex AI generative APIs, including future generations of models that power those products. Google-Extended does not impact a site's inclusion or ranking in Google Search.

User agent token Google-Extended

See the full list at https://developers.google.com/search/docs/crawling-indexing/overview-google-crawlers

@blaine-arcjet
Copy link
Contributor

It looks like ai.robots.txt includes this so this should also be resolved by #3

@blaine-arcjet blaine-arcjet self-assigned this Sep 16, 2024
@blaine-arcjet
Copy link
Contributor

The google docs state:

Google-Extended doesn't have a separate HTTP request user agent string. Crawling is done with existing Google user agent strings; the robots.txt user-agent token is used in a control capacity.

This means that it only works via detection in robots.txt similar to Applebot-Extended.

@davidmytton Does this mean we should categorize all of the google crawler products as ai?

@davidmytton
Copy link
Author

@davidmytton Does this mean we should categorize all of the google crawler products as ai?

Started a discussion: #16 (comment)

@blaine-arcjet blaine-arcjet changed the title Add Google-Extended AI crawler Analyze traffic to see if Google ever sends Google-Extended Sep 18, 2024
@blaine-arcjet blaine-arcjet removed their assignment Sep 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants