René's URL Explorer Experiment


Title: Google Crawler (User Agent) Overview | Google Crawling Infrastructure  |  Crawling infrastructure  |  Google for Developers

Open Graph Title: Google Crawler (User Agent) Overview | Google Crawling Infrastructure  |  Crawling infrastructure  |  Google for Developers

Description: Understand the technical properties of Google crawlers and fetchers, including supported transfer protocols, caching, and file size limits.

Open Graph Description: Understand the technical properties of Google crawlers and fetchers, including supported transfer protocols, caching, and file size limits.

Opengraph URL: https://developers.google.com/crawling/docs/crawlers-fetchers/overview-google-crawlers

direct link

Domain: www.google.com


Hey, it has json ld scripts:
  {
    "@context": "https://schema.org",
    "@type": "BreadcrumbList",
    "itemListElement": [{
      "@type": "ListItem",
      "position": 1,
      "name": "Crawling infrastructure",
      "item": "https://developers.google.com/crawling"
    },{
      "@type": "ListItem",
      "position": 2,
      "name": "Google Crawler (User Agent) Overview | Google Crawling Infrastructure",
      "item": "https://developers.google.com/crawling/docs/crawlers-fetchers/overview-google-crawlers"
    }]
  }
  

google-signin-client-id721724668570-nbkv1cfusk7kk4eni4pjvepaus73b13t.apps.googleusercontent.com
google-signin-scopeprofile email https://www.googleapis.com/auth/developerprofiles https://www.googleapis.com/auth/developerprofiles.award https://www.googleapis.com/auth/devprofiles.full_control.firstparty
og:site_nameGoogle for Developers
og:typewebsite
theme-color#fff
NoneIE=Edge
og:imagehttps://www.gstatic.com/devrel-devsite/prod/v11431966d26d9f049ef61662c2b798f1cdee8af320f1ba0f77a43eee64301d60/developers/images/opengraph/white.png
og:image:width1200
og:image:height675
og:localeen
twitter:cardsummary_large_image

Links:

Skip to main content http://www.google.com/mobile/adsbot.html#main-content
Crawling infrastructure https://developers.google.com/crawling
Home https://developers.google.com/crawling
Docs https://developers.google.com/crawling/docs/crawlers-fetchers/overview-google-crawlers
Crawling infrastructure https://developers.google.com/crawling
Home http://www.google.com/crawling
Docs http://www.google.com/crawling/docs/crawlers-fetchers/overview-google-crawlers
Introhttp://www.google.com/crawling/docs/crawlers-fetchers/overview-google-crawlers
About Google's web crawlinghttp://www.google.com/crawling/docs/about-crawling
Verify requests from Googlehttp://www.google.com/crawling/docs/crawlers-fetchers/verify-google-requests
Authenticate requests with Web Bot Auth (experimental)http://www.google.com/crawling/docs/crawlers-fetchers/web-bot-auth
Reduce Google's crawl ratehttp://www.google.com/crawling/docs/crawlers-fetchers/reduce-crawl-rate
Create and submit a robots.txt filehttp://www.google.com/crawling/docs/robots-txt/create-robots-txt
How Google interprets the robots.txt specificationhttp://www.google.com/crawling/docs/robots-txt/robots-txt-spec
Update your robots.txt filehttp://www.google.com/crawling/docs/robots-txt/submit-updated-robots-txt
List of useful robots.txt ruleshttp://www.google.com/crawling/docs/robots-txt/useful-robots-txt-rules
Optimize your crawl budgethttp://www.google.com/crawling/docs/crawl-budget
Myths about crawlinghttp://www.google.com/crawling/docs/myths-about-crawling
Improve crawling of faceted navigation URLshttp://www.google.com/crawling/docs/faceted-navigation
Common crawlershttp://www.google.com/crawling/docs/crawlers-fetchers/google-common-crawlers
Special case crawlershttp://www.google.com/crawling/docs/crawlers-fetchers/google-special-case-crawlers
User-triggered fetchershttp://www.google.com/crawling/docs/crawlers-fetchers/google-user-triggered-fetchers
APIs-Googlehttp://www.google.com/crawling/docs/crawlers-fetchers/apis-user-agent
Feedfetcherhttp://www.google.com/crawling/docs/crawlers-fetchers/feedfetcher
Googlebothttp://www.google.com/search/docs/crawling-indexing/googlebot
Google Read Aloudhttp://www.google.com/crawling/docs/crawlers-fetchers/read-aloud-user-agent
HTTP status codeshttp://www.google.com/crawling/docs/troubleshooting/http-status-codes
Network and DNS errorshttp://www.google.com/crawling/docs/troubleshooting/dns-network-errors
Changeloghttp://www.google.com/crawling/docs/changelog
Home https://developers.google.com/
Crawling infrastructure https://developers.google.com/crawling
Docs https://developers.google.com/crawling/docs/crawlers-fetchers/overview-google-crawlers
automatically discover and scan websiteshttp://www.google.com/search/docs/fundamentals/how-search-works#crawling
wgethttps://www.gnu.org/software/wget/
updates to our documentationhttp://www.google.com/crawling/docs/changelog
Common crawlershttp://www.google.com/crawling/docs/crawlers-fetchers/google-common-crawlers
Googlebothttp://www.google.com/search/docs/crawling-indexing/googlebot
Special-case crawlershttp://www.google.com/crawling/docs/crawlers-fetchers/google-special-case-crawlers
User-triggered fetchershttp://www.google.com/crawling/docs/crawlers-fetchers/google-user-triggered-fetchers
Google Site Verifierhttps://support.google.com/webmasters/answer/9008080
HTTP/2https://en.wikipedia.org/wiki/HTTP/2
can send a message to the Crawling teamhttps://www.google.com/webmasters/tools/googlebot-report
RFC959https://datatracker.ietf.org/doc/html/rfc959
RFC4217https://datatracker.ietf.org/doc/html/rfc4217
gziphttps://en.wikipedia.org/wiki/Gzip
deflatehttps://en.wikipedia.org/wiki/Deflate
Brotli (br)https://en.wikipedia.org/wiki/Brotli
like Googlebothttp://www.google.com/search/docs/crawling-indexing/googlebot
reduce the crawl ratehttp://www.google.com/crawling/docs/crawlers-fetchers/reduce-crawl-rate
HTTP response codehttp://www.google.com/search/docs/crawling-indexing/http-network-errors
HTTP caching standardhttps://httpwg.org/specs/rfc9111.html
required by the HTTP standardhttps://www.rfc-editor.org/rfc/rfc9110.html#section-13.1.3
ETaghttps://www.rfc-editor.org/rfc/rfc9110#name-etag
HTTP Caching standardhttps://httpwg.org/specs/rfc9111.html
ETaghttps://www.rfc-editor.org/rfc/rfc9110#name-etag
If-None-Matchhttps://www.rfc-editor.org/rfc/rfc9110#name-if-none-match
HTTP Caching standardhttps://httpwg.org/specs/rfc9111.html
HTTP standardhttps://www.rfc-editor.org/rfc/rfc9110.html
max-age field of the Cache-Control response headerhttps://www.rfc-editor.org/rfc/rfc9111.html#name-max-age-2
Last-Modifiedhttps://www.rfc-editor.org/rfc/rfc9110#name-last-modified
If-Modified-Sincehttps://www.rfc-editor.org/rfc/rfc9110#name-if-modified-since
verify Google's crawlers and fetchershttp://www.google.com/crawling/docs/crawlers-fetchers/verify-google-requests
Creative Commons Attribution 4.0 Licensehttps://creativecommons.org/licenses/by/4.0/
Apache 2.0 Licensehttps://www.apache.org/licenses/LICENSE-2.0
Google Developers Site Policieshttps://developers.google.com/site-policies
Blog http://googledevelopers.blogspot.com
Bluesky https://goo.gle/3FReQXN
Instagram https://www.instagram.com/googlefordevs/
LinkedIn https://www.linkedin.com/showcase/googledevelopers/
X (Twitter) http://twitter.com/googledevs
YouTube http://www.youtube.com/user/GoogleDevelopers
Google Developer Program http://www.google.com/program
Google Developer Groups http://www.google.com/community
Google Developer Experts http://www.google.com/community/experts
Accelerators http://www.google.com/community/accelerators
Google Cloud & NVIDIA http://www.google.com/community/nvidia
Google API Console http://console.developers.google.com
Google Cloud Platform Console http://console.cloud.google.com
Google Play Console http://play.google.com/apps/publish
Firebase Console http://console.firebase.google.com
Actions on Google Console http://console.actions.google.com
Cast SDK Developer Console http://cast.google.com/publish
Chrome Web Store Dashboard http://chrome.google.com/webstore/developer/dashboard
Google Home Developer Console http://console.home.google.com
https://developers.google.com/
Android http://developer.android.com
Chrome http://developer.chrome.com/home
Firebase http://firebase.google.com
Google Cloud Platform http://cloud.google.com
Google AI http://ai.google.dev/
All products http://www.google.com/products
Terms http://www.google.com/terms/site-terms
Privacy http://policies.google.com/privacy
Manage cookies http://www.google.com/mobile/adsbot.html

Viewport: width=device-width, initial-scale=1


URLs of crawlers that visited me.