Googlebot response-code handling — 200 proceeds, 3xx is followed (up to a chain limit), 4xx (incl. 410) is dropped without wasting crawl budget, 5xx slows or pauses crawl, soft-404 confuses everything
Created 2026-06-25
Summary
Claim: Per Google's documentation, Googlebot handles HTTP response codes as follows:
- 200 — eligible to proceed to rendering/indexing.
- 3xx — followed (Google follows redirect chains up to a limit; long chains waste resources).
- 4xx (incl. 410) — dropped from crawl. Google explicitly notes 4xx do not "waste" crawl budget. 404/410 are the clean way to remove content.
- 5xx / timeouts — Google slows or pauses crawling to avoid overloading the server; sustained 5xx can reduce crawl rate site-wide.
- Soft 404s (200 status on a "not found" page) — confuse the crawler and waste resources; Google advises eliminating them.
Faster server response can let Google crawl more — but Google stresses that simply speeding up low-quality pages won't increase crawling.
Source: Google Search Central documentation.
Confidence: High.
Caveat: Soft-404 is the silent killer on WordPress sites — empty category pages, "no results found" search pages returning 200 — and is a frequent Candid-audit finding.
Related entries
Referenced by (3)
- reference Research brief: the lifecycle of a website in Google Search — from launch to mature standing and the perpetual re-evaluation that follows (June 2026) relates-to
- reference Research brief: the launch-build technical foundation — what the technology must get right before a new site can be found (June 2026) relates-to
- reference Google hard gate — only 200-status pages are queued for rendering; non-200 (4xx/5xx) may skip rendering entirely relates-to