Category 2 — EFFICIENCY / HYGIENE: factors that help Google process the site efficiently with modest effect (NOT differentiators)
Created 2026-06-25
Summary
Claim: Six hygiene factors help Google process the site efficiently. They produce a modest effect at most. They are NOT differentiators.
The six hygiene factors:
- Internal linking & site architecture — discovery effect is solid; PageRank-flow magnitude is real but unquantified — Internal linking & site architecture — discovery + crawl-efficiency effects are SOLID; PageRank-flow magnitude is real but UNQUANTIFIED (Google publishes no weighting; survivorship bias plagues case studies).
- XML sitemaps — aids discovery; doesn't guarantee indexing — XML sitemap — aids discovery; Google: keeping sitemaps current is "adequate" for most sites; does NOT guarantee indexing; semantics in Google sitemap tag semantics — uses
<loc>and<lastmod>(when accurate); openly ignores<changefreq>and<priority>(Illyes: "a bag of noise"). - Crawl budget — over-rated for small sites — cross-link to lifecycle: Google explicitly tells sub-few-thousand-URL sites they DO NOT need to think about crawl budget — thresholds for caring are roughly 1M+ pages updated regularly, or 10k+ pages updated daily, Rule: crawl budget is not a concern for SMB-scale sites — sub-few-thousand URLs do not need to think about it, per Google.
- Core Web Vitals / page experience — real but minor; tiebreaker-class; inert without CrUX data — Mueller — "We've been pretty clear that Core Web Vitals are not giant factors in ranking"; "relevance is still by far much more important"; CWV is tiebreaker-class, Google CWV at launch — brand-new sites usually have NO CrUX field data (popularity/traffic threshold); Mueller confirms the signal "is not used" without sufficient field data.
- Canonicalization, redirects, duplicate-content hygiene — Google docs treat these as processing-efficiency tools, NOT ranking boosts — Canonicalization, redirects, duplicate-content hygiene — Google docs treat these as processing-EFFICIENCY tools, NOT ranking boosts.
- Semantic HTML — modest effect; Google can understand non-perfect HTML — Google AI optimization guide — "not required to have perfectly semantic HTML… Google can understand it"; semantic HTML is good practice with modest effect.
Source: Google Search Central documentation; compass_artifact research synthesis.
Confidence: High on the category sort.
Caveat: Hygiene is worth doing because it's cheap and it removes friction. It is not where ranking is won. Selling hygiene work as a growth strategy misrepresents what it can deliver.
Related entries
Related
- reference Google explicitly tells sub-few-thousand-URL sites they DO NOT need to think about crawl budget — thresholds for caring are roughly 1M+ pages updated regularly, or 10k+ pages updated daily
- reference Internal linking & site architecture — discovery + crawl-efficiency effects are SOLID; PageRank-flow magnitude is real but UNQUANTIFIED (Google publishes no weighting; survivorship bias plagues case studies)
- reference XML sitemap — aids discovery; Google: keeping sitemaps current is "adequate" for most sites; does NOT guarantee indexing
- reference Canonicalization, redirects, duplicate-content hygiene — Google docs treat these as processing-EFFICIENCY tools, NOT ranking boosts
- reference Google AI optimization guide — "not required to have perfectly semantic HTML… Google can understand it"; semantic HTML is good practice with modest effect
- reference Mueller — "We've been pretty clear that Core Web Vitals are not giant factors in ranking"; "relevance is still by far much more important"; CWV is tiebreaker-class
- reference Google CWV at launch — brand-new sites usually have NO CrUX field data (popularity/traffic threshold); Mueller confirms the signal "is not used" without sufficient field data