Category 2 — EFFICIENCY / HYGIENE: factors that help Google process the site efficiently with modest effect (NOT differentiators)

Summary

Claim: Six hygiene factors help Google process the site efficiently. They produce a modest effect at most. They are NOT differentiators.

The six hygiene factors:

  1. Internal linking & site architecture — discovery effect is solid; PageRank-flow magnitude is real but unquantifiedInternal linking & site architecture — discovery + crawl-efficiency effects are SOLID; PageRank-flow magnitude is real but UNQUANTIFIED (Google publishes no weighting; survivorship bias plagues case studies).
  2. XML sitemaps — aids discovery; doesn't guarantee indexing — XML sitemap — aids discovery; Google: keeping sitemaps current is "adequate" for most sites; does NOT guarantee indexing; semantics in Google sitemap tag semantics — uses <loc> and <lastmod> (when accurate); openly ignores <changefreq> and <priority> (Illyes: "a bag of noise").
  3. Crawl budget — over-rated for small sites — cross-link to lifecycle: Google explicitly tells sub-few-thousand-URL sites they DO NOT need to think about crawl budget — thresholds for caring are roughly 1M+ pages updated regularly, or 10k+ pages updated daily, Rule: crawl budget is not a concern for SMB-scale sites — sub-few-thousand URLs do not need to think about it, per Google.
  4. Core Web Vitals / page experience — real but minor; tiebreaker-class; inert without CrUX data — Mueller — "We've been pretty clear that Core Web Vitals are not giant factors in ranking"; "relevance is still by far much more important"; CWV is tiebreaker-class, Google CWV at launch — brand-new sites usually have NO CrUX field data (popularity/traffic threshold); Mueller confirms the signal "is not used" without sufficient field data.
  5. Canonicalization, redirects, duplicate-content hygiene — Google docs treat these as processing-efficiency tools, NOT ranking boosts — Canonicalization, redirects, duplicate-content hygiene — Google docs treat these as processing-EFFICIENCY tools, NOT ranking boosts.
  6. Semantic HTML — modest effect; Google can understand non-perfect HTML — Google AI optimization guide — "not required to have perfectly semantic HTML… Google can understand it"; semantic HTML is good practice with modest effect.

Source: Google Search Central documentation; compass_artifact research synthesis.

Confidence: High on the category sort.

Caveat: Hygiene is worth doing because it's cheap and it removes friction. It is not where ranking is won. Selling hygiene work as a growth strategy misrepresents what it can deliver.