Research brief: the launch-build technical foundation — what the technology must get right before a new site can be found (June 2026)

Summary

TL;DR

The named deliverable: Gate / Hygiene / Overclaimed sort

Every major technical build factor lands in exactly one of three categories. Google does not publish numerical weightings for any of them; where magnitude is unknown, it is stated explicitly. No invented precision.

Headline reframings

Schema is eligibility, not ranking (Mueller (Bluesky, April 13, 2025) — "Structured data won't make your site rank better. It's used for displaying the search features listed in Google's documentation", Mueller (December 2021) — "it's fairly rare that you would be able to provide some structured data on a page which gives us unique information that we don't see from the page itself"; Google "won't do anything with structured data that's not visible on a page", Rule: structured data is RICH-RESULT eligibility + entity/clarity + machine-readability — NOT a ranking lever; do not promise a ranking lift to clients). Implement structured data for the rich-result features your pages genuinely map to. Do not promise a ranking lift.

CSR for indexable content is a gamble (Google two-stage processing — crawl raw HTML first; all 200-status pages queued for rendering ("a headless Chromium renders the page and executes the JavaScript") with delays from seconds to hours per Vercel/MERJ, Google rendering — a noindex in the INITIAL HTML is NEVER overridden by JS; the directive is applied as soon as Googlebot sees it; client-side JS cannot undo a server-rendered noindex, Rule: SSR or SSG is the DEFAULT for any content that must be indexed; CSR for indexable content is a GAMBLE whose downside lands hardest on new low-authority sites). The favourable rendering numbers come from high-authority sites (Vercel + MERJ July 2024 rendering study — analyzed 100,000+ Googlebot fetches; 100% HTML pages rendered; median delay 10s, p75 26s, p90 ~3h, p95 ~6h, p99 ~18h; VENDOR INCENTIVE FLAGGED + high-authority test sites); the punishing numbers come from a zero-authority test subdomain (Onely November 2022 experiment (Ziemek Bućko) — brand-new zero-authority test subdomain; JS-folder page 7 took 313 HOURS vs HTML 36 hours (9x slower); first link 52h vs 25h — different stage (discovery, not render) but the high-authority/low-authority contrast is the point). The downside lands hardest on exactly the sites least able to absorb it. Cross-link to the lifecycle brief: Google queues ALL 200-status pages for rendering, JS or not — Splitt: "you don't really see how long it takes us to render, if we render at all, when we render", "Two waves of indexing" — Google's Martin Splitt now calls it an oversimplification; "pretty much every website, when we see them for the first time, goes to rendering" and the waves "play less and less of a role", Google Web Rendering Service (WRS) runs on evergreen Chromium since 2019 — modern JS (ES6+, Web Components, etc.) is supported; previously frozen at Chrome 41.

llms.txt, AI-specific markup, and content chunking do nothing for Google Search, including its generative AI features (Google AI optimization guide (updated June 15, 2026) — llms.txt is "ignored"; AI-specific markup and content chunking are explicitly listed as unnecessary, Rule: do NOT implement llms.txt, AI-specific markup, or content chunking for Google Search — Google explicitly says these do nothing).

Core Web Vitals are tiebreaker-class, not a major lever (Mueller — "We've been pretty clear that Core Web Vitals are not giant factors in ranking"; "relevance is still by far much more important"; CWV is tiebreaker-class, Mueller on chasing perfect Lighthouse / CWV scores — getting "those last few percent… your site's SEO generally won't change because of that", Rule: do NOT chase perfect Lighthouse / Core Web Vitals scores — Mueller: last few percent "won't change" SEO; once in the "good" range, stop). A brand-new site usually has no CrUX field data and the signal does not apply at all (Google CWV at launch — brand-new sites usually have NO CrUX field data (popularity/traffic threshold); Mueller confirms the signal "is not used" without sufficient field data).

Mobile-first means the mobile version IS the indexed version — content, structured data, internal links, metadata must all be present on mobile (Content/structured-data/link parity between mobile and desktop is required — limiting links on the mobile version "can slow down discovery of new pages", Google mobile-first parity — structured data must be present on the MOBILE version; if it exists only on desktop, Google will NOT use it, Google mobile-first parity — titles, descriptions, robots meta must match between mobile and desktop, Rule: mobile parity is MANDATORY — anything missing on mobile (content, links, structured data, metadata) is missing from Google's index; do NOT trim the mobile version).

A stray staging noindex is the most common launch-killer (Google hard gate — when Googlebot sees noindex (meta or X-Robots-Tag), it "will drop that page entirely from Google Search results"; stray staging noindex is the most common launch-killer, Rule: scan for a stray noindex BEFORE launch — meta tag, X-Robots-Tag header, CMS "discourage search engines" toggle — the most common launch-killer).

You cannot force indexing or rank with technical polish alone (Rule: you cannot reliably force Google to index a page — submission tools AID discovery but do not guarantee inclusion or ranking; "Search is never guaranteed" (Mueller)) — see the sister lifecycle brief for the deeper treatment.

Staged playbook

Genuine unknowns

See Genuine unknowns in the launch-build technical foundation — Google publishes no numerical weightings; PageRank-flow magnitude unquantified; render-queue behavior for new low-authority sites under-measured; AI-surface signal weighting undisclosed; CWV thresholds shift: Google publishes no numerical weightings for any technical factor; internal-linking / PageRank-flow magnitude is unquantified; render-queue behavior for new low-authority sites is under-measured; AI-surface signal weighting is not disclosed; CWV thresholds shift.

Source: compass_artifact research document, June 2026. Anchored in Google's documentation, Search Central Blog, on-record statements from John Mueller, Gary Illyes, Martin Splitt, the July 2024 Vercel/MERJ rendering study, and Onely's November 2022 9x JS-discovery experiment.