Skip to main contentSkip to content

llms.txt and Lighthouse audit for WordPress

The llms.txt conversation changed because two Google-adjacent surfaces now create confusion:

  • Google Search guidance says you do not need llms.txt, AI-specific files, Markdown versions, or special markup to appear in generative AI Search features.
  • Chrome Lighthouse’s experimental Agentic Browsing category checks for llms.txt as a machine-readable summary at the domain root.

For WordPress teams, the correct conclusion is not “llms.txt ranks”.

The correct conclusion is:

llms.txt is becoming part of the agentic-readiness conversation, but it remains a guidance surface, not a ranking guarantee or enforcement mechanism.

What the Lighthouse check actually means

The Lighthouse check is narrow.

It asks whether a site exposes a machine-readable summary at the expected location. Chrome’s documentation describes llms.txt as an emerging convention for summarizing a website’s content for LLMs and AI agents.

That makes the file useful as a routing and orientation layer.

It does not prove:

  • that Google Search uses the file for ranking;
  • that AI Overviews or AI Mode consume the file in a special way;
  • that ChatGPT, Claude, Perplexity, Gemini, or other systems use the file;
  • that an answer changed because the file exists;
  • that crawlers obey the file;
  • that the site is agent-ready.

The file can be valuable without being magical.

Why WordPress sites should still care

WordPress sites often expose messy public surfaces:

  • home page;
  • blog posts;
  • categories;
  • tags;
  • author archives;
  • date archives;
  • attachment pages;
  • search pages;
  • WooCommerce product pages;
  • cart and checkout routes;
  • translation paths;
  • plugin-generated pages;
  • documentation and support pages.

A crawler or agent can eventually explore that structure, but the site may not make its priorities obvious. A well-maintained llms.txt can compress the public intent:

  • what the site is;
  • what product or service it offers;
  • which pages are primary;
  • which pages are support or policy surfaces;
  • where the machine should find the best context;
  • what the file does not authorize or prove.

That is useful even if it is not a ranking factor.

A useful llms.txt structure for WordPress

A good WordPress llms.txt should avoid generic filler. It should route machines toward the best public sources.

SectionPurposeWordPress example
Site summaryExplain the site in one short paragraphProduct, service, publication, local business, documentation site
Primary pagesIdentify canonical pagesHome, features, pricing, service hubs, documentation hubs
Policy surfacesClarify usage and governancerobots.txt, AI usage policy, governance files
Source pagesPoint to evidence-rich pagesComparisons, checklists, guides, cases, definitions
Contact/supportProvide escalation and correction routesContact page, support email, documentation
LimitsPrevent over-reading“This file is guidance, not enforcement.”

For Better Robots.txt itself, the file should point to the product page, the plugin download, the audit documentation, governance pages, crawler-specific pages, and the agentic readiness cluster.

The mistake: treating Lighthouse as the goal

A shallow implementation tries to “pass Lighthouse” by adding a minimal file:

txt
# Example.com

This is our website.

That may satisfy a presence check, but it does not help machines understand the site.

A stronger implementation uses llms.txt to express a maintained routing structure:

txt
# Example.com — machine-readable summary

Canonical site: https://example.com/
Primary product: https://example.com/product
AI usage policy: https://example.com/ai-usage-policy
Robots policy: https://example.com/robots.txt
Source pages:
- https://example.com/definition/topic
- https://example.com/compare/product-a-vs-product-b
- https://example.com/case-study/example

Notice: this file provides guidance. It does not override robots.txt, authentication, legal terms, or runtime access controls.

The difference is not formatting. The difference is governance quality.

How Better Robots.txt helps

Better Robots.txt gives WordPress teams a safer workflow for the file because llms.txt should not be treated as an isolated text blob.

The file should be aligned with:

  • robots.txt;
  • AI crawler rules;
  • sitemap declarations;
  • AI usage signals;
  • governance files;
  • public policy pages;
  • source-page architecture;
  • the site’s real edition or product scope.

When these layers contradict each other, machine readers may over-interpret, under-interpret, or route to the wrong pages.

Better Robots.txt helps by making llms.txt part of the same crawl-governance workflow as robots.txt rather than a separate afterthought.

What Better Robots.txt cannot prove

A WordPress plugin cannot prove that every external AI system will:

  • fetch llms.txt;
  • respect the file;
  • treat it as authoritative;
  • change generated answers because of it;
  • rank the site higher;
  • avoid unsupported claims.

Those are external behaviors. Some require logs, prompt tests, surfaced URL tracking, and direct observations.

The plugin’s role is narrower and stronger:

publish a coherent, inspectable, machine-readable guidance layer for WordPress.

1. Audit the current file

Use the llms.txt checker and the free site audit. Check whether the file exists, whether it is reachable, whether it contains real guidance, and whether it contradicts the rest of the crawl policy.

2. Map the source pages

Before writing the file, decide which pages deserve to be listed:

  • product hub;
  • pricing or plan page;
  • documentation hub;
  • AI visibility hub;
  • crawler-specific guides;
  • policy and governance pages;
  • proof or case-study pages;
  • support page.

3. Remove weak routes

Do not list low-value WordPress archives, utility pages, duplicate pages, thin tags, internal search pages, or pages that are not stable sources.

4. Align with robots.txt

Do not point machines toward pages you also disallow without explaining why. A contradiction between llms.txt and robots.txt is worse than a short file.

5. Keep the file updated

A stale llms.txt can mislead machines. Review it when you change services, pricing, product capabilities, crawler posture, documentation routes, or governance files.

FAQ

Is llms.txt mandatory for WordPress?

No. It is optional. It is useful when it is accurate and maintained.

Does the Lighthouse check make llms.txt an SEO ranking factor?

No. Lighthouse can inspect a browser-facing machine-readability signal. That does not make the file a Google Search ranking factor.

Should I add llms.txt just to avoid an audit warning?

No. Add it when it improves the site’s machine-readable routing. Avoid empty placeholders.

Should llms.txt contain every URL?

No. It should list priority surfaces, not replicate the sitemap.

Does llms.txt replace robots.txt?

No. Read robots.txt vs llms.txt for WordPress and robots.txt, llms.txt, and WebMCP.

Official references

For source verification, start with: