Question 1

What is a robots.txt file?

Accepted Answer

A robots.txt file is a plain text file placed at the root of a website (e.g., example.com/robots.txt) that tells search engine crawlers which pages or sections of the site they are allowed or not allowed to visit. It is part of the Robots Exclusion Protocol.

Question 2

Why does robots.txt matter for SEO?

Accepted Answer

robots.txt controls what search engines crawl. If important pages are accidentally blocked, they won't appear in search results. It also lets you point crawlers to your sitemap so they can discover all your pages faster.

Question 3

What does Disallow: / mean in robots.txt?

Accepted Answer

Disallow: / tells crawlers they are not allowed to access any page on the site. This effectively blocks your entire site from being indexed by search engines. It is almost always a mistake if found on a live production site.

Question 4

Should I add my sitemap to robots.txt?

Accepted Answer

Yes. Adding a Sitemap: directive to robots.txt is a best practice. It allows all crawlers — not just Googlebot — to discover your sitemap automatically without needing to submit it manually to every search engine.

Question 5

Does robots.txt prevent pages from being indexed?

Accepted Answer

robots.txt prevents crawlers from visiting those pages, but it does not guarantee they won't appear in search results. If other sites link to a disallowed page, search engines may still list it without content. To fully prevent indexing, use a noindex meta tag instead.

Robots.txt Checker

Crawl rule parsing

Sitemap detection

Launch advice

Use this before launch

Raw robots.txt content

What is a robots.txt file?

Why it matters for SEO

Frequently asked questions