Robots.txt Generator

Generate a robots.txt file for your website. Control search engine crawling with an easy visual builder.

No signup. 100% private. Processed in your browser.

Configure your crawling rules below and copy or download the generated robots.txt file.

Sitemap URL

Crawling Rules

Allow all pages

Block /admin

Block /api

Block /login

Block /search

Block /images

Custom Disallow Paths (one per line)

Crawl Delay (seconds, 0 = none)

Generated robots.txt

User-agent: *
Allow: /
Disallow: /admin
Disallow: /api
Disallow: /login

Sitemap: https://example.com/sitemap.xml

What Is robots.txt?

The robots.txt file is a plain text file at your website's root that tells search engine crawlers which pages they're allowed to visit and which they should skip. It's the first file a well-behaved crawler checks before accessing any page on your site.

Think of it as a "staff only" sign for web crawlers. It's a polite request, not a security measure — crawlerschoose to respect it. Googlebot and Bingbot follow the rules reliably. Malicious bots and scrapers typically ignore it entirely. Never use robots.txt as your only defence for sensitive content.

Directive Reference

Directive	Syntax	What It Does
User-agent	User-agent: *	Specifies which crawler the rules apply to (* = all)
Allow	Allow: /public/	Permits crawling of a specific path
Disallow	Disallow: /admin/	Blocks crawling of a specific path
Sitemap	Sitemap: https://…/sitemap.xml	Points crawlers to your XML sitemap
Crawl-delay	Crawl-delay: 10	Requests N seconds between requests (not respected by Google)

What this means for you: Google ignores Crawl-delay — use Search Console's crawl rate settings instead. Bing and Yandex do respect it. The Sitemap directive is the most important after Allow/Disallow.

Common robots.txt Patterns

Scenario	robots.txt	Notes
Allow everything	User-agent: * Allow: /	Default for most sites — let crawlers see everything
Block everything	User-agent: * Disallow: /	Staging/dev sites only — never do this in production
Block admin paths	Disallow: /admin/ Disallow: /api/	Standard security hygiene — don't expose backend routes
Block a specific bot	User-agent: AhrefsBot Disallow: /	Blocks aggressive SEO crawlers that waste bandwidth

Common Mistakes

Blocking CSS and JS Files

Google needs to render your pages to understand them. Blocking CSS/JS files in robots.txt prevents rendering and can hurt your rankings. Only block truly private resources.

Using robots.txt for Security

Disallow doesn't hide pages — it just asks crawlers not to visit them. The URLs are still visible in the file. Use authentication and proper access controls for sensitive content.

Forgetting to Update After Redesign

Site redesigns often change URL structures. If your old robots.txt blocks paths that are now important, those pages won't get crawled. Review robots.txt after every major change.

Confusing Disallow with Noindex

Disallow prevents crawling. Noindex prevents indexing. A page blocked by robots.txt can still appear in search results if other sites link to it. Use noindex meta tags to prevent indexing.

AI Crawlers You Should Know About

Bot Name	Company	User-Agent	What It Does
GPTBot	OpenAI	GPTBot	Crawls content for training ChatGPT models
Google-Extended	Google	Google-Extended	Training data for Gemini/Bard AI models
CCBot	Common Crawl	CCBot	Open dataset used by many AI companies
anthropic-ai	Anthropic	anthropic-ai	Crawls for Claude model training
ClaudeBot	Anthropic	ClaudeBot	Web browsing for Claude responses

To block all AI training crawlers, add User-agent: GPTBot and Disallow: / blocks for each bot. Blocking search engine crawlers (Googlebot, Bingbot) is a separate decision — those affect your search rankings, not AI training. You can block AI training while keeping your search presence.

Robots.txt Generator

Crawling Rules

What Is robots.txt?

Directive Reference

Common robots.txt Patterns

Common Mistakes

Blocking CSS and JS Files

Using robots.txt for Security

Forgetting to Update After Redesign

Confusing Disallow with Noindex

AI Crawlers You Should Know About

Related Tools

Sitemap Generator

Meta Tag Generator

Schema Markup Generator

Google SERP Preview

User Agent Parser

HTTP Status Code Lookup

How to use this tool

Common uses

Related Business Tools

Meta Tag Generator

Schema Markup Generator

Google SERP Preview

Privacy Policy Generator

Frequently Asked Questions

Robots.txt Generator

Crawling Rules

What Is robots.txt?

Directive Reference

Common robots.txt Patterns

Common Mistakes

Blocking CSS and JS Files

Using robots.txt for Security

Forgetting to Update After Redesign

Confusing Disallow with Noindex

AI Crawlers You Should Know About

Related Tools

Sitemap Generator

Meta Tag Generator

Schema Markup Generator

Google SERP Preview

User Agent Parser

HTTP Status Code Lookup

How to use this tool

Common uses

Related Business Tools

Meta Tag Generator

Schema Markup Generator

Google SERP Preview

Privacy Policy Generator

Frequently Asked Questions

What is robots.txt?

Where should I place robots.txt?

Does robots.txt block pages from Google?

Should I block /admin pages?

Is robots.txt a security measure?

What does Disallow: / mean?

Does Google respect the Crawl-delay directive?

Should I reference my sitemap in robots.txt?

Can I block specific bots?

What's the difference between Allow and Disallow?

Can I use wildcards in robots.txt?

Is my generated robots.txt stored anywhere?