User-agent: Magpie Disallow: / User-agent: magpie-crawler Disallow: / # Dissallow SEO bots User-agent: AhrefsBot Disallow: / User-agent: MJ12bot Disallow: / User-agent: SemrushBot Disallow: / User-agent: dotbot Disallow: / User-agent: rogerbot Disallow: / User-agent: Screaming Frog SEO Spider Disallow: / User-agent: cognitiveSEO Disallow: / User-agent: OnCrawl Disallow: / # Dissallow AI bots # Brandwatch - "AI to discover new trends" User-agent: magpie-crawler Disallow: / # webz.io - they sell data for training LLMs. User-agent: Omgilibot Disallow: / # Items below were sourced from darkvisitors.com # Categories included: "AI Data Scraper", "AI Assistant", "AI Search Crawler", "Undocumented AI Agent" # AI Search Crawler # https://darkvisitors.com/agents/amazonbot User-agent: Amazonbot Disallow: / # Undocumented AI Agent # https://darkvisitors.com/agents/anthropic-ai User-agent: anthropic-ai Disallow: / # AI Search Crawler # https://darkvisitors.com/agents/applebot User-agent: Applebot Disallow: / # AI Data Scraper # https://darkvisitors.com/agents/applebot-extended User-agent: Applebot-Extended Disallow: / # AI Data Scraper # https://darkvisitors.com/agents/bytespider User-agent: Bytespider Disallow: / # AI Data Scraper # https://darkvisitors.com/agents/ccbot User-agent: CCBot Disallow: / # AI Assistant # https://darkvisitors.com/agents/chatgpt-user User-agent: ChatGPT-User Disallow: / # Undocumented AI Agent # https://darkvisitors.com/agents/claude-web User-agent: Claude-Web Disallow: / # AI Data Scraper # https://darkvisitors.com/agents/claudebot User-agent: ClaudeBot Disallow: / # Undocumented AI Agent # https://darkvisitors.com/agents/cohere-ai User-agent: cohere-ai Disallow: / # AI Data Scraper # https://darkvisitors.com/agents/diffbot User-agent: Diffbot Disallow: / # AI Data Scraper # https://darkvisitors.com/agents/facebookbot User-agent: FacebookBot Disallow: / # AI Data Scraper # https://darkvisitors.com/agents/google-extended User-agent: Google-Extended Disallow: / # AI Data Scraper # https://darkvisitors.com/agents/gptbot User-agent: GPTBot Disallow: / # AI Data Scraper # https://darkvisitors.com/agents/omgili User-agent: omgili Disallow: / # AI Search Crawler # https://darkvisitors.com/agents/perplexitybot User-agent: PerplexityBot Disallow: / # AI Search Crawler # https://darkvisitors.com/agents/youbot User-agent: YouBot Disallow: / # disallow dynamic content User-agent: * Disallow: /*? Allow: /*?searchItem=* Disallow: /*/archive?searchItem=* Allow: /css/*?version=* Allow: /javascript/*?version=* Crawl-delay: 10 Sitemap: https://www.heinzelnisse.info/sitemap.xml