281 days ago · Tech · 0 comments

I noticed that a large percentage of traffic to my site comes from AI bots. Even though many of them do not comply with robots.txt or ai.txt, I’ve included the following lists in an attempt to block some of them: robots.txt --- layout: null sitemap: false --- User-agent: * Sitemap: https://jawad.ca/sitemap.xml User-agent: CCBot Disallow: / User-agent: ChatGPT-User Disallow: / User-agent: GPTBot Disallow: / User-agent: Google-Extended Disallow: / User-agent: Google-CloudVertexBot Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: anthropic-ai Disallow: / User-agent: ClaudeBot Disallow: / User-agent: Omgilibot Disallow: / User-agent: Omgili Disallow: / User-agent: FacebookBot Disallow: / User-agent: Diffbot Disallow: / User-agent: DuckAssistBot Disallow: / User-agent: AI2Bot Disallow: / User-agent: Bytespider Disallow: / User-agent: Kangaroo Bot Disallow: / User-agent: PanguBot Disallow: / User-agent: ImagesiftBot Disallow: / User-agent: PerplexityBot Disallow: /…

No comments yet. Log in to reply on the Fediverse. Comments will appear here.