User-agent: BLEXBot* Disallow: / User-agent: blexbot* Disallow: / User-agent: blex* Disallow: / # Yandex User-agent: yandex* Disallow: / # Yandex User-agent: Yandex* Disallow: / # Baidu User-agent: baidu* Disallow: / User-agent: Scooter Disallow: / User-agent: TurnitinBot Disallow: / User-agent: CRAWLER Disallow: / User-Agent: OmniExplorer_Bot Disallow: / User-Agent: dumbot Disallow: / User-Agent: MJ12bot Disallow: / User-agent: LocalcomBot Disallow: / User-agent: seekbot Disallow: / User-agent: psbot Disallow: / User-agent: e-SocietyRobot Disallow: / User-agent: boitho.com-dc Disallow: / User-agent: ConveraCrawler Disallow: / User-agent: FindLinks Disallow: / User-agent: http://www.almaden.ibm.com/cs/crawler Disallow: / User-agent: dotbot Disallow: / Allhref user-agent: AhrefsBot disallow: / User-agent: XoviBot Disallow: / User-agent: dotbot Disallow: / User-agent: spbot Disallow: / # Sorry, wget in its recursive mode is a frequent problem. # Please read the man page and use it properly; there is a # --wait option you can use to set the delay between hits, # for instance. # User-agent: wget Disallow: / # # The 'grub' distributed client has been *very* poorly behaved. # User-agent: grub-client Disallow: / # # Doesn't follow robots.txt anyway, but... # User-agent: k2spider Disallow: / # # Hits many times per second, not acceptable # http://www.nameprotect.com/botinfo.html User-agent: NPBot Disallow: / # A capture bot, downloads gazillions of pages with no public benefit # http://www.webreaper.net/ User-agent: WebReaper Disallow: / User-agent: LocalcomBot Disallow: * User-agent: BecomeBot Crawl-Delay: 45 User-agent: googlebot Disallow: Crawl-Delay: 45 User-agent: msnbot Crawl-Delay: 45 User-agent: bingbot Crawl-Delay: 45 User-agent: Slurp Crawl-Delay: 60 User-agent: YahooSeeker Crawl-delay: 60 User-agent: googlebot-image Crawl-delay: 60 User-agent: Teoma Crawl-delay: 60 User-agent: ia_archiver Crawl-delay: 60 User-agent: Fbot Crawl-delay: 60 User-agent: * Disallow: /shop_image/product/ Disallow: /orphaned_images/ Request-rate: 1/60 Crawl-delay: 50 Sitemap: http://www.purplecactusbooks.com.au/sitemap1.xml.gz Sitemap: http://www.purplecactusbooks.com.au/sitemapindex.xml.gz