Search:
Home  About  Submit Site    
  
 
Web robots (also known as crawlers or spiders) are programs that traverse the Web automatically, and which are used by search engines to index the Web, or part of it.
Sites [ Submit ]
User Agent String - Tool from ASAP Consulting s.r.o. for detailed user agent string analysis using an online form. Includes databases of browsers and robots. The Web Robots Pages - Information on the robots.txt Robots Exclusion Standard and other articles about writing well-behaved Web robots. ACAP - Automated Content Access Protocol - Standard being developed on behalf of content publishers to communicate permissions information more extensively than is the case with robots.txt. Project documents, implementation and background information. Search Engine Robots and Other User Agents - John A. Fotheringham presents data in tabular form on the robots sent by search engines and other sites to read and index Web pages: their origins, names and IP addresses. About Search Indexing Robots and Spiders - Search Tools Consulting explains how the search engine programs called "robots" or "spiders" work, and reviews related sites. Search Engine IP Addresses - Lists IP addresses of search engine spiders. Can be searched by IP address. Also links to resources on spiders. HTTP User Agent Index - An alphabetical list of user agents and the deployer behind them, compiled by Christoph Rüegg. Bots vs Browsers - This large database lists user agents in categories and distinguishes between robots and browsers. List of User-Agents - A searchable database of user-agents with information about their type, purpose and origin. User-Agents.My-Addr.com - Contains a database of user-agents for crawlers, spiders, browsers; tools for user-agent lookup and tools for user-agent string search.
Click [ Submit ] above to Add a New Site, Update a Site, or Remove a Site from this Category.
This directory is made available through a Creative Commons Attribution license from the DMOZ Organization.

© 2025 - Midnight Design Productions, LLC