mobile-menu mobile-menu-arrow Menu
 
 

Appendix I: List of internet robots, crawlers and spiders

Download Appendix I

The growing use of internet robots, crawlers and spiders has the potential to artificially inflate usage statistics. Only genuine, user-driven usage should be reported in COUNTER usage reports. Usage of full text articles that is initiated by automatic or semi-automatic bulk download tools, such as Quosa or Pubget should only be recorded when the user has clicked on the downloaded full-text article in order to open it.

Activity generated by internet robots, crawlers and spiders must be excluded from all COUNTER usage reports.

This list of internet robots, crawlers and spiders was published in April 2016 and updated July 2016. Please note it is rationalised, removing some previously redundant entries (e.g. the text ‘bot’ – msnbot, awbot, bbot, turnitinbot, etc. – which is now collapsed down to a single entry ‘bot’).

The list is displayed below and also available here https://github.com/atmire/COUNTER-Robots

This page will always show the readme and give potential users and contributors of the list more information on how to integrate the list.

 

For further information on regular expression matching, see: http://www.regular-expressions.info/quickstart.html.

Please let us know of any user agents that should be included in this list or to suggest other amendments.

 

Report a Change

Help us keep the information on this page accurate.
Publishers, please tell us if an update is needed; libraries, please let us know if you spot an issue.

 
 
 
About COP Register Members Guides Members

Gold and Silver Sponsors