If we want to identify ourselves through a User Agent to omniture as a "spider" what works by default?
There is the botDetection option, which can be set, but what user-agents are filtered by default?
I see that you can overrride getBots, but if we don't do that what is the default? Is it the specific googlbot User Agent, or anything that contains "robot" or "spider" etc?
Thanks!
-Mark
Using the latest version of either the PHP or Java Measurement Library, if you set the botDetection flag to true, the following list contains the default strings that Omniture looks for in the incoming User-Agent to identify the hit as a bot:
googlebot mediapartners yahooysmcm baiduspider msnbot slurp teoma spider heritrix attentio twiceler irlbot fast crawler fastmobilecrawl jumpbot yahooseeker motionbot mediobot chtml generic
If any of the above strings is found in the User-Agent, the hit is classified as a bot and will not be tracked (if botDetection is set to true).
As you mentioned, you can override the getBots() method if you wish to modify this list. A common practice may be to add to the list. Here is an example in PHP of how to do so:
class OmnitureMeasurementSubClass extends OmnitureMeasurement { ... protected function getBots() { return array_merge(parent::getBots(), array( 'otherbotname', 'anotherbotname' )); } ... }
Reply
googlebot mediapartners yahooysmcm baiduspider msnbot slurp teoma spider heritrix attentio twiceler irlbot fast crawler fastmobilecrawl jumpbot yahooseeker motionbot mediobot chtml generic
If any of the above strings is found in the User-Agent, the hit is classified as a bot and will not be tracked (if botDetection is set to true).
As you mentioned, you can override the getBots() method if you wish to modify this list. A common practice may be to add to the list. Here is an example in PHP of how to do so:
class OmnitureMeasurementSubClass extends OmnitureMeasurement { ... protected function getBots() { return array_merge(parent::getBots(), array( 'otherbotname', 'anotherbotname' )); } ... }