Filtering fake hits from AVG in Xlogan
Saturday, June 14th, 2008The new version of AVG is causing some problems for web statistics tools such as Xlogan, as it automatically scans Google search results even if the user never visits your site.
This results in artificially increased statistics for your pages. See The Register article for more information.
Fortunately these fake hits are (currently) easy to filter out in Xlogan. Just click Filter, then tick “Raw log text”. Select “doesn’t contain” and enter “;1813″ in the text box.

This filtering works because the scanner currently uses a fixed User-Agent string ”Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1;1813)”, so it is quite easy to detect these fake hits at the moment.
In my case, for this site I found the effect wasn’t too high. Only around 0.01% of the bandwidth used came from these fake AVG hits, and the total number of visitors was skewed again by only 0.01%.
The biggest effect of the fake hits was causing Xlogan to over-estimate the proportion of users using IE6 and Windows XP, as these elements also appear in the fake User-Agent string. On this site, before filtering, Xlogan reported that 63.4% of users were using Windows XP, and 31.6% were using IE6. The actual figures, as calculated when using the filter, were 61.2% for Windows XP and 27.4% for IE6.


RSS Feed