我正在制作自己的网络统计信息脚本。
成功淘汰了许多机器人。问题只有推特,当我立即发布链接时,我看到 20-30次访问我怀疑是 cURL /未知蜘蛛
检测此类访问者的有效方法是什么?我想改善访客统计数据。
P.S。这些访问者不会在Google Analytics中报告/看到,对于机器人我会从统计信息中删除这些用户代理:
<?php
array(
"Butterfly","Twitturls","Me.dium","Twiceler","facebookexternalhit",
"Teoma", "alexa", "froogle", "Gigabot", "inktomi",
"looksmart", "URL_Spider_SQL", "Firefly", "NationalDirectory",
"Ask Jeeves", "TECNOSEEK", "InfoSeek", "WebFindBot", "girafabot",
"crawler", "www.galaxy.com", "Googlebot", "Scooter", "Slurp",
"msnbot", "appie", "FAST", "WebBug", "Spade", "ZyBorg", "rabaz",
"Baiduspider", "Feedfetcher-Google", "TechnoratiSnoop", "Rankivabot",
"Mediapartners-Google", "Sogou web spider", "WebAlta Crawler","TweetmemeBot"
);
?>
由于