Leave a comment (0) 作者:小项-怪物猪

#!/usr/bin/env bash

LANG=en_US.UTF-8 

if [ -n "$2" ] then 

logpath=~/logs/$1/http/access.log.$2 
else 
logpath=~/logs/$1/http/access.log 
fi 

if [ -n "$1" ] then 
for i in baidu Sogou Googlebot yahoo bingbot YandexBot YoudaoBot  do 
spider=`cat $logpath | grep -E -v 'jpg|gif|png|js|css' | grep -E $i | wc -l` 
echo "$i Spider:$spider" 
done
topip=`cat $logpath | grep -E -v 'jpg|gif|png|js|css' | awk '$1 {print $1,$12,$13,$14,$15,$16,$17,$18,$19}' | sort | uniq -c | sort -rn | head -n 10 | awk '{printf "\\n%-8s %-15s %s %s %s %s %s %s %s",$1,$2,$3,$4,$5,$6,$7,$8,$9}'` 

echo "TOP10 IP:$topip" 
else 
echo "Using #./spider_log hosting(20hotel.com) [`date +%F`]." 
fi
分类:FreeBSD | 标签: apache, 日志, shell, 蜘蛛爬行
打劫!!留下你的评论 »

表情 emoticons

:?::razz::sad::evil::!::smile::oops::grin::eek::shock::???::cool::lol::mad::twisted::roll::wink::idea::arrow::neutral::cry::mrgreen:
  • 关闭

[ 搜狗云输入法 ]