Our blocking srapers system have reported us a IP which browse continuously one of our website.
So I have done a grep on the varnishncsa log, but i don’t see it in realtime.
I’ve done a grep on the all day varnishncsa log, i’ve seen:
91.88.187.67 - - [19/May/2010:00:58:14 +0200] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" 91.88.187.67 - - [19/May/2010:00:39:13 +0200] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" 91.88.187.67 - - [19/May/2010:01:25:15 +0200] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" 91.88.187.67 - - [19/May/2010:04:02:24 +0200] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" 91.88.187.67 - - [19/May/2010:04:28:14 +0200] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)"
When i do varnishncsa in realtime with a varnishncsa | grep “/0_2″, i can see the backend traffic:
server1 - - [00/Jan/1900:00:00:00 +0000] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" server2 - - [00/Jan/1900:00:00:00 +0000] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" server1 - - [00/Jan/1900:00:00:00 +0000] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" server2 - - [00/Jan/1900:00:00:00 +0000] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" server1 - - [00/Jan/1900:00:00:00 +0000] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" server1 - - [00/Jan/1900:00:00:00 +0000] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" server2 - - [00/Jan/1900:00:00:00 +0000] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" server3 - - [00/Jan/1900:00:00:00 +0000] "POST http://www.ouest-france.fr/0_2 HTTP/1.1" (null) - "http://www.ouest-france.fr/0_2?page_ref=/0_4" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0)" ... ...
Varnishlog contains:
115 SessionClose c pipe 115 ReqStart c 91.88.187.67 4436 1089132167 115 RxRequest c POST 115 RxURL c /0_2 115 RxProtocol c HTTP/1.1 115 RxHeader c x-requested-with: XMLHttpRequest 115 RxHeader c Accept-Language: fr 115 RxHeader c Referer: http://www.ouest-france.fr/0_2?page_ref=/0_4 115 RxHeader c Accept: application/xml, text/xml, */* 115 RxHeader c Content-Type: application/x-www-form-urlencoded 115 RxHeader c x-requested-handler: ajax 115 RxHeader c Accept-Encoding: gzip, deflate 115 RxHeader c User-Agent: Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; ficeLiveConnector.1.3; OfficeLivePatch.0.0) 115 RxHeader c Host: www.ouest-france.fr 115 RxHeader c Content-Length: 13 115 RxHeader c Connection: Keep-Alive 115 RxHeader c Cache-Control: no-cache 115 RxHeader c Cookie: crm_cookieEnabled=1; __utmc=88432901; xtvrn=$61164$; __utma=88432901.1379090940.1266326763.1274179082.1274180897.682; __utmz=88432901.1274169829.676.57.utmcsr=ofmnewsletter|utmccn=une|utmcmd=lettredinformation 115 VCL_call c recv 115 VCL_return c pipe 115 VCL_call c pipe 115 VCL_return c pipe 209 BackendOpen b server1 192.120.1.182 35846 192.120.1.68 80 115 Backend c 209 director_of server1 209 TxRequest b POST 209 TxURL b /0_2 209 TxProtocol b HTTP/1.1 209 TxHeader b x-requested-with: XMLHttpRequest 209 TxHeader b Accept-Language: fr 209 TxHeader b Referer: http://www.ouest-france.fr/0_2?page_ref=/0_4 209 TxHeader b Accept: application/xml, text/xml, */* 209 TxHeader b Content-Type: application/x-www-form-urlencoded 209 TxHeader b x-requested-handler: ajax 209 TxHeader b Accept-Encoding: gzip, deflate 209 TxHeader b User-Agent: Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; GTB6.4; .NET CLR 2.0.50727; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729; OfficeLiveConnector.1.3; OfficeLivePatch.0.0) 209 TxHeader b Host: www.ouest-france.fr 209 TxHeader b Content-Length: 13 209 TxHeader b Connection: Keep-Alive 209 TxHeader b Cache-Control: no-cache 209 TxHeader b Cookie: crm_cookieEnabled=1; __utmc=88432901; xtvrn=$61164$; __utma=88432901.1379090940.1266326763.1274179082.1274180897.682; __utmz=88432901.1274169829.676.57.utmcsr=ofmnewsletter|utmccn=une|utmcmd=lettredinformation 209 TxHeader b X-Forwarded-For: 91.88.187.67 209 TxHeader b X-Varnish: 1089132167 209 TxHeader b X-Forwarded-For: 91.88.187.67 209 TxHeader b whitelisted: 0 209 TxHeader b X-Scraping: 1 209 TxHeader b X-Hostname: 67.187.88-91.rev.gaoland.net 209 BackendClose b server1 115 ReqEnd c 1089132167 1274251253.994797468 1274251254.165796518 0.000020742 0.001982212 0.169016838 115 StatSess c 91.88.187.67 4436 0 1 1 1 0 0 948 0
I’ve searched on google about something like that. What I found is a problem on a box from a french ISP which loop on an url.
But i don’t understand why i can’t see all the client traffic with varnishncsa. Is it due to the “pipe” command ?
I don’t understand why there isn’t http status in the varnishncsa for the few lines i’ve captured.

