internetarchive / liveweb

Liveweb proxy of the Wayback Machine project
https://web.archive.org/
44 stars 13 forks source link

ARC header should contain the remote IP address #8

Closed anandology closed 12 years ago

anandology commented 12 years ago

The header of each ARC record should contain the IP address of the remote host, not the IP address of the machine where it is run.

Here is a test to confirm it.

$ curl -s --location-trusted http://archive.org/download/forum.nos.org-2007/IAH-20110620101010-00000-ubuntu-8080.arc.gz | gzip -cd |  grep '^http://.* [0-9]' | head
http://forum.nos.nl/robots.txt 145.58.28.182 20110620101010 text/plain 2127
http://forum.nos.nl/ 145.58.28.182 20110620101012 text/html 21844
http://www.react.nl/robots.txt 83.96.164.115 20110620101013 text/html 462
http://forum.nos.nl/forum.php/list_topics/1/1000 145.58.28.182 20110620101013 text/html 14815
http://www.react.nl/ 83.96.164.115 20110620101014 text/html 5390
http://forum.nos.nl/forum.php/list_topics/3/1000 145.58.28.182 20110620101014 text/html 88371
http://react.nl/robots.txt 83.96.164.115 20110620101015 text/html 458
http://www.react.nl/images/private/intro_still-play(1).jpg 83.96.164.115 20110620101015 image/jpeg 37147
http://www.google-analytics.com/robots.txt 74.125.79.102 20110620101015 text/plain 363
http://react.nl/js/mootools.js 83.96.164.115 20110620101016 application/x-javascript 23904
nibrahim commented 12 years ago

Will fix that. Thanks!

~noufal http://nibrahim.net.in

I tripped over a hole that was sticking up out of the ground.