<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments for Web Archiving at archive.org</title>
	<atom:link href="http://iawebarchiving.wordpress.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://iawebarchiving.wordpress.com</link>
	<description>Internet Archive Web Team</description>
	<lastBuildDate>Tue, 28 Jun 2011 22:03:50 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>Comment on Wayback Machine &amp; Web Archiving Open Thread, April 2011 by adrolli</title>
		<link>http://iawebarchiving.wordpress.com/2011/04/07/wayback-machine-web-archiving-open-thread-april-2011/#comment-709</link>
		<dc:creator><![CDATA[adrolli]]></dc:creator>
		<pubDate>Tue, 28 Jun 2011 22:03:50 +0000</pubDate>
		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=181#comment-709</guid>
		<description><![CDATA[I found no entries in 2010 and 2011 for most of our (well known) websites, in 2011 there are no entries (even Microsoft.com or Apple.com).

Is this a temporary issue or the end of the waybackmachine?

Regards,
Alf]]></description>
		<content:encoded><![CDATA[<p>I found no entries in 2010 and 2011 for most of our (well known) websites, in 2011 there are no entries (even Microsoft.com or Apple.com).</p>
<p>Is this a temporary issue or the end of the waybackmachine?</p>
<p>Regards,<br />
Alf</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Wayback Machine &amp; Web Archiving Open Thread, April 2011 by Vitaliy Kuzmin</title>
		<link>http://iawebarchiving.wordpress.com/2011/04/07/wayback-machine-web-archiving-open-thread-april-2011/#comment-708</link>
		<dc:creator><![CDATA[Vitaliy Kuzmin]]></dc:creator>
		<pubDate>Tue, 28 Jun 2011 11:02:18 +0000</pubDate>
		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=181#comment-708</guid>
		<description><![CDATA[How can I force Wayback Machine to archive entire site and all files on it?]]></description>
		<content:encoded><![CDATA[<p>How can I force Wayback Machine to archive entire site and all files on it?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Wayback Machine &amp; Web Archiving Open Thread, April 2011 by gokitalo</title>
		<link>http://iawebarchiving.wordpress.com/2011/04/07/wayback-machine-web-archiving-open-thread-april-2011/#comment-706</link>
		<dc:creator><![CDATA[gokitalo]]></dc:creator>
		<pubDate>Mon, 20 Jun 2011 09:48:44 +0000</pubDate>
		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=181#comment-706</guid>
		<description><![CDATA[I do have a fairly urgent one. I&#039;m not sure how long you intend to have the classic interface around, but there are certain pages and sites that only seem to exist in the classic interface. This message board site, for example:

http://pub17.ezboard.com/bschoolforgiftedyoungsters

Which also went by the URL:

http://p082.ezboard.com/bschoolforgiftedyoungsters

When I type these two URLs in the classic interface, archived versions of the site appear, as you can see below:
http://classic-web.archive.org/web/*/http://pub17.ezboard.com/bschoolforgiftedyoungsters

http://classic-web.archive.org/web/*/http://p082.ezboard.com/bschoolforgiftedyoungsters

When I use these same URL with the current interface, however, no archived versions of the site appear. And while the site has changed URLs since then:

http://schoolforgiftedyoungsters.yuku.com

... both the classic and current versions of the interface say that no versions of the page have been archived. Frankly, I&#039;m worried that if the classic interface is removed, all the older versions of this site will disappear. While the message board does continue to exist, a lot of old threads were deleted when EZBoard was hacked in 2005. However, a lot of these deleted threads still exist in the classic interface of the Internet Archive.

If the classic version of the interface is removed, however... I&#039;m worried that all these old message board threads may be lost for good. This is a roleplaying/writing board, and I don&#039;t think anyone who posted there wants to see some of their best work deleted.]]></description>
		<content:encoded><![CDATA[<p>I do have a fairly urgent one. I&#8217;m not sure how long you intend to have the classic interface around, but there are certain pages and sites that only seem to exist in the classic interface. This message board site, for example:</p>
<p><a href="http://pub17.ezboard.com/bschoolforgiftedyoungsters" rel="nofollow">http://pub17.ezboard.com/bschoolforgiftedyoungsters</a></p>
<p>Which also went by the URL:</p>
<p><a href="http://p082.ezboard.com/bschoolforgiftedyoungsters" rel="nofollow">http://p082.ezboard.com/bschoolforgiftedyoungsters</a></p>
<p>When I type these two URLs in the classic interface, archived versions of the site appear, as you can see below:<br />
<a href="http://classic-web.archive.org/web/*/http://pub17.ezboard.com/bschoolforgiftedyoungsters" rel="nofollow">http://classic-web.archive.org/web/*/http://pub17.ezboard.com/bschoolforgiftedyoungsters</a></p>
<p><a href="http://classic-web.archive.org/web/*/http://p082.ezboard.com/bschoolforgiftedyoungsters" rel="nofollow">http://classic-web.archive.org/web/*/http://p082.ezboard.com/bschoolforgiftedyoungsters</a></p>
<p>When I use these same URL with the current interface, however, no archived versions of the site appear. And while the site has changed URLs since then:</p>
<p><a href="http://schoolforgiftedyoungsters.yuku.com" rel="nofollow">http://schoolforgiftedyoungsters.yuku.com</a></p>
<p>&#8230; both the classic and current versions of the interface say that no versions of the page have been archived. Frankly, I&#8217;m worried that if the classic interface is removed, all the older versions of this site will disappear. While the message board does continue to exist, a lot of old threads were deleted when EZBoard was hacked in 2005. However, a lot of these deleted threads still exist in the classic interface of the Internet Archive.</p>
<p>If the classic version of the interface is removed, however&#8230; I&#8217;m worried that all these old message board threads may be lost for good. This is a roleplaying/writing board, and I don&#8217;t think anyone who posted there wants to see some of their best work deleted.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Wayback Machine &amp; Web Archiving Open Thread, April 2011 by kevinff</title>
		<link>http://iawebarchiving.wordpress.com/2011/04/07/wayback-machine-web-archiving-open-thread-april-2011/#comment-704</link>
		<dc:creator><![CDATA[kevinff]]></dc:creator>
		<pubDate>Thu, 09 Jun 2011 17:28:35 +0000</pubDate>
		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=181#comment-704</guid>
		<description><![CDATA[Hello,
I&#039;ve searched everywhere but didn&#039;t get any decent information:
We are using whitelisting to whitelist crawlers, eg: for googlebot we verify that the reverse address ends with google.com and that the reverse address resolves back to the IP. Then we can prevent the site from throwing captcha&#039;s and other stuff at googlebot, bingbot, baidu, yandex and others.. While preventing fake bots from passing through our anti-gathering protection.

However it seems that Archive.org/Alexa are using various IPs from Amazon to collect data.. 
Is there a list of IP that we can whitelist? Is there any other way to be sure that some IPs are from Archive.org/Alexa? (i&#039;m not talking only about the user agent, as we&#039;ve found many fake Googlebots).

Thanks for the help]]></description>
		<content:encoded><![CDATA[<p>Hello,<br />
I&#8217;ve searched everywhere but didn&#8217;t get any decent information:<br />
We are using whitelisting to whitelist crawlers, eg: for googlebot we verify that the reverse address ends with google.com and that the reverse address resolves back to the IP. Then we can prevent the site from throwing captcha&#8217;s and other stuff at googlebot, bingbot, baidu, yandex and others.. While preventing fake bots from passing through our anti-gathering protection.</p>
<p>However it seems that Archive.org/Alexa are using various IPs from Amazon to collect data..<br />
Is there a list of IP that we can whitelist? Is there any other way to be sure that some IPs are from Archive.org/Alexa? (i&#8217;m not talking only about the user agent, as we&#8217;ve found many fake Googlebots).</p>
<p>Thanks for the help</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Wayback Machine &amp; Web Archiving Open Thread, April 2011 by mariko</title>
		<link>http://iawebarchiving.wordpress.com/2011/04/07/wayback-machine-web-archiving-open-thread-april-2011/#comment-703</link>
		<dc:creator><![CDATA[mariko]]></dc:creator>
		<pubDate>Sun, 01 May 2011 14:01:33 +0000</pubDate>
		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=181#comment-703</guid>
		<description><![CDATA[I see the Advanced Search is gone- will that be back? I&#039;m interested in searching for text rather than URLs.]]></description>
		<content:encoded><![CDATA[<p>I see the Advanced Search is gone- will that be back? I&#8217;m interested in searching for text rather than URLs.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Wayback Machine &amp; Web Archiving Open Thread, April 2011 by glennp000</title>
		<link>http://iawebarchiving.wordpress.com/2011/04/07/wayback-machine-web-archiving-open-thread-april-2011/#comment-702</link>
		<dc:creator><![CDATA[glennp000]]></dc:creator>
		<pubDate>Fri, 22 Apr 2011 20:07:06 +0000</pubDate>
		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=181#comment-702</guid>
		<description><![CDATA[I didn&#039;t see any advanced filters by date range on your new interface.  (I&#039;m interested in entries in the last 12 months, but not just the latest) And if you discussed this in your FAQs, I wouldn&#039;t know, because the FAQ link doesn&#039;t go anywhere except redirect to home.]]></description>
		<content:encoded><![CDATA[<p>I didn&#8217;t see any advanced filters by date range on your new interface.  (I&#8217;m interested in entries in the last 12 months, but not just the latest) And if you discussed this in your FAQs, I wouldn&#8217;t know, because the FAQ link doesn&#8217;t go anywhere except redirect to home.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Wayback Machine &amp; Web Archiving Open Thread, April 2011 by siplushwguy</title>
		<link>http://iawebarchiving.wordpress.com/2011/04/07/wayback-machine-web-archiving-open-thread-april-2011/#comment-701</link>
		<dc:creator><![CDATA[siplushwguy]]></dc:creator>
		<pubDate>Mon, 11 Apr 2011 08:34:19 +0000</pubDate>
		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=181#comment-701</guid>
		<description><![CDATA[Hello, Internet Archive!

Could you allow browsing archived versions of http://halflifehq.com/ ? Robots.txt blocking you was placed on this site when it got closed and its domain got parked (most domain parking services block crawling to make parked domains unsearchable), and it contained very valuable information for Half-Life community (the most valuable content is videos such as bullchicken360.avi, he360.avi, alieng.avi, alienslave.avi, burnacle.avi, tentacle.avi and xenome.avi) before closing.

Original owners did not block you, Internet Archive (you can check it by browsing archived versions of halflifehq.com/robots.txt, it was 404 in the year 2002, archive of which I want to see.)

And, if you can&#039;t allow browsing the entire site, could you just send us archived versions of the following files?:
http://www.halflifehq.com/files/downloads/avi/bullchicken360.avi !VERY IMPORTANT
http://www.halflifehq.com/files/downloads/avi/he360.avi !VERY IMPORTANT
http://www.halflifehq.com/files/downloads/avi/xenome.avi
http://www.halflifehq.com/files/downloads/avi/alieng.avi
http://www.halflifehq.com/files/downloads/avi/alienslave.avi
http://www.halflifehq.com/files/downloads/avi/burnacle.avi or /files/downloads/avi/barnacle.avi
http://www.halflifehq.com/files/downloads/avi/headcrab.avi
http://www.halflifehq.com/files/downloads/avi/tentacle.avi
And maybe the following too (if they&#039;re not from HL: Further Data):
http://www.halflifehq.com/files/downloads/mp3/half-life1.mp3
http://www.halflifehq.com/files/downloads/mp3/half-life2.mp3

Thanks in advance]]></description>
		<content:encoded><![CDATA[<p>Hello, Internet Archive!</p>
<p>Could you allow browsing archived versions of <a href="http://halflifehq.com/" rel="nofollow">http://halflifehq.com/</a> ? Robots.txt blocking you was placed on this site when it got closed and its domain got parked (most domain parking services block crawling to make parked domains unsearchable), and it contained very valuable information for Half-Life community (the most valuable content is videos such as bullchicken360.avi, he360.avi, alieng.avi, alienslave.avi, burnacle.avi, tentacle.avi and xenome.avi) before closing.</p>
<p>Original owners did not block you, Internet Archive (you can check it by browsing archived versions of halflifehq.com/robots.txt, it was 404 in the year 2002, archive of which I want to see.)</p>
<p>And, if you can&#8217;t allow browsing the entire site, could you just send us archived versions of the following files?:<br />
<a href="http://www.halflifehq.com/files/downloads/avi/bullchicken360.avi" rel="nofollow">http://www.halflifehq.com/files/downloads/avi/bullchicken360.avi</a> !VERY IMPORTANT<br />
<a href="http://www.halflifehq.com/files/downloads/avi/he360.avi" rel="nofollow">http://www.halflifehq.com/files/downloads/avi/he360.avi</a> !VERY IMPORTANT<br />
<a href="http://www.halflifehq.com/files/downloads/avi/xenome.avi" rel="nofollow">http://www.halflifehq.com/files/downloads/avi/xenome.avi</a><br />
<a href="http://www.halflifehq.com/files/downloads/avi/alieng.avi" rel="nofollow">http://www.halflifehq.com/files/downloads/avi/alieng.avi</a><br />
<a href="http://www.halflifehq.com/files/downloads/avi/alienslave.avi" rel="nofollow">http://www.halflifehq.com/files/downloads/avi/alienslave.avi</a><br />
<a href="http://www.halflifehq.com/files/downloads/avi/burnacle.avi" rel="nofollow">http://www.halflifehq.com/files/downloads/avi/burnacle.avi</a> or /files/downloads/avi/barnacle.avi<br />
<a href="http://www.halflifehq.com/files/downloads/avi/headcrab.avi" rel="nofollow">http://www.halflifehq.com/files/downloads/avi/headcrab.avi</a><br />
<a href="http://www.halflifehq.com/files/downloads/avi/tentacle.avi" rel="nofollow">http://www.halflifehq.com/files/downloads/avi/tentacle.avi</a><br />
And maybe the following too (if they&#8217;re not from HL: Further Data):<br />
<a href="http://www.halflifehq.com/files/downloads/mp3/half-life1.mp3" rel="nofollow">http://www.halflifehq.com/files/downloads/mp3/half-life1.mp3</a><br />
<a href="http://www.halflifehq.com/files/downloads/mp3/half-life2.mp3" rel="nofollow">http://www.halflifehq.com/files/downloads/mp3/half-life2.mp3</a></p>
<p>Thanks in advance</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Wayback Machine &amp; Web Archiving Open Thread, April 2011 by yahudeejay</title>
		<link>http://iawebarchiving.wordpress.com/2011/04/07/wayback-machine-web-archiving-open-thread-april-2011/#comment-700</link>
		<dc:creator><![CDATA[yahudeejay]]></dc:creator>
		<pubDate>Fri, 08 Apr 2011 06:51:52 +0000</pubDate>
		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=181#comment-700</guid>
		<description><![CDATA[I&#039;m still interested and most of all  interested- WHEN CHANCE TO SEE WAYBACK MACHINE RESULTS FOR  www.djsportal.com - JUNE - SEPTEMBER 2009]]></description>
		<content:encoded><![CDATA[<p>I&#8217;m still interested and most of all  interested- WHEN CHANCE TO SEE WAYBACK MACHINE RESULTS FOR  <a href="http://www.djsportal.com" rel="nofollow">http://www.djsportal.com</a> &#8211; JUNE &#8211; SEPTEMBER 2009</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Updated Wayback Machine in Beta Testing by inkdroid &#8250; xhtml, wayback</title>
		<link>http://iawebarchiving.wordpress.com/2011/01/24/updated-wayback-machine-in-beta-testing/#comment-699</link>
		<dc:creator><![CDATA[inkdroid &#8250; xhtml, wayback]]></dc:creator>
		<pubDate>Wed, 09 Mar 2011 23:59:26 +0000</pubDate>
		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=171#comment-699</guid>
		<description><![CDATA[[...] Internet Archive gave the Wayback Machine a facelift back in January. It actually looks really nice, but I noticed something kinda odd. I was looking [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Internet Archive gave the Wayback Machine a facelift back in January. It actually looks really nice, but I noticed something kinda odd. I was looking [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Updated Wayback Machine in Beta Testing by edsu</title>
		<link>http://iawebarchiving.wordpress.com/2011/01/24/updated-wayback-machine-in-beta-testing/#comment-698</link>
		<dc:creator><![CDATA[edsu]]></dc:creator>
		<pubDate>Wed, 09 Mar 2011 23:46:29 +0000</pubDate>
		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=171#comment-698</guid>
		<description><![CDATA[I ran into some problems with archives XHTML which I documented &lt;a href=&quot;http://inkdroid.org/journal/2011/03/09/xhtml-wayback/&quot; rel=&quot;nofollow&quot;&gt;here&lt;/a&gt;. I&#039;d be interested to hear what you think.]]></description>
		<content:encoded><![CDATA[<p>I ran into some problems with archives XHTML which I documented <a href="http://inkdroid.org/journal/2011/03/09/xhtml-wayback/" rel="nofollow">here</a>. I&#8217;d be interested to hear what you think.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
