<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Web Archiving at archive.org</title>
	<atom:link href="http://iawebarchiving.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://iawebarchiving.wordpress.com</link>
	<description>Internet Archive Web Team</description>
	<lastBuildDate>Thu, 17 Dec 2009 05:38:59 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='iawebarchiving.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/657cd9c70983a4faa299bdcbb9475d0c?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>Web Archiving at archive.org</title>
		<link>http://iawebarchiving.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://iawebarchiving.wordpress.com/osd.xml" title="Web Archiving at archive.org" />
		<item>
		<title>Archive-It Partnership with LOCKSS</title>
		<link>http://iawebarchiving.wordpress.com/2009/12/17/archive-it-partnership-with-lockss/</link>
		<comments>http://iawebarchiving.wordpress.com/2009/12/17/archive-it-partnership-with-lockss/#comments</comments>
		<pubDate>Thu, 17 Dec 2009 05:38:59 +0000</pubDate>
		<dc:creator>Kate</dc:creator>
				<category><![CDATA[Archive-It]]></category>
		<category><![CDATA[Digital Stewardship]]></category>
		<category><![CDATA[Web Archiving Community]]></category>

		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=149</guid>
		<description><![CDATA[We should have announced this back in July, but we are still just as excited about it 6 months later, so we wanted to be sure we got the word out.  We are pleased to announce that data harvested through the Archive-It service was successfully re-harvested into a LOCKSS network for preservation. The transfer [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=149&subd=iawebarchiving&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>We should have announced this back in July, but we are still just as excited about it 6 months later, so we wanted to be sure we got the word out.  We are pleased to announce that data harvested through the Archive-It service was successfully re-harvested into a LOCKSS network for preservation. The transfer was part of a Andrew W. Mellon foundation project with the University of Rochester.</p>
<p>If you are interested in learning more about the Archive-It/LOCKSS partnership, please contact the LOCKSS team lockss-support (at) lockss (dot) org or the Archive-It team (http://www.archive-it.org/public/contact-us)</p>
<p>The Archive-It team would like to partner with additional preservation systems and needs to hear from our partners.  If your institution is interested in participating in a pilot for the preservation system you use, please contact the Archive-It team and let us know. We have done a pilot with iRODS and are in the middle of a test with CONTENTdm.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/iawebarchiving.wordpress.com/149/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/iawebarchiving.wordpress.com/149/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/iawebarchiving.wordpress.com/149/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/iawebarchiving.wordpress.com/149/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/iawebarchiving.wordpress.com/149/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/iawebarchiving.wordpress.com/149/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/iawebarchiving.wordpress.com/149/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/iawebarchiving.wordpress.com/149/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/iawebarchiving.wordpress.com/149/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/iawebarchiving.wordpress.com/149/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=149&subd=iawebarchiving&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://iawebarchiving.wordpress.com/2009/12/17/archive-it-partnership-with-lockss/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/c82c23bc80037af16c874e764e432141?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Kate</media:title>
		</media:content>
	</item>
		<item>
		<title>Alaska State Library Archiving Governor Palin&#8217;s End of Term Website</title>
		<link>http://iawebarchiving.wordpress.com/2009/07/28/alaska-state-library-archiving-governor-palins-website-at-end-of-term/</link>
		<comments>http://iawebarchiving.wordpress.com/2009/07/28/alaska-state-library-archiving-governor-palins-website-at-end-of-term/#comments</comments>
		<pubDate>Tue, 28 Jul 2009 18:31:00 +0000</pubDate>
		<dc:creator>lorimd</dc:creator>
				<category><![CDATA[Archive-It]]></category>
		<category><![CDATA[Digital Stewardship]]></category>
		<category><![CDATA[History]]></category>
		<category><![CDATA[Web Archiving Community]]></category>

		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=130</guid>
		<description><![CDATA[The Alaska State Library’s collection Alaska Governor/Lt. Governor Web Sites was originally conceived to archive these government websites over time. Alaska Governor Sarah Palin’s resignation announcement earlier this month and the transition of power to Lieutenant Governor Sean Parnell this past Sunday, July 26, 2009 gave the Alaska State Library a great chance to use [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=130&subd=iawebarchiving&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>The Alaska State Library’s collection <a href="http://www.archive-it.org/collections/1200" target="_blank">Alaska Governor/Lt. Governor Web Sites</a> was originally conceived to archive these government websites over time. Alaska Governor Sarah Palin’s resignation announcement earlier this month and the transition of power to Lieutenant Governor Sean Parnell this past Sunday, July 26, 2009 gave the Alaska State Library a great chance to use the crawl on demand feature of Archive-It to preserve information on the announcement and the end of Governor Palin’s term.</p>
<p>By crawling Governor Palin and Lt. Governor Parnell&#8217;s websites on the eve of the transition of power, the Alaska State Library was able to capture information that is now offline.  Once Sarah Palin left office, the <a href="http://gov.state.ak.us/" target="_blank">governor’s website</a> changed to reflect Sean Parnell as governor, and the <a href="http://ltgov.state.ak.us/" target="_blank">lieutenant governor’s website</a><strong> </strong> changed to reflect Craig Campbell as lieutenant governor. The information from former Governor Palin’s website as well as speeches and press releases from Sean Parnell’s time as lieutenant governor are no longer available on the live web. The foresight of the staff of the Alaska State Library and on-demand crawling through Archive-It made it possible to preserve the final changes to these websites before they were taken offline.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/iawebarchiving.wordpress.com/130/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/iawebarchiving.wordpress.com/130/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/iawebarchiving.wordpress.com/130/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/iawebarchiving.wordpress.com/130/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/iawebarchiving.wordpress.com/130/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/iawebarchiving.wordpress.com/130/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/iawebarchiving.wordpress.com/130/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/iawebarchiving.wordpress.com/130/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/iawebarchiving.wordpress.com/130/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/iawebarchiving.wordpress.com/130/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=130&subd=iawebarchiving&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://iawebarchiving.wordpress.com/2009/07/28/alaska-state-library-archiving-governor-palins-website-at-end-of-term/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/06e7923d36d2d2434037deaa722d9949?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lorimd</media:title>
		</media:content>
	</item>
		<item>
		<title>Join the K-12 Web Archiving Program!</title>
		<link>http://iawebarchiving.wordpress.com/2009/07/22/join-the-k-12-web-archiving-program/</link>
		<comments>http://iawebarchiving.wordpress.com/2009/07/22/join-the-k-12-web-archiving-program/#comments</comments>
		<pubDate>Wed, 22 Jul 2009 20:56:51 +0000</pubDate>
		<dc:creator>waybackmolly</dc:creator>
				<category><![CDATA[Archive-It]]></category>
		<category><![CDATA[Digital Stewardship]]></category>
		<category><![CDATA[History]]></category>
		<category><![CDATA[Web Archiving Community]]></category>

		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=124</guid>
		<description><![CDATA[ 
Apply to be part of the Internet Archive K-12 program, and your school can help to capture and archive today&#8217;s primary source materials on the Web. 
A growing number of individuals and institutions recognize the importance of archiving and preserving the often transitory digital cultural artifacts that are distributed over the Web. But so far, the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=124&subd=iawebarchiving&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p> </p>
<p><a href="http://www.loc.gov/teachers/newsevents/news/">Apply</a> to be part of the Internet Archive K-12 program, and your school can help to capture and archive today&#8217;s primary source materials on the Web. </p>
<p>A growing number of individuals and institutions recognize the importance of archiving and preserving the often transitory digital cultural artifacts that are distributed over the Web. But so far, the vast majority of decisions about what Web sites will live into the future have been made by adults, and reflect adults&#8217; sensibilities about what constitutes the important records of history. We want and need to hear from students. </p>
<p>The Internet Archive, the Library of Congress and California Digital Library collaborated on a pilot in the spring of 2008 and a full-year program for the 2008/2009 school year, working with a total of 10 elementary, middle and high schools. We are looking to expand this program to new schools in the coming year. You can explore the collections created during the 2008/2009 school year on the Archive-It website at: <a href="http://www.archive-it.org/k12">http://www.archive-it.org/k12/</a>. </p>
<p>Find a complete project description and the brief application here: <a href="http://www.loc.gov/teachers/newsevents/news/">http://www.loc.gov/teachers/newsevents/news/</a>  Apply by <strong>August 14 </strong>for full consideration.</p>
<p> </p>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;"> &lt;a href=&#8217;http://www.loc.gov/teachers/&#8217;&gt;Apply&lt;/a&gt; to be part of the Internet Archive K-12 program, and your school can help to capture and archive today&#8217;s primary source materials on the Web.  &lt;br&gt;&lt;br&gt; A growing number of individuals and institutions recognize the importance of archiving and preserving the often transitory digital cultural artifacts that are distributed over the Web. But so far, the vast majority of decisions about what Web sites will live into the future have been made by adults, and reflect adults&#8217; sensibilities about what constitutes the important records of history. We want and need to hear from students.  &lt;br&gt;&lt;br&gt; The Internet Archive, the Library of Congress and California Digital Library collaborated on a pilot in the spring of 2008 and a full-year program for the 2008/2009 school year, working with a total of 10 elementary, middle and high schools. We are looking to expand this program to new schools in the coming year. You can explore the collections created during the 2008/2009 school year on the &lt;a href=&#8217;http://www.archive-it.org&#8217;&gt;Archive-It&lt;/a&gt; website at: http://www.archive-it.org/k12/.  &lt;br&gt;&lt;br&gt; Find a complete project description and the brief application in the &#8220;Featured Resources&#8221; section at http://www.loc.gov/teachers/. Apply by &lt;b&gt;August 14&lt;/b&gt; for full consideration.</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">&lt;/p</div>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/iawebarchiving.wordpress.com/124/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/iawebarchiving.wordpress.com/124/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/iawebarchiving.wordpress.com/124/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/iawebarchiving.wordpress.com/124/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/iawebarchiving.wordpress.com/124/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/iawebarchiving.wordpress.com/124/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/iawebarchiving.wordpress.com/124/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/iawebarchiving.wordpress.com/124/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/iawebarchiving.wordpress.com/124/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/iawebarchiving.wordpress.com/124/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=124&subd=iawebarchiving&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://iawebarchiving.wordpress.com/2009/07/22/join-the-k-12-web-archiving-program/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/5382cc685df6cff2892d8560e05eb1e8?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">waybackmolly</media:title>
		</media:content>
	</item>
		<item>
		<title>Archive-It and LOCKSS Interoperability!</title>
		<link>http://iawebarchiving.wordpress.com/2009/07/21/archive-it-and-lockss-interoperability/</link>
		<comments>http://iawebarchiving.wordpress.com/2009/07/21/archive-it-and-lockss-interoperability/#comments</comments>
		<pubDate>Tue, 21 Jul 2009 19:24:19 +0000</pubDate>
		<dc:creator>waybackmolly</dc:creator>
				<category><![CDATA[Archive-It]]></category>
		<category><![CDATA[Digital Stewardship]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Web Archiving Community]]></category>

		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=120</guid>
		<description><![CDATA[The Archive-It team is excited to announce that a successful transfer of Archive-It data moved from the Internet Archive data center into the LOCKSS network.  The transfer was part of a Andrew W. Mellon foundation project with the University of Rochester.   
We are excited to be able to provide these and other preservation options to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=120&subd=iawebarchiving&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>The <a href="http://www.archive-it.org">Archive-It</a> team is excited to announce that a successful transfer of Archive-It data moved from the Internet Archive data center into the LOCKSS network.  The transfer was part of a <a href="http://www.mellon.org/">Andrew W. Mellon foundation</a> project with the <a href="http://www.rochester.edu/">University of Rochester</a>.   </p>
<p>We are excited to be able to provide these and other preservation options to <a href="http://www.archive-it.org/public/partners">Archive-It partners</a> as we increase the interoperability of the Archive-It service.  If you are interested in learning more, please <a href="http://www.archive-it.org/public/contact-us">contact the Archive-It team</a>. More information about the LOCKSS system can be found at <a href="http://www.lockss.org/">www.lockss.org</a></p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/iawebarchiving.wordpress.com/120/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/iawebarchiving.wordpress.com/120/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/iawebarchiving.wordpress.com/120/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/iawebarchiving.wordpress.com/120/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/iawebarchiving.wordpress.com/120/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/iawebarchiving.wordpress.com/120/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/iawebarchiving.wordpress.com/120/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/iawebarchiving.wordpress.com/120/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/iawebarchiving.wordpress.com/120/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/iawebarchiving.wordpress.com/120/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=120&subd=iawebarchiving&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://iawebarchiving.wordpress.com/2009/07/21/archive-it-and-lockss-interoperability/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/5382cc685df6cff2892d8560e05eb1e8?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">waybackmolly</media:title>
		</media:content>
	</item>
		<item>
		<title>K-12 Web Archiving Program, 2008-2009</title>
		<link>http://iawebarchiving.wordpress.com/2009/07/07/k-12-web-archiving-program-2008-2009/</link>
		<comments>http://iawebarchiving.wordpress.com/2009/07/07/k-12-web-archiving-program-2008-2009/#comments</comments>
		<pubDate>Tue, 07 Jul 2009 18:36:15 +0000</pubDate>
		<dc:creator>lorimd</dc:creator>
				<category><![CDATA[Archive-It]]></category>
		<category><![CDATA[Digital Stewardship]]></category>
		<category><![CDATA[Web Archiving Community]]></category>

		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=115</guid>
		<description><![CDATA[The website for the first full year of our K-12 Web Archiving Program is now available online.
For the 2008/2009 school year, Internet Archive, the Library of Congress and California Digital Library collaborated on a program that explores archiving the Web from the perspective of students in elementary, middle and high school.
Using the Archive-It service, students [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=115&subd=iawebarchiving&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>The website for the first full year of our <a href="http://www.archive-it.org/k12/">K-12 Web Archiving Program</a> is now available online.</p>
<p>For the 2008/2009 school year, <a href="http://www.archive.org">Internet Archive</a>, the <a href="http://www.loc.gov">Library of Congress</a> and <a href="http://www.cdlib.org">California Digital Library</a> collaborated on a program that explores archiving the Web from the perspective of students in elementary, middle and high school.</p>
<p>Using the Archive-It service, students from ten different schools selected born digital content from the Web to create &#8220;time capsules&#8221; to represent their world. By allowing students to identify sites that will be preserved for the long-term, the program gives teens and younger students a chance to identify and document their cultural history and the world that&#8217;s important to them. Unlike time capsules of tangible objects, which usually remain hidden for decades or centuries, the resulting Web collections are immediately visible and publicly accessible on the Archive-it website, with full text search for study and analysis.</p>
<p>For the 2009/2010 school year we hope to broaden the program&#8217;s outreach to additional schools around the country. To get involved and/or learn more, please send us your information through <a href="http://www.archive-it.org/public/contact-us">this request form</a>. Applications will be available mid to late July.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/iawebarchiving.wordpress.com/115/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/iawebarchiving.wordpress.com/115/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/iawebarchiving.wordpress.com/115/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/iawebarchiving.wordpress.com/115/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/iawebarchiving.wordpress.com/115/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/iawebarchiving.wordpress.com/115/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/iawebarchiving.wordpress.com/115/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/iawebarchiving.wordpress.com/115/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/iawebarchiving.wordpress.com/115/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/iawebarchiving.wordpress.com/115/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=115&subd=iawebarchiving&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://iawebarchiving.wordpress.com/2009/07/07/k-12-web-archiving-program-2008-2009/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/06e7923d36d2d2434037deaa722d9949?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">lorimd</media:title>
		</media:content>
	</item>
		<item>
		<title>Check Us Out on Free Government Information</title>
		<link>http://iawebarchiving.wordpress.com/2009/06/16/check-us-out-on-free-government-information/</link>
		<comments>http://iawebarchiving.wordpress.com/2009/06/16/check-us-out-on-free-government-information/#comments</comments>
		<pubDate>Tue, 16 Jun 2009 22:38:55 +0000</pubDate>
		<dc:creator>waybackmolly</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=112</guid>
		<description><![CDATA[The Archive-It team are guest blogging throughout the month of June on the Free Government Information blog.  Please come on over and take a look at our posts or follow our feed. 
The entire Free Gov Info blog is an excellent resource for news and information, please check it out!
       [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=112&subd=iawebarchiving&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>The <a href="http://www.archive-it.org">Archive-It</a> team are guest blogging throughout the month of June on the <a href="http://freegovinfo.info/">Free Government Information</a> blog.  Please come on over and take a look at <a href="http://freegovinfo.info/blog/175">our posts</a> or follow <a href="http://freegovinfo.info/blog/175/feed">our feed</a>. </p>
<p>The entire <a href="http://freegovinfo.info/">Free Gov Info</a> blog is an excellent resource for news and information, please check it out!</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/iawebarchiving.wordpress.com/112/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/iawebarchiving.wordpress.com/112/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/iawebarchiving.wordpress.com/112/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/iawebarchiving.wordpress.com/112/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/iawebarchiving.wordpress.com/112/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/iawebarchiving.wordpress.com/112/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/iawebarchiving.wordpress.com/112/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/iawebarchiving.wordpress.com/112/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/iawebarchiving.wordpress.com/112/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/iawebarchiving.wordpress.com/112/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=112&subd=iawebarchiving&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://iawebarchiving.wordpress.com/2009/06/16/check-us-out-on-free-government-information/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/5382cc685df6cff2892d8560e05eb1e8?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">waybackmolly</media:title>
		</media:content>
	</item>
		<item>
		<title>University of Melbourne&#8217;s Award Winning Web Archiving Program</title>
		<link>http://iawebarchiving.wordpress.com/2009/06/11/university-of-melbournes-award-winning-web-archiving-program/</link>
		<comments>http://iawebarchiving.wordpress.com/2009/06/11/university-of-melbournes-award-winning-web-archiving-program/#comments</comments>
		<pubDate>Thu, 11 Jun 2009 00:45:01 +0000</pubDate>
		<dc:creator>waybackmolly</dc:creator>
				<category><![CDATA[Archive-It]]></category>
		<category><![CDATA[Digital Stewardship]]></category>
		<category><![CDATA[Web Archiving Community]]></category>

		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=106</guid>
		<description><![CDATA[The University of Melbourne  were recently recognized for their excellent web archiving program at the Sir Rupert Hamer Records Management Awards.

Each year the Public Records Advisory Council (PRAC) of Victoria, Australia offers the Sir Rupert Hamer Records Management Awards, recognizing excellence and innovation in records management in the Victorian public sector.  The Awards are named after Sir [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=106&subd=iawebarchiving&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p style="text-align:left;">The <a href="http://www.unimelb.edu.au/">University of Melbourne</a>  were recently recognized for their excellent <a href="http://www.unimelb.edu.au/records/web-archiving/">web archiving program</a> at the Sir Rupert Hamer Records Management Awards.</p>
<blockquote>
<p style="text-align:left;">Each year the Public Records Advisory Council (PRAC) of Victoria, Australia offers the Sir Rupert Hamer Records Management Awards, recognizing excellence and innovation in records management in the Victorian public sector.  The Awards are named after Sir Rupert Hamer who was the Victorian Premier when the Public Records Act was passed in 1973 and when Public Record Office Victoria opened its first office and repository in 1975. </p>
<p style="text-align:left;">The ceremony was held at Queens Hall at Parliament House on Thursday 28th of May.  The Web Archiving Program run by Records Services (team included Lucinda Davies &#8211; Ptrogram Coordinator, Silvia Paparozzi &#8211; Team Member, Mahesh Sundar &#8211; Team leader and Catherine Nicholls &#8211; Project Manager,) was awarded a &#8220;Certificate of Commendation&#8221; in the large agency category.  </p>
</blockquote>
<p style="text-align:left;">The University of Melbourne has been an Archive-It partner since January 2008.  Overall they have collected over 5 million URLs and 500 gb of data.</p>
<p style="text-align:left;">Please take a look at their now award winning <a href="http://www.unimelb.edu.au/records/web-archiving/">program</a> including this wonderful <a href="http://www.unimelb.edu.au/records/web-archiving/overview.html">video</a> (featuring puppets!!) they put together at the end of last year.</p>
<p style="text-align:left;">Congratulations Team Melbourne!  </p>
<p style="text-align:left;"> </p>
<p style="text-align:left;"> </p>
<p style="text-align:left;"> </p>
<p style="text-align:left;"> </p>
<p style="text-align:left;"> </p>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">Each year the Public Records Advisory Council (PRAC) of Victoria,</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">Australia, offers the Sir Rupert Hamer Records Management Awards,</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">recognising excellence and innovation in records management in the</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">Victorian public sector. The Awards are named after Sir Rupert Hamer who</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">was the Victorian Premier when the Public Records Act was passed in 1973</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">and when Public Record Office Victoria opened its first office and</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">repository in 1975.</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">The ceremony was held at Queens Hall at Parliament House on Thursday</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">28th May. The Web Archiving Program run by Records Services (team</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">included Lucinda Davies &#8211; Program Coordinator, Silvia Paparozzi &#8211; Team</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">Member, Mahesh Sundar &#8211; Team Leader and me &#8211; Project Manager,) was</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;">awarded a &#8220;Certificate of Commendation&#8221; in the large agency category.</div>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/iawebarchiving.wordpress.com/106/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/iawebarchiving.wordpress.com/106/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/iawebarchiving.wordpress.com/106/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/iawebarchiving.wordpress.com/106/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/iawebarchiving.wordpress.com/106/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/iawebarchiving.wordpress.com/106/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/iawebarchiving.wordpress.com/106/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/iawebarchiving.wordpress.com/106/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/iawebarchiving.wordpress.com/106/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/iawebarchiving.wordpress.com/106/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=106&subd=iawebarchiving&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://iawebarchiving.wordpress.com/2009/06/11/university-of-melbournes-award-winning-web-archiving-program/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/5382cc685df6cff2892d8560e05eb1e8?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">waybackmolly</media:title>
		</media:content>
	</item>
		<item>
		<title>WARC File Format Published as an International Standard</title>
		<link>http://iawebarchiving.wordpress.com/2009/06/03/warc-file-format-published-as-an-international-standard/</link>
		<comments>http://iawebarchiving.wordpress.com/2009/06/03/warc-file-format-published-as-an-international-standard/#comments</comments>
		<pubDate>Wed, 03 Jun 2009 23:16:46 +0000</pubDate>
		<dc:creator>waybackmolly</dc:creator>
				<category><![CDATA[Digital Stewardship]]></category>
		<category><![CDATA[Heritrix]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Web Archiving Community]]></category>

		<guid isPermaLink="false">http://iawebarchiving.wordpress.com/?p=92</guid>
		<description><![CDATA[An exciting announcement from the International Internet Preservation Consortium regarding the preservation file format generated using the Heritrix web crawler (used for all Archive-It and Internet Archive crawls for partners):
The International Internet Preservation Consortium is pleased to
announce the publication of the WARC file format as an international
standard: ISO 28500:2009, Information and documentation &#8212; WARC file
format.
[http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=44717]
For [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=92&subd=iawebarchiving&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p style="text-align:left;">An exciting announcement from the <a href="http://www.netpreserve.org">International Internet Preservation Consortium</a> regarding the preservation file format generated using the <a href="http://crawler.archive.org">Heritrix</a> web crawler (used for all <a href="http://www.archive-it.org">Archive-It</a> and Internet Archive crawls for partners):</p>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">The International Internet Preservation Consortium is pleased to</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">announce the publication of the WARC file format as an international</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">standard: ISO 28500:2009, Information and documentation &#8212; WARC file</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">format.</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">[http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=44717]</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">For many years, heritage organizations have tried to find the most</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">appropriate ways to collect and keep track of World Wide Web material</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">using web-scale tools such as web crawlers. At the same time, these</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">organizations were concerned with the requirement to archive very large</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">numbers of born-digital and digitized files. A need was for a container</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">format that permits one file simply and safely to carry a very large</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">number of constituent data objects (of unrestricted type, including many</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">binary types) for the purpose of storage, management, and exchange.</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">Another requirement was that the container need only minimal knowledge</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">of the nature of the objects.</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">The WARC format is expected to be a standard way to structure, manage</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">and store billions of resources collected from the web and elsewhere. It</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">is an extension of the ARC format</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">[http://www.archive.org/web/researcher/ArcFileFormat.php ], which has</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">been used since 1996 to store files harvested on the web. WARC format</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">offers new possibilities, notably the recording of HTTP request headers,</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">the recording of arbitrary metadata, the allocation of an identifier for</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">every contained file, the management of duplicates and of migrated</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">records, and the segmentation of the records. WARC files are intended to</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">store every type of digital content, either retrieved by HTTP or another</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">protocol.</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">The motivation to extend the ARC format arose from the discussion and</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">experiences of the International Internet Preservation Consortium [</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">http://netpreserve.org/ ], whose core mission is to acquire, preserve</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">and make accessible knowledge and information from the Internet for</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">future generations. IIPC Standards Working Group put forward to ISO</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">TC46/SC4/WG12 a draft presenting the WARC file format. The draft was</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">accepted as a new Work Item by ISO in May 2005.</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">Over a period of four years, the ISO working group, with the</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">Bibliothèque nationale de France [http://www.bnf.fr/ ] as convener,</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">collaborated closely with IIPC experts to improve the original draft.</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">The WG12 will continue to maintain [http://bibnum.bnf.fr/WARC/ ] the</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">standard and prepare its future revision.</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">Standardization offers a guarantee of durability and evolution for the</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">WARC format. It will help web archiving entering into the mainstream</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">activities of heritage institutions and other branches, by fostering the</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">development of new tools and ensuring the interoperability of</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">collections. Several applications are already WARC compliant, such as</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">the Heritrix [http://crawler.archive.org/ ] crawler for harvesting, the</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">WARC tools [http://code.google.com/p/warc-tools/ ] for data management</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">and exchange, the Wayback Machine</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">[http://archive-access.sourceforge.net/projects/wayback/ ], NutchWAX</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">[http://archive-access.sourceforge.net/projects/nutch/ ] and other</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">search tools [http://code.google.com/p/search-tools/ ] for access. The</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">international recognition of the WARC format and its applicability to</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">every kind of digital object will provide strong incentives to use it</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">within and beyond the web archiving community.</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">A press release is available on the IIPC website:</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">http://netpreserve.org/press/pr20090601.php</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">General information about the IIPC can be found at:</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">http://netpreserve.org</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">—&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">Abbie Grotke</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">Library of Congress</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">IIPC Communications Officer</div>
<div id="_mcePaste" style="position:absolute;left:-10000px;top:0;width:1px;height:1px;text-align:left;">netpreserve.org</div>
<blockquote>
<p style="text-align:left;"><span style="font-family:Helvetica;line-height:normal;">The International Internet Preservation Consortium is pleased to<br />
announce the publication of the WARC file format as an international<br />
standard: ISO 28500:2009, Information and documentation &#8212; WARC file<br />
format.</span></p>
<p>[<a href="http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=44717">http://www.iso.org/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=44717</a>]</p>
<p style="text-align:left;">For many years, heritage organizations have tried to find the most<br />
appropriate ways to collect and keep track of World Wide Web material<br />
using web-scale tools such as web crawlers. At the same time, these<br />
organizations were concerned with the requirement to archive very large<br />
numbers of born-digital and digitized files. A need was for a container<br />
format that permits one file simply and safely to carry a very large<br />
number of constituent data objects (of unrestricted type, including many<br />
binary types) for the purpose of storage, management, and exchange.<br />
Another requirement was that the container need only minimal knowledge<br />
of the nature of the objects.</p>
<p>The WARC format is expected to be a standard way to structure, manage<br />
and store billions of resources collected from the web and elsewhere. It<br />
is an extension of the ARC format<br />
[<a href="http://www.archive.org/web/researcher/ArcFileFormat.php">http://www.archive.org/web/researcher/ArcFileFormat.php</a> ], which has<br />
been used since 1996 to store files harvested on the web. WARC format<br />
offers new possibilities, notably the recording of HTTP request headers,<br />
the recording of arbitrary metadata, the allocation of an identifier for<br />
every contained file, the management of duplicates and of migrated<br />
records, and the segmentation of the records. WARC files are intended to<br />
store every type of digital content, either retrieved by HTTP or another<br />
protocol.</p>
<p>The motivation to extend the ARC format arose from the discussion and<br />
experiences of the International Internet Preservation Consortium [<br />
<a href="http://netpreserve.org/">http://netpreserve.org/</a> ], whose core mission is to acquire, preserve<br />
and make accessible knowledge and information from the Internet for<br />
future generations. IIPC Standards Working Group put forward to ISO<br />
TC46/SC4/WG12 a draft presenting the WARC file format. The draft was<br />
accepted as a new Work Item by ISO in May 2005.</p>
<p>Over a period of four years, the ISO working group, with the<br />
Bibliothèque nationale de France [<a href="http://www.bnf.fr/">http://www.bnf.fr/</a> ] as convener,<br />
collaborated closely with IIPC experts to improve the original draft.<br />
The WG12 will continue to maintain [<a href="http://bibnum.bnf.fr/WARC/">http://bibnum.bnf.fr/WARC/</a> ] the<br />
standard and prepare its future revision.</p>
<p>Standardization offers a guarantee of durability and evolution for the<br />
WARC format. It will help web archiving entering into the mainstream<br />
activities of heritage institutions and other branches, by fostering the<br />
development of new tools and ensuring the interoperability of<br />
collections. Several applications are already WARC compliant, such as<br />
the Heritrix [<a href="http://crawler.archive.org/">http://crawler.archive.org/</a> ] crawler for harvesting, the<br />
WARC tools [<a href="http://code.google.com/p/warc-tools/">http://code.google.com/p/warc-tools/</a> ] for data management<br />
and exchange, the Wayback Machine<br />
[<a href="http://archive-access.sourceforge.net/projects/wayback/">http://archive-access.sourceforge.net/projects/wayback/</a> ], NutchWAX<br />
[<a href="http://archive-access.sourceforge.net/projects/nutch/">http://archive-access.sourceforge.net/projects/nutch/</a> ] and other<br />
search tools [<a href="http://code.google.com/p/search-tools/">http://code.google.com/p/search-tools/</a> ] for access. The<br />
international recognition of the WARC format and its applicability to<br />
every kind of digital object will provide strong incentives to use it<br />
within and beyond the web archiving community.</p>
<p>A press release is available on the IIPC website:<br />
<a href="http://netpreserve.org/press/pr20090601.php">http://netpreserve.org/press/pr20090601.php</a></p>
<p>General information about the IIPC can be found at:<br />
<a href="http://netpreserve.org/">http://netpreserve.org</a></p>
<p>—&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;<br />
Abbie Grotke<br />
Library of Congress<br />
IIPC Communications Officer<br />
netpreserve.org</p></blockquote>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/iawebarchiving.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/iawebarchiving.wordpress.com/92/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/iawebarchiving.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/iawebarchiving.wordpress.com/92/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/iawebarchiving.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/iawebarchiving.wordpress.com/92/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/iawebarchiving.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/iawebarchiving.wordpress.com/92/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/iawebarchiving.wordpress.com/92/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/iawebarchiving.wordpress.com/92/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=92&subd=iawebarchiving&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://iawebarchiving.wordpress.com/2009/06/03/warc-file-format-published-as-an-international-standard/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/5382cc685df6cff2892d8560e05eb1e8?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">waybackmolly</media:title>
		</media:content>
	</item>
		<item>
		<title>Searching the dawn of the 21st Century</title>
		<link>http://iawebarchiving.wordpress.com/2008/10/07/searching-the-dawn-of-the-21st-century/</link>
		<comments>http://iawebarchiving.wordpress.com/2008/10/07/searching-the-dawn-of-the-21st-century/#comments</comments>
		<pubDate>Tue, 07 Oct 2008 21:38:20 +0000</pubDate>
		<dc:creator>gojomo</dc:creator>
				<category><![CDATA[History]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Wayback Machine]]></category>
		<category><![CDATA[Web Archiving Community]]></category>

		<guid isPermaLink="false">http://wa.archive.org/blog/2008/10/07/searching-the-dawn-of-the-21st-century/</guid>
		<description><![CDATA[What was the web of the past really like?
Last Tuesday, Google unveiled a unique new web search, 2001 Google, as part of their 10th birthday celebration.
Using an actual archived version of their search engine index from January 2001, the service answers queries more-or-less how Google did back then &#8212; same results, same ranking, same summary [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=37&subd=iawebarchiving&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>What was the web of the past really like?</p>
<p>Last Tuesday, Google <a href="http://googleblog.blogspot.com/2008/09/2001-search-odyssey.html">unveiled</a> a unique new web search, <a href="http://www.google.com/search2001.html">2001 Google</a>, as part of their 10th birthday celebration.</p>
<p>Using an actual archived version of their search engine index from January 2001, the service answers queries more-or-less how Google did back then &#8212; same results, same ranking, same summary &#8217;snippets&#8217;.</p>
<p>But of course, many of those result pages have changed or disappeared entirely since then &#8212; and that&#8217;s where the Internet Archive&#8217;s <a href="http://web.archive.org">Wayback Machine</a> comes in. For many of the 2001 search results, the best or only view comes from the Wayback Machine, which Google has helpfully provided in lieu of the usual &#8216;cached version&#8217; links.</p>
<p>The combination of authentic Google search and the Wayback&#8217;s giant web archive is more powerful than either alone: finding needles lost in the Wayback haystack, showing actual prior rankings/popularity of pages for real queries, and highlighting material that would have been lost forever without purposeful public-interest archiving.</p>
<p>We thank Google for this chance to work together and highlight our web archive. Google plans to leave the 2001 search up for one month, and we&#8217;ll talk more about what we&#8217;ve learned from this service in a future blog post.</p>
<p>In the meantime, try the <a href="http://www.google.com/search2001.html">2001 Google Search</a>!</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/iawebarchiving.wordpress.com/37/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/iawebarchiving.wordpress.com/37/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/iawebarchiving.wordpress.com/37/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/iawebarchiving.wordpress.com/37/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/iawebarchiving.wordpress.com/37/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/iawebarchiving.wordpress.com/37/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/iawebarchiving.wordpress.com/37/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/iawebarchiving.wordpress.com/37/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/iawebarchiving.wordpress.com/37/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/iawebarchiving.wordpress.com/37/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=37&subd=iawebarchiving&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://iawebarchiving.wordpress.com/2008/10/07/searching-the-dawn-of-the-21st-century/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/7543a4ca55d870656a3961ae17f0f9a5?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">gojomo</media:title>
		</media:content>
	</item>
		<item>
		<title>Seeking Schools for K-12 Web Archiving Program</title>
		<link>http://iawebarchiving.wordpress.com/2008/09/11/seeking-schools-for-k-12-web-archiving-program/</link>
		<comments>http://iawebarchiving.wordpress.com/2008/09/11/seeking-schools-for-k-12-web-archiving-program/#comments</comments>
		<pubDate>Thu, 11 Sep 2008 17:51:46 +0000</pubDate>
		<dc:creator>waybackmolly</dc:creator>
				<category><![CDATA[Archive-It]]></category>
		<category><![CDATA[Digital Stewardship]]></category>
		<category><![CDATA[Web Archiving Community]]></category>

		<guid isPermaLink="false">http://wa.archive.org/blog/2008/09/11/seeking-schools-for-k-12-web-archiving-program/</guid>
		<description><![CDATA[Apply to be part of the Internet Archive k-12 project!
Could your school be one of 10 middle or high schools helping to
capture and archive today&#8217;s primary source materials on the Web?
A small number of individuals and institutions recognize the importance of archiving and preserving the often transitory digital cultural artifacts that are distributed over the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=36&subd=iawebarchiving&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><a href="http://www.loc.gov/teachers/">Apply</a> to be part of the Internet Archive k-12 project!</p>
<p>Could your school be one of 10 middle or high schools helping to<br />
capture and archive today&#8217;s primary source materials on the Web?</p>
<p>A small number of individuals and institutions recognize the importance of archiving and preserving the often transitory digital cultural artifacts that are distributed over the Web. But so far, the vast majority of decisions about what Web sites will live into the future have been made by adults, and reflect adults&#8217; sensibilities about what constitutes the important stuff of history.</p>
<p>The <a href="http://www.archive.org">Internet Archive</a>, the <a href="http://www.loc.gov">Library of Congress</a> and <a href="http://www.cdlib.org/">California Digital Library</a> are collaborating on a project that explores archiving the Web from the perspective of adolescents.</p>
<p>Find a complete project description and the brief application in the &#8220;Featured Resources&#8221; section at http://www.loc.gov/teachers/. Apply by September 30 for full consideration.</p>
<p>A pilot of the <a href="http://www.archive-it.org/k12/">K-12 web archiving program</a> took place in the Spring of 2008.  Three high schools from across the country participated and the resulting collection represent a broad range of interests and points of view.  You can learn more about the pilot and view the collections on the <a href="http://www.archive-it.org/k12/">Archive-It website</a>.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/iawebarchiving.wordpress.com/36/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/iawebarchiving.wordpress.com/36/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/iawebarchiving.wordpress.com/36/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/iawebarchiving.wordpress.com/36/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/iawebarchiving.wordpress.com/36/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/iawebarchiving.wordpress.com/36/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/iawebarchiving.wordpress.com/36/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/iawebarchiving.wordpress.com/36/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/iawebarchiving.wordpress.com/36/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/iawebarchiving.wordpress.com/36/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/iawebarchiving.wordpress.com/36/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/iawebarchiving.wordpress.com/36/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=iawebarchiving.wordpress.com&blog=7170684&post=36&subd=iawebarchiving&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://iawebarchiving.wordpress.com/2008/09/11/seeking-schools-for-k-12-web-archiving-program/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/5382cc685df6cff2892d8560e05eb1e8?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">waybackmolly</media:title>
		</media:content>
	</item>
	</channel>
</rss>