End of Term Archive

End of Term Archive

@eotarchive

Tweets from the project team of the End of Term Web Archive - preserving U.S. government websites during transitions in government.

http://eotarchive.cdlib.org/

Tweets

EOT Harvest, part II: How we got involved</a> </h3> <p class="article"> <a target="_blank" class="twitter-timeline-link" href=https://twitter.com/"http://t.co/42z7ZT2a">I became aware of the EOT Harvest through meetings and presentations at the Depository Library Council conference, so when I saw that the project was seeking volunteers to nominate content for the ...</a> </p> <div class="source"><a class="account-group js-action-profile js-user-profile-link" href=https://twitter.com/"/wordpressdotcom" data-user-id="823905"> <img class="avatar js-action-profile-avatar" src=https://twitter.com/"https://si0.twimg.com/profile_images/2306687229/gj7dkvjwxsm4pwxizb0g_normal.png" alt=""> <strong class="fullname js-action-profile-name">WordPress.com</strong> <span class="username js-action-profile-name"><s>@</s><b>wordpressdotcom</b></span> </a> &middot; <span class="user-actions not-following" data-user-id="823905" > <a href=https://twitter.com/"#" class="follow-link"> <span class="link-text follow-text">Follow</span> <span class="link-text unfollow-text">Unfollow</span> <span class="link-text cancel-text">Cancel</span> </a> </span> </div> </div> </div></div> <div class="js-tweet-media-container "></div> <div class="js-machine-translated-tweet-container"></div> <div class="js-tweet-stats-container tweet-stats-container already-open"> </div> <div class="client-and-actions"> <span class="metadata"> <span title="8:42 AM - 22 Oct 12">8:42 AM - 22 Oct 12</span> &middot; <a class="permalink-link js-permalink js-nav" href=https://twitter.com/"/eotarchive/status/260405746464866305" >Details</a> <span class="flag-container flag-cards"> <button type="button" class="flaggable btn-link"> Flag media </button> <span class="flagged hidden"> Flagged <span> <a target="_blank" href=https://twitter.com/"//support.twitter.com/articles/20069937"> (learn more) </a> </span> </span> </span> </span> </div> </div> " >

Mark P: 4.5 million gov PDFs are lurking in the 16 terabytes of End of Term Harvest. Even if only 1/2 in scope for FDLP that's a lot

In my experience, QA of crawls takes more human time than the actual crawl. Let's talk about crowd-sourcing of web crawls

<img src=https://twitter.com/"https://o.twimg.com/1/proxy.jpg?t=FQQVBBhZaHR0cDovL2RlYmJpZXJhYmluYS5maWxlcy53b3JkcHJlc3MuY29tLzIwMTIvMDkvc2NyZWVuLXNob3QtMjAxMi0wOS0yOS1hdC0xMS0xMy0xMi1hbS5wbmcUAhYAEgA&amp;s=PS5yZ_fP_7rqoiBpfIn1L0Wv1zs3Ab_HgwosQZ6uYUE" alt="Embedded image permalink" width="435" height="291" title="Introducing the End of Term Harvest project at Pratt SILS"> </a> </div> <div class="cards-content"> <h3> <a target="_blank" class="twitter-timeline-link" href=https://twitter.com/"http://t.co/Rk36Rem1">Introducing the End of Term Harvest project at Pratt SILS</a> </h3> <div class="byline"> </div> <p><a target="_blank" class="twitter-timeline-link" href=https://twitter.com/"http://t.co/Rk36Rem1">I am excited to be contributing this semester to the End of Term (EOT) harvest project. What, you may rightly ask, is the EOT harvest? So here’s the short answer: The EOT harvest archives web infor...</a></p> </div> <div class="source"><a class="account-group js-action-profile js-user-profile-link" href=https://twitter.com/"/wordpressdotcom" data-user-id="823905"> <img class="avatar js-action-profile-avatar" src=https://twitter.com/"https://si0.twimg.com/profile_images/2306687229/gj7dkvjwxsm4pwxizb0g_normal.png" alt=""> <strong class="fullname js-action-profile-name">WordPress.com</strong> <span class="username js-action-profile-name"><s>@</s><b>wordpressdotcom</b></span> </a> &middot; <span class="user-actions not-following" data-user-id="823905" > <a href=https://twitter.com/"#" class="follow-link"> <span class="link-text follow-text">Follow</span> <span class="link-text unfollow-text">Unfollow</span> <span class="link-text cancel-text">Cancel</span> </a> </span> </div> </div></div> <div class="js-tweet-media-container "></div> <div class="js-machine-translated-tweet-container"></div> <div class="js-tweet-stats-container tweet-stats-container already-open"> </div> <div class="client-and-actions"> <span class="metadata"> <span title="5:24 AM - 1 Oct 12">5:24 AM - 1 Oct 12</span> &middot; <a class="permalink-link js-permalink js-nav" href=https://twitter.com/"/eotarchive/status/252745813799866368" >Details</a> <span class="flag-container flag-cards"> <button type="button" class="flaggable btn-link"> Flag media </button> <span class="flagged hidden"> Flagged <span> <a target="_blank" href=https://twitter.com/"//support.twitter.com/articles/20069937"> (learn more) </a> </span> </span> </span> </span> </div> </div> " >

RT : On Oct 15 at noon, learn how to use the Wayback Machine for research. Presentation , Madison Building,...

Just spoke to federal web managers about EOT & the 2008 site -- Glad to spread the word to agencies we're archiving!

I can't believe the IRS redesigned its website and didn't put redirects in place.

& Kris are about to talk to some Pratt students who will be helping identify social media for the EOT

Loading seems to be taking a while.

Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.