Wayback Machine at 240 Billion URLS

Internet Archive - Wayback MachineInternet Archive - Wayback Machine

Good news from the Internet Archive – its Wayback Machine for archived web pages now contains 240 billion urls for web pages from late 1996 to early December 2012. This adds up to 5 petabytes of data. The Wayback Machine is an excellent research tool for historical research.

Wayback Machine Now Has 240 Billion URLs by Gary Price at Search Engine Land (Jan 14)

Other points:

  • Some of the oldest content is still in the old system – must search it separately until everything is brought into the new system.
  • Internet Archive has received $ 1 million to buy more storage.
  • A portion of the archive is keyword searchable through the fee-based Archive-It. It has 5 billion urls from public collections.