Comment restaurer le contenu d'un site web en utilisant archive.org

Disclaimer: the following steps can help gather/rebuild from cached static images of your website from the Internet Archive. This procedure may give you a starting point in rebuilding your website.

Also, there is no guarantee that the Internet Archive will have cached your website files. Following the below steps should only ever be an alternative to restoring an actual backup of your website.

This procedure applies to static websites only (i.e. It will not work for CMS platforms such as WordPress, etc).

What is the Internet Archive?

The Wayback Machine (web.archive.org) is a digital archive of the World Wide Web. Since its launch in 2001, over 452 billion pages have been added to the archive. Users can enter a URL to view and interact with past versions of any website contained in the Archive, even if the site no longer exists on the "live" web.

Procedure:

  1. In a browser, navigate to the Internet Archive
  2. Enter the full URL of your website (e.g. example.com/index.html)
  3. Press enter or click the Browse History button
  4. On the next page, you’ll see a calendar displaying all cached copies of your webpage
    Archive
  5. You can click on a date to open a cached version of your webpage, then click the time from the available snapshots
    Archive
  6. Your cached page will open so you can obtain the source code (in most browsers simply right-click and select View Page Source). Copy the code and paste it into a text editor, then save it as an HTML file and upload to your server (see Publish a Site From your Computer), ensuring you have renamed the file as the page you are replacing.
There are several tools that allow you to conveniently download an entire website backup for restoration, such as Wayback Machine Downloader (open source), Warrick, and Wayback Downloader (paid).
Est-ce que cet article vous a aidé?