7 upvotes, 3 direct replies (showing 3)
For anyone like me that wants to save a local copy but is having trouble saving the source directly from the Wayback Machine, I got you.
When you save a local copy directly from the Wayback Machine, the source code contains links that mean queries still get routed through the Wayback Machine and in my case, caused errors, although I could still view comments and posts by clicking on the API link and reading them in plain text directly from Pushshift. For me that's not a good solution, so I looked around and found something much better. Just follow these steps:
1. Goto https://rubyinstaller.org/downloads/[1] and install the latest build of Ruby with Devkit. Install it with all additional options selected.
1: https://rubyinstaller.org/downloads/
2. From your start menu, select "Start Command Prompt With Ruby". This will bring up a CMD window.
3.
4. Once the downloader is installed, type in the CMD window "wayback_machine_downloader https://camas.github.io/reddit-search[2]" (no quotes). This will download the original website source code to a folder named "websites" in your user directory.
2: https://camas.github.io/reddit-search
5. In that folder, double click on index.html and you have a locally hosted version of Reddit Search.
Edit: Refined base URL to avoid downloading extraneous data.
Comment by [deleted] at 02/05/2022 at 23:01 UTC
4 upvotes, 1 direct replies
I may have installed it incorrectly, but presently step 4 returns:
Getting snapshot pages. found 0 snapshots to consider.
No files to download.
Possible reasons:
* Site is not in Wayback Machine Archive.
I have a working offline version already, but was curious about your instructions. Thanks for posting this regardless. I probably installed it incorrectly.
Comment by digwhoami at 04/05/2022 at 18:00 UTC
2 upvotes, 1 direct replies
When you save a local copy directly from the Wayback Machine, the source code contains links that mean queries still get routed through the Wayback Machine [...]
Was googling about waybackmachine and rewriting links after the site wen't down and stumbled upon this:
https://superuser.com/a/828908
In short: just slap an **id_** at the end of the date string, like this:
https://web.archive.org/web/20220501043233id_/https://camas.github.io/reddit-search/
Save the static html, profit.
Comment by capfan67 at 04/05/2022 at 17:00 UTC
1 upvotes, 1 direct replies
Worked perfectly. I am in your debt.