Comment by Olnium on 02/05/2022 at 22:15 UTC*

7 upvotes, 3 direct replies (showing 3)

View submission: Camas reddit-search "has been disabled by GitHub Staff due to a violation of GitHub's Terms of Service."

For anyone like me that wants to save a local copy but is having trouble saving the source directly from the Wayback Machine, I got you.

When you save a local copy directly from the Wayback Machine, the source code contains links that mean queries still get routed through the Wayback Machine and in my case, caused errors, although I could still view comments and posts by clicking on the API link and reading them in plain text directly from Pushshift. For me that's not a good solution, so I looked around and found something much better. Just follow these steps:

1. Goto https://rubyinstaller.org/downloads/[1] and install the latest build of Ruby with Devkit. Install it with all additional options selected.

1: https://rubyinstaller.org/downloads/

2. From your start menu, select "Start Command Prompt With Ruby". This will bring up a CMD window.

In the CMD window, type "gem install wayback_machine_downloader" (no quotes). This will install a gem package as explained here: https://github.com/hartator/wayback-machine-downloader

4. Once the downloader is installed, type in the CMD window "wayback_machine_downloader https://camas.github.io/reddit-search[2]" (no quotes). This will download the original website source code to a folder named "websites" in your user directory.

2: https://camas.github.io/reddit-search

5. In that folder, double click on index.html and you have a locally hosted version of Reddit Search.

If you get stuck, read the documentation for the Ruby Gem here: https://github.com/hartator/wayback-machine-downloader

Edit: Refined base URL to avoid downloading extraneous data.

Replies

Comment by [deleted] at 02/05/2022 at 23:01 UTC

4 upvotes, 1 direct replies

I may have installed it incorrectly, but presently step 4 returns:

Getting snapshot pages. found 0 snapshots to consider.

No files to download.

Possible reasons:

* Site is not in Wayback Machine Archive.

I have a working offline version already, but was curious about your instructions. Thanks for posting this regardless. I probably installed it incorrectly.

Comment by digwhoami at 04/05/2022 at 18:00 UTC

2 upvotes, 1 direct replies

When you save a local copy directly from the Wayback Machine, the source code contains links that mean queries still get routed through the Wayback Machine [...]

Was googling about waybackmachine and rewriting links after the site wen't down and stumbled upon this:

https://superuser.com/a/828908

In short: just slap an **id_** at the end of the date string, like this:

https://web.archive.org/web/20220501043233id_/https://camas.github.io/reddit-search/

Save the static html, profit.

Comment by capfan67 at 04/05/2022 at 17:00 UTC

1 upvotes, 1 direct replies

Worked perfectly. I am in your debt.