๐Ÿ’พ Archived View for bbs.geminispace.org โ€บ s โ€บ AskGemini โ€บ 22269 captured on 2024-12-17 at 15:09:11. Gemini links have been rewritten to link to archived content

View Raw

More Information

-=-=-=-=-=-=-

I have been travelling with spotty Internet today and I got the idea that you might want to download a whole site for offline viewing.

What archiving format would be good for downloading a whole site?

My personal choice would be ARJ since there is a open source version, it doesn't require much in computing power and it can split archives into volumes. (nostalgia) Another feature that would be good would be to be able to pluck files from the archive without unpacking the whole archive, I think ARJ can do that, need to check.

Posted in: s/AskGemini

๐Ÿš‚ MrSVCD

Nov 27 ยท 3 weeks ago

7 Comments โ†“

๐Ÿ€ gritty ยท Nov 28 at 00:22:

there's a format in Gemini for this and lagrange supports it. there's also offpunk that does this.

๐Ÿ€ gritty ยท Nov 28 at 00:23:

ah disregard, you meant the web

๐Ÿ”ญ thewanderer ยท Nov 28 at 08:53:

I can recommend two options. For small web and normal offpunk works well. I'm using it now. Another good option for larger sites would be kiwix. They already have a large catalogue of websites available for download. They also have a website you can use to download sites they don't have.

๐Ÿ‘บ daruma ยท Nov 28 at 14:33:

Offpunk ftw! I was hoping the cache for Lagrange could also be used as offline material. I also download page as eBook when I find a good article and send it to my ebool through syncthing.

๐Ÿš‚ MrSVCD [OP] ยท Nov 28 at 15:44:

I meant gemini capsules but I was really tired when I wrote the post.

For https/html it is not really a problem since either it would not work since the site is interactive or it is a static site an a large single zip file would suffice since https can handle resumes.

I am going to look into offpunk.

๐Ÿ€ gritty ยท Nov 29 at 02:57:

โ€” Gempub format

Here it is, gempub. It's an ebook format for gemini that can alao archive capsules. Lagrange supports it out of the box.

For example, in BBS you can download all your data, which is provided in gempub format (from settings).

๐Ÿš€ lanterm ยท Nov 30 at 06:23:

I've used wget (with I think mirror and convert-links flags) to download offline copies of sites e.g. for developer docs.

You can typically just open these as-is in your browser, but hit issues if the site tries to load assets in a way that's not friendly for file:// URLs.

I have a script that starts a simple proxy server and opens a Firefox window with a preloaded profile to route all traffic though the proxy server. The proxy server takes care of serving the offline content. It's a nice setup because it's very similar to browsing the live site.

This setup is a bit ad-hoc and it's not optimizing how sites are stored by any means, but it works for me.