๐Ÿ’พ Archived View for bbs.geminispace.org โ€บ u โ€บ Acidus โ€บ 6253 captured on 2023-11-04 at 15:03:58. Gemini links have been rewritten to link to archived content

View Raw

More Information

โžก๏ธ Next capture (2023-11-14)

-=-=-=-=-=-=-

Comment by ๐Ÿง‡ Acidus

Re: "[gemini link] โ€” gemi.dev/gemlog/2023-10-15-numbering-..."

In: u/Acidus

Sorry for not being more clear. Threading is not the problem. I'm using Message-Id, In-Reply-To, and References headers to build the threads. I'm also using JWZ's threading algorithm from Netscape, which handles threading when those headers are missing/wrong, as is the case for a lot of threads in the mailing list

The problem is linking the "message number" (aka. the 6 digit number used as the HTML filename) that Orbital Fox assigned to each message in the mailing list for the HTML interface with the message content that is in the mbox files.

๐Ÿง‡ Acidus

2023-10-16 ยท 3 weeks ago

2 Later Comments โ†“

โ˜•๏ธ mozz

Check the source code :)

โ€” https://github.com/Koumbit/mailman2/blob/437d44b2258726fe150308d5fe5686f34c70d069/Mailman/Archiver/pipermail.py#L582

Looks like the index is a "sequence" number, and the missing numbers are emails that were discarded by the mail server (detected as spam or otherwise thrown away).

Seems like you could probably:

1. Pull all of the sequence numbers from the wayback archive yearly pages (sorted by date).

2. Sort the emails in your mailbox by UTC date.

3. Line up the numbers with the emails.

๐Ÿ’€ anisotropus

There is just enough craziness in the numbers that I can assume they increment by 1, and then go back and manually fix a few weird ones. With over 7700 messages, that just isn't reasonable.

I'll help you with the manual work if you can't fully automate it. When you're ready, make a zip of the workspace and provide a link here. What you're doing is too important to leave half-done.

Original Post

๐Ÿง‡ Acidus

โ€” gemi.dev/gemlog/2023-10-15-numbering-madness.gmi

Could definitely use help on trying to recovery the numbering scheme used by messages on the Gemini Mailing List. It should be easy, but I can't crack it. More details here:

๐Ÿ’ฌ 4 comments ยท 2023-10-15 ยท 3 weeks ago