💾 Archived View for bbs.geminispace.org › u › Acidus › 6244 captured on 2023-11-14 at 09:51:56. Gemini links have been rewritten to link to archived content

View Raw

More Information

⬅️ Previous capture (2023-11-04)

➡️ Next capture (2023-12-28)

🚧 View Differences

-=-=-=-=-=-=-

Could definitely use help on trying to recovery the numbering scheme used by messages on the Gemini Mailing List. It should be easy, but I can't crack it. More details here:

gemi.dev/gemlog/2023-10-15-numbering-madness.gmi

🧇 Acidus

Oct 15 · 4 weeks ago

4 Comments ↓

👤 jdcard · Oct 15 at 20:58:

Could you use the original e-mail threading scheme based on the Message-Id, In-Reply-To, and References headers? See

— https://datatracker.ietf.org/doc/html/rfc5322#page-25

🧇 Acidus · Oct 16 at 00:37:

Sorry for not being more clear. Threading is not the problem. I'm using Message-Id, In-Reply-To, and References headers to build the threads. I'm also using JWZ's threading algorithm from Netscape, which handles threading when those headers are missing/wrong, as is the case for a lot of threads in the mailing list

The problem is linking the "message number" (aka. the 6 digit number used as the HTML filename) that Orbital Fox assigned to each message in the mailing list for the HTML interface with the message content that is in the mbox files.

☕️ mozz · Oct 16 at 02:58:

Check the source code :)

— https://github.com/Koumbit/mailman2/blob/437d44b2258726fe150308d5fe5686f34c70d069/Mailman/Archiver/pipermail.py#L582

Looks like the index is a "sequence" number, and the missing numbers are emails that were discarded by the mail server (detected as spam or otherwise thrown away).

Seems like you could probably:

1. Pull all of the sequence numbers from the wayback archive yearly pages (sorted by date).

2. Sort the emails in your mailbox by UTC date.

3. Line up the numbers with the emails.

💀 anisotropus · Oct 16 at 05:10:

There is just enough craziness in the numbers that I can assume they increment by 1, and then go back and manually fix a few weird ones. With over 7700 messages, that just isn't reasonable.

I'll help you with the manual work if you can't fully automate it. When you're ready, make a zip of the workspace and provide a link here. What you're doing is too important to leave half-done.