I’m sitting on the sofa and outside it sounds like a war zone. I have luckily never been in a war zone, but it’s a constant stream of flashes and explosions, and on the school yard around the corner they’re aiming those little shooting star sticks at each other.
Anyway, I heard that the Gemini mailing list archive was down. I got the archives from the Wayback Machine, and nytpu offered the same files plus the remaining emails from his own archives. Very cool!
gemini-mailing-list-archive.tar.gz
I installed Crabmail and processed the mails I had. The files from the Wayback Machine used email addresses like “alex at gnu.org” instead of “alex@gnu.org” and so I had to change those files. The Perl one-liner I used fixes the “From” and “From:” lines.
zcat 2019.txt.gz 2020.txt.gz 2021.txt.gz \ | perl -pne 's/\sat\s/@/g if /^From/' > mbox zcat 2021.new.mbox.gz >> mbox
Now I’m wondering, should I set up the mail account, the mail fetching, and the archive regeneration? It would mean some work, and in a way I think the world is better off without the mailing list! 😈 (evil laugh!) but … not really. No mailing list? Where would we announce stuff? It does seem like a sorry state of affairs after a while. I mean, it’s fine not to have a mailing list right now.
So perhaps I’ll take a look. Here’s what we have right now:
Archive of the Gemini Mailing List
In the end it wasn’t too hard to setup a new mail account, setup mpop to fetch mail from there and append it to the archive, and regenerate the site.
I’d say all I have to do is clean up the directories used and all that.
And it’s done! 😀
Update: timmc also published an archive, at least temporarily. Sadly, I can’t get these two archives to agree. Here’s what I did:
zcat timmc/2021.txt.gz | grep -i "^Message-ID:" \ | cut -d ' ' -f 2 | sort > timmc-from.txt zcat nyptu/2021.txt.gz nyptu/2021.new.mbox.gz | grep -i "^Message-ID:" \ | cut -d ' ' -f 2 | sort > nyptu-from.txt diff nyptu-from.txt timmc-from.txt
timmc said on IRC:
I happen to have a copy of the mailman gzip exports with a most recent message of Fri, 24 Dec 2021 16:21:07 +0100
This makes me think that I should use timmc’s archive up until December 24 2021 and then do some manual shuffling for the rest.
So here’s what I did:
It looks OK, and is about 10MiB smaller!
#Gemini