gtalug / infrastructure

GTALUG infrastructure (the server).
0 stars 0 forks source link

Transfer message from the old mailing list to the new one. #22

Closed myles closed 8 years ago

myles commented 10 years ago

Mailman has a mbox import script so we could import the old mailing list messages into the new archive.

cbbrowne commented 10 years ago

Note that the following repo https://github.com/cbbrowne/TLUG-Gmane-backup can pull the old messages into an mbox file. (The result can't be pushed back to GitHub, as they now have a limit on file size, and the mbox generated is way too large.)

I suspect we might want to import the messages into a non-subscribable "Legacy Mailing List" list so there's no confusion as to whether the messages came from "old daze Majordomo" versus "tied to the new mailing list."

The mbox file seems to load readily (assuming we have a list called "tlug-archives") via the command:

/usr/lib/mailman/bin/arch tlug-archives /tmp/tlug_archive.txt

What do you think? New list, or attach all the old traffic to the "talk" list retroactively?

mgjk commented 10 years ago

I think attaching the old traffic to 'talk' could be confusing. Old messages coming up in search will have reply-to and subscribe urls for a dead list. The nice part of a separate list archive is that we could make it clear in the pages that this is an archived discussion for a list which migrated. This issue doesn't sound to me like it's urgent for migration though, unless I'm missing something?

avolkov commented 10 years ago

I ran my script again, and it seems that indeed the list was started back in 2003, and that's as far as the archive goes.

I tweaked the source a bit, but there's nothing major the script pulls the messages in mbox format, in fact you can pull individual messages as mbox files giving the range as url params i.e. http://download.gmane.org/gmane.org.user-groups.linux.tolug/62000/64000

I ran the script today -- this is complete GTALUG archive at this point, here's the full archive in xz on the off chance it might be useful. --- http://data.flamy.ca/tlug_data.txt.xz

cbbrowne commented 10 years ago

I reckon that we should, after the old list is shut down, do a single big run to...

0) Create a mailing list named something like "tlug-legacy-list". Make sure it is not subscribable, but that the archives are visible. 1) Pull all the old messages using Alex's script, from start to finish 2) Push all those messages into "tlug-legacy-list" via the mailman import script 3) I'll run the mailing list backup script against tlug-legacy-list, once. Never need to touch it again.

avolkov commented 9 years ago

I pulled all the old messages on November 5, The file can be found here-- http://data.flamy.ca/tlug_archive.mbox.xz 27.6MB