WikiTeam / wikiteam

Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2023, WikiTeam has preserved more than 350,000 wikis.
https://github.com/WikiTeam
GNU General Public License v3.0
705 stars 147 forks source link

Confluence import/export #108

Open emijrp opened 10 years ago

emijrp commented 10 years ago

From nemow...@gmail.com on April 04, 2014 21:42:34

For Confluence import, but may be interesting for the reverse and to understand what's in their dumps: https://www.ohloh.net/p/wikiproc

Original issue: http://code.google.com/p/wikiteam/issues/detail?id=108

emijrp commented 10 years ago

From nemow...@gmail.com on April 21, 2014 02:51:21

https://migrations.atlassian.net/wiki/display/UWC/Universal+Wiki+Converter https://marketplace.atlassian.com/plugins/com.atlassian.uwc This UWC/Universal Wiki Converter appears to be under Apache license, it's worth looking into. It's focused on server-side conversions but if we figure out the Confluence format it could still help us bring some order to the chaos of wiki data structures.

nemobis commented 9 years ago

Paul Boddie, 26/01/2015 18:09:

On Monday 26. January 2015 17.56.26 Federico Leva (Nemo) wrote:

Barry Warsaw, 26/01/2015 16:10:

Our very huge thanks go to Paul Boddie who did the amazing conversion work, almost single-handedly reverse engineering the Confluence dump through several iterations.

Hurray! Well done. Can you publish the dump somewhere on your site, or on archive.org? WikiTeam is very interested in Confluence dumps, any writing/braindump/code on the matter is very welcome. https://github.com/WikiTeam/wikiteam/issues/108

I'll leave this for the others to decide, although it's not that difficult to obtain dumps directly from Confluence yourself. So, even though one might have bad things to say about proprietary services, credit is due to Atlassian for allowing their users to download most of their data (the profile information isn't available through the same mechanism) for deployment in other ways.

The conversion software itself is described here:

https://moinmo.in/ConfluenceConverter

My advice to anyone considering migrating from Confluence is to do so in a short timeframe (and as soon as possible if you want the above code to work!) in order to avoid changes in the product disrupting the activity and necessitating more work. For example, while tackling this task, Confluence 4 came out with a completely different page text representation which then had to be parsed and converted.

If you need to know more about this, feel free to ask me directly.