dernorberto / confluenceDumpWithPython

Download Confluence pages including attachments and emoticons using Atlassian API and Python
MIT License
14 stars 6 forks source link

Confluence Dump With Python

Dump Confluence pages using Python (requests) in HTML and RST format, including embedded pictures and attachments. References to downloaded files will be updated to their local relative path.

Description

Nonetheless, the refactoring will require only 2 files and accept command-line args:

For CSS Styling, it uses the confluence.css from Confluence that can be obtained by using the Workaround described in: https://jira.atlassian.com/browse/CONFSERVER-40907. The site.css file included with Confluence UI HTML exports is not as complete as the one above.

Folder and file structure:

What it does

Requirements

Dependencies

Installing

Executing program

confluenceDumpWithPython.py -m single -S <site Name> -p <ID of page to dump> [<output folder>] [--sphinx]
confluenceDumpWithPython.py -m pageprops -S <site Name> -p <ID of page properties report page> [<output folder>] [--sphinx]
confluenceDumpWithPython.py -m space -S <site Name> -s <space KEY> [<output folder>]

Help

No special advice other than:

Authors

Contributors names and contact info

@dernorberto

Improvements

Issues

Version History

legacy/ folder with previous version of scripts

Purpose of the files:

  1. confluenceExportHTMLrequestsByLabel.py: download a set of pages based on one (or more) page Labels.
  2. confluenceExportHTMLrequestsSingle.py: download a single page by supplying the page ID as an argument.
  3. confluenceExportHTMLrequestsPagePropertiesReport.py: download page properties and all the pages in the report by supplying the page ID as an argument.
  4. confluenceExportHTMLrequestsPagesInSpace.py: download all pages from a space.

License

This project is licensed under the MIT License - see the LICENSE.txt file for details

Acknowledgments