This utility assists in converting Confluence content to a Jupyter Book project. Specifically, once you have a Confluence HTML export that has been converted to Markdown, run this utility against the converted source to move documents into a directory structure that mirrors the confluence page hierarchy as well as generating the Jupyter Book-isms necessary to correctly publish back to confluence.
To run:
./con2jupyterbook -i <source-dr> -o <target-dir>
When exporting from confluence, an index.md
file is generated that holds the exported page hierarchy in a
nested list. This utility relies on that format, so the index.md
file should not be edited prior to running
this utility.
The HTML export from Confluence dumps the whole tree into a single directory. The con2jb will create a directory tree to mirror the page hierarchy from confluence and do some requisite fixup automatically. Additionally, it will generate the jupyter book table of contents to ensure that the correct page hierarchy is restored upon publication.
There are several other miscellaneous fixes that are performed including: referencing attachment and image links from the root rather than the current directory and removal of some sections that are added to a given page that are not present in the Confluence page source, e.g. Comments, Attachment Lists, Change History.
- Prerequisites: Node.js installed, Python installed
- Use Confluence to export all or a part of your wiki via HTML. (Export Content...)
- Create a working directory and unzip the Confluence export into that directory (sub-dir html)
- Clone Confluence to Markdown
- From within the cloned directory run
npm run start <working-dir>/html <working-dir>/markdown
pip install migcon
- Run con2jb to convert the exported markdown into a MyST, sphinx friendly markdown/dir structure
con2jb -i <working-dir>/markdown -o <working-dir>/sphinx
- Copy the resultant jupyter book friendly markdown into the project of your choice
- Use jupyter book to build your desired output. Note: this will result in a few warnings that need to be addressed.