An OWL ontology documentation tool using Python and templating, based on LODE.
In addition to making human-readable forms of ontologies/taxonomies, pyLODE encourages ontology annotation best practice by only producing good results for well documented inputs! pyLODe defines what it considers w'well documented' in sections below, such as Profiles & What pyLODE understands.
- Quick Intro
- Examples
- Installation
- Use
- What pyLODE understands
- Profiles
- Differences from LODE
- Releases
- License
- Citation
- Collaboration
- Contacts
The Live OWL Documentation Environment tool (LODE) is a well-known (in Semantic Web circles) Java & XSLT-based tool used to generate human-readable HTML documents for OWL and RDF ontologies. That tool is now a bit dated (old-style HTML, use of older technologies like XSLT) and it's (online version) is not always online.
This tool is a complete re-implementation of LODE's functionality using Python and Python's RDF manipulation module, rdflib. An ontology to be documented is parsed and inspected using rdflib and HTML or Markdown is generated using basic Python scripting and Python's Jinja2 templating.
The tool can be run as in these ways:
- Python command line
- cli.py in the main folder
- BASH command line
- bin/ directory, uses cli.py
- as-a-service locally
- via the popular Falcon framework.
- see server.py in the main folder
- as-a-service online
- hosted at https://pylode.surroundaustralia.com
pyLODE has been tested with all of the 30+ ontologies in pylode/examples/ and we are trying to ensure it captures all of their annotations. For each example, there is the original RDF file and the corresponding output, in HTML & Markdown. There are some examples of ADOC too.
For example, Epimorphic's's Registry Ontology is:
- reg.ttl - source file
- reg.html - HTML output
- this is a couple of releases ahead of the online version
- reg.md - Markdown output
Another, the Australian Government's Records Interoperability Framework (AGRIF) Ontology:
- agrif.ttl - source file
- agrif.html - HTML output
- agrif.md - Markdown output
- see this rendered online by GitHub
- agrif.md - ASCIIDOC output
- agrif.skos.html - HTMl output, "vocpub" profile
- see this rendered online by GitHack
You can build all of the example outputs locally by running pylode/examples/_make_examples.py which also serves as a good demonstration of calling pyLODE from a Python file.
- Australia's Department of Finance's's AGRIF ontology - http://linked.data.gov.au/def/agrif
- see the Markdown version
- National Archives of Australia's Commonwealth Records Series ontology - http://linked.data.gov.au/def/crs
- see the Markdown version
- CSIRO's's LocI ontology - http://linked.data.gov.au/def/loci
- Geological Survey of Queensland's Boreholes Profile - http://linked.data.gov.au/def/borehole
- Geoscience Australia's Place Names Profile - http://linked.data.gov.au/def/placenames
- Epimorphic's Registry Ontology - https://epimorphics.com/public/vocabulary/Registry.html
- Semantic Web for Earth and Environmental Terminology (SWEET)
- a series of ontologies that are live rendered into HTML via pyLODE being called by a Falcon server
See pairs of RDF & HTML files in the pylode/examples/ directory for other, preprocessed examples.
This tool can be used either as a command line utility (Linux, Mac or Windows, see below) or as a Python module in other Python code. It can also be used via a hosted, online, service or even as a local web server that you can run. This repo contains executable files for Mac & Windows (soon Linux!) that you can use without any installation too.
The most important dependency to get correct when using this as a Python script of a command line program is the package rdflib
which must be v5.0.0 or greater (see requirements.txt).
Do this to use pyLODE as a Python command line program.
This tool is available on PyPI, the Python Package Index, at https://pypi.org/project/pyLODE/ and can be installed for use as a Python module via pip:
pip install pylode
To use pyLODE within Python, try something like this:
import pylode html = pylode.MakeDocco( input_data_file=input_file_path, outputformat="html", profile="ontdoc" ).document()
You will now have the HTML content within the variable html
.
For desktop command line use, just clone this repository and either use cli.py
as per the command line instructions below or use makedocco.py as a Python script directly.
pyLODE presents natively as a Python command-line utility, pylode/cli.py and there are also a BASH, Windows & Mac OS options available for command line use:
- pylode/bin/pylode.sh - BASH script
- Linux executable coming soon!
- pylode/bin/pylode.app - MAC OS command line executable script
- pylode/bin/pylode.exe - Windows command line executable
All use the same command line arguments.
Additionally, there is a Falcon framework local HTTP server option.
Also, a web UI for pyLODE is available at <https://pylode.surroundaustralia.com>.
These are the command line arguments to run pyLODE as a BASH or Python script on Linux, Mac etc. or via the Windows executable:
-i
or--inputfile
, required if-u
not used- The RDF ontology file you wish to generate HTML for Must be in either Turtle, RDF/XML, JSON-LD or N-Triples formats indicated by the file type extensions .rdf, .owl, .ttl, .n3, .nt, .json respectively
-u
or--url
, required if-i
not used- The RDF ontology you wish to generate HTML for, online. Must be an absolute URL that can be resolved to RDF, preferably via Content Negotiation.
-c
or--css
, optional, default 'true'- Whether (true) or not (false) to include CSS in an HTML output.
-o
or--outputfile
, optional- A name you wish to assign to the output file. Will be postfixed with .html, .md or .adoc. If not specified, the name of the input file or last segment of RDF URI will be used, + .html/.md/.adoc.
-f
or--outputformat
, optional, default 'html'- The output format of the documentation. 'html', 'md' or 'adoc' accepted.
-p
or--profile
, optional, default 'ontdoc'- The profile (specification) for ontology documentation used. "ontdoc" (for OWL Ontologies), "vocpub" (for Simple Knowledge Organization System (SKOS)) vocabularies or SKOS versions of OWL ontologies, "prof" for Profiles Vocabularies profiles. See
-lp
for all profiles supported.
- The profile (specification) for ontology documentation used. "ontdoc" (for OWL Ontologies), "vocpub" (for Simple Knowledge Organization System (SKOS)) vocabularies or SKOS versions of OWL ontologies, "prof" for Profiles Vocabularies profiles. See
-lp
or--listprofiles
, optional, no arguments- Lists all the profiles (specifications) for ontology documentation supported by pyLODE
This basic call to the BASH script in pylode/bin/ will
print to standard out an HTML document for an ontology called placenames.html
.
./pylode -i ../example/prof.ttl
An output file could be specified by using -o, rather than printing to standard out.
An online version of pyLODE is now available at https://pylode.surroundaustralia.com/.
Install locally by first building the container
docker build -t pylode:latest --build-arg PYTHON_VERSION=3.8-slim .
Then run the container
docker run -it -d -p 8000:8000 -e GTAGID=${Google TagID} pylode
N.B. The Google TagID is NOT required unless Google Analytics is required. It looks as follows GTAGID=UA-168806395-1.
You can now access the service on localhost
curl localhost:8000/lode?url=http://sweetontology.net/sweetAll.ttl
You can run pyLODE using your own, local, HTTP server like this:
gunicorn --chdir /path/to/pyLODE/pylode server:api
The server is then available at localhost:8000 and localhost:8000/pylode for the active endpoint. Note that the server must be fed a URL to an ontology to document supplied by a server capable of responding to Content Negotiation, i.e. it must supply RDF according to an HTTP Accept request for text/turtle, application/rdf+xml etc.
curl localhost:8000/pylode?url=http://sweetontology.net/sweetAll.ttl
In pylode/bin/
, a Windows executable, pylode.exe
is available for command line use.
Be sure to add pylode.exe
to your Windows user's PATH variable so you can run pylode.exe
from any folder. Just run the Registry Editor (search for "Regedit" in the Start menu) and then navicate to HKEY_CURRENT_USER > Environment > Path. Adding the full path of the folder pylode/bin/
to Path will ensure you can run pylode.exe
within the Windows command prompt, regardless of what folder you are in.
You can rebuild the pylode.exe
file from the source code, if you like. Use the Python program Pyinstaller as per
its instructions. The created pylode.exe
will have the same characteristics as the Linux/Mac
CLI program.
Pyinstaller uses a .spec
file to make the binary and that is included in this repository: pylode-cli.spec
.
See the PyInstaller installation guide for info on how to install PyInstaller for Windows.
Once you have PyInstaller, use pyinstaller
to generate the pyLODE.exe
CLI file like so:
cd pylode pyinstaller pylode-cli.spec
This will output pylode.exe
in the dist
directory in pylode
. The .exe file in bin/
is just the latest copy of this.
You can now run the pyLODE Command Line utility via pylode.exe
.
See above for the pyLODE command line util usage instructions.
In pylode/bin/
, there is a Mac executable, pylode.app
.
As per instructions for PyInstaller use on Windows, you can rebuild the file pylode.app
using pylode.spec
, if you wish.
In pylode/bin/
, there is a shell script pylode.sh
. You can run this on the command line. It just pushes queries to the Python command line cli.py
.
pyLODE understands Ontologies, Taxonomies & Profiles and handles them based on the Ontology Document, Vocabulary Publication and PROF profiles that it contains. These three profiles share understanding of basic annotation properties.
pyLODE understands the following ontology constructs:
- ontology/taxonomy/profile metadata
- imports -
owl:imports
- title -
rdfs:label
,skos:prefLabel
,dct:title
ordc:title
- description -
rdfs:comment
,skos:definition
,dct:description
ordc:description
- inline HTML & Markdown are supported
- description -
- historyNote -
skos:historyNote
- inline HTML & Markdown are supported
- historyNote -
- version URI -
owl:versionIRI
as a URI - version info -
owl:versionInfo
as a string - preferred namespace prefix -
vann:preferredNamespacePrefix
as a token - preferred namespace URI -
vann:preferredNamespaceUri
as a URI
- preferred namespace prefix -
- version info -
- agents: publishers, creators, contributors
- see Agent Formatting below for details
- see the pylode/examples/ directory for examples!
- dates: created, modified, issued -
dct:created
etc., all asxsd:date
orxsd:dateTime
datatype properties - rights: license -
dct:license
as a URI & rights -dct:rights
as a string - code respository -
schema:codeRepository
as a literal of typexsd:anyURI
- source -
dcterms:source
as a literal of typexsd:anyURI
or text
- imports -
- ontology classes
- per
rdfs:Class
orowl:Class
- title -
rdfs:label
orskos:prefLabel
ordct:title
- description -
rdf:comment
,skos:definition
,dct:description
as a string or using inline HTML or Markdown - scope note - a
skos:scopeNote
as a literal - inline HTML & Markdown are supported
- scope note - a
- example - a
skos:example
- see Example Handling below
- example - a
- super classes - by declaring a class to be
owl:subClassOf
something - sub classes - pyLODE will work these out itself
- restrictions - by declaring a class to be
owl:subClassOf
of anowl:Restriction
with any of the normal cardinality or property existence etc. restrictions - in domain/range of - pyLODE will auto-calculate these
- per
- ontology properties
- per
owl:ObjectProperty
,owl:DatatypeProperty
orowl:AnnotationProperty
- title -
rdfs:label
orskos:prefLabel
ordct:title
string literal - description -
rdf:comment
,skos:definition
,dct:description
string literal - inline HTML & Markdown are supported
- description -
- scope note - a
skos:scopeNote
string literal - inline HTML & Markdown are supported
- scope note - a
- example - a
skos:example
- see Example Handling below
- example - a
- super properties - by declaring a class to be
owl:subPropertyOf
something - sub properties - pyLODE will work these out itself
- equivalent properties - by declaring a class to be
owl:equivalentProperty
something - inverse of - by declaring a class to be
owl:inverseOf
something - domains -
rdfs:domain
orschema:domainIncludes
- ranges -
rdfs:range
orschema:rangeIncludes
- per
- namespaces
- pyLODE will honour any namespace prefixes you set and look up others in http://prefix.cc
- it will either read your ontology's default/base URI in annotations or guess it using a number of methods
- named individuals
- as per class but also
owl:sameAs
- as per class but also
pyLODE can handle many forms of examples for Classes & Properties and can handle multiple examples per class/property. In all cases, the example value is indicated with a skos:example
property like this:
<x> a owl:Class ; skos:example {Literal, Blank Node or URI} .
Simple Literals
The most basic form is an example that is a literal with no format type indicated. This will be printed out in monospaced text, e.g. the Class Fish
in the Examples Ontology has a plain Turtle example like this:
<x> a eggs:Fish ; skos:prefLabel "Fish X"@en ; eggs:livesInFreshWater true ; ... .
If you indicate one of the RDF built-in formats (rdf:HTML
, rdf:XMLLiteral
or rdf:JSON
), it will be interpreted in the markup form specified, which means, in practice, that HTML will be rendered where as XML or JSON will be monospaced. The Examples Ontology has this HTML example for the property has scale colour
:
scale colour:
- blue
- orange
- white
You can use Markdown in example literals too, but to do so, you must set the format to text/markdown
so see the Resource Descriptor method below.
URIs
If you put a URI in the example field like this: <x> skos:example <...> ;
or like this <x> skos:example "..."^^xsd:anyURI ;
then pyLODE will render it as a clickable hyperlink in HTML, Markdown or ASCIIDOC, as per your chose output format.
Images
You can use images in the example field. To do so, either use a URI to an image on the web or a relative URI to a local image file. pyLODE will render either form as an inline image. See the Fish Food class example that looks like this:
"Resource Descriptor" Examples
To do more you can use a Profiles Vocabulary (PROF) ResourceDescriptor
to define multiple properties for an example resource. This involves defining a ResourceDescriptor
either as a Blank Node or a URI node like this, the Examples Ont 'eats' property:
:eats skos:example :eats-example . :eats-example a prof:ResourceDescriptor ; dcterms:format "text/turtle" ; dcterms:conformsTo <https://example.com> ; prof:hasArtifact """<x> a :Creature ; :eats <y> ; . <y> a :Food .""" ; .
Here the ResourceDescriptor
says that this example is in the text/turtle
format, has an inline artifact (the actual example text) and conforms to something, in this case the profile defined by <https://example.com>
.
You can use this ResourceDescriptor
method to create multiple examples for a class or property that conform to different things (perhaps profiles of your ontology).
Agents, individual persons or organisations, should be associated with ontologies/taxonomies/profiles to indicate authors, creators, publishers etc. There are 2 ways to do this that pyLODE understands: datatype & object type.
A simple literal value for an agent that a human can read but not a machine can't understand:
<ONTOLOGY_URI> dc:creator "AGENT NAME" .
- the range value is a string literal, either string typed (
^^xsd:string
) or language typed (@en
or@de
) - the following Dublin Core Elements 1.1 properties may be used:
dc:contributor
dc:creator
dc:publisher
- the following schema.org properties may be used:
schema:author
schema:contributor
schema:creator
schema:editor
schema:funder
schema:publisher
schema:translator
- the range value is a string literal, either string typed (
<ontology_x> dc:creator "Nicholas J. Car" ;
An RDF object is used for the agent and can contain multiple details. A Blank Node or a URI can be used. Best case, a persistent agent URI!
<ONTOLOGY_URI> dct:creator [...] .
or
<ONTOLOGY_URI> dct:creator <SOME_URI> .
- the range value is a Blank Node or a URI of type:
schema:Person
schema:Organization
foaf:Person
foaf:Organization
- the properties of the Blank Node or the URI are as below
- the following Dublin Core Terms properties may be used:
dct:contributor
dct:creator
dct:publisher
dct:rightsHolder
- the following schema.org properties may be used:
schema:author
schema:contributor
schema:creator
schema:editor
schema:funder
schema:publisher
schema:translator
- the following FOAF properties may be used:
foaf:maker
e.g. (Blank Node):
<ontology_x> schema:editor [ a schema:Organization ; ... ] ;
or (URI):
<ontology_x> schema:editor <https://orcid.org/0000-0002-8742-7730> ; ... <https://orcid.org/0000-0002-8742-7730> a foaf:Person ; ...
foaf:name
/schema:name
foaf:mbox
/schema:email
foaf:homepage
/schema:url
schema:identifier
e.g.:
<ontology_x> dct:creator [ schema:name "Nicholas J. Car" ; schema:identifier <http://orcid.org/0000-0002-8742-7730> ; schema:email <mailto:[email protected]> ; ] ;
Use schema:member
, schema:affiliation
(there is no FOAF Person -> Group/Org property):
e.g.:
<ontology_x> dct:creator [ schema:name "Nicholas J. Car" ; schema:identifier <http://orcid.org/0000-0002-8742-7730> ; schema:email <mailto:[email protected]> ; schema:affiliation [ schema:name "SURROUND Australia Pty Ltd" ; schema:url <https://surroundaustralia.com> ; ] ; ] ;
The ontology's HTML representation linking back to the RDF: generated automatically
Indicating to readers where the 'live' version of the ontology/taxonomy is managed:
This should not be done for profiles, instread, create a prof:ResourceDescriptor
instance with role:repository
to indicate a profile's repository.
Code repositories that house an ontology can be indicated either using schema.org's codeRepository property or a combination of the Description of a Project and PROV:
@prefix schema: <https://schema.org/> . <ONTOLOGY_URI> schema:codeRepository <REPO_URI> ; ...
or
@prefix doap: <http://usefulinc.com/ns/doap#> . @prefix prov: <http://www.w3.org/ns/prov#> . <ONTOLOGY_URI> prov:wasGeneratedBy [ a doap:Project , prov:Activity ; doap:repository <REPO_URI> ] ...
e.g., for the ontology version on ISO 19160-1:
<http://linked.data.gov.au/def/iso19160-1-address> prov:wasGeneratedBy [ a doap:Project , prov:Activity ; doap:repository <https://github.com/AGLDWG/iso19160-1-address-ont> ] ; ...
This tool generates HTML that is shamelessly similar to LODE's styling. That's because we want things to look familiar and LODE's outputs look great. The Markdown's pretty vanilla.
Also, pyLODE generates and uses only static HTML + CSS, no JavaScript, live loading Google Fonts etc. This is to ensure that all you need for nice display is within a couple of static, easy to use and maintain, files. Prevents documentation breaking over time.
Feel free to extend your styling with your own CSS.
pyLODE can document ontologies, taxonomies and profiles according to different profiles which are specifications. The basic, default, profile is pyLODE's so-called Ontology Documentation profile, which is a profile of OWL and a few other bits and pieces. See What pyLODE understands section.
pyLODE can tell you what profiles it supports: just run ~$ pylode -lp
("list profiles") or, if calling from Python:
m = MakeDocco(input_data_file="examples/data-access-rights.ttl", profile="vocpub") print(m.list_profiles())
Currently pyLODE supports its OWL profile, as described above, and a profile of SKOS. For full details of what the profiles include, see the profiles' definitions at:
Token | URI |
---|---|
prof | https://www.w3.org/TR/dx-prof/ |
ontdoc | https://w3id.org/profile/ontdoc |
vocpub | https://w3id.org/profile/vocpub |
In the folder pylode/profiles/
, you will see an __init__.py
file containing the BaseProfile
class which all
profiles must inherit from. The existing OntDoc
, Prof
& VocPub
profile classes are in files ontdoc.py
,
prof.py
& vocpub.py
respectively. They do all the things profiles need to do and are listed in
pylode/profiles/__init__.py
for pyLODE to know about with both a profile declaration and an entry in the PROFILES
list. The profile declaration for PROF is:
PROF_PROFILE = Profile( "https://www.w3.org/TR/dx-prof/", "The Profiles Vocabulary", "The Profiles Vocabulary is an RDF vocabulary created to allow the machine-readable description of profiles of " "specifications for information resources.", [HTML_MEDIA_TYPE, "text/markdown"], HTML_MEDIA_TYPE, languages=["en"], default_language="en" )
See the Profile
class in pylode/profiles/__init__.py
for mor details.
The PROFILES
object currently contains:
PROFILES = { "prof": PROF_PROFILE, "ontdoc": ONT_DOC_PROFILE, "vocpub": VOC_PUB_PROFILE, }
Profiles also contain templates in pylode/templates/FOLDER
and need to be imported into pylode/__init.py
and
added to that file's document()
finction to be made accessible.
So, to create your own profile:
- create a class to inherit from
BaseProfile
- do the work of profileing in your class, following the prof, ontdoc & vocpub examples
- list your profile with a profile declaration and an entry in PROFILES in
pylode/profiles/__init__.py
- place your templates in
pylode/templates/FOLDER
(FOLDER being your profile's folder name) - make your profile work with pyLODE by importing it into
pylode/__init.py
and adding a call to its constructor indocument()
We hope to simplify this with profile auto-discovery soon!
You can, of course, document an OWL ontology using the owldoc profile or a SKOS taxonomy using the vocpub profile however, you can also document an OWL ontology using the vocpub profile! This is because SKOS is conceptually a subset of OWL - whatever you can express in SKOS you can express in OWL.
pyLODE performs an OWL > SKOS transformation on OWL ontologies to produce a taxonomy document. The following conversions are made:
owl:Ontology
>skos:ConceptScheme
- and all the ontology metadata is used with the ConceptScheme
owl:Class
>skos:Concept
- and other class annotation properties used with Concept
owl:subClassOf
>skos:broader
- and the inverses,
skos:narrower
- and the inverses,
To see the full list of transformations, see the function _expand_graph_for_skos()
in makedocco.py.
Examples of a small taxonomies documented using the vocpub profile are:
An example of a large one:
An example of a vocpub-documented OWL ontology and the corresponding owldoc original is AGRIF:
- command line access
- you can use this on your own desktop so you don't need me to maintain a live service for use
- use of more modern & simpler HTML
- catering for a wider range of ontology options such as:
- schema.org
domainIncludes
&rangeIncludes
for properties
- schema.org
- better Agent linking
foaf:Agent
orschema:Person
objects for creators, contributors & publishers- you can still use simple string peoperties like
dc:contributor "Nicholas J. Car"
too if you really must!
<ontology_x> dct:creator [ sdo:name "Nicholas J. Car" ; sdo:identifier <http://orcid.org/0000-0002-8742-7730> ; ] ;
- smarter CURIES
- pyLODE caches and looks up well-known prefixes to make more/better CURIES
- it tries to be smart with CURIE presentation by CURIE-ising all URIs it finds, rather than printing them
- active development
- this software is in use and will be improved for the foreseeable future so we will cater for more and more things
- recent ontology documentation initiatives such as the MOD Ontology will be handled, if requested
pyLODE is under continual and constant development. The current developers have a roadmap for enhancements in mind, which is given here, however, since this is an open source project, new developers may join the pyLODE dev community and change/add development priorities.
The current release, as of May, 2021, is 2.10.0.
Version | Date | Description |
---|---|---|
3.0 | ? | Will include pre-testing inputs with SHACL |
2.10.0 | 24 May 2021 | Update Windows EXE build process, simplified versioning |
2.9.1 | 28 Apr 2021 | Support for ASCIIDOC format (OntDoc profile only) |
2.8.11 | 28 Apr 2021 | Further changes for PyPI only |
2.8.10 | 27 Apr 2021 | Further changes for PyPI only |
2.8.9 | 27 Apr 2021 | PyPI enhancements only |
2.8.8 | 27 Apr 2021 | Several small bugs fixed, auto-generation of version no. from Git tag |
2.8.6 | 23 Feb 20201 | Fixing char encoding issues, updated examples, new test files style - per issue |
2.8.5 | 5 Jan 20201 | Small enhancements to the Falcon server deployment option |
2.8.3 | 3 July 2020 | Packaging bugfixes only |
2.7 | 1 July 2020 | Much refactoring for new profile creation ease |
2.6 | June 2020 | Supports PROF profiles as well as taxonomies & ontologies |
2.4 | 27 May 2020 | Small improvements over 2.0 |
2.0 | 18 Apr 2020 | Includes multiple profiles - OWP & vocpub |
1.0 | 15 Dec 2019 | Initial working release |
Expected to handle
- pre-documentation graph shape testing using SHACL
- you will be able to see what pyLODE-recommended annotation and design patterns your inputs do/don't handle
- "modp", a documentation profile based on the MOD Ontology
- handles complex Examples (
skos:example
) - can export to ASCIIDOC format
- includes 3rd-party-created profile: NMPF
- handles Named Individuals in OWL ontologies
- implements "owldoc" & "vocpub" documentation profiles for OWL, SKOS and OWL-as-SKOS results
- implements "prof" profile for documentation of Profiles Vocabulary profiles
- has a more modular structure than 1.0 to assist with the creation or more profiles
Initial pyLODE release. Generated HTML documentation for OWL ontologies, missed quite a few expected ontology elements, such as Named Individuals.
This code is licensed using the GPL v3 licence. See the LICENSE file for the deed. Note Citation below though for attribution.
If you use pyLODE, please leave the pyLODE logo with a hyperlink back here in the top left of published HTML pages.
The maintainers welcome any collaboration.
If you have suggestions, please email the contacts below or leave Issues in this repository's Issue tracker.
But the very best thing you could do is create a Pull Request for us to action!