This README gives an overview of how to build and contribute to the documentation of Apache Flink.
The documentation is included with the source of Apache Flink in order to ensure that you always have docs corresponding to your checked out version. The online documentation at https://flink.apache.org/ is also generated from the files found here.
The Flink documentation uses Hugo to generate HTML files. More specifically, it uses the extended version of Hugo with Sass/SCSS support.
To build the documentation, you can install Hugo locally or use a Docker image.
Both methods require you to execute commands in the directory of this module (docs/
). The built site is served at http://localhost:1313/.
$ git submodule update --init --recursive
$ ./setup_docs.sh
$ docker pull jakejarvis/hugo-extended:latest
$ docker run -v $(pwd):/src -p 1313:1313 jakejarvis/hugo-extended:latest server --buildDrafts --buildFuture --bind 0.0.0.0
Make sure you have installed Hugo on your system.
$ ./setup_hugo.sh
Then build the docs from source:
$ ./build_docs.sh
The shell ./build_docs.sh
will integrate external connector docs, referencing setup_docs.sh#integrate_connector_docs
.
This process involves cloning the repo of some external connectors, which can be time-consuming and prone to network issues.
So, if the connector docs have been synced before, and you wish to skip this step, you can do so by adding the following arg:
$ ./build_docs.sh --skip-integrate-connector-docs
The site can be viewed at http://localhost:1313/
With the ongoing efforts to move Flink's connectors from this repository to individual, dedicated repositories, this also requires the documentation to be hosted outside this repo. However, we still want to serve all documentation as a whole on the Flink documentation website.
Adding new externally hosted documentation requires the following steps to be taken:
-
(If necessary) Move the existing documentation to the new repository
-
In the Flink repository, edit the
docs/setup_docs.sh
file and add a reference to your now externally hosted documentation. The reference will look likeintegrate_connector_docs <connector_name> <branch_or_tag>
.
Replace <connector_name> with the name of your connector, e.g., elasticsearch
for flink-connector-elasticsearch
.
Configuration descriptions are auto generated from code. To trigger the generation, you need to run a command in the project root (see Configuration documentation.)
The resulting html files will be written to layouts/shortcodes/generated
.
The documentation pages are written in Markdown. It is possible to use GitHub flavored syntax and intermix plain html.
In addition to Markdown, every page contains a Jekyll front matter, which specifies the title of the page and the layout to use. The title is used as the top-level heading for the page. The default layout is plain
(found in _layouts
).
---
title: "Title of the Page"
---
---
title: "Title of the Page" <-- Title rendered in the side nave
weight: 1 <-- Weight controls the ordering of pages in the side nav.
type: docs <-- required
aliases: <-- Alias to setup redirect from removed page to this one
- /alias/to/removed/page.html
---
All documents are structured with headings. From these headings, you can automatically generate a page table of contents (see below).
# Level-1 Heading <- Used for the title of the page
## Level-2 Heading <- Start with this one for content
### Level-3 heading
#### Level-4 heading
##### Level-5 heading
Please stick to the "logical order" when using the headlines, e.g. start with level-2 headings and use level-3 headings for subsections, etc. Don't use a different ordering, because you don't like how a headline looks.
Table of contents are added automatically to every page, based on heading levels 2 - 4. The ToC can be omitted by adding the following to the front matter of the page:
---
bookToc: false
---
Flink uses shortcodes to add custom functionality to its documentation markdown. The following are available for use:
{{< artifact flink-streaming-scala withScalaVersion >}}
This will be replaced by the maven artifact for flink-streaming-scala that users should copy into their pom.xml file. It will render out to:
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-streaming-scala_2.12</artifactId>
<version><!-- current flink version --></version>
</dependency>
It includes a number of optional flags:
- withScalaVersion: Includes the scala version suffix to the artifact id
- withTestScope: Includes
<scope>test</scope>
to the module. Useful for marking test dependencies. - withTestClassifier: Includes
<classifier>tests</classifier>
. Useful when users should be pulling in Flink tests dependencies. This is mostly for the test harnesses and probably not what you want.
You can also use the shortcodes (with same flags) instead:
artifact_gradle
to show the Gradle syntaxartifact_tabs
to create a tabbed view, showing both Maven and Gradle syntax
{{< connector_artifact flink-connector-elasticsearch 3.0.0 >}}
This will be replaced by the maven artifact for flink-connector-elasticsearch that users should copy into their pom.xml file. It will render out to:
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-connector-elasticsearch</artifactId>
<version>3.0.0</version>
</dependency>
{{< top >}}
This will be replaced by a back to top link. It is recommended to use these links at least at the end of each level-2 section.
{{< hint info >}}
Some interesting information
{{< /hint >}}
The hint will be rendered in a blue box. This hint is useful when providing additional information for the user that does not fit into the flow of the documentation.
{{< hint warning >}}
Something to watch out for.
{{< /hint >}}
The hint will be rendered in a yellow box. This hint is useful when highlighting information users should watch out for to prevent errors.
{{< hint danger >}}
Something to avoid
{{< /hint >}}
The hint will be rendered in a red box. This hint is useful when highlighting information users need to know to avoid data loss or to point out broken functionality.
{{< label "My Label" >}}
The label will be rendered in an inlined blue box. This is useful for labeling functionality such as whether a SQL feature works for only batch or streaming execution.
{{< version >}}
Interpolates the current Flink version
{{< scala_version >}}
Interpolates the default scala version
{{< stable >}}
Some content
{{< /stable >}}
This shortcode will only render its content if the site is marked as stable.
{{< unstable >}}
Some content
{{< /unstable >}}
This shortcode will only render its content if the site is marked as unstable.
{{< query_state_warning >}}
Will render a warning the current SQL feature may have unbounded state requirements.
{{< tabs "sometab" >}}
{{< tab "Java" >}}
```java
System.out.println("Hello World!");
```
{{< /tab >}}
{{< tab "Scala" >}}
```scala
println("Hello World!");
```
{< /tab >}}
{{< /tabs }}
Prints the content in tabs. IMPORTANT: The label in the outermost "tabs" shortcode must be unique for the page.
{{< github_repo >}}
Renders a link to the apache flink repo.
{{< gh_link file="/some/file.java" name="Some file" >}}
Renders a link to a file in the Apache Flink repo with a given name.
{{< javadoc file="some/file" name="Some file" >}}
Renders a link to a file in the Apache Flink Java Documentation.
{< pythondoc file="some/file" name="Some file" >}}
Renders a link to a file in the Apache Flink Python Documentation.
{{< downloads >}}
Renders a link to the apache flink download page.