This check monitors Amazon Managed Streaming for Apache Kafka (Amazon MSK) through the Datadog Agent.
Follow the instructions below to install and configure this check for an Agent running on a host. For containerized environments, see the Autodiscovery Integration Templates for guidance on applying these instructions.
- Create a client machine if one does not already exist
- Ensure the client machine has been granted the permission policy arn:aws:iam::aws:policy/AmazonMSKReadOnlyAccess or equivalent credentials are available
- Enable open monitoring with Prometheus on the MSK side to enable the JmxExporter and the NodeExporter.
- Install the Datadog Agent
-
Edit the
amazon_msk.d/conf.yaml
file, in theconf.d/
folder at the root of your Agent's configuration directory to start collecting your Amazon MSK performance data. See the sample amazon_msk.d/conf.yaml for all available configuration options.Note: If you reuse the sample file, change the file name from
conf.yaml.example
toconf.yaml
.
Run the Agent's status subcommand and look for amazon_msk
under the Checks section.
See metadata.csv for a list of metrics provided by this check.
aws.msk.can_connect:
Returns CRITICAL
if the Agent is unable to discover nodes of the MSK cluster. Otherwise, returns OK
.
aws.msk.prometheus.health:
Returns CRITICAL
if the check cannot access a metrics endpoint. Otherwise, returns OK
.
When using the Agent 7+ implementation by setting use_openmetrics
to true
:
aws.msk.openmetrics.health:
Returns CRITICAL
if the Agent is unable to connect to the OpenMetrics endpoint, otherwise returns OK
.
The Amazon MSK check does not include any events.
Need help? Contact Datadog support.