This dashboard is a monitoring and alerting solution for Qumulo clusters. This solution uses the Qumulo OpenMetrics API with a Prometheus time-series database and Grafana monitoring software and includes a set of dashboards and alerts that you can customize or use as templates.
For detailed information about available metrics, see Qumulo OpenMetrics API Specification on the Qumulo Documentation Portal.
Table of Contents
- Initial Configuration
- Prerequisites
- Step 1: Clone This Repository to Your Docker Host
- Step 2: Create a Service Account and Access Token on your Qumulo Clusters
- Step 3: Configure Prometheus
- Step 4: Start Prometheus and Grafana
- Step 5: Verify Your Prometheus Configuration
- Step 6: Verify Your Grafana Configuration
- Step 7: Configure Grafana Alert Notifications
- Updating Your Qumulo Monitoring Dashboard Configuration
This section explains the initial configuration of the Qumulo Monitoring Dashboard.
Before you begin, ensure that you have the following minimum software versions:
- Git
- Docker Engine 1.13
- Docker Compose 1.11
- Qumulo Core 5.3.0
-
Log in to your Docker host.
-
Use the
git
CLI to clone this repository.For more information, see Cloning a repository in the GitHub documentation.
-
Navigate to the
qumulo-monitoring-dashboard
directory.
For this section, follow Working with Qumulo Access Tokens on the Qumulo Documentation Portal.
-
Create a service account.
-
Assign a role with only
PRIVILEGE_METRICS_READ
to the service account. -
Create an access token for the service account.
-
Save the bearer token temporarily.
A bearer token is an item in the
Authorization
HTTP header which acts as the authentication mechanism for the Qumulo REST API.
-
To let Prometheus read metrics from your clusters, update the Prometheus configuration in
prometheus.yml
.⚠️ Important: Perform the following step for each of your clusters. -
Into the
scrape_configs
section, copy thequmulo-cluster
job and fill in the following:-
job_name
: A unique name for labeling the cluster's metrics. -
In the
static_configs
block, fortargets
: A list that contains the cluster's DNS name or IP address.ℹ️ Notes:
-
To specify a port, you must append
:8000
to the DNS name or IP address. -
Because the cluster's metrics are labeled with the target variable, use a DNS name rather than an IP address.
-
To allow monitoring to continue to work if a node goes offline, using floating IP addresses rather than DHCP or static IP addresses.
-
-
In the
authorization
block, forcredentials
: The bearer token for the service account. -
In the
tls_config
block, forinsecure_skip_verify
: If the Qumulo cluster uses the default, self-signed SSL certificate, set this value totrue
.
-
-
(Optional) Before you start Prometheus and Grafana, change the default administrator credentials in
docker-compose.yml
.-
For
ADMIN_USER
andADMIN_PASSWORD
, specify credentials that conform to your security policies.⚠️ Important: If you don't specify your credentials, Docker uses the default values. -
For
ADMIN_PASSWORD_HASH
, specify the password hash. To generate the hash for your password, install and run the version ofcaddy
that matches the version of the container that the Qumulo Monitoring Dashboard uses (2.6.2).
ℹ️ Note: Alternatively, you can set your credentials as environment variables:
-
On Linux:
a. To set the environment variables, use the
export
command.export ADMIN_USER='<username>' export ADMIN_PASSWORD='<password>' export ADMIN_PASSWORD_HASH='<password-hash>'
b. To verify the environment variables, use the
printenv
command. -
On Windows:
a. To set the environment variables, use the
setx
command.setx ADMIN_USER "<username>" setx ADMIN_PASSWORD <password> setx ADMIN_PASSWORD_HASH <password-hash>
b. To verify the environment variables, use the
set
command.
-
To start Prometheus and Grafana on the Docker host, run the following command.
ℹ️ Note: The -d
flag runs the container in the background.
docker-compose up -d
This section explains how to verify that Prometheus can gather metrics from your Qumulo clusters.
-
Connect to the Prometheus server at
http://<docker-host-ip>:9090
. -
Log in with the
admin
username andadmin
password. -
On the top menu bar, select Status > Targets.
-
On the Targets page, find job name that you defined in the
prometheus.yml
file and then confirm that that the State is Up.If the State isn't Up, check the Error column.
Common problems include:
-
In the
static_configs
block, a mistake intargets
, in a DNS name or IP address. -
Inability to connect to the cluster from the machine that runs the containers.
Test the connection by using the
qq
CLI from that machine. -
In the
static_configs
block, fortargets
, a missing:8000
port specification. -
In the
tls_config
blockinsecure_skip_verify
not set totrue
when using a self-signed SSL certificate on a Qumulo cluster.
-
This section explains how to verify that Grafana can query Prometheus and display metrics:
-
Connect to the Grafana server at
http://<docker-host-ip>:3000
. -
Log in with the
admin
username andadmin
password. -
On the Welcome to Grafana page, enter a new, secure password and click Submit.
-
On the Dashboards page, click the Qumulo folder and then click Cluster Overview.
-
On the Qumulo / Cluster Overview Page, next to the cluster label filter, select your cluster from the list.
Metrics for your cluster begin to populate graphs.
ℹ️ Note: The Cluster Info and Node Info panels may take up to 24 hours to fully populate after initial setup.
This section explains how to configure Grafana alerts to notify you through email, Slack, or an alerting tool.
-
On the Alerting page, click Contact Points.
-
In the Contact Points section, on the right, click + New contact point.
-
On the New contact point page, do the following.
a. Enter a Name for the contact point.
b. Select a Contact point type and fill out the fields that appear depending on the contact point.
c. To test the contact point click Test.
ℹ️ Note: The test message might take a few minutes to arrive.
-
Click Save contact point.
Grafana begins to use the contact point to deliver alerts.
For SMTP email server configuration, you can add your email server info in the grafana.ini
.
SMTP configuration can be referenced here Configuring Grafana - SMTP configuration in the Grafana documentation.
This section explains how to update the configuration of the Qumulo Monitoring Dashboard for your system.
This section explains how to update the Prometheus configuration for your system.
While Prometheus runs, it doesn't apply configuration changes automatically. To reload the configuration, you must do one of the following:
-
To stop and restart the container in which Prometheus runs on your Docker host, run the following commands.
ℹ️ Note: The
-d
flag runs the container in the background.docker-compose down docker-compose up -d
-
To make an HTTP
POST
call, use thecurl
command. For example:curl -X POST http://admin:[email protected]:9090/-/reload
This section explains how to update the Grafana configuration for your system. To update the built-in Grafana alerts, you must modify their configuration files. To create new alerts, use the Grafana web UI.
low_disk_endurance
) alert notifies when 20% endurance remains. For endurance information, check your disks' vendor documentation.
grafana.ini
before making any changes.
For information about working with Grafana dashboards, see Create a dashboard in the Grafana documentation.
For information about working with Grafana configuration using the grafana.ini
, see Configuring Grafana in the Grafana documentation.
While Grafana runs, it doesn't apply alert or any other configuration changes automatically. To reload the configuration, you must do one of the following:
-
To stop and restart the container in which Grafana runs on your Docker host, run the following commands.
ℹ️ Note: The
-d
flag runs the container in the background.docker-compose down docker-compose up -d
-
To make an HTTP
POST
call, use thecurl
command. For example:curl -X POST --user admin:admin http://203.0.113.1:3000/api/admin/provisioning/alerting/reload