Introduction

If you’re managing a number of applications across multiple servers then having easy access to their logs can make discovering and debugging issues much more efficient. Deploying a system to aggregate and visualise this information greatly reduces the need to identify and log into the relevant server(s) directly, and enables more sophisticated analysis such as error tracing across systems (see 5 Good Reasons to Use a Log Server for more).

The Elastic (ELK) Stack is commonly used for this purpose: centralising logs from multiple machines and providing a single point of access to the aggregated messages. Its components are open source and can be self-hosted, albeit with some operational complexity. In this post we show how to set up a simple deployment, focused on logging from applications deployed in one or more Docker containers. We’ll do this by describing our setup and explaining a few of the choices that we’ve made so far: we’re still in the early days of exploring how Elasticsearch and Kibana can help to monitor applications and identify issues in our code.

Method

The unofficial docker-elk project provides a very helpful setup for running ELK itself in Docker, effectively reducing deployment to running docker-compose up. The key components are the log indexer (Elasticsearch) and web interface (Kibana).

The architecture we’re aiming for is:

The deployment steps are:

Clone docker-elk
Optional: make any desired changes to the configuration, such as those described below
Perform some initialisation, as described in README.md:

docker-compose up elasticsearch -d
docker-compose exec -T elasticsearch bin/elasticsearch-setup-passwords auto --batch
# Make a note of these passwords
docker-compose down

Bring up Elasticsearch and Kibana:

export KIBANA_PASSWORD=…
docker-compose up -d

Create an index pattern, telling Kibana which Elasticsearch indices to explore:

docker-compose exec kibana curl -XPOST -D- 'http://localhost:5601/api/saved_objects/index-pattern' \
  -H 'Content-Type: application/json' \
  -u elastic:${ELASTIC_PASSWORD?} \
  -d '{"attributes":{"title":"filebeat-*,journalbeat-*","timeFieldName":"@timestamp"}}'

Send some logs to Elasticsearch (see below)
Visit http://localhost:5601 (assuming you’re not running a proxy - also see below)
Log in using username elastic and the password you noted earlier
View logs, create visualisations/alerts/dashboards etc

We make some changes to the default configuration as follows:

We don’t run an Elasticsearch cluster (yet) so we don’t need to expose port 9300
We’re not currently using the Elasticsearch keystore, so we’ve removed the bootstrap password
We’re using Filebeat and Journalbeat in their default configurations to send logs directly to Elasticsearch, so we don’t need to run Logstash - which reduces operational complexity. However, Logstash can easily be re-enabled or replaced e.g. with Fluentd.
We ensure that Elasticsearch and Kibana restart automatically if the Docker host is rebooted
We expose the Kibana web interface via SSL using a separate Nginx proxy container, so we don’t need to expose port 5601 to the host
As we use Vault for password management we pass the Kibana password as an environment variable rather than embedding it in the configuration file
Docker Compose automatically creates a bridged network for the relevant project, so we remove the redundant definition
We only use the non-commercial features of the stack

Once you’ve got Elasticsearch up and running you can set up one or more Beats agents to forward logs from your containers. These too can be run in containers. Our applications use the Docker json-file and journald logging drivers, so we use Filebeat and Journalbeat - our setup is documented here. Again, we use Vault for credential storage, but for a simple deployment you could store the secrets in a suitably secured Docker Compose .env file. These agents talk directly to Docker (via volumes and/or socket) so don’t need to be on the same Docker network as the relevant containers.

For reference our complete setup is described in reside-ic/logs, which includes docker-elk as a submodule.

Next steps

This will give you a minimal setup - it’s worth ensuring the following for a robust, scalable and secure system:

Users/roles with access to Kibana are suitably restricted
Your Elasticsearch and Kibana Docker volumes are backed-up
Sensitive information is either not logged by your applications or is shipped with suitable security (e.g. using SSL) and appropriately access controlled. Obviously information such as credentials should never be logged.
Log messages are suitably structured, so they can be easily searched/filtered
Related containers/services share a common Docker Compose project name, Docker container tag etc, so that they can be grouped as one unit within Elasticsearch/Kibana

Acknowledgements

Thanks to Elasticsearch and the docker-elk contributors for making their respective projects freely available.

Reside-IC

Introduction

Method

Next steps

Acknowledgements