Aggregating logs from services deployed with Docker
Feb 18, 2021
Mark Woodbridge
4 minute read
Tags:

Introduction

If you’re managing a number of applications across multiple servers then having easy access to their logs can make discovering and debugging issues much more efficient. Deploying a system to aggregate and visualise this information greatly reduces the need to identify and log into the relevant server(s) directly, and enables more sophisticated analysis such as error tracing across systems (see 5 Good Reasons to Use a Log Server for more).

The Elastic (ELK) Stack is commonly used for this purpose: centralising logs from multiple machines and providing a single point of access to the aggregated messages. Its components are open source and can be self-hosted, albeit with some operational complexity. In this post we show how to set up a simple deployment, focused on logging from applications deployed in one or more Docker containers. We’ll do this by describing our setup and explaining a few of the choices that we’ve made so far: we’re still in the early days of exploring how Elasticsearch and Kibana can help to monitor applications and identify issues in our code.

Method

The unofficial docker-elk project provides a very helpful setup for running ELK itself in Docker, effectively reducing deployment to running docker-compose up. The key components are the log indexer (Elasticsearch) and web interface (Kibana).

The architecture we’re aiming for is:

Logging architecture diagram

The deployment steps are:

  1. Clone docker-elk
  2. Optional: make any desired changes to the configuration, such as those described below
  3. Perform some initialisation, as described in README.md:
docker-compose up elasticsearch -d
docker-compose exec -T elasticsearch bin/elasticsearch-setup-passwords auto --batch
# Make a note of these passwords
docker-compose down
  1. Bring up Elasticsearch and Kibana:
export KIBANA_PASSWORD=docker-compose up -d
  1. Create an index pattern, telling Kibana which Elasticsearch indices to explore:
docker-compose exec kibana curl -XPOST -D- 'http://localhost:5601/api/saved_objects/index-pattern' \
  -H 'Content-Type: application/json' \
  -u elastic:${ELASTIC_PASSWORD?} \
  -d '{"attributes":{"title":"filebeat-*,journalbeat-*","timeFieldName":"@timestamp"}}'
  1. Send some logs to Elasticsearch (see below)
  2. Visit http://localhost:5601 (assuming you’re not running a proxy - also see below)
  3. Log in using username elastic and the password you noted earlier
  4. View logs, create visualisations/alerts/dashboards etc

We make some changes to the default configuration as follows:

  • We don’t run an Elasticsearch cluster (yet) so we don’t need to expose port 9300
  • We’re not currently using the Elasticsearch keystore, so we’ve removed the bootstrap password
  • We’re using Filebeat and Journalbeat in their default configurations to send logs directly to Elasticsearch, so we don’t need to run Logstash - which reduces operational complexity. However, Logstash can easily be re-enabled or replaced e.g. with Fluentd.
  • We ensure that Elasticsearch and Kibana restart automatically if the Docker host is rebooted
  • We expose the Kibana web interface via SSL using a separate Nginx proxy container, so we don’t need to expose port 5601 to the host
  • As we use Vault for password management we pass the Kibana password as an environment variable rather than embedding it in the configuration file
  • Docker Compose automatically creates a bridged network for the relevant project, so we remove the redundant definition
  • We only use the non-commercial features of the stack

Once you’ve got Elasticsearch up and running you can set up one or more Beats agents to forward logs from your containers. These too can be run in containers. Our applications use the Docker json-file and journald logging drivers, so we use Filebeat and Journalbeat - our setup is documented here. Again, we use Vault for credential storage, but for a simple deployment you could store the secrets in a suitably secured Docker Compose .env file. These agents talk directly to Docker (via volumes and/or socket) so don’t need to be on the same Docker network as the relevant containers.

For reference our complete setup is described in reside-ic/logs, which includes docker-elk as a submodule.

Next steps

This will give you a minimal setup - it’s worth ensuring the following for a robust, scalable and secure system:

  • Users/roles with access to Kibana are suitably restricted
  • Your Elasticsearch and Kibana Docker volumes are backed-up
  • Sensitive information is either not logged by your applications or is shipped with suitable security (e.g. using SSL) and appropriately access controlled. Obviously information such as credentials should never be logged.
  • Log messages are suitably structured, so they can be easily searched/filtered
  • Related containers/services share a common Docker Compose project name, Docker container tag etc, so that they can be grouped as one unit within Elasticsearch/Kibana

Acknowledgements

Thanks to Elasticsearch and the docker-elk contributors for making their respective projects freely available.




comments powered by Disqus