Microservices with Nomad and Consul

8.11.2017 | 10 minutes of reading time

Companies want to deliver software faster and be able to deploy different parts of their system autonomously. Therefore they are splitting their existing monoliths into smaller services. Splitting a monolith is an ambitious project and requires identifying the correct service boundaries and cross-cutting concerns. As the number of microservices increases, operational problems that didn’t exist before come to light. Some of these problems are:

How to dynamically distribute my running services while using available hardware resources as reasonably as possible?
How to get resource isolation (CPU, memory, disk I/O, network, etc.) when multiple services are running on the same host?
How to resolve service discovery and failure detection?
How to share configuration between multiple services?

To get a better overview of the first two problems, we have displayed them in the picture below, assuming that we want to deploy three new instances of the microservice ‘D’. The new microservice ‘D’ has resource requirements of 1 GB RAM and needs at least 500 MHz CPU (see picture below). Currently, we have 4 available nodes with a different number of microservices already running on them, so it is necessary to deploy the 3 new instances taking into account the currently available CPU and RAM on each node. Also, we need to ensure that we are able to limit the resources that each microservice uses. Because of possible misbehavior it could happen that one microservice consumes all of the memory or CPU of the node. So we have an additional requirement that we need resource isolation on the service level.

The possible solution for these two problems is to use an orchestration and resource manager system, which will help us get some order into the chaos. The ‘usual suspects’ for these jobs are Kubernetes, Docker Swarm or a combination of Mesos, Marathon, and DC/OS. If our decision is not to use Docker for the deployment and packaging of the microservices or we realize that most of our hosts are running on a Kernel version that is not suitable to run Docker at all, then we must be able to run binaries or fat jars that are not packaged as a Docker container. In this case, the selection is reduced to Mesos and Marathon. Setting up and maintaining any of these solutions in a private datacenter is a challenging task for every organization. Besides these common products we have an additional solution on the market: HashiCorp’s cluster manager Nomad.

What is Nomad?

Nomad’s primary task is to manage a cluster of machines and run different types of applications on them. Nomad integrates very well with the HashiCorp service discovery and configuration tool Consul, providing a complementary feature set. Let’s focus on the key features which actually distinguish Nomad from most of the other platforms:

Flexible workload: Nomad’s most interesting feature is the ability to run different kinds of applications: Docker Containers, ‘normal Linux processes’, Java applications, Apache Spark jobs, Rocket containers and even images (e.g. qcow, img, iso) with the quemu driver
Simplicity: Like all Hashicorp products, Nomad consists of a single binary for the clients and servers.
Distributed and highly available even across multiple datacenters: For leader election and state replication, Nomad uses the Serf protocol, a HashiCorp lightweight gossip protocol to communicate with the nodes. Multiple datacenters can be managed as part of a larger region, and jobs can be scheduled across datacenters.

There are additional features that Nomad provides and we could continue to list them, but the greatest advantage are the flexible workloads, which means that we are not limited to use only Docker containers like in some other platforms. Let’s dig a little bit deeper and see how this is realized and implemented in Nomad.

It’s all about Jobs, Allocations, and Drivers

Nomad is deployed as a single binary and dependent on the configuration. Nomad is started in the client or in the server mode. Servers are responsible for managing the cluster and task scheduling.

The Client is a very lightweight process that registers to the servers. The client’s primary job is to execute tasks assigned to them by the server. The regular Nomad cluster setup is to have at least 3 to 5 servers which may manage up to thousands of clients. The tasks are defined in a so-called job specification file. This file is written in a vendor-specific HCL (HashiCorp configuration language) format, in a declarative way. This job specification contains all the necessary information for running a Nomad job. The job specification file includes the executing tasks, specifies the necessary resources, and limits the job execution within the defined constraints.

This is an example of a job specification file for a python app:

1job "python-app" {
2 
3  # Run this job as a "service" type. Each job type has different properties
4  type = "service"
5 
6  # A group defines a series of tasks that should be co-located on the same client (host)
7  group "server" {
8    count = 1
9 
10    # Create an individual task (unit of work)
11    task "python-app" {
12      driver = "exec"
13 
14      # Specifies what should be executed when starting the job
15      config {
16        command = "/bin/sh"
17        args = [
18          "/local/install_run.sh"]
19      }
20 
21      # Defines the source of the artifact which should be downloaded
22      artifact {
23        source = "https://github.com/tomiloza/nomad-consul-demo/raw/master/apps/python/app.tgz"
24      }
25 
26      # The service block tells Nomad how to register this service with Consul for service discovery and monitoring.
27      service {
28        name = "python-app"
29        port = "http"
30 
31        check {
32          type = "http"
33          path = "/"
34          interval = "10s"
35          timeout = "2s"
36        }
37      }
38 
39      # Specify the maximum resources required to run the job, include CPU, memory, and bandwidth
40      resources {
41        cpu = 500
42        memory = 256
43 
44        network {
45          mbits = 5
46 
47          port "http" {
48            static = 9080
49          }
50        }
51      }
52    }
53  }
54}

When we submit a job specification to the Nomad server, we initiate a desired state. The job specification defines that we want to run a task. A task creates an individual unit of work, such as a Docker container, web application, or batch processing. This unit of work is deployed a certain number of times and under specific constraints, as we have mentioned in the first example above where we want to deploy 3 new instances of service ‘D’ with the requirements of 1 GB RAM and at least 500 MHz CPU. The jobs define the desired state and the Nomad server analyzes the actual state and triggers the evaluation process. In the picture below the evaluation and scheduling process is displayed.

If the current state changes, either by a desired (deploy of a new job) or emergent event (node failure), the evaluation process is triggered. This means that Nomad must evaluate the current state and compare it with the desired state defined in the jobs specification. This evaluation is queued into the evaluation broker and the evaluation broker manages the pending evaluations. The responsibility for processing the evaluations is on the schedulers. There are three basic types of schedulers batch, service and system. The service scheduler is responsible for processing the service jobs which usually should be long-living services, the batch scheduler, as expected, is responsible for processing batch jobs, and the system scheduler processes jobs that run on every node and they are called system jobs. The outcome of the scheduling process is an action plan. The plan defines the exact actions that are executed to accomplish the desired state. Such a plan defines that an allocation should be created, updated or deleted. The allocation is an isolated environment that is created for the particular job on the Nomad client node. Let’s see how an allocation is constructed and what the building blocks for an allocation are.

Allocations (chroot + cgroups)

We will explain the allocations concept based on a real example. For this purpose, we will set up a Vagrant Ubuntu box with Nomad and Consul. The Ansible configuration can be found on my Github account: github-tomiloza .

After we provision the VM with Vagrant, we will have Nomad and Consul up and running. The Nomad/Consul agents are started in the bootstrap mode, which is basically a hybrid client-server mode, and should only be used for testing purposes. Additionally we will install Hashi-ui , a user interface for Nomad. When the virtual machine is provisioned, we will also have 4 different Nomad jobs running (the Nomad job definitions are located under the jobs directory ). The Nomad UI is available on port 3000 on the IP address which we defined in the Vagrant file under the config.vm.network parameter. Navigating to Nomad-UI (link when the VM is started), we can see that we currently have 4 different Nomad jobs running.

We could also acquire the same information on the command line in the VM by typing nomad status. In the VM we have triggered 3 different types of jobs: a Docker job, a Java job and a so-called “isolated fork/exec” job, which is basically an isolated Linux process. Now that we have Nomad with the jobs up and running, we can analyze how these jobs are actually triggered in the VM. As already mentioned, for every job Nomad creates a corresponding allocation. The directory where Nomad creates these allocations is defined in the client configuration in the data_dir property. In our provisioned VM these allocations are stored under /data/nomad/data/alloc.

To get a better overview how a Nomad job is created and where the job stores the necessary files, we will analyze the running Java job. First, we need to get the pid of the running Java process

1ps -axf | grep java | grep -v grep

When we have found out the pid of the Java process, we can cd to the /proc/ directory. By listing the /proc// directory structure, we can see that the root of the process is pointing to the location defined by the data_dir property.

This means that the Java process is chrooted. The Java process (like the Python process) is isolated from the rest of the system. The basic idea behind this concept is to copy the system files that are necessary for running the process to a specific location and then use chroot to change the root directory of the process. The referenced directory becomes the new root of the process. For the process, however, the root directory is seemingly still at /. A program that is run in such a modified environment cannot access files outside the designated directory tree.

Also, we have the option to narrow down the binaries that will be copied to the defined location with a specific nomad chroot_env configuration parameter. In the nomad job definition, we can add resource restrictions for every particular job. Under the resource section it is possible to define memory, iops, cpu and network requirements for the job. To set up these resource requirements for the job, Nomad uses the Linux cgroups feature. The Linux Control Groups (cgroups) allow us to limit the resources for a certain process. When we put the chroot process under a cgroup, we limit the resource that the process and its children have access to. For the nomad java job , we have defined a memory limit of 256 MB RAM.
We can find out which cgroup is applied for the java process when we take a look at the /proc//cgroup file. With the retrieved cgroup id we can display the value of the memory.limit_in_bytes file for this java process.

As expected, the cgroup memory limit in bytes corresponds to the value defined in the jobs specification in megabytes. With these two basic Linux features, cgroups and chroot, Nomad creates isolated environments (allocations), which allows us to separate memory and CPU for a process and jail a process to its own set of binaries. This gives us the ability to have resource isolation for plain java or python apps.

Consul integration

Navigating to the Consul-UI (link when the VM is started), we can see that all jobs are registered in Consul as services. In the job specification under the service block we define the necessary values for the Consul registration. With the services registered in Consul we get features from Consul like: service registration, failure detection and configuration sharing.

Conclusion

Nomad and Consul together are a powerful combination for resolving lots of the operational problems that come with the microservices architecture. One of the key advantages is the ability to run microservices as plain Java, Python or Go apps and still have the necessary service and resource isolation. Just to be clear, there is also the possibility to run full dockerized apps, with the Nomad Docker driver.

Was this post helpful?

Likes

Blog author

Tomislav Lozancic

Do you still have questions? Just send me a message.

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

OpenAPI direkt in VS Code schreiben – geht das?

OpenAPI-Spezifikationen (OAS) beschreiben standardisiert und Programmiersprachen-unabhängig HTTP-APIs. Für die Erstellung von OAS gibt es verschiedene Möglichkeiten, häufig werden sie auch generiert. Das ist aber nicht für alle Programmiersprachen und...

API
Open Source

28.3.2024 | 7 Minuten Lesezeit

Mirabell Büscher

Wie als Software-Entwickler sichtbar werden?

Egal ob Junior, Medior oder Senior, introvertiert oder extrovertiert: Jeder Software-Entwickler kann seine Sichtbarkeit mit unterschiedlichen Werkzeugen erhöhen und sollte dem Thema eine gewisse Bedeutsamkeit beimessen. Die Frage dabei ist nur: wie und...

Weiterbildung
Softwareentwicklung
Community
Open Source

21.2.2024 | 6 Minuten Lesezeit

Edgar Klepek

Datenbanken testen mit Testcontainers in Mule4

Hier erfährst du die Möglichkeiten Testcontainers in Mule4 zu nutzen, um deine Datenbankaufrufe zu testen. Vor einiger Zeit hat mein Kollege Christian Langmann eine Blogartikelserie veröffentlicht, in welcher er aufzeigt, wie man in Mule3 Munit-Tests...

Community
Softwareentwicklung
Testing
API
Open Source
Datenbank
Container
Integration

19.1.2024 | 3 Minuten Lesezeit

Benjamin Lüdicke

Mule Flow Landscape: Abhängigkeiten zwischen Mule Flows sichtbar machen

Erfahre, wie du mit dem Tool Mule Flow Landscape den Überblick über alle Mule Flows und deren Abhängigkeiten behältst. Die Integrationsplattform Mule ermöglicht es uns, Integrationen mittels einer Low-Code-Entwicklungsplattform umzusetzen. Die Bausteine...

Softwareentwicklung
API
Open Source
Dokumentation
Integration

13.8.2023 | 3 Minuten Lesezeit

Benjamin Lüdicke

P5.JS: Zeichnen mit der Open-Source-JavaScript-Bibliothek

Im Rahmen eines kleinen Projekts, bei dem es um das Thema Berechnung von Flugrouten ging, brauchten wir eine einfache und leichtgewichtige Möglichkeit, die Route und andere Bereiche auf der Karte zu visualisieren. Bei der Suche nach einem passenden ...

JavaScript
Framework
Open Source

28.11.2022 | 14 Minuten Lesezeit

Danny Steinbrecher

Tastaturen selbst bauen

Wir verwenden sie alle jeden Tag: die Tastatur. Aber woraus besteht die Tastatur eigentlich? Wie baue ich eine Tastatur? Und wer legt fest, wie ich an das @ komme? All das haben wir bei einem Ausflug auf Texel erfahren und selbst Tastaturen gebaut. Dieser...

Raspberry Pi
Open Source

31.10.2022 | 6 Minuten Lesezeit

Robert Meißner

PDF-Generierung aus dem Container – speedata Publisher

Nach fast fünf Jahren bei codecentric ist es nun endlich so weit, dass ich auf meine Zeit vor codecentric zurückblicke und ein Thema betrachten möchte, das immer noch viele Menschen im Rahmen von Softwareentwicklungsprojekten bewegt: die Generierung ...

Open Source
Container
Go

9.3.2021 | 6 Minuten Lesezeit

Daniel Kocot

Keycloak-Konfiguration mit Terraform

Infrastructure as Code (IaC) ist heutzutage aus der modernen IT-Landschaft nicht mehr wegzudenken. Red Hat beschreibt den Begriff wie folgt:Infrastructure as Code (IaC) is the managing and provisioning of infrastructure through code instead of through...

DevOps
Infrastructure
IT-Security
CI/CD
Keycloak
Open Source

2.3.2021 | 6 Minuten Lesezeit

Johanna Nolte

Process Mining mit bupaR

Process Mining schafft Transparenz darüber, was wirklich in Unternehmen geschieht. Im Prozessmanagement werden die Idealvorstellungen eines Prozesses meist langwierig definiert. In der Praxis ist die Qualität dieser Beschreibungen jedoch oft nicht eindeutig...

Open Source
Data
Process Management

5.5.2020 | 9 Minuten Lesezeit

Anna Lukas

Hyperledger Fabric CouchDB lässt meine Cloud-Rechnung explodieren

Hyperledger Fabric ist eine hervorragende DLT-Plattform und bietet großartige Anpassungsmöglichkeiten. Eine Möglichkeit davon ist es, verschiedene Datenbanken zur Speicherung von Blockchain -Daten zu nutzen. Die empfohlene und am besten unterstützte ...

Blockchain
Datenbank
Infrastructure
Open Source

9.1.2020 | 2 Minuten Lesezeit

Jan Rümenapf

Norbert Schneider

Kong API-Gateway – Observability mit Prometheus, Grafana und OpsGenie

Im vorherigen Blogpost habe ich das bestehende Demo-Setup um decK und Konga erweitert. Nun soll es darum gehen, die vorhandenen Daten der APIs sichtbarer werden zu lassen. Hierzu möchte ich zwei Observability Patterns, nämlich Monitoring und Alerting...

Softwarearchitektur
Atlassian
Microservices
Open Source
API
APM

19.12.2019 | 4 Minuten Lesezeit

Daniel Kocot

Kong API Gateway – Deklarative Konfiguration mit decK und Visualisierung...

Seit dem letzten Post ist eine neue Version (1.4 ) des Kong API Gateways veröffentlicht worden. Die größte Neuerung stellt die /status-Route dar. Über diese lässt sich der Status eines Gateways direkt abfragen. Anfang Dezember ist auch ein Patch-Release...

Open Source
Softwarearchitektur
API
Microservices

12.12.2019 | 4 Minuten Lesezeit

Daniel Kocot

Play-with-Docker: Container-Workshops auf AWS

Kubernetes- und Docker-Workshops sind sehr schwer vorzubereiten, Play-with-Docker und Play-with-Kubernetes können dabei aber eine große Hilfe sein. Die Dokumentation dazu ist leider nicht sehr umfangreich, wie man es schnell und einfach installieren ...

Infrastructure
Cloud
DevOps
Container
Kubernetes
Open Source

22.11.2019 | 9 Minuten Lesezeit

Sebastian Kornehl

Kubernetes Operator: Operations-Wissen als Code

In diesem Artikel erkläre ich, was ein Kubernetes Operator ist und wie er aufgebaut ist. Anschließend zeige ich euch, wie man seinen ersten eigenen Kubernetes Operator in Go schreibt.Was ist ein Kubernetes OperatorEin Kubernetes Operator hilft, eine ...

Infrastructure
Open Source
DevOps
Go
Kubernetes

29.10.2019 | 10 Minuten Lesezeit

Manuel

REST: Standardisierte Fehlermeldungen mittels RFC 7807 Problem Details

REST-Fehlermeldungen: Einleitung Wenn man eine REST-Schnittstelle implementiert, kommt schnell die Frage auf, wie man Fehler am besten zurückgibt. Die erste und naheliegendste Option sind die HTTP-Statuscodes (4xx, 5xx je nach Problem) – diese sind ....

Microservices
Python
Spring
Softwareentwicklung
API
Open Source

10.9.2019 | 5 Minuten Lesezeit

Christian Sauer

API-Management mit Kong – Ein Update und mehr

Seit dem letzten Blogpost zu diesem Thema von Alexander Melnyk sind fast zwei Jahre vergangen, und es ist in Sachen „API-Management mit Kong“ eine Menge passiert. Daher war es an der Zeit, zum einen die Inhalte des Posts von Alexander zu aktualisieren...

Open Source
Python
Softwarearchitektur
API
Microservices

3.9.2019 | 5 Minuten Lesezeit

Daniel Kocot

Abweichungen zwischen Spezifikation und REST-API mit hikaku erkennen

Wenn man eine REST-API mit dem Contract-first-Ansatz erstellt, verwendet man vermutlich Codegenerierung oder einen anderen Weg, um sicherzustellen, dass die Spezifikation und die Implementierung im Laufe der Zeit inhaltlich gleich bleiben. In diesem ...

Microservices
Open Source
Testing

8.3.2019 | 3 Minuten Lesezeit

Jannes Heinrich

Continuous Integration von Hyperledger-Composer-Anwendungen mit Gitlab...

In meinem vorherigen Artikel „Hyperledger-Fabric-Test-Netzwerk mit Ansible auf AWS aufsetzen “ habe ich eine einfache Möglichkeit vorgestellt, VM-Instanzen in der Cloud mittels Ansible mit der nötigen Software zu provisionieren, um eine Hyperledger-Fabric...

CI/CD
Blockchain
Open Source

18.10.2018 | 8 Minuten Lesezeit

Jonas Verhoelen

Hyperledger-Fabric-Test-Netzwerk mit Ansible auf AWS aufsetzen

Aus Unzufriedenheit mit bisherigen Cloud-basierten Lösungen zum Thema Hyperledger möchte ich in diesem Artikel das automatisierte Aufsetzen einer Testumgebung für Fabric (ein Fabric-Test-Netzwerk) motivieren und erläutern.Vorsicht: Dieser Artikel stammt...

Infrastructure
Blockchain
Open Source

12.8.2018 | 8 Minuten Lesezeit

Jonas Verhoelen

Edge Computing und Industrial IoT mit Apache Edgent und Apache PLC4X

Immer mehr vernetzte Devices produzieren immer mehr Daten. Industrie 4.0 ist in aller Munde. Nur: Wie kommen wir an die Daten im industriellen Umfeld? Wie gehen wir mit den damit einhergehenden Datenmengen um? Wie nutzen wir die beschränkten Möglichkeiten...

Open Source
IoT
IIoT

26.6.2018 | 18 Minuten Lesezeit

Christofer Dutz

Gemeinsam bessere Projekte umsetzen.

Wir helfen deinem Unternehmen.

Du stehst vor einer großen IT-Herausforderung? Wir sorgen für eine maßgeschneiderte Unterstützung. Informiere dich jetzt.

Hilf uns, noch besser zu werden.

Wir sind immer auf der Suche nach neuen Talenten. Auch für dich ist die passende Stelle dabei.

Contact

Send

Microservices with Nomad and Consul

What is Nomad?

It’s all about Jobs, Allocations, and Drivers

Allocations (chroot + cgroups)

Consul integration

Conclusion

Was this post helpful?

Ja

Blog author

Get in contact

Get in contact

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

View Job

More articles in this subject area

OpenAPI direkt in VS Code schreiben – geht das?

Wie als Software-Entwickler sichtbar werden?

Datenbanken testen mit Testcontainers in Mule4

Mule Flow Landscape: Abhängigkeiten zwischen Mule Flows sichtbar machen

P5.JS: Zeichnen mit der Open-Source-JavaScript-Bibliothek

Tastaturen selbst bauen

PDF-Generierung aus dem Container – speedata Publisher

Keycloak-Konfiguration mit Terraform

Process Mining mit bupaR

Hyperledger Fabric CouchDB lässt meine Cloud-Rechnung explodieren

Kong API-Gateway – Observability mit Prometheus, Grafana und OpsGenie

Kong API Gateway – Deklarative Konfiguration mit decK und Visualisierung...

Play-with-Docker: Container-Workshops auf AWS

Kubernetes Operator: Operations-Wissen als Code

REST: Standardisierte Fehlermeldungen mittels RFC 7807 Problem Details

API-Management mit Kong – Ein Update und mehr

Abweichungen zwischen Spezifikation und REST-API mit hikaku erkennen

Continuous Integration von Hyperledger-Composer-Anwendungen mit Gitlab...

Hyperledger-Fabric-Test-Netzwerk mit Ansible auf AWS aufsetzen

Edge Computing und Industrial IoT mit Apache Edgent und Apache PLC4X

Gemeinsam bessere Projekte umsetzen.

Wir helfen deinem Unternehmen.

Unsere Leistungen

Hilf uns, noch besser zu werden.

Zu den Jobangeboten