Performance measurement with JMH – Java Microbenchmark Harness

22.10.2017 | 7 minutes of reading time

What is benchmarking and why should we do that?
If there are multiple ways to implement a feature or if we have serious doubts about performance while using a certain technology, special implementation patterns or a new “cutting edge” library, we have to make decisions. There might be assumptions about performance effects of a certain way of implementing something, but in the end – if we do not measure and compare the different approaches – we will not be sure if our decision was correct. This is true for the big architectural topics, but also for smaller-scoped concerns such as preferring a certain API method although multiple alternatives exist. So we should stop guessing and start measuring performance! In other words, we should use benchmarks! This article introduces Java Microbenchmark Harness (JMH), an OpenJDK project which aims to ease setting up a benchmark environment for Java performance tests.

Benchmark == Benchmark?
To categorize benchmarks in a more fine-grained manner, people invented benchmark categories such as “micro”, “macro”, or even “meso”, which separate performance tests by scope. Maybe we can say the separation is done by the affected layers or complexity level of code under test.

Microbenchmarks are performance metrics on the lowest level. You can compare them to unit tests, which means they invoke single methods or execute small pieces of business logic without “more (cross-cutting) stuff” around.

Macrobenchmarks are the opposite of that. They test entire applications similar to end-to-end tests.

Mesobenchmarks represent something in between, which means they are written to measure entire actions (features, workflows) related to bigger parts of our applications using different layers in interaction with each other without spinning up the entire application. This could be a single feature which uses authentication/authorization, reads data from a database or calls external services and so on. We could range mesobenchmarks with integration tests.

In this post I will put the focus on the smallest kind of these. So let’s concentrate on the microbenchmarks.

How to implement microbenchmarks
If we want to know which methods are performing better than others, we should give it a try and compare them under equal conditions. A naive approach would be to call the different methods within some kind of common simple unit test and take a look at the time the execution takes, maybe implemented using java.lang.System.currentTimeMillis(). Then we could just compute the difference between start and stop timestamps and on the first view that’s sufficient to get an idea about its performance – but taking a second look, it’s not. We should take into account how the JVM executes and especially optimizes code. Regarding this point of view, our results would not be reliable using values we got after a single execution. There are many JVM-related optimization topics we have to keep in mind and I will give some further hints afterwards. For now it’s important that the more often a line of code is executed, the more information the JVM will get about that code and it will optimize it (if possible). So if we want to measure code which will be invoked frequently in later production environments (and that code is the crucial part of our software we should measure), we should also measure it after some warmup iterations to simulate “real” production conditions. And now it’s getting interesting (some people would rather say “complicated”).

The question now is: How should the warmup be implemented? Use a boolean flag which separates warmup iterations from measurement iterations and switch that flag after some time? Maybe, but doing that again and again would be a cumbersome, error-prone task.

Using JHM for benchmarks
Blessedly, there is the Java Microbenchmark Harness . This is an OpenJDK project which aims to ease setting up a benchmark environment for Java performance tests. If you are already familiar with JUnit tests (and you should be) it will be very comfortable to get started with JMH.

Set up the JMH environment
To create a maven benchmark project, just use the maven archetype and provide your preferred groupId, artifactId and version.

mvn archetype:generate \ -DinteractiveMode=false \ -DarchetypeGroupId=org.openjdk.jmh \ -DarchetypeArtifactId=jmh-java-benchmark-archetype \ -DgroupId=com.example \ -DartifactId=jmh-number-verification-performance-test \ -Dversion=1.0

That command will create a skeleton project which can execute your benchmarks. After you wrote your tests (as described below), build it with mvn clean install. The project creates a benchmarks.jar in the target folder which should be used to run the measurements:

java -jar target/benchmarks.jar

Although you could use your IDE to run the tests, you should prefer this standalone JAR. It provides great portability – you can execute it on different machines – and there is no performance penalty due to any IDE overhead.

Writing benchmarks
Writing benchmarks is as simple as writing JUnit tests. The main difference is that you have to annotate a test method with @Benchmark instead of @Test. Just use the archetype-generated class (MyBenchmark), rename it or write your own class and invoke the suspicious code you want to measure within a @Benchmark method. JHM is doing everything else and generates a performance report for you.

As with JUnit, it is also possible to use parameterized tests. This is the purpose of the @Param annotation. There are a lot of examples for a bunch of use cases available on the project site.

Long story short, to try it out, I created a benchmark which compares different approaches to check if a String represents a valid Integer value. It compares following implementations:

– using try-catch with Integer.parseInt(String)
– StringUtils.isNumeric(String)
– String.matches("\\d+")

Check out the example project on Github . With that benchmark, we could find out which approach produces the best performance result.

Performance results
Per default, JMH executes 10 Forks (separate execution environments), 20 warmup cycles (without measurement, providing the opportunity to the JVM to optimize the code before the measurement starts) and 20 real measurement iterations for every test. Of course, this behavior can be overidden on annotation basis (@Fork, @Warmup, @Measurement).

The results will vary depending on the configuration of the target machine they are running on. After the runs are finished, a report will be generated. The following output is a result report generated by the mentioned comparison benchmark, running on my i7 laptop (7700 HQ, 4 x 2.8 GHz, 32 MB RAM):

Since you can execute your benchmarks in different @BenchmarkModes you have to read the results differently. E.g. in Mode.AverageTime a lower score is preferred, while using Mode.Throughput a higher value points to better performance.

Beware of the JVM optimizations
As mentioned before, the JVM will optimize code based on collected information during execution. Usually this is a good thing we should appreciate, at least for production environments. But under artificial conditions (our microbenchmark definitely is one) this could cause problems. Here are some topics you should be aware of:

Warmup:
The first big obstacle is conquered using JHM itself – it delivers the warmup cycles out of the box. So the JVM can collect some information about the code under test and the effectively executed code will be more “production-like” than a once executed method ever could be.

Always read computed results:
If you don’t use code (e.g. if you never read a private variable), the JVM is free to discard that code during compilation. This is called “Dead Code Elimination”, which means that even the entire computation of these dead results will probably be eliminated if no one is interested in. This will definitely distort your benchmark results and can lead to false conclusions. So take an interest (or at least pretend) and read your computation results even if they are not relevant for your test. This could be done either by returning result variables or throwing it into a so-called Blackhole injected by declaring it as input parameter for your benchmark method.

Differences to production code:
We’re done with this short introduction to JMH and although we talked about reliable results, please be aware of the fact that code in tests will always behave differently to the same code executed during production. There are a lot of optimizations the JVM will do afterwards, e.g. depending how often methods are invoked (hot code), call hierarchies and stack depth. So performance tests are at most a good hint, but no guarantee. The best thing you can do is measure performance in production using metrics or with profiling.

Was this post helpful?

Likes

Blog author

Kevin Peters

Senior IT Software Engineer / Consultant

Do you still have questions? Just send me a message.

fromKevin Peters

Goldene Wasserhähne – Wie wichtig ist Qualität in der Softwareentwicklung...

Stellt man Projektbeteiligten die Frage, ob Qualität von Software wichtig ist, antwortet ein Großteil der Befragten vermutlich mit „Ja”. Jede andere Antwort würde sicherlich weitere, unangenehme Fragen aufkommen lassen. Aber was bedeutet Qualität im ...

Testing
Softwareentwicklung

18.10.2023 | 9 Minuten Lesezeit

Kevin Peters

Count your queries! Repository integration tests with Hibernate Statistics

If you are using Spring Data JPA as a data access framework, Hibernate is almost certainly hiding under the hood. And although this setup takes a lot of work off your hands by doing a lot of awesome things, the final outcome should better be checked....

Java
Testing
Spring
Database

7.8.2023 | 6 Minuten Lesezeit

Kevin Peters

Threat Modeling 101 – Wie fange ich eigentlich an?

In einem früheren Blogpost haben wir bereits erklärt, wie wichtig Awareness im Bereich IT-Security im agilen Projekt ist. Ein Kernthema war das Threat Modeling. Doch wie genau funktioniert das? Wie bewerte ich, welche Bereiche meiner Applikation unter...

Agilität
IT-Security
Softwareentwicklung

27.2.2023 | 13 Minuten Lesezeit

Kevin Peters

Shift left security – Sicherheit ist Daily-Business

IT-Security ist ein Thema, das nicht ausschließlich InfoSec-Expert*innen angeht. Auch als Entwickler*in muss man diese Thematik auf dem Schirm haben. Security gehört zum grundlegenden Prozess der Softwareentwicklung und von Beginn an zum Daily-Business...

Agilität
IT-Security

19.7.2022 | 15 Minuten Lesezeit

Kevin Peters

Spy vs. spy – aka “The two sides of the testing coin”

When you ask ten developers about unit testing, you will definitely get at least eleven opinions on how to do testing right. As for every other topic, there is also no silver bullet approach for testing, but there are some practices which have become...

Software development
Java
Testing

27.2.2020 | 5 Minuten Lesezeit

Kevin Peters

DON’T make an ASS out of U and ME when dealing with Hibernate caching!

In my current project a simple question came up. “Is Hibernate’s first-level cache limited to a certain transaction?” Intuitively my answer was: “No, the first-level cache is also called session cache. So it should rather be bound to a Hibernate session...

Software development
Java
Database

16.7.2019 | 4 Minuten Lesezeit

Kevin Peters

IntelliJ IDEA 2018.2 auto-detection for Spring Boot and the ‘Run Dashboard...

In an earlier post we explained how to enable the Spring Boot Run Dashboard in IntelliJ IDEA 2017.2. The workaround at that time was not really easy, and due to the ever-increasing popularity of Spring Boot, the community certainly expected IDEA to ...

Java
Spring

18.9.2018 | 1 Minuten Lesezeit

Kevin Peters

Polite HTTP API design – “Use the headers, Luke!”

We are developers and largely we are also practitioners. This means, in general we want to get things done, preferably quickly. But apart from doing our daily work and implementing required features, thinking a few steps ahead is really valuable. Can...

Frontend
API
Spring

25.9.2017 | 2 Minuten Lesezeit

Kevin Peters

How to enable the Spring Boot ‘Run Dashboard’ in IntelliJ IDEA 2017.2....

Since JetBrains released IntelliJ IDEA 2017.2.1 – which contains the so called ‘Run Dashboard’ – maintaining (start, stop, debug, etc.) local Spring Boot services in a convenient manner is not exclusively available to STS / Eclipse users anymore. ...

Software development
Java
Spring

17.9.2017 | 1 Minuten Lesezeit

Kevin Peters

How to mix Java and Kotlin within one Spring Boot Application

This blog post will show how to mix Java and Kotlin within one Spring Boot Web Application. We will use Kotlin data classes side by side with Lombok backed Java POJOs, for example as value objects or Jackson marshalling purposes. Since Kotlin is supported...

Kotlin
Java
Spring

12.9.2017 | 2 Minuten Lesezeit

Kevin Peters

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Test Fixtures mit JUnit 5

Wir Softwareentwickler leben in einem ständigen Dilemma. Jede Funktionalität der Software sollte durch Unit-Tests und Integrationstest abgesichert werden. Es sollten dabei so viel Tests wie nötig, aber nur so wenige wie möglich geschrieben werden. Schreiben...

Java
Testing
Framework
Softwareentwicklung

25.3.2024 | 7 Minuten Lesezeit

Jens Kaiser

Charge your APIs Volume 23: REST vs. gRPC

APIs dienen als Verbindungsstück zwischen Daten und Verarbeitung und erlauben uns damit, Daten im richtigen Kontext als Informationen zu interpretieren. Passende fachliche Themen sind dabei präsenter denn je und erreichen bald auch den Endverbraucher...

Java
Softwareentwicklung
Spring
Softwarearchitektur
API
Data

11.2.2024 | 7 Minuten Lesezeit

Sebastian Tiemann

Reactive Programming mit Spring Webflux

In diesem Artikel geben wir einen Überblick über Reactive Programming, erläutern, welche Prinzipien diesem zugrunde liegen und wann ein Einsatz sinnvoll sein kann. Anschließend zeigen wir, wie mithilfe des Spring-Webflux-Projekts eine reaktive Anwendung...

Spring
Java
Reactive Programming

11.12.2023 | 13 Minuten Lesezeit

Christian Franzen

Ferdinand Ade

Test-Fixtures: Wozu denn überhaupt?

Für uns Softwareentwickler ist der ultimative Endgegner immer die Komplexität. Wir haben zahlreiche, teils ziemlich mächtige Waffen gesammelt, um in diesen Kämpfen bestehen zu können: Dinge wie Modularisierung, Abstraktion, Lean Development, iteratives...

Testing
Java
Test Driven Development

12.5.2023 | 19 Minuten Lesezeit

Rüdiger zu Dohna

Microstream – das Ende der O/R-Mapper?

Über eine Suche nach Alternativen zu O/R-Mappern und Persistenz-Frameworks für NoSQL-Datenbanken bin ich auf Microstream aufmerksam geworden und war ziemlich schnell interessiert. Zum einen, weil Microstream wie ich aus der Oberpfalz kommt, aber haupts...

Java
Datenbank
Softwarearchitektur

29.9.2022 | 13 Minuten Lesezeit

Felix Rieß

Streaming Wikipedia mit Apache Kafka

Apache Kafka ist in aller Munde und entwickelt sich im Kontext von verteilten Systemen zum De-facto-Standard als Plattform für Event Streaming. Im Rahmen unserer OffProject Time (Weiterbildungszeit) haben wir uns die Plattform auch näher angeschaut und...

Kotlin
Data
Java
Messaging
Spring

15.8.2022 | 10 Minuten Lesezeit

Christoph Metzger

Felix Rieß

Die Zukunft der IDEs – aus Sicht eines „Java-EE-Entwicklers“

Bei unseren Kunden und auch bei codecentric dreht sich alles um den besten und schnellsten Weg, die richtige Software zu entwickeln – und das natürlich in hoher Qualität. Von daher bin ich auch ein fleißiger Leser des „State of DevOps“-Report (hier zum...

Cloud
Java
Remote Work

16.5.2022 | 11 Minuten Lesezeit

Rainer Vehns

Keycloak.X, aber sicher – ohne bekannte Sicherheitslücken!

TLDR: Wie man die bekannten CVEs (Common Vulnerabilities and Exposures) mit einer eigenen Keycloak-Distribution auf null* reduziert.EinführungKeycloak (s. Website) wird durch die Umstellung auf Quarkus einfacher und robuster, so das Versprechen. Wie...

Java
IT-Security
Keycloak

9.5.2022 | 9 Minuten Lesezeit

Sebastian Rose

Thomas Darimont

Wie man Java-Klassen in Python benutzt

Generell sollte man zwar für jedes Problem das passende Werkzeug nutzen. Aber oftmals wird man gezwungen, den Hammer Java zu nutzen, weil der Rest des Hauses mit diesem Hammer gebaut wurde. Eine moderne Lösung dieses Problems ist natürlich die Microservice...

Künstliche Intelligenz
Java
Python

15.11.2021 | 8 Minuten Lesezeit

Hendrik Schawe

Effizient mit Text, Code und IDEs arbeiten

Hast du dich schon immer gefragt, warum andere Leute ihre Entwicklungsumgebung (Integrated Development Environment, IDE) anders nutzen als du? Ist dir aufgefallen, dass andere beim Programmieren deutlich langsamer oder schneller sind? Kennst du auch ...

Softwareentwicklung
Java

6.10.2021 | 12 Minuten Lesezeit

Jonas Verhoelen

Automatisch skaliertes Cloud Native Consent Management in der Google Cloud

Immer häufiger ersetzen unsere Kunden lokale Rechenzentren durch eine Cloud-Infrastruktur. Die Gründe sind Ausfallsicherheit, Wartbarkeit und vor allem Skalierbarkeit. Mit dem letzten dieser Aspekte befassen wir uns in diesem Blogartikel anhand eines...

APM
Python
Cloud
Google Cloud
Infrastructure
Softwarearchitektur
Serverless

28.6.2021 | 16 Minuten Lesezeit

Markus Lüger

Christopher

Datenbankoperationen in Mule 4 optimieren

Häufig geht es in Mule-Projekten darum, Daten aus irgendeiner Quelle effizient in einer Datenbank abzulegen. Heute zeige ich, mit welchen Strategien man dabei die Performance optimieren kann.AufgabenstellungDa es hier primär um Datenbankoperationen geht...

APM
Integration

10.2.2021 | 8 Minuten Lesezeit

Roger Butenuth

Serverless Java mit AWS – Zwei Jahre Cloud-Native

Vor zwei Jahren haben wir angefangen, ein Kundenprodukt Cloud-Native auf Basis von Serverless, Java und AWS Managed Services umzusetzen. Im Folgenden möchte ich beschreiben, was wir in dieser Zeit gemeinsam gelernt haben und was wir heute besser machen...

Softwarearchitektur
Cloud
Java
Microservices
Serverless
Softwareentwicklung

2.12.2020 | 9 Minuten Lesezeit

Felix Massem

BPMN im Smart Home: Camunda und openHAB

Geschäftsprozessmodellierung und einhergehende Sprachen wie BPMN und DMN sind Begriffe, denen man normalerweise im beruflichen Umfeld begegnet und die im privaten Raum keine Rolle spielen. Natürlich kann man die Prozesse eines Haushalts (aka kleines,...

Java
BPM
Smart Home
IoT

6.4.2020 | 8 Minuten Lesezeit

Stephan Köninger

State Management in Svelte

Teil der Webentwicklung in 2020 sind nicht nur komponentenbasierte Ansätze, sondern ebenso die Nutzung von State-Management-Lösungen. Diese orientieren sich in der Regel an der Flux-Architektur und ihrem prominentesten Vertreter, Redux . Und so ist es...

JavaScript
React
Java

25.2.2020 | 3 Minuten Lesezeit

Daniel Zenzes

Gleich und doch anders: Einführung in Svelte

Verglichen mit den letzten Jahren ist es im JavaScript-Umfeld in letzter Zeit verhältnismäßig ruhig geworden. Gerade im Frontend sind React, Angular und, mit etwas Abstand, Vue etabliert und erfreuen sich einer wachsenden Nutzerbasis. Komponentenbasierte...

JavaScript
Java

18.2.2020 | 4 Minuten Lesezeit

Daniel Zenzes

Synchroner Batch mit Mule 4

Während in Mule 3 der Batch noch eine eigenständige Komponente war und Batches sich in der Konfiguration auf der gleichen Ebene wie Flows befanden, ist der Batch in Mule 4 zu einem sogenannten Scope geworden, der jetzt innerhalb eines Flows lebt. Auf...

Java
APM
JavaScript
Integration

28.1.2020 | 5 Minuten Lesezeit

Roger Butenuth

Was ist GraalVM?

Als ich anfing, mich genauer mit GraalVM zu beschäftigen, hatte ich nur eine grobe Vorstellung davon, was sich hinter der Bezeichnung eigentlich verbirgt. Beim Lesen der ersten Artikel zum Thema war ich geradezu verwirrt. Was ist GraalVM denn nun? Ein...

Java

23.1.2020 | 5 Minuten Lesezeit

Timo

Kong API-Gateway – Observability mit Prometheus, Grafana und OpsGenie

Im vorherigen Blogpost habe ich das bestehende Demo-Setup um decK und Konga erweitert. Nun soll es darum gehen, die vorhandenen Daten der APIs sichtbarer werden zu lassen. Hierzu möchte ich zwei Observability Patterns, nämlich Monitoring und Alerting...

Softwarearchitektur
Atlassian
Microservices
Open Source
API
APM

19.12.2019 | 4 Minuten Lesezeit

Daniel Kocot

Kubernetes-Monitoring mit Instana (Teil 1)

Einführung: Weshalb Kubernetes und Instana?Cloud- oder cloud-ähnliche Dienste bedienen bekanntermaßen das “As a Service”-Prinzip. Egal ob “Software”, “Function” oder “Platform as a Service”, meist steckt eine containerbasierte Infrastruktur dahinter....

Infrastructure
APM
Kubernetes

13.10.2019 | 6 Minuten Lesezeit

Niko Blättermann

Maximilian Mayer

Gemeinsam bessere Projekte umsetzen.

Wir helfen deinem Unternehmen.

Du stehst vor einer großen IT-Herausforderung? Wir sorgen für eine maßgeschneiderte Unterstützung. Informiere dich jetzt.

Hilf uns, noch besser zu werden.

Wir sind immer auf der Suche nach neuen Talenten. Auch für dich ist die passende Stelle dabei.

Contact

Send

Performance measurement with JMH – Java Microbenchmark Harness

Was this post helpful?

Ja

Blog author

Get in contact

Get in contact

More articles

Goldene Wasserhähne – Wie wichtig ist Qualität in der Softwareentwicklung...

Count your queries! Repository integration tests with Hibernate Statistics

Threat Modeling 101 – Wie fange ich eigentlich an?

Shift left security – Sicherheit ist Daily-Business

Spy vs. spy – aka “The two sides of the testing coin”

DON’T make an ASS out of U and ME when dealing with Hibernate caching!

IntelliJ IDEA 2018.2 auto-detection for Spring Boot and the ‘Run Dashboard...

Polite HTTP API design – “Use the headers, Luke!”

How to enable the Spring Boot ‘Run Dashboard’ in IntelliJ IDEA 2017.2....

How to mix Java and Kotlin within one Spring Boot Application

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

View Job

More articles in this subject area

Test Fixtures mit JUnit 5

Charge your APIs Volume 23: REST vs. gRPC

Reactive Programming mit Spring Webflux

Test-Fixtures: Wozu denn überhaupt?

Microstream – das Ende der O/R-Mapper?

Streaming Wikipedia mit Apache Kafka

Die Zukunft der IDEs – aus Sicht eines „Java-EE-Entwicklers“

Keycloak.X, aber sicher – ohne bekannte Sicherheitslücken!

Wie man Java-Klassen in Python benutzt

Effizient mit Text, Code und IDEs arbeiten

Automatisch skaliertes Cloud Native Consent Management in der Google Cloud

Datenbankoperationen in Mule 4 optimieren

Serverless Java mit AWS – Zwei Jahre Cloud-Native

BPMN im Smart Home: Camunda und openHAB

State Management in Svelte

Gleich und doch anders: Einführung in Svelte

Synchroner Batch mit Mule 4

Was ist GraalVM?

Kong API-Gateway – Observability mit Prometheus, Grafana und OpsGenie

Kubernetes-Monitoring mit Instana (Teil 1)

Gemeinsam bessere Projekte umsetzen.

Wir helfen deinem Unternehmen.

Unsere Leistungen

Hilf uns, noch besser zu werden.

Zu den Jobangeboten