LANGUAGE

Why good metrics values do not equal good quality

3.10.2011 | 7 minutes of reading time

Quite regularly, codecentric’s experts perform reviews and quality evaluations of software products. For example, clients may want to get an independent assessment of a program they had a contractor develop. In other cases, they request an assessment of software developed in-house to get an understanding of their its current level of quality.

There often is an implicit assumption that by just using automatic analysis tools you can get a reliable impression of the quality and maintainability, saving the cost and effort for a manual review. Using a simplified example we are going to explain, why this is a fallacy and why an automatically derived set of metrics cannot be a viable replacement for the manual process.

Metrics and Tools

In fact, at the beginning of most analyses there is a step of collecting some base metrics automatically, to get a first superficial impression of the software under inspection. Usually at this early stage one uses simple counts – e. g. to get an idea of the product’s size (number of packages, classes, methods, lines of code) – as well as common quality metrics, for example the cyclomatic complexity.

These values can be quickly calculated using several free or commercial tools and are based on the source code and compiled Java classes.

Once these metrics have been measured, they can be compared to well-known references, e. g. those of Carnegie Mellon University for cyclomatic complexity.

Cyclomatic Complexity

The purpose of this metric s to get an assessment of the complexity – and therefore indirectly the maintainability – of a piece of software.

The aforementioned reference values from Carnegie Mellon define four rough ranges for cyclomatic complexity values:

methods between 1 and 10 are considered simple and easy to understand and test
values between 10 and 20 indicate more complex code, which may still be comprehensible; however testing becomes more difficult due to the greater number of possible branches the code can take
values of 20 and above are typical of code with a very large number of potential execution paths and can only be fully grasped and tested with great difficulty and effort
methods going even higher, e. g. >50, are certainly unmaintainable

Often complexity increases gradually with the life-time of a code base as new features are added and existing code is modified. Over time new code is introduced into the system, but the individual “small” changes regularly do not convey the impression of being complex enough to warrant refactoring the affected sections of the code.

In effect the risk of introducing new bugs increases proportionally with the code’s complexity as undesirable side-effects cannot be foreseen. Theoretically this could be alleviated with a sufficient level of test coverage, but unfortunately coming up with useful test code also becomes more difficult and time-consuming for complex code. This regularly leads to test coverage becoming worse, making future changes even more error prone. This is a vicious circle that is hard to break out from.

All this leads to a simple and unsurprising conclusion: Lower complexity eases maintenance, writing meaningful tests and consequently reduces the chances of introducing new bugs. It can therefore be used as an indicator for good quality.

Let’s assume the following result of a complexity analysis of a code base with 10.000 methods:

96% – 9600 methods: CC < 17 : acceptable
3% – 300 methods: 17 < CC < 20 : borderline
1% – 100 methods: 20 <= CC : too high

Does this mean that complexity is not a critical issue in this code base?

The answer has to be: No.

The statement of “only” 1% of all methods being reported as too complex does not carry much meaning in and of itself. There is no way to tell if those 100 methods contain central and mission critical business logic and are disproportionately important for the overall application’s quality.

However, the complexity metric alone does not say anything about the possibly great test coverage of this critical portion of code. Thorough testing could have been deliberately introduced to verify the correctness and guard and against regressions despite high complexity values. But we can get more information on that topic with more tools…

Test Coverage

Several tools are available to determine test coverage, a few popular ones being Clover, Cobertura or Emma. They monitor the execution of unit tests and report on which parts of the code under test are exercised. This allows a reasonable evaluation of which percentage of a software product is covered by automated tests.

While it is difficult to proclaim a generally valid minimum degree of test coverage, because it partly depends on the application at hand – e. g. completely covering trivial bean setters and getters is not usually very useful – values of 80% or above are advised to be sufficiently confident that refactorings and modifications will not break existing functionality.

Assuming an average test coverage of 85% – esp. including the 100 complex (and allegedly important) methods mentioned above – would that not imply a reasonably good code quality, because the source code is covered by tests for the most part?

Again, the answer must be: No.

Even high levels of test coverage only prove that the execution paths that are exercised by the tests are run at least once and with a particular set of test data. Even though the coverage tools do record the number of times each branch gets executed, for it to be “covered” just requires a single execution.

Moreover, 85% of coverage leave 15% uncovered – there is no immediate indication of which parts comprise that 15%. Not seldom this is code for error conditions or exception handling, which can have especially nasty consequences when there are bugs lurking around here.

and so on…

Everything that has been said so far can be applied to virtually all calculated metrics: Every automated analysis process can at most produce hints as to which parts of the code should be targeted for a manual review. They provide starting points and allow a directed approach of large projects, but just looked at in isolation is never sufficient and can even be misleading.

In a recent case, good or sometimes even very good results of the initial automated metrics analysis runs, including – among others – cyclomatic complexity and Robert C. Martin’s metrics about levels of coupling and abstraction, conveyed a rather positive first impression of the subject project.

Even further diagnostics using static analysis tools like Checkstyle , FindBugs or Sonar did not report unusually high numbers of problems, relative to the overall size of the software product, and those issues that were reported would mostly have been rather easy to fix.

But despite the seemingly uncritical results of all tool runs, at the end of the review process we had found a number of severe problems in the code base that clearly prohibited the customer from going live with the new product. Some of – but not limited to – these problems were fundamental issues with concurrency, useless caches, severe flaws in error- and exception handling and obvious performance problems (unnecessary, but frequent calls to remote services in tight loops) etc.

Bottom Line

Judging the quality of a software product – and consequently the risk when using it in production – by tool-based measurements and metrics alone can easily lead to false conclusions.

Too many factors that influence the actual quality of a solution cannot reliably, if at all, be evaluated automatically. Despite lots of great and proven tools being readily available and even free to use, their results still require careful evaluation – they must be seen as the indicators that they are, not comprehensive and final statements about quality. They can only lead to the way and hint at where it might be sensible to focus a manual review.

In the case mentioned above, using the software in production would have had far-reaching and potentially critical consequences, because data could have been corrupted silently or the system might have crashed completely.

Though manual reviews and checks cannot guarantee error-free software, even in the IT business experience and intuition – luckily – still cannot be replaced with tools.

Was this post helpful?

LANGUAGE

Likes

Blog author

Daniel Schneller

Do you still have questions? Just send me a message.

fromDaniel Schneller

XFS: Possible Memory Allocation Deadlock in kmem_alloc

A few weeks ago we were surprised by seemingly random I/O hangs on several virtual machines. Any attempt to write to their data volumes blocked, making the load average rise into the stratosphere, and — slightly more consequentially — make Elasticsearch...

Cloud
DevOps
Infrastructure

10.4.2017 | 10 Minuten Lesezeit

Daniel Schneller

True KVM Live Migration with OpenStack Icehouse and Ceph based VM storage

Intro As mentioned before — for example in Fabian’s The CenterDevice Cloud Architecture Revisited post from December 2014) — our document management product CenterDevice runs on top of infrastructure virtualized by OpenStack. Where that older post...

Cloud

16.3.2015 | 12 Minuten Lesezeit

Daniel Schneller

Rate Limiting based on HTTP headers with HAProxy

Recently we had a problem with a buggy update to a piece of 3rd party client software. It produced lots and lots of valid, but nonsensical requests, targeting our system. This post details how we added a dynamic rate limiting to our HAProxy load balancers...

3.12.2014 | 7 Minuten Lesezeit

Daniel Schneller

Localizing Mobile Apps

What do the acronyms I18N or L10N stand for? What do they mean for developers of mobile applications in particular? I hosted a session about localizing mobile applications at Developer Week 2014 in Nuremberg. It covers — among other things — text, numbers...

26.8.2014 | 1 Minuten Lesezeit

Daniel Schneller

Jinja2 for better Ansible playbooks and templates

There have been posts about Ansible on this blog before, so this one will not go into Ansible basics again, but focus on ways to improve your use of variables, often, but not only used together with the template module, showing some of the more involved...

24.8.2014 | 11 Minuten Lesezeit

Daniel Schneller

Ansible: Simple yet powerful automation

Automatic provisioning of infrastructure as well as deployment is a cornerstone of DevOps. It brings the benefits of version control, reproducibility, and a central place to consolidate (executable) knowledge about infrastructure setups. Best known provisioning...

CI/CD
DevOps
Infrastructure

22.6.2014 | 14 Minuten Lesezeit

Daniel Schneller

SSH Two-Factor Authentication with Duo Security

An ever increasing number of services start offering (and recommending) additional means of securing access to your accounts: Instead of just asking users to identify and authenticate themselves with a simple set of username and password, a second piece...

10.3.2014 | 7 Minuten Lesezeit

Daniel Schneller

Pseudo-Localization for Cocoa Apps

Locali… what? Simply speaking, localizing an application means translating all output it produces on the screen (and printouts etc.) to the language of the people using it. There is more to it, though, than a simple translation of messages. You should...

Java
iOS
Software development

23.10.2013 | 14 Minuten Lesezeit

Daniel Schneller

SSL: Man in the middle? – No, thank you!

At DWX Developer Week I recently gave a talk on SSL and man in the middle attacks. Due to the popular demand (and some internal scheduling issues) I repeated it again internally. However, the recording of that is available on the codecentric YouTube ...

2.7.2013 | 1 Minuten Lesezeit

Daniel Schneller

Easier JBehave steps with variants

In an earlier post we offered an introduction to the JBehave project for automatic acceptance testing. While that article focused on setup and general use of the framework, this time I will concentrate on a recent addition I wrote and contributed to...

Agile
Java

1.4.2012 | 4 Minuten Lesezeit

Daniel Schneller

SOAP Webservices mit iOS

Betrachtet man APIs für aktuelle Web-Plattformen wie Soziale Netzwerke, die Amazon Web Services, Fotodienste à la Flickr oder Instagram und zahllose mehr, so könnte der Eindruck entstehen, REST hätte als der Kommunikation mit entfernten Diensten zu ...

Java
API

2.1.2012 | 5 Minuten Lesezeit

Daniel Schneller

Using JMeter to measure binary protocols

In a recent project I developed a bridge component to connect a backend web service with a credit-card terminal. The terminal can only speak a binary protocol. The bridge needs to map the binary messages to the corresponding backend calls. If you are...

Java
APM

9.5.2011 | 6 Minuten Lesezeit

Daniel Schneller

droidcon 2011

Vom 23. bis 24. März fand in der Urania in Berlin die droidcon.2011 statt. Neben zahlreichen Ausstellern im Expo Bereich, die bislang teilweise noch nicht (in Deutschland) erhältliche Produkte, darunter z. B. Motorola mit dem Xoom Tablet und Android...

Android
Community
Mobile

5.4.2011 | 4 Minuten Lesezeit

Daniel Schneller

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Test Fixtures mit JUnit 5

Wir Softwareentwickler leben in einem ständigen Dilemma. Jede Funktionalität der Software sollte durch Unit-Tests und Integrationstest abgesichert werden. Es sollten dabei so viel Tests wie nötig, aber nur so wenige wie möglich geschrieben werden. Schreiben...

Java
Testing
Framework
Softwareentwicklung

25.3.2024 | 7 Minuten Lesezeit

Jens Kaiser

Charge your APIs Volume 23: REST vs. gRPC

APIs dienen als Verbindungsstück zwischen Daten und Verarbeitung und erlauben uns damit, Daten im richtigen Kontext als Informationen zu interpretieren. Passende fachliche Themen sind dabei präsenter denn je und erreichen bald auch den Endverbraucher...

Java
Softwareentwicklung
Spring
Softwarearchitektur
API
Data

11.2.2024 | 7 Minuten Lesezeit

Sebastian Tiemann

Reactive Programming mit Spring Webflux

In diesem Artikel geben wir einen Überblick über Reactive Programming, erläutern, welche Prinzipien diesem zugrunde liegen und wann ein Einsatz sinnvoll sein kann. Anschließend zeigen wir, wie mithilfe des Spring-Webflux-Projekts eine reaktive Anwendung...

Spring
Java
Reactive Programming

11.12.2023 | 13 Minuten Lesezeit

Christian Franzen

Ferdinand Ade

Test-Fixtures: Wozu denn überhaupt?

Für uns Softwareentwickler ist der ultimative Endgegner immer die Komplexität. Wir haben zahlreiche, teils ziemlich mächtige Waffen gesammelt, um in diesen Kämpfen bestehen zu können: Dinge wie Modularisierung, Abstraktion, Lean Development, iteratives...

Testing
Java
Test Driven Development

12.5.2023 | 19 Minuten Lesezeit

Rüdiger zu Dohna

Microstream – das Ende der O/R-Mapper?

Über eine Suche nach Alternativen zu O/R-Mappern und Persistenz-Frameworks für NoSQL-Datenbanken bin ich auf Microstream aufmerksam geworden und war ziemlich schnell interessiert. Zum einen, weil Microstream wie ich aus der Oberpfalz kommt, aber haupts...

Java
Datenbank
Softwarearchitektur

29.9.2022 | 13 Minuten Lesezeit

Felix Rieß

Streaming Wikipedia mit Apache Kafka

Apache Kafka ist in aller Munde und entwickelt sich im Kontext von verteilten Systemen zum De-facto-Standard als Plattform für Event Streaming. Im Rahmen unserer OffProject Time (Weiterbildungszeit) haben wir uns die Plattform auch näher angeschaut und...

Kotlin
Data
Java
Messaging
Spring

15.8.2022 | 10 Minuten Lesezeit

Christoph Metzger

Felix Rieß

Die Zukunft der IDEs – aus Sicht eines „Java-EE-Entwicklers“

Bei unseren Kunden und auch bei codecentric dreht sich alles um den besten und schnellsten Weg, die richtige Software zu entwickeln – und das natürlich in hoher Qualität. Von daher bin ich auch ein fleißiger Leser des „State of DevOps“-Report (hier zum...

Cloud
Java
Remote Work

16.5.2022 | 11 Minuten Lesezeit

Rainer Vehns

Keycloak.X, aber sicher – ohne bekannte Sicherheitslücken!

TLDR: Wie man die bekannten CVEs (Common Vulnerabilities and Exposures) mit einer eigenen Keycloak-Distribution auf null* reduziert.EinführungKeycloak (s. Website) wird durch die Umstellung auf Quarkus einfacher und robuster, so das Versprechen. Wie...

Java
IT-Security
Keycloak

9.5.2022 | 9 Minuten Lesezeit

Sebastian Rose

Thomas Darimont

Wie man Java-Klassen in Python benutzt

Generell sollte man zwar für jedes Problem das passende Werkzeug nutzen. Aber oftmals wird man gezwungen, den Hammer Java zu nutzen, weil der Rest des Hauses mit diesem Hammer gebaut wurde. Eine moderne Lösung dieses Problems ist natürlich die Microservice...

Künstliche Intelligenz
Java
Python

15.11.2021 | 8 Minuten Lesezeit

Hendrik Schawe

Effizient mit Text, Code und IDEs arbeiten

Hast du dich schon immer gefragt, warum andere Leute ihre Entwicklungsumgebung (Integrated Development Environment, IDE) anders nutzen als du? Ist dir aufgefallen, dass andere beim Programmieren deutlich langsamer oder schneller sind? Kennst du auch ...

Softwareentwicklung
Java

6.10.2021 | 12 Minuten Lesezeit

Jonas Verhoelen

Serverless Java mit AWS – Zwei Jahre Cloud-Native

Vor zwei Jahren haben wir angefangen, ein Kundenprodukt Cloud-Native auf Basis von Serverless, Java und AWS Managed Services umzusetzen. Im Folgenden möchte ich beschreiben, was wir in dieser Zeit gemeinsam gelernt haben und was wir heute besser machen...

Softwarearchitektur
Cloud
Java
Microservices
Serverless
Softwareentwicklung

2.12.2020 | 9 Minuten Lesezeit

Felix Massem

BPMN im Smart Home: Camunda und openHAB

Geschäftsprozessmodellierung und einhergehende Sprachen wie BPMN und DMN sind Begriffe, denen man normalerweise im beruflichen Umfeld begegnet und die im privaten Raum keine Rolle spielen. Natürlich kann man die Prozesse eines Haushalts (aka kleines,...

Java
BPM
Smart Home
IoT

6.4.2020 | 8 Minuten Lesezeit

Stephan Köninger

State Management in Svelte

Teil der Webentwicklung in 2020 sind nicht nur komponentenbasierte Ansätze, sondern ebenso die Nutzung von State-Management-Lösungen. Diese orientieren sich in der Regel an der Flux-Architektur und ihrem prominentesten Vertreter, Redux . Und so ist es...

JavaScript
React
Java

25.2.2020 | 3 Minuten Lesezeit

Daniel Zenzes

Gleich und doch anders: Einführung in Svelte

Verglichen mit den letzten Jahren ist es im JavaScript-Umfeld in letzter Zeit verhältnismäßig ruhig geworden. Gerade im Frontend sind React, Angular und, mit etwas Abstand, Vue etabliert und erfreuen sich einer wachsenden Nutzerbasis. Komponentenbasierte...

JavaScript
Java

18.2.2020 | 4 Minuten Lesezeit

Daniel Zenzes

Synchroner Batch mit Mule 4

Während in Mule 3 der Batch noch eine eigenständige Komponente war und Batches sich in der Konfiguration auf der gleichen Ebene wie Flows befanden, ist der Batch in Mule 4 zu einem sogenannten Scope geworden, der jetzt innerhalb eines Flows lebt. Auf...

Java
APM
JavaScript
Integration

28.1.2020 | 5 Minuten Lesezeit

Roger Butenuth

Was ist GraalVM?

Als ich anfing, mich genauer mit GraalVM zu beschäftigen, hatte ich nur eine grobe Vorstellung davon, was sich hinter der Bezeichnung eigentlich verbirgt. Beim Lesen der ersten Artikel zum Thema war ich geradezu verwirrt. Was ist GraalVM denn nun? Ein...

Java

23.1.2020 | 5 Minuten Lesezeit

Timo

Schnelles Entwickeln mit Kubernetes in Azure

Kubernetes ist die de facto Deployment-Umgebung für moderne Microservice-Architekturen. Alle großen Cloud-Anbieter haben daher Angebote für Kubernetes, die durch zahlreiche Features ergänzt werden, die Ressourcen des jeweiligen Anbieters intelligent ...

Cloud
Java
Microservices
Azure
Kubernetes

31.7.2019 | 5 Minuten Lesezeit

Christian Sauer

RESTful Webservices mit Quarkus

Im ersten Artikel zu Quarkus wurde beschrieben, wie man es nutzen kann und was die theoretischen Hintergründe sind. In diesem Artikel wird beleuchtet, wie mit Quarkus eine vollständige REST-Anwendung erstellt werden kann. In der Anwendung werden verschiedene...

Java
Cloud
Microservices
API

3.6.2019 | 7 Minuten Lesezeit

Enno Lohmann

Quarkus macht Java fit für die Cloud

Vor über zwanzig Jahren wurde Java vorgestellt, und es ist bis heute eine der erfolgreichsten Programmiersprachen. Durch sein Alter ist Java jedoch nicht auf die Cloud optimiert und fällt hier hinter anderen Sprachen zurück. Java ist eher auf ein monolithisches...

Cloud
Java
Microservices

9.4.2019 | 7 Minuten Lesezeit

Enno Lohmann

Springfox Swagger Extensions für Spring Security

Eine populäre Methode, um REST APIs zu dokumentieren ist Swagger 2. Für Spring(-Boot)-Projekte bietet sich Springfox an. Springfox integriert sich recht nahtlos in ein Spring-Projekt und stellt für konfigurierte REST Endpoints eine Browser-basierte ...

Java
IT-Security
Spring

1.11.2018 | 8 Minuten Lesezeit

Henning Waack

Gemeinsam bessere Projekte umsetzen.

Wir helfen deinem Unternehmen.

Du stehst vor einer großen IT-Herausforderung? Wir sorgen für eine maßgeschneiderte Unterstützung. Informiere dich jetzt.

Hilf uns, noch besser zu werden.

Wir sind immer auf der Suche nach neuen Talenten. Auch für dich ist die passende Stelle dabei.

Contact

Send

Why good metrics values do not equal good quality

Metrics and Tools

Cyclomatic Complexity

Test Coverage

and so on…

Was this post helpful?

Ja

Blog author

Get in contact

Get in contact

More articles

XFS: Possible Memory Allocation Deadlock in kmem_alloc

True KVM Live Migration with OpenStack Icehouse and Ceph based VM storage

Rate Limiting based on HTTP headers with HAProxy

Localizing Mobile Apps

Jinja2 for better Ansible playbooks and templates

Ansible: Simple yet powerful automation

SSH Two-Factor Authentication with Duo Security

Pseudo-Localization for Cocoa Apps

SSL: Man in the middle? – No, thank you!

Easier JBehave steps with variants

SOAP Webservices mit iOS

Using JMeter to measure binary protocols

droidcon 2011

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

View Job

More articles in this subject area

Test Fixtures mit JUnit 5

Charge your APIs Volume 23: REST vs. gRPC

Reactive Programming mit Spring Webflux

Test-Fixtures: Wozu denn überhaupt?

Microstream – das Ende der O/R-Mapper?

Streaming Wikipedia mit Apache Kafka

Die Zukunft der IDEs – aus Sicht eines „Java-EE-Entwicklers“

Keycloak.X, aber sicher – ohne bekannte Sicherheitslücken!

Wie man Java-Klassen in Python benutzt

Effizient mit Text, Code und IDEs arbeiten

Serverless Java mit AWS – Zwei Jahre Cloud-Native

BPMN im Smart Home: Camunda und openHAB

State Management in Svelte

Gleich und doch anders: Einführung in Svelte

Synchroner Batch mit Mule 4

Was ist GraalVM?

Schnelles Entwickeln mit Kubernetes in Azure

RESTful Webservices mit Quarkus

Quarkus macht Java fit für die Cloud

Springfox Swagger Extensions für Spring Security

Gemeinsam bessere Projekte umsetzen.

Wir helfen deinem Unternehmen.

Unsere Leistungen

Hilf uns, noch besser zu werden.

Zu den Jobangeboten