Can you win the stacking challenge? An example of heuristic optimization

27.3.2019 | 9 minutes of reading time

I have come across an interesting optimization problem. The task is to stack the items of a given set of boxes of different sizes, weights, and stabilities onto as few pallets as possible. Moreover, there is a multitude of additional conditions that have to be met. To name only a few:

Each pallet must consist of no more than four non-overlapping stacks, one in each quarter of the pallet.
The additional weight that can be stacked onto a box is limited. Each box has a specific stacking weight it can carry.
To ensure transport safety, a pallet must not exceed a certain degree of top-heaviness.
Some types of boxes are only allowed to be stacked onto one another if a certain order is kept. So, for example, you can stack box A onto box B, but not the other way around.
The maximum height of each stack is limited and the limit should be exploited wherever possible.

Optimization problems of this kind first of all have to be translated into a mathematical expression. The result of the translation process is a (multivariate) function \(f\left(\boldsymbol{x}\right), \boldsymbol{x}=\left[x_{1}, x_{2},…,x_{n}\right]\). Since all common optimization algorithms calculate the minimum of a function, the function \(f\) must be formulated accordingly. Unfortunately, optimization problems like the one mentioned above usually result in extremely high-dimensional functions that are neither continuous nor unimodal. Therefore, classical, derivative-based optimization algorithms are not very effective at this point. In such cases, so-called heuristic optimization methods represent a good alternative. In a nutshell, heuristic optimization algorithms start out by generating a set of random inputs to the function to be minimized. Depending on the associated outputs, new inputs are generated, which hopefully yield a function output that is lower than the lowest one of the previous iteration. The generation of new inputs is often inspired by phenomena in nature and in particular in the animal world (particle swarm algorithm, genetic algorithm, ant colony algorithm, grey wolf algorithm, etc.) and physics (harmony search, wind-driven optimization, simulated annealing).

The following YouTube video shows how the traveling salesman problem is solved by using the simulated annealing algorithm. In this problem, a list of cities that the traveling salesman should visit is given. At the end of the journey, he has to return to the first city and no other city may be visited twice. The goal is to determine the shortest possible route.

By loading the video, you agree to YouTube’s privacy policy.
Learn more

Load video

Always unblock YouTube

The complexity of such optimization problems usually leads to the circumstance that a global optimum can hardly be found. Therefore, the aim is rather to find a decent solution within a predetermined time period.

To give an example how problems of the above kinds can be tackled, I have made up a little example problem (which is much harder than the traveling salesman problem 🙂 ): the stacking challenge. (Python sourcecode: The stacking challenge ).

The stacking challenge

Five types of boxes are given, blue (0), red (1), yellow (2), green (3) and orange (4) in which the colors will be coded as indices. The vector of existing box types reads: \(B = \left[0, 1, 2, 3, 4\right]\).
We have a certain number of boxes of each type: \(N = \left[5, 10, 7, 9, 10\right]\).
Each box type has a height: \(H = \left[50, 70, 100, 110, 150\right]\).
Each box type has a weight: \(W = \left[50, 80, 30, 10, 100\right]\).
Each box type can carry a certain weight on top: \(T = \left[500, 1000, 2000, 600, 600\right]\).

Some stacking combinations are forbidden and will be coded as vectors. An example: If a red box cannot be stacked onto a blue box, we have \(f=\left[0,1\right]\). If a yellow box cannot be stacked onto a green box, we have \(f=\left[3,2\right]\). The forbidden combinations will be summarized as follows: \(F = \left[f_{1}, f_{2},..\right]\).
For the challenge, the forbidden combinations read: \(F = \left[\left[1, 3\right] (\text{no green on red}), \left[4, 1\right] (\text{no orange on red}), \left[0, 2\right] (\text{no yellow on blue}), \left[3, 0\right] (\text{no blue on green})\right]\).

The challenge: Build a stack as tall as possible, making sure that no box carries too much weight on top and that no forbidden combinations occur. Additionally, at least 10% of the boxes must be red.

Simplified version

We start out by investigating a reduced version of the problem:
\(B = \left[0, 1, 2, 3, 4\right]\)
Number of boxes of each box type: \(N = \left[2, 1, 1, 1, 1\right]\).
Height: \(H = \left[10, 20, 30, 40, 50\right]\).
Weight: \(W = \left[20, 30, 20, 50, 10\right]\).
Weight a box type can carry on top: \(T = \left[30, 20, 40, 40, 50\right]\).
Forbidden combinations: \(F = \left[\left[1, 3\right]\right]\) (no green on red).
Additionally: At least 10% of the boxes must be red.

In order to code this information, we first generate the so-called items vector, which holds all boxes we can choose from, ordered by index type. The items vector reads:
\(\text{Items_vector} = \left[0, 0, 1, 2, 3, 4\right]\), because we have two boxes of the type 0 and one box of each of the remaining types. Putting all constraints aside, a stack consists of an arbitrary subset of the elements of the items vector. Here, of course, the case of taking all elements is also included. Furthermore, the elements of this subset can be ordered in any manner. Here are some example stacks, in which the first element represents the box at the bottom: \(\left[0, 2, 3, 4\right]\) (blue, yellow, green, orange), \(\left[0, 0, 1\right]\) (blue, blue, red), \(\left[0, 0, 4, 3, 2, 1\right]\) (blue, blue, orange, green, yellow, red), \(\left[4\right]\) (orange).

1. Constructing a stack

To code this, we draw six random numbers between 0 and 1 (or some other range):
\(\text{Rand_vect}=\left[0.65, 0.76, 0.11, 0.52, 0.43, 0.90\right]\).
Now, we compute the indices that sort this vector,
\(\left[2, 4, 3, 0, 1, 5\right]\), and arange the elements of the items vector accordingly:
\(\text{Items_vector_ordered} = \left[1, 3, 2, 0, 0, 4\right]\) (red, green, yellow, blue, blue, orange).
Now we draw another random number between 1 and the the number of boxes which we can choose from, which is 6:
\(\text{Rand_num} = 3.2\). Rounding the number yields \(\text{Rand_num} = 3\).
Therefore, we build our stack from the first 3 elements of the ordered items vector:
\(\text{Stack} = \left[1, 3, 2\right]\). Using these two steps enables us to compute every stack possible. Moreover, the height of a stack is calculated as \(\text{Height} = \text{sum}\left(H\left[\text{Stack}\right]\right)\) which is \(\text{Height} = \text{sum}\left(\left[20, 40, 30\right]\right)=90\) in this example.

2. Avoiding forbidden orderings

At first, it makes sense to take some complexity out of the optimization procedure. In order to achieve this, we do not penalize for forbidden orderings, but we reorder our stack if necessary. As can be seen, the forbidden order \(\left[1, 3\right]\) is part our stack. Therefore, we exchange these two elements and arrive at the reordered stack: \(\text{Stack_reordered} = \left[3, 1, 2\right]\).
Reordering a stack is always possible, as long as no rules exists, so that two elements can never be stacked onto one another (Example: \(F = \left[\left[1, 3\right], \left[3, 1\right]\right]\)).

3. Checking for overweight

Now we check if a box carries excess weight. From the reordered stack, which holds the used box type indices, we can compute the weights used in the stack and the allowed top weights of the used boxes. Because the bottom box does not add top weight to any other box, the first element can be discarded: \(\text{Stack_weights} = W\left[\text{Stack_reordered}\left[1:\text{end}\right]\right] = \left[30, 20\right]\), \(\text{Stack_weights_allowed} = T\left[\text{Stack_reordered}\left[1:\text{end}\right]\right] = \left[20, 40\right]\). The cumulative weights computed from the vector \(\text{Stack_weights}\) are gathered in the vector \(\text{Stack_cumulative_weights} = \left[50, 20\right]\). So, the first box carries a weight of 50, the second a weight of 20 and the third, of course, no weight. Now we calculate \(\text{Precentage_overweight} = \left[\text{Stack_cumulative_weights}/\text{Stack_weights_allowed}-1\right]\) and keep only the results that are greater than 0 \((^{+})\). With these elements, we then compute \(\text{Weights_tester} = \text{mean}\left(\text{Precentage_overweight}^{+}\right)\). If no elements greater than 0 exist, it is \(\text{Weights_tester} = 0\). In the case where a stack has only one element, the whole procedure is skipped and we also have \(\text{Weights_tester} = 0\). In our example case, it is \(\text{Weights_tester} = 1.5\).

4. Checking for the percentage amount of red boxes

To test for the percentage amount of red boxes in our stack, we simply define:
\(\text{color_tester} = 1-\text{number_of_red_boxes_in_stack}/\text{number_of_boxes_in_stack}\). In the worst case, we have \(\text{Color_tester}=1\). As the percentage amount of red boxes approaches 10%, \(\text{Color_tester}\) approaches 0.9. From that point on, we set \(\text{Color_tester}=0\), as we then have fulfilled the minimum requirement. In our case, we have 33.33% of red boxes and so it is \(\text{Color_tester}=0\).

5. Decoupled optimization

Steps 1. – 4. constitute the function \(f\) which has the input variables \(\text{Rand_vect}\) and \(\text{Rand_num}\) as well as the outputs \(\text{Height}, \text{Weights_tester}\) and \(\text{Color_tester}\). We set \(\text{Test}=\text{Weights_tester} + \text{Color_tester}\) and define the overall result-function to be:

\(
\text{Result} =
\begin{cases}
\phantom{-} \text{Test} & \text{if}\ \ \text{Test} > 0 \\
-\text{Height} & \text{else}
\end{cases}
\).

It is very unlikely that the heuristic optimizer will generate a stack in the first iterations that satisfies the complex constraints. The used result-function decouples the constraints from the overall goal of maximizing the stack height. This means that the heuristic optimizer is first forced to satisfy the constraints, regardless of the height of the stack. From the moment the result-function is zero, it is ensured that the constraints are fulfilled. From then on, only the height of the stack plays a role. In this example the result function yields \(
\text{Result}=1.5\), which indicates that the constraints are not fulfilled yet.

Data

Below the data for the actual challenge is repeated:

\(B = \left[0, 1, 2, 3, 4\right]\)
Number of boxes of each box type: \(N = \left[5, 10, 7, 9, 10\right]\).
Height: \(H = \left[50, 70, 100, 110, 150\right]\).
Weight: \(W = \left[50, 80, 30, 10, 100\right]\).
Weight a box type can carry on top: \(T = \left[500, 1000, 2000, 600, 600\right]\).
Forbidden combinations: \(F = \left[\left[1, 3\right] (\text{no green on red}), \left[4, 1\right] (\text{no orange on red}), \left[0, 2\right] (\text{no yellow on blue}), \left[3, 0\right] (\text{no blue on green})\right]\).
Additionally: At least 10% of the boxes must be red.

As optimizer I have chosen the particle swarm algorithm (Python package Pyswarm). The resulting stack satisfies all constraints, has a height of 2770 and a percentage amount of red boxes of 15.38% (computation time approximately one minute). Can you construct a higher stack? If so, let me know. 🙂

Was this post helpful?

Likes

Blog author

Dominik Ballreich

Do you still have questions? Just send me a message.

fromDominik Ballreich

Better time series forecasting using expert knowledge

Methods for time series forecasting have become more and more powerful in recent decades, ranging form simple linear models to complex machine learning algorithms. Nevertheless, not only the quality of the forecasts is important, but also their acceptance...

Data
Machine Learning
Data Science

15.2.2019 | 6 Minuten Lesezeit

Dominik Ballreich

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Green Cloud: Daten und Emissionen sparen

Das Internet produziert jährlich 900 Millionen Tonnen CO₂ – das ist deutlich mehr als Deutschland insgesamt emittiert. Hauptverantwortlich ist der immer weiter steigende Stromverbrauch beim Transport und der Speicherung von Daten. Wenn ihr kurz darüber...

Cloud
Green IT
Softwarearchitektur
Data

11.3.2024 | 5 Minuten Lesezeit

Dennis

Charge your APIs Volume 23: REST vs. gRPC

APIs dienen als Verbindungsstück zwischen Daten und Verarbeitung und erlauben uns damit, Daten im richtigen Kontext als Informationen zu interpretieren. Passende fachliche Themen sind dabei präsenter denn je und erreichen bald auch den Endverbraucher...

Java
Softwareentwicklung
Spring
Softwarearchitektur
API
Data

11.2.2024 | 7 Minuten Lesezeit

Sebastian Tiemann

Eine Einführung in Federated Learning im industriellen Kontext: Fortgeschritten

Im Bereich des maschinellen Lernens wurde eine lange Zeit angenommen, dass die Eingabedaten von Modellen und Gewichten sicher sei und nicht extrahiert werden könnten. In den letzten Jahren veröffentlichte Forschung hat diese Annahme in Frage gestellt...

Machine Learning
Big Data
Data Science
Data

18.9.2023 | 8 Minuten Lesezeit

Ihsan Kisi

Eine Einführung in Federated Learning im industriellen Kontext: Grundlagen

Mithilfe von Daten können Unternehmen fundiertere Entscheidungen treffen, ihre Arbeitsabläufe optimieren und mit der Kraft des maschinellen Lernens (ML) einen Vorteil in der wettbewerbsintensiven Geschäftswelt erlangen. Allerdings ist der Umgang mit ...

Machine Learning
Data Science
Data
Big Data

25.8.2023 | 7 Minuten Lesezeit

Ihsan Kisi

Bessere SQL-Datenpipelines mit dbt

SQL ist weiterhin aus der Datenanalyse nicht wegzudenken – es ist vergleichsweise einfach zu lernen und Anwender können es ohne zusätzliche Werkzeuge auf einer Datenbank ausführen. Entsprechend ist es bei vielen Datenanalysten und Engineers beliebt. ...

Data

22.2.2023 | 2 Minuten Lesezeit

Matthias Niehoff

Streaming Wikipedia mit Apache Kafka

Apache Kafka ist in aller Munde und entwickelt sich im Kontext von verteilten Systemen zum De-facto-Standard als Plattform für Event Streaming. Im Rahmen unserer OffProject Time (Weiterbildungszeit) haben wir uns die Plattform auch näher angeschaut und...

Kotlin
Data
Java
Messaging
Spring

15.8.2022 | 10 Minuten Lesezeit

Christoph Metzger

Felix Rieß

Einführung in die Welt der Tourenoptimierung – Echte Routen und realistischere...

In diesem Artikel möchte ich euch mit einem Python Jupyter Notebook zeigen, wie ihr Anwendungsfälle der Tourenoptimierung inklusive Nebenbedingungen lösen und visualisieren könnt. Außerdem zeige ich euch, wie ihr mit OpenStreetMaps die Route zwischen...

Data

21.6.2022 | 7 Minuten Lesezeit

Lukas Heidemann

Einführung in die Welt der Tourenoptimierung – Visualisierung und Lösungsverfahren...

In diesem Artikel möchte ich euch zeigen, wie ihr Probleme der Tourenoptimierung in einem Python Jupyter Notebook lösen und visualisieren könnt. Am Beispiel eines Fahrradkurierdienst zeige ich außerdem, wie das Grundproblem um gängige Nebenbedingungen...

Data

16.6.2022 | 9 Minuten Lesezeit

Lukas Heidemann

Einführung in die Welt der Tourenoptimierung (1/3)

In vielen Unternehmen fallen täglich verschiedene Transportprozesse an. Klassische Beispiele sind die Optimierung von Warenein- und ausgängen, die Einsatzplanung von Servicetechnikern oder die optimale Reihenfolge der Auslieferung bei Lieferdiensten....

Data

12.6.2022 | 8 Minuten Lesezeit

Lukas Heidemann

Machine-Learning-Modelle bewerten – Quality Gates etablieren

Die Qualität bzw. Nützlichkeit von Machine-Learning-Modellen lässt sich mit Hilfe von Testdaten und Metriken bewerten. Allerdings in welchem Umfang? Manuell, automatisiert, einmalig, regelmäßig? Manuell lassen sich die ersten Modelle als Ergebnis eines...

Data
Machine Learning
Softwareentwicklung
CI/CD

7.12.2021 | 7 Minuten Lesezeit

Berthold Schulte

Schnelles Training eines Recommendation-Modells durch BigQuery ML

Machine Learning (ML) kann nur durch Modelle in der Produktion Business Value erzeugen. Allerdings kann die Zeitspanne zwischen der Entwicklung der nächsten Iteration eines Modells und dessen Einsatz in einer Produktionsumgebung massiv sein. Dies gilt...

Accelerate
Cloud
Data
Google Cloud
Machine Learning

26.7.2021 | 11 Minuten Lesezeit

Niklas Haas

Timo Böhm

KI, Daten und Infrastruktur – ML-Systeme schnell Ende-zu-Ende verproben...

Heutzutage steht fast alles, was mit den Labels „künstliche Intelligenz (KI)“ oder „Machine Learning (ML)“ versehen ist, für Fortschritt. Seltsamerweise schließt diese Assoziation jedoch häufig die Themen Daten und Dateninfrastruktur nicht ausreichend...

Kultur
Data
Machine Learning

21.6.2021 | 12 Minuten Lesezeit

Marcel Mikl

Schnelles KI-Prototyping mit Google Cloud AutoML Vision

Bei klassischen Machine-Learning-(ML-)Projekten beschäftigen sich Data Scientists häufig längere Zeit (mehrere Monate) mit der Entwicklung eines ML-Modells. Dabei werden hohe Kosten verursacht und die Zeit, bis ein erstes Modell zur Verfügung steht, ...

Cloud
Computer Vision
Data
Künstliche Intelligenz
Google Cloud
Machine Learning

17.5.2021 | 5 Minuten Lesezeit

Nils Bauroth

Sven Rediske

The Good, the Bad and the Ugly: Daten effektiv visualisieren und kommunizieren

Dieser Artikel begleitet meinen Vortrag The Good, the Bad and the Ugly: Daten effektiv visualisieren und kommunizieren, den ich am 20.10.2020 auf der data2day gehalten habe.Datenvisualisierung ist ausschlaggebend für Verständnis und KommunikationDatenvisualisierung...

Data
Data Science

19.10.2020 | 11 Minuten Lesezeit

Shirin Elsinghorst

KI in der Praxis: Fehlerhafte Bauteile mit Rekognition auf AWS identifizieren

Noch vor kurzer Zeit mussten für den Einsatz von künstlicher Intelligenz (KI) unter großem Aufwand eigene KI-Modelle erstellt werden. Heute ist für viele Anwendungsfälle die Einstiegshürde in die Welt der KI durch Cloud-Computing-Dienste stark gesunken...

Cloud
Computer Vision
Data
Künstliche Intelligenz
Machine Learning
Python

29.7.2020 | 11 Minuten Lesezeit

Marcel Mikl

Nico Axtmann

KI in der Praxis: Fehlerhafte Bauteile mit AutoML in der Google Cloud ...

Noch vor kurzer Zeit war der Einsatz von künstlicher Intelligenz (KI) nur mit großem Aufwand und Konstruktion eigener neuronaler Netze möglich. Heute ist die Einstiegshürde in die Welt der KI durch Cloud-Computing-Dienste stark gesunken. So kann man ...

Cloud
Computer Vision
Data
Python
Machine Learning
Google Cloud
Künstliche Intelligenz

8.7.2020 | 11 Minuten Lesezeit

Nico Axtmann

Marcel Mikl

KI für KMU: (Teil-)Automatisierung der Qualitätskontrolle von Bauteilen

Noch vor kurzer Zeit war der Einsatz von künstlicher Intelligenz (KI) nur mit großem Aufwand und ausreichend Spezialwissen möglich. Hauptsächlich große Internet-Konzerne wie Google, Apple und Facebook hatten das Geld, die Daten und die Expertise, um ...

Data
Machine Learning
Künstliche Intelligenz

6.7.2020 | 7 Minuten Lesezeit

Marcel Mikl

Nico Axtmann

Machine Learning in der Praxis. Eine Mate mit … Matthias Niehoff #EineMateMit

Machine Learning und künstliche Intelligenz sind aktuell in aller Munde und versprechen vielfältige Einsatzmöglichkeiten im Unternehmen. Trotzdem tun sich viele Unternehmen aktuell noch schwer, das Potential der Technologie zu nutzen. „Der Fokus liegt...

Künstliche Intelligenz
Data
Community
Machine Learning

27.5.2020 | 1 Minuten Lesezeit

Matthias Niehoff

Process Mining mit bupaR

Process Mining schafft Transparenz darüber, was wirklich in Unternehmen geschieht. Im Prozessmanagement werden die Idealvorstellungen eines Prozesses meist langwierig definiert. In der Praxis ist die Qualität dieser Beschreibungen jedoch oft nicht eindeutig...

Open Source
Data
Process Management

5.5.2020 | 9 Minuten Lesezeit

Anna Lukas

Wie man Data-Science-Projekte nicht in die PoC-Sackgasse manövriert

Warum gelingt es Data-Science-Initiativen häufig nicht, einen echten Mehrwert zu schaffen? Wir haben einige Ursachen dafür ausgemacht. In diesem Blogpost stellen wir vier typische Fallen für Data-Science-Projekte vor und geben Tipps, wie Du sie umschiffen...

Machine Learning
Data
Künstliche Intelligenz
Softwareentwicklung

27.3.2020 | 11 Minuten Lesezeit

Marcel Mikl

Gemeinsam bessere Projekte umsetzen.

Wir helfen deinem Unternehmen.

Du stehst vor einer großen IT-Herausforderung? Wir sorgen für eine maßgeschneiderte Unterstützung. Informiere dich jetzt.

Hilf uns, noch besser zu werden.

Wir sind immer auf der Suche nach neuen Talenten. Auch für dich ist die passende Stelle dabei.

Contact

Send

Can you win the stacking challenge? An example of heuristic optimization

The stacking challenge

Simplified version

1. Constructing a stack

2. Avoiding forbidden orderings

3. Checking for overweight

4. Checking for the percentage amount of red boxes

5. Decoupled optimization

Data

Was this post helpful?

Ja

Blog author

Get in contact

Get in contact

More articles

Better time series forecasting using expert knowledge

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

View Job

More articles in this subject area

Green Cloud: Daten und Emissionen sparen

Charge your APIs Volume 23: REST vs. gRPC

Eine Einführung in Federated Learning im industriellen Kontext: Fortgeschritten

Eine Einführung in Federated Learning im industriellen Kontext: Grundlagen

Bessere SQL-Datenpipelines mit dbt

Streaming Wikipedia mit Apache Kafka

Einführung in die Welt der Tourenoptimierung – Echte Routen und realistischere...

Einführung in die Welt der Tourenoptimierung – Visualisierung und Lösungsverfahren...

Einführung in die Welt der Tourenoptimierung (1/3)

Machine-Learning-Modelle bewerten – Quality Gates etablieren

Schnelles Training eines Recommendation-Modells durch BigQuery ML

KI, Daten und Infrastruktur – ML-Systeme schnell Ende-zu-Ende verproben...

Schnelles KI-Prototyping mit Google Cloud AutoML Vision

The Good, the Bad and the Ugly: Daten effektiv visualisieren und kommunizieren

KI in der Praxis: Fehlerhafte Bauteile mit Rekognition auf AWS identifizieren

KI in der Praxis: Fehlerhafte Bauteile mit AutoML in der Google Cloud ...

KI für KMU: (Teil-)Automatisierung der Qualitätskontrolle von Bauteilen

Machine Learning in der Praxis. Eine Mate mit … Matthias Niehoff #EineMateMit

Process Mining mit bupaR

Wie man Data-Science-Projekte nicht in die PoC-Sackgasse manövriert

Gemeinsam bessere Projekte umsetzen.

Wir helfen deinem Unternehmen.

Unsere Leistungen

Hilf uns, noch besser zu werden.

Zu den Jobangeboten