DVC dependency management – a guide

26.8.2019 | 10 minutes of reading time

This post is a follow-up to A walkthrough of DVC that deals with managing dependencies between DVC projects. In particular, this follow-up is about importing specific versions of an artifact (e.g. a trained model or a dataset) from one DVC project into another.

We discuss two recently added DVC features, namely dvc get and dvc import.

Intuitively, dvc get downloads an artifact from a DVC project (where DVC takes care of resolving the project’s DVC cache and its remotes). For example, dvc get is a very helpful tool in delivering your model to production in a CI/CD pipeline.
Think of dvc import as dvc get plus managing version information for the downloaded artifact. That is, using dvc import you can manage artifacts from another DVC project like you would manage a software library dependency in a software engineering project. For example, dvc import helps you when organizing dataset registries.

You might want to browse through the walkthrough first, so that you can get the most out of this post. For those of you in a hurry, we provide a quick recap of the original walkthrough . In the first part of this article we create a playground project which we use as a DVC dependency hands-on in the second part. If you’re in a hurry, still, note that part one can be skipped. A publicly accessible playground project is provided, so that you can step right into the hands-on fun in part two.

A real-world application of dvc get and dvc import will be discussed in an upcoming post. So stay tuned for more 😀

Recap of the original walkthrough

In A walkthrough of DVC , we trained a classifier for hand-written numbers. The walkthrough showed how implementing a DVC-pipeline makes all of data loading, preprocessing, training, performance evaluation, etc. fully reproducible. The gist is that DVC versions training data, (hyper-)parameters, code and trained models together. In particular, a DVC project resides in a Git repository, which implements all necessary versioning (see the following figure). Binary data, such as e.g. training data and trained models, are located in DVC’s so-called cache. In particular, for each version of the pipeline, the cache contains different versions of all binary data. However, cache data is not stored in the Git repository itself, but in a separate so-called remote (e.g. an Amazon S3 bucket). When checking out a specific version of the pipeline from the Git repository, DVC takes care of fetching cache data that matches the current pipeline version from the remote.

DVC dependency management: Creating the playground

As for the original walkthrough, the GitHub repository for the post you are reading now provides a readily usable working environment. In this environment, you can interactively create the playground number classifier project (or let a script perform all actions for you). Compared to the original walkthrough, the following extensions were implemented. First, the playground project is pushed to GitHub so that it can easily be referenced as a DVC dependency. Secondly, the playground’s DVC cache is now pushed to a remote in an Amazon S3 bucket, so that binary data (e.g. trained models) can also be accessed from the internet.

To prepare the working environment (see the following code block), clone the GitHub repository , change into the cloned directory, and start the working environment using ./start_environment.sh bash. You will be ‘logged in’ to a newly created container. From the prompt in the container, configure variables at the top of the file /home/dvc/scripts/walkthrough_extended.sh to match your GitHub repository and S3 bucket, where both must be empty and writable. (Details of the bucket configuration are discussed in section Configuring the S3 remote .) Then, to automatically perform all steps from the extended walkthrough, you might want to execute /home/dvc/scripts/walkthrough_extended.sh. After the script is finished, the GitHub repository and the S3 bucket will now contain the playground DVC project and its cache data, respectively.

1# $ is the host prompt
2# $$ is the working environment container prompt
3
4$ git clone https://github.com/bbesser/dvc-deps-management
5$ cd dvc-deps-management
6$ vi scripts/walkthrough_extended.sh # configure GitHub and S3 at the top
7$ ./start_environment.sh bash # create and 'log in' to working environment
8$$ /home/dvc/scripts/walkthrough_extended.sh # creates the playground DVC project and its cache

Configuring the S3 remote

In this section, we take a quick look at the part of the scripts/walkthrough_extended.sh script that sets up an S3 bucket as the DVC cache’s remote.

To access an S3 bucket, DVC relies on the boto3 library¹, i.e., boto3 must be given access to the bucket. Therefore, AWS credentials can be provided as environment variables like in the following code block. (For the provided working environment, this configuration has to be done at the top of the scripts/walkthrough_extended.sh script.)

1$$ export AWS_ACCESS_KEY_ID=<YOUR_PLAYGROUND_KEY>
2$$ export AWS_SECRET_ACCESS_KEY=<YOUR_PLAYGROUND_SECRET>
3$$ # from here on, DVC can interact with your bucket

Other means of configuring S3 bucket access for DVC/boto3 are documented here .

Adding an S3 bucket as a remote to a DVC project is the same as adding any other type of remote (see the following code block or scripts/walkthrough_extended.sh). Since the given URI starts with s3://, DVC knows that the remote should reside in a bucket. The -d flag tells DVC that this remote should be used by default. Once the bucket is added, the DVC pipeline’s configuration in .dvc/config should be saved, by committing the changes to Git.

1$$ dvc remote add -d playground_remote s3://<YOUR_BUCKET_NAME>
2$$ git add .dvc/config # save the configuration of the newly added remote

Using a DVC project as a DVC dependency

We discuss how to access data from a DVC project. To be more precise, we want to access some (possibly binary) file processed and/or created by the pipeline defined in the project. E.g., we want to access some “source” data like a set of training images, or the model created by the training stage. Recall that the pipeline and all data is versioned. Consequently, any version of any data can be accessed.

We present two ways to access data, namely dvc get and dvc import. Intuitively, dvc get simply downloads data. dvc import downloads data and additionally considers what version was fetched.

Setup

The working environment given in section Creating the playground should also be used to follow along interactively with the commands presented in this section. If you did not create your own playground project when setting up the environment, you can ignore all steps regarding scripts/walkthrough_extended.sh.

All commands discussed from here on are also available in /home/dvc/scripts/deps_management.sh in the working environment.

To enable DVC to access the playground project, recall that DVC needs to know its GitHub repository’s URL. Also, cache data is located in an S3 bucket remote and read access to that bucket must be provided. Configure your environment as follows:

1$$ export GIT_REPO=<YOUR_PLAYGROUND_REPO_URL>
2$$ export AWS_ACCESS_KEY_ID=<YOUR_PLAYGROUND_KEY>
3$$ export AWS_SECRET_ACCESS_KEY=<YOUR_PLAYGROUND_SECRET>

If you did not create your own playground project, you can use this publicly available playground project . Its URL and credentials are readily configured in /home/dvc/scripts/deps_management.sh. (Note that the given AWS credentials provide no permissions except read access to the S3 bucket.)

dvc get

When getting data from a DVC project, DVC takes care of ‘downloading’ the desired version from the file. In the following example, we download version 0.3 of the trained model from the playground project.

1$$ # replace with your playground GitHub repo (if any)
2$$ GIT_REPO=https://github.com/bbesser/dvc-deps-management-playground.git
3
4$$ dvc get --rev 0.1 $GIT_REPO model/model.h5
5$$ ls
6model.h5 # no other file was downloaded

In particular, observe that no file was downloaded besides model.h5. Consequently, there is no way for DVC to tell which version of model.h5 is in your file system. Hence, issuing the same dvc get command again, model.h5 will be downloaded again.

Recall that the file model.h5 is not part of the Git repository of the DVC project. To download the file, DVC transparently resolves the cache remote of your project, see the following figure. In our example, DVC inspects the following files in the tag 0.3 of $GIT_REPO:

1# file .dvc/cache
2['remote "playground_remote"']
3url = s3://dvc-deps-management.bertatcodecentric.de/dvc-deps-management-playground
4[core]
5remote = playground_remote
6
7# file train.dvc
8# [...]
9outs:
10- md5: 1042d7fd78dd740019699eaaefd1822f
11  path: model/model.h5
12  cache: true
13  metric: false
14  persist: false

From there, DVC deducts that model.h5 is a file with md5 sum 1042d7fd78dd740019699eaaefd1822f in the given remote S3 bucket.

dvc import

dvc import adds version control to dvc get, i.e., dvc import is meant to manage a software project’s dependencies to some DVC project’s data. The project that receives an import must itself be a DVC project (although it does not have to be an ML project). This way, DVC is able to track the desired version of the imported data as the software project evolves over time.

Here is an example of creating a new project that imports data (model.h5) from another DVC project.

1$$ git init
2$$ dvc init
3$$ git add .
4$$ git commit -m 'initialize repository'
5$$ dvc import --rev 0.1 $GIT_REPO model/model.h5
6
7$$ # besides downloading the data,
8$$ # dvc also stores some versioning information ...
9$$ ls
10model.h5  model.h5.dvc
11
12$$ # ... which contains the 'source' of the data
13$$ cat model.h5.dvc
14[...]
15deps:
16- repo:
17    url: https://github.com/bbesser/dvc-deps-management-playground.git
18    [...]
19    rev: '0.1'
20  path: model/model.h5
21[...]

Observe that, following the DVC approach, the imported binary file model.h5 will not be committed to the Git repository. Instead, the import is managed by committing the DVC file model.h5.dvc to our project. Following the naming convention that .dvc files represent stages of a pipeline, a .dvc file for an import is called an import stage.

1$$ git status # model.h5 is not listed as untracked ...
2    .gitignore
3    model.h5.dvc
4$$ cat .gitignore # ... since it's ignored
5/model.h5
6$$ git add model.h5.dvc .gitignore
7$$ git commit -m 'import model.h5 (version 0.1)'
8$$ git tag v0.0.1

As time goes by and your project evolves, a new version of the import becomes available. Updating to the new version is easy:

1$$ dvc import --rev 0.2 $GIT_REPO model/model.h5 # bump revision to 0.2
2$$ # the file model.h5 now contains the newer version of the model
3$$ git status
4    modified:   model.h5.dvc
5$$ git add model.h5.dvc
6$$ git commit -m 'update model.h5 to version 0.2'
7$$ git tag v0.0.2 # version 0.0.2 of our project uses version 0.2 of model.h5

Cloning a project with DVC dependencies

In the project created in the previous section, the Git repository does not contain the binary file model.h5. So, if one of your team members clones this project, she will not receive model.h5.

1$$ # checkout tag v0.0.1 of the software project
2$$ ls
3model.h5.dvc # the model is not part of the git repository

How does she obtain model.h5 without reproducing the entire pipeline? She can reproduce the import stage using dvc repro. (At the time of writing, the stage file has to be manually unlocked/locked. However, there are plans to implement the use case in dvc pull.)

1$$ dvc unlock model.h5.dvc
2$$ dvc repro model.h5.dvc
3$$ dvc lock model.h5.dvc # fix lock
4$$ ls
5model.h5  model.h5.dvc # dvc downloaded version 0.1 of model.h5

When issuing the same commands again, DVC detects that the version of model.h5 did not change, and therefore does not download the data again.

1$$ dvc unlock model.h5.dvc; dvc repro model.h5.dvc; dvc lock model.h5.dvc
2Stage 'model.h5.dvc' didnt change.
3Data and pipelines are up to date.

However, when updating the import for another version, DVC detects that another binary must be fetched.

1$$ git checkout -f v0.0.2
2$$ dvc unlock model.h5.dvc; dvc repro model.h5.dvc; dvc lock model.h5.dvc
3WARNING: Output 'model.h5' of 'model.h5.dvc' changed because it is 'not in cache'
4WARNING: Stage 'model.h5.dvc' changed.
5Reproducing 'model.h5.dvc'
6Importing 'model/model.h5 (https://github.com/bbesser/dvc-deps-management-playground.git)' -> 'model.h5'
7Preparing to download data from 's3://dvc-deps-management.bertatcodecentric.de/dvc-deps-management-playground'
8[...]

After the command has executed, the correct version of model.h5 was downloaded and is available in your file system.

Wrap-up

We introduced you to the DVC commands get, import, and update which allow you to manage artifacts from a DVC project as dependencies. While get simply downloads an artifact, import and update enable you to track the version of the artifact in your software project.

In an upcoming post, we will demo these features in a real-world application. We’re happy to meet you there 😀

Notes

¹ To install boto3 alongside DVC, issue the command pip install 'dvc[s3]', preferably in a virtual environment .

Was this post helpful?

Likes

Blog authors

Bert Besser

Do you still have questions? Just send me a message.

Veronika Schwan

Do you still have questions? Just send me a message.

fromBert Besser & Veronika Schwan

Remote training with GitLab-CI and DVC

In many Data Science projects there is a point in time where the workstation under your desk is not the ideal machine to perform the model training anymore. More potent processors and GPUs are required, e.g. a suitable server in your company’s rack or...

Git
Machine Learning
CI/CD
AI
GitLab

27.1.2020 | 15 Minuten Lesezeit

Marcel Mikl

Bert Besser

A walkthrough of DVC

This post is on how to systematially organize Machine Learning (ML) model development. A model’s performance improves when, e.g., you tune its parameters or when more training data becomes available. To measure improvement, you should track at least ...

Data
AI
Machine Learning
Python

13.3.2019 | 12 Minuten Lesezeit

Bert Besser

Schülerpraktikum bei codecentric

Jeder in der 9. Klasse muss ein Praktikum machen. Nachdem ich einen, diesem hier ähnlichen, Blog-Beitrag gesehen hatte und ihn sehr interessant fand, wollte ich mein Praktikum bei codecentric in Frankfurt machen. Jetzt, am letzten Tag, kann ich sagen...

HR
Python

13.4.2020 | 2 Minuten Lesezeit

Veronika Schwan

Your job at codecentric?

Jobs

Agile Developer und Consultant (w/d/m)

Alle Standorte

Green Cloud: Daten und Emissionen sparen

Das Internet produziert jährlich 900 Millionen Tonnen CO₂ – das ist deutlich mehr als Deutschland insgesamt emittiert. Hauptverantwortlich ist der immer weiter steigende Stromverbrauch beim Transport und der Speicherung von Daten. Wenn ihr kurz darüber...

Cloud
Green IT
Softwarearchitektur
Data

11.3.2024 | 5 Minuten Lesezeit

Dennis

Charge your APIs Volume 23: REST vs. gRPC

APIs dienen als Verbindungsstück zwischen Daten und Verarbeitung und erlauben uns damit, Daten im richtigen Kontext als Informationen zu interpretieren. Passende fachliche Themen sind dabei präsenter denn je und erreichen bald auch den Endverbraucher...

Java
Softwareentwicklung
Spring
Softwarearchitektur
API
Data

11.2.2024 | 7 Minuten Lesezeit

Sebastian Tiemann

Eine Einführung in Federated Learning im industriellen Kontext: Fortgeschritten

Im Bereich des maschinellen Lernens wurde eine lange Zeit angenommen, dass die Eingabedaten von Modellen und Gewichten sicher sei und nicht extrahiert werden könnten. In den letzten Jahren veröffentlichte Forschung hat diese Annahme in Frage gestellt...

Machine Learning
Big Data
Data Science
Data

18.9.2023 | 8 Minuten Lesezeit

Ihsan Kisi

Eine Einführung in Federated Learning im industriellen Kontext: Grundlagen

Mithilfe von Daten können Unternehmen fundiertere Entscheidungen treffen, ihre Arbeitsabläufe optimieren und mit der Kraft des maschinellen Lernens (ML) einen Vorteil in der wettbewerbsintensiven Geschäftswelt erlangen. Allerdings ist der Umgang mit ...

Machine Learning
Data Science
Data
Big Data

25.8.2023 | 7 Minuten Lesezeit

Ihsan Kisi

Große Sprachmodelle: Was ist ein LLM?

Große Sprachmodelle (Large Language Models oder LLM) haben in den letzten Jahren enorme Fortschritte gemacht und spielen eine entscheidende Rolle in verschiedenen Anwendungen. Aber was ist ein LLM? Es ist sinnvoll zu erklären, was ein „einfaches“ Sprachmodell...

Machine Learning

20.6.2023 | 4 Minuten Lesezeit

Elvira Siegel

Bessere SQL-Datenpipelines mit dbt

SQL ist weiterhin aus der Datenanalyse nicht wegzudenken – es ist vergleichsweise einfach zu lernen und Anwender können es ohne zusätzliche Werkzeuge auf einer Datenbank ausführen. Entsprechend ist es bei vielen Datenanalysten und Engineers beliebt. ...

Data

22.2.2023 | 2 Minuten Lesezeit

Matthias Niehoff

Streaming Wikipedia mit Apache Kafka

Apache Kafka ist in aller Munde und entwickelt sich im Kontext von verteilten Systemen zum De-facto-Standard als Plattform für Event Streaming. Im Rahmen unserer OffProject Time (Weiterbildungszeit) haben wir uns die Plattform auch näher angeschaut und...

Kotlin
Data
Java
Messaging
Spring

15.8.2022 | 10 Minuten Lesezeit

Christoph Metzger

Felix Rieß

Einführung in die Welt der Tourenoptimierung – Echte Routen und realistischere...

In diesem Artikel möchte ich euch mit einem Python Jupyter Notebook zeigen, wie ihr Anwendungsfälle der Tourenoptimierung inklusive Nebenbedingungen lösen und visualisieren könnt. Außerdem zeige ich euch, wie ihr mit OpenStreetMaps die Route zwischen...

Data

21.6.2022 | 7 Minuten Lesezeit

Lukas Heidemann

Einführung in die Welt der Tourenoptimierung – Visualisierung und Lösungsverfahren...

In diesem Artikel möchte ich euch zeigen, wie ihr Probleme der Tourenoptimierung in einem Python Jupyter Notebook lösen und visualisieren könnt. Am Beispiel eines Fahrradkurierdienst zeige ich außerdem, wie das Grundproblem um gängige Nebenbedingungen...

Data

16.6.2022 | 9 Minuten Lesezeit

Lukas Heidemann

Einführung in die Welt der Tourenoptimierung (1/3)

In vielen Unternehmen fallen täglich verschiedene Transportprozesse an. Klassische Beispiele sind die Optimierung von Warenein- und ausgängen, die Einsatzplanung von Servicetechnikern oder die optimale Reihenfolge der Auslieferung bei Lieferdiensten....

Data

12.6.2022 | 8 Minuten Lesezeit

Lukas Heidemann

Smart DistancR – Perspektivisch korrekte Distanzmessung zwischen Personen

Die Corona-Krise ist weiterhin in aller Munde und wird uns mit hoher Wahrscheinlichkeit noch etwas länger begleiten. Wie man aus unterschiedlichen Statistiken erfährt, schwanken die Fallzahlen weiter und sorgen für zusätzliche Restriktionen. Diese werden...

Computer Vision
Künstliche Intelligenz
IoT
Machine Learning

13.12.2021 | 7 Minuten Lesezeit

Michel Ehmen

Machine-Learning-Modelle bewerten – Quality Gates etablieren

Die Qualität bzw. Nützlichkeit von Machine-Learning-Modellen lässt sich mit Hilfe von Testdaten und Metriken bewerten. Allerdings in welchem Umfang? Manuell, automatisiert, einmalig, regelmäßig? Manuell lassen sich die ersten Modelle als Ergebnis eines...

Data
Machine Learning
Softwareentwicklung
CI/CD

7.12.2021 | 7 Minuten Lesezeit

Berthold Schulte

Kürzere Time-to-Market für ML-Modelle durch Googles BigQuery ML

Machine Learning (ML) erzeugt erst dann realen Mehrwert, wenn es in Produktion benutzt wird. Allerdings kann die Zeitspanne zwischen der Entwicklung eines belastbaren Modells und dessen Einsatz frustrierend lange sein. Insbesondere in schnelllebigen ...

Agile Methoden
Cloud
Machine Learning

26.7.2021 | 5 Minuten Lesezeit

Timo Böhm

Niklas Haas

Schnelles Training eines Recommendation-Modells durch BigQuery ML

Machine Learning (ML) kann nur durch Modelle in der Produktion Business Value erzeugen. Allerdings kann die Zeitspanne zwischen der Entwicklung der nächsten Iteration eines Modells und dessen Einsatz in einer Produktionsumgebung massiv sein. Dies gilt...

Accelerate
Cloud
Data
Google Cloud
Machine Learning

26.7.2021 | 11 Minuten Lesezeit

Niklas Haas

Timo Böhm

KI, Daten und Infrastruktur – ML-Systeme schnell Ende-zu-Ende verproben...

Heutzutage steht fast alles, was mit den Labels „künstliche Intelligenz (KI)“ oder „Machine Learning (ML)“ versehen ist, für Fortschritt. Seltsamerweise schließt diese Assoziation jedoch häufig die Themen Daten und Dateninfrastruktur nicht ausreichend...

Kultur
Data
Machine Learning

21.6.2021 | 12 Minuten Lesezeit

Marcel Mikl

Schnelles KI-Prototyping mit Google Cloud AutoML Vision

Bei klassischen Machine-Learning-(ML-)Projekten beschäftigen sich Data Scientists häufig längere Zeit (mehrere Monate) mit der Entwicklung eines ML-Modells. Dabei werden hohe Kosten verursacht und die Zeit, bis ein erstes Modell zur Verfügung steht, ...

Cloud
Computer Vision
Data
Künstliche Intelligenz
Google Cloud
Machine Learning

17.5.2021 | 5 Minuten Lesezeit

Nils Bauroth

Sven Rediske

The Good, the Bad and the Ugly: Daten effektiv visualisieren und kommunizieren

Dieser Artikel begleitet meinen Vortrag The Good, the Bad and the Ugly: Daten effektiv visualisieren und kommunizieren, den ich am 20.10.2020 auf der data2day gehalten habe.Datenvisualisierung ist ausschlaggebend für Verständnis und KommunikationDatenvisualisierung...

Data
Data Science

19.10.2020 | 11 Minuten Lesezeit

Shirin Elsinghorst

KI in der Praxis: Fehlerhafte Bauteile mit Rekognition auf AWS identifizieren

Noch vor kurzer Zeit mussten für den Einsatz von künstlicher Intelligenz (KI) unter großem Aufwand eigene KI-Modelle erstellt werden. Heute ist für viele Anwendungsfälle die Einstiegshürde in die Welt der KI durch Cloud-Computing-Dienste stark gesunken...

Cloud
Computer Vision
Data
Künstliche Intelligenz
Machine Learning
Python

29.7.2020 | 11 Minuten Lesezeit

Marcel Mikl

Nico Axtmann

KI in der Praxis: Fehlerhafte Bauteile mit AutoML in der Google Cloud ...

Noch vor kurzer Zeit war der Einsatz von künstlicher Intelligenz (KI) nur mit großem Aufwand und Konstruktion eigener neuronaler Netze möglich. Heute ist die Einstiegshürde in die Welt der KI durch Cloud-Computing-Dienste stark gesunken. So kann man ...

Cloud
Computer Vision
Data
Python
Machine Learning
Google Cloud
Künstliche Intelligenz

8.7.2020 | 11 Minuten Lesezeit

Nico Axtmann

Marcel Mikl

KI für KMU: (Teil-)Automatisierung der Qualitätskontrolle von Bauteilen

Noch vor kurzer Zeit war der Einsatz von künstlicher Intelligenz (KI) nur mit großem Aufwand und ausreichend Spezialwissen möglich. Hauptsächlich große Internet-Konzerne wie Google, Apple und Facebook hatten das Geld, die Daten und die Expertise, um ...

Data
Machine Learning
Künstliche Intelligenz

6.7.2020 | 7 Minuten Lesezeit

Marcel Mikl

Nico Axtmann

Gemeinsam bessere Projekte umsetzen.

Wir helfen deinem Unternehmen.

Du stehst vor einer großen IT-Herausforderung? Wir sorgen für eine maßgeschneiderte Unterstützung. Informiere dich jetzt.

Hilf uns, noch besser zu werden.

Wir sind immer auf der Suche nach neuen Talenten. Auch für dich ist die passende Stelle dabei.

Contact

Send

DVC dependency management – a guide

Recap of the original walkthrough

DVC dependency management: Creating the playground

Configuring the S3 remote

Using a DVC project as a DVC dependency

Setup

dvc get

dvc import

Cloning a project with DVC dependencies

Wrap-up

Notes

Was this post helpful?

Ja

Blog authors

Get in contact

Get in contact

Contact Bert

Contact Veronika

More articles

Remote training with GitLab-CI and DVC

A walkthrough of DVC

Schülerpraktikum bei codecentric

Your job at codecentric?

Agile Developer und Consultant (w/d/m)

View Job

More articles in this subject area

Green Cloud: Daten und Emissionen sparen

Charge your APIs Volume 23: REST vs. gRPC

Eine Einführung in Federated Learning im industriellen Kontext: Fortgeschritten

Eine Einführung in Federated Learning im industriellen Kontext: Grundlagen

Große Sprachmodelle: Was ist ein LLM?

Bessere SQL-Datenpipelines mit dbt

Streaming Wikipedia mit Apache Kafka

Einführung in die Welt der Tourenoptimierung – Echte Routen und realistischere...

Einführung in die Welt der Tourenoptimierung – Visualisierung und Lösungsverfahren...

Einführung in die Welt der Tourenoptimierung (1/3)

Smart DistancR – Perspektivisch korrekte Distanzmessung zwischen Personen

Machine-Learning-Modelle bewerten – Quality Gates etablieren

Kürzere Time-to-Market für ML-Modelle durch Googles BigQuery ML

Schnelles Training eines Recommendation-Modells durch BigQuery ML

KI, Daten und Infrastruktur – ML-Systeme schnell Ende-zu-Ende verproben...

Schnelles KI-Prototyping mit Google Cloud AutoML Vision

The Good, the Bad and the Ugly: Daten effektiv visualisieren und kommunizieren

KI in der Praxis: Fehlerhafte Bauteile mit Rekognition auf AWS identifizieren

KI in der Praxis: Fehlerhafte Bauteile mit AutoML in der Google Cloud ...

KI für KMU: (Teil-)Automatisierung der Qualitätskontrolle von Bauteilen

Gemeinsam bessere Projekte umsetzen.

Wir helfen deinem Unternehmen.

Unsere Leistungen

Hilf uns, noch besser zu werden.

Zu den Jobangeboten