Thinking AI means re-thinking data 05/28/20 by Marcel Mikl While doing AI is sexy and cool, data infrastructure is not considered any of this, even though machine learning applications rely on it. Read more Leave your thoughts
From PDF data sheets to shared understanding with serverless SHACL 04/02/20 by Matthias Frank In this article we implement a generic approach to prepare formalized knowledge for a shared understanding by extracting the information contained in arbitrary PDF data sheets. Read more Leave your thoughts
Thoughts after completing the Coursera “Data Engineering, Big Data, and Machine Learning on GCP Specialization” 09/09/19 by Niklas Haas Having worked with Google Cloud Platform’s Big Data Services for almost a year, I wanted to have a broader view on GCP’s capabilities. Read more Leave your thoughts
Hands-on Spark intro: Cross Join customers and products with business logic 08/06/19 by Niklas Haas In this blog post, I want to share my aha moments with you I had during the development of my... Read more Leave your thoughts
Window Functions in Stream Analytics 10/12/18 by Frank Rosner Introduction to Stream Analytics Why should we talk about stream analytics? In the past decades data analytics was dominated by... Read more Leave your thoughts
ETL with Kafka 03/02/18 by Akhlaq Malik You don’t need to write any code for pushing data into Kafka, instead just choose your connector and start the job with your necessary configurations. And it’s absolutely Open Source! Read more Leave your thoughts
Deep Learning Workshop at codecentric AG in Solingen 02/06/18 by Shirin Elsinghorst and Maria Hartmann How does Big Data Analysis work? To find that out, I participated in the workshop on "Deep Learning with Keras and TensorFlow". Read more Leave your thoughts
Change Streams in MongoDB 3.6 01/15/18 by Tobias Trelle MongoDB 3.6 introduces an interesting API enhancement called change streams. With change streams you can watch for changes to certain collections by means of the driver API. Read more Leave your thoughts
Validating Topic Configurations in Apache Kafka 12/07/17 by Florian Troßbach Messages in Apache Kafka are appended to (partitions of) a topic. Topics have a partition count, a replication factor and various other configuration values. What could possibly go wrong? Read more Leave your thoughts