Introduction In the ever-evolving landscape of data management, adaptability and choice are key. Stackable, renowned for its open and flexible Stackable Data Platform, continues to pioneer innovations that empower businesses to tailor their data architectures to unique needs. The latest … Read More
Author Archives: Sönke Liebau
A Modern Data Lakehouse: Stackable and dbt on the TPC-H Dataset
Introduction At Stackable, we like to push the limits of what our Stackable Data Platform can do. Whether it’s the millisecond performance of HBase on Kubernetes (https://stackable.tech/hbase-performance-benchmark/) or connecting to legacy databases via Hive (https://stackable.tech/stackable-and-trino-part-3-migrating-hive-tables-using-ctas/). This time we focused on … Read More
Spark on Kubernetes Operators – A Comparison
Introduction Apache Spark has come a long way since its inception in 2014, originally envisioned as a more generic model to work with data than the previous concept of Map & Reduce. It has been widely adopted since then and … Read More
Tools To Make Interacting With Your Kubernetes Clusters A Breeze
How to get the busywork out of handling Kubernetes and finding out what is going on in your cluster? If you feel like you have written quite enough kubectl commands for a while, this is for you! In this post, we will … Read More
Building a Kubernetes-Hosted Web UI for Our Data Platform
Approachability is not one of the biggest strengths of Kubernetes. Anybody who can’t express their questions in form of kubectl commands will have a hard time finding out what’s going on inside a cluster. Not that it were an easy task. The … Read More
Benchmarking Apache HBase on Kubernetes
TL;DR In this blog post, I describe a benchmarking exercise we conducted to compare the performance impact of running HBase and HDFS on Kubernetes versus running them on bare-metal. The blog post explains the challenges of running stateful workloads on … Read More
What Were All Those Project Names on the KubeCon Europe 2023 Page?
When checking out this year’s KubeCon in Amsterdam, the following text snippet caught my attention: “Join containerd, CoreDNS, Envoy, etcd, Fluentd, Harbor, Helm, Jaeger, Kubernetes, Linkerd, Open Policy Agent, Prometheus, Rook, TiKV, TUF, Vitess, Argo, Backstage, Buildpacks, Chaos Mesh, Cilium, … Read More
Stumbling Over ‘ubi-8-appstream’ In ‘ubi8/ubi-minimal’
Today our Docker image builds started crashing out of the blue. The following error message showed up: The offending line, was calling microdnf with –enablerepo=ubi-8-appstream. Suddenly, images which were building just fine in the morning, stopped doing so. This might be a very … Read More
Deleting Custom Resources in a Kubernetes Namespace
The custom resources I thought I deleted were in fact around all this time!? When working with Kubernetes and custom resources, you need to be able to go back to a clean state. Unfortunately, almost all kubectl commands are ignoring … Read More
7 Useful CLI Tools You Can Give a Try
What CLI tools do you LOVE to use? Is there something you wouldn’t want to miss? I asked this question in the Stackable Slack a few days ago, got some cool replies and wanted to share them on. Interested in … Read More