Stackable

Stackable

case-study_discovery

illustration shape

Case Study: Discovery

Success Story, Discovery x Stackable

A Modern Data Platform for AI.

Transforming Data Management in Financial Services.

You can read the Case Study on this page or download the PDF here. 

Discovery South Africa Logo shown in Macbook

Initial situation and challenges

Discovery, one of the largest South Africa-founded financial services organisations, provides most of its local business units with more than 60 data scientists with a comprehensive data platform for advanced analytics and machine learning. Initially, their solution was a complex on-premises setup based on Cloudera, including HBase, YARN, HDFS, Kafka, NiFi and CDSW. However, the team was facing increasingly significant challenges with package management, version conflicts and failed attempts to run YARN with Kubernetes. HDFS was not optimised for their InfiniBand network, resulting in network overhead and poor hardware utilisation. There was a need for a more flexible solution to make updates to frameworks less cumbersome and time-consuming. More agility and scalability was required, particularly for their data science and machine learning workloads. Following a thorough evaluation period, Discovery decided to transition to the Stackable Data Platform.

"The Stackable Data Platform greatly simplifies the management of complex environments with numerous components thanks to its K8s operators. This significantly increases flexibility while reducing operational effort."

Nick Alexander, 
Systems Architect, Discovery Health

"By adopting our Stackable Data Platform, Discovery created a comprehensive data and AI environment optimised down to the last detail. Discovery's forward-thinking approach is truly commendable."

Dr. Stefan Igel,
COO, Stackable

The solution for Discovery Health

Discovery addressed its challenges by migrating to the Stackable Data Platform, which is built on a modular, Kubernetes-native architecture. While the existing hardware could be reused by reallocating resources, the platform’s software components have been completely renewed or updated: The team migrated from HDFS to VAST DataStore and configured Apache HBase® to run on top of it. Other systems such as Apache Hive™ and Impala have been replaced with Trino, which improved flexibility. Apache Iceberg™ was introduced as the new storage format, together with enhanced privacy features. The team also transitioned from Apache Spark™ 2 to Spark 3 to leverage improved performance, GPU acceleration, and the PySpark experience. Finally, the data migration, including 400 TB of storage, was completed over a weekend with a prior dry-run to ensure system stability and minimal downtime.

Result and Successes

Following its migration to the Stackable Data Platform, Discovery has achieved a significant increase in agility, scalability, and operational efficiency across its data architecture. Most business units, including those in insurance and healthcare, now run on shared infrastructure that supports ETL, streaming, and machine learning workloads. The shift to modular, operator-managed components has reduced complexity and staffing needs while enabling the use of new tools such as the AI compute engine Ray alongside Spark. Discovery has significantly improved the performance and flexibility of queries, as well as transitioning to a future-proof data architecture. Enhanced data virtualisation provides seamless access to sources such as Oracle and Netezza. Day-2 operations are made easy with Stackable. Its open approach to system maintenance, monitoring, and observability allows for modular component updates and integration with other third-party solutions. The result is a flexible, modern data stack that supports performance and innovation.

Highlights at a glance

icon_benefits-300x300

High-performance AI data plaform to support over 60 data scientists in several business units

icon_benefits-300x300

Minimized dependency on single vendor, enhancing flexibility and control over technology choices

icon_benefits-300x300

Capability to incrementally upgrade to the latest versions of data products without disruption

icon_benefits-300x300

Trino as central part of the new solution allowing data virtualization across the organization

icon_benefits-300x300

Greater adaptability and innovation through open-source solutions and customization

icon_benefits-300x300

Efficient data migration with minimal downtime

About Discovery

Discovery is a proudly South African-founded financial services organisation that operates in the healthcare, life insurance, shortterm insurance, long-term savings, banking and wellness markets. 

Since inception in 1992, Discovery has been guided by a clear core purpose – to make people healthier and to enhance and protect their lives. They have been able to do this by pioneering the shared-value insurance model, which delivers better health and value for clients, superior actuarial dynamics for the insurer, and a healthier society. The success of the model in the markets where Discovery operates has been testament to its importance to society.

More references

screenshot shows the end to end security with stackable

Discover how Stackable’s data platform enhances security and governance for financial services in this demo. It highlights seamless single sign-on, advanced impersonation for role-based access, and key features like row-level security and data masking to protect sensitive information.

Our specialist for financial services solutions

Sönke Liebau

Sönke Liebau
CPO & Co-Founder of Stackable

Subscribe to our Newsletter

With the Stackable newsletter, you’ll always stay up to date on the latest from Stackable!

illustration of an envelope entering a mail box
An illustration of a laptop and phone on a desk

Newsletter

Subscribe to the newsletter

With the Stackable newsletter you’ll always be up to date when it comes to updates around Stackable!