Margaret Rouse is an award-winning technical writer and teacher known for her ability to explain complex technical subjects simply to a non-technical, business audience. Over…
A data lakehouse is a unified storage architecture that combines the cost benefits of a data lake with the analytic benefits of a data warehouse.
An important purpose of a data lakehouse is to make it easier for machine learning engineers (MLEs) to use the same large data sets for different types of artificial intelligence (AI) workloads.
A data lakehouse architecture has five layers:
A data lakehouse allows the same unified storage layer to be used for multiple purposes — including predictive analytics, prescriptive analytics, deep learning and reporting.
This emerging architecture uses metadata to combine the flexibility of a data lake with the benefits of a data warehouse. Popular data lakehouse vendors include:
Cloudera – this open source, open standards-based data lakehouse is built on Apache Iceberg’s open table format.
Databricks – the Databricks Lakehouse Platform can be delivered and managed as a service on AWS, Microsoft Azure and Google Cloud.
Dremio – provides fully-managed services designed to help customers experiment with using a lakehouse architecture with less TCO.
Snowflake – integrates subject-specific data marts, data warehouses and data lakes into a single source of truth (SSOT) that can be used to power different types of workloads.
Techopedia’s editorial policy is centered on delivering thoroughly researched, accurate, and unbiased content. We uphold strict sourcing standards, and each page undergoes diligent review by our team of top technology experts and seasoned editors. This process ensures the integrity, relevance, and value of our content for our readers.
Margaret is an award-winning technical writer and teacher known for her ability to explain complex technical subjects to a non-technical business audience. Over the past twenty years, her IT definitions have been published by Que in an encyclopedia of technology terms and cited in articles by the New York Times, Time Magazine, USA Today, ZDNet, PC Magazine, and Discovery Magazine. She joined Techopedia in 2011. Margaret's idea of a fun day is helping IT and business professionals learn to speak each other’s highly specialized languages.
What is Security Service Edge (SSE)? Security Service Edge (SSE) is an information and communication technology (ICT) market segment that...
Margaret RouseTechnology Expert
What Is Cloud Robotics? Cloud robotics is a concept where robots are connected to the cloud, enabling them to access...
Marshall GunnellIT & Cybersecurity Expert
What Does Edge Computing Mean? Edge computing is a distributed network architecture that processes data as close to its source...
Trending NewsLatest GuidesReviewsTerm of the Day