A data lakehouse is a data management architecture that seeks to combine the strengths of data lakes with the strengths of data warehouses. The idea behind the data lakehouse is to merge the cheap, reliable storage of data lakes with the powerful data management and data structure capabilities found in data warehouses, allowing for the storage and use of unstructured data without sacrificing performance. Combining these capabilities in a single data lakehouse system allows data teams to move faster than they could if they had to access lake and warehouse systems separately. Data lakehouses are enabled by new query engine designs that allow high performance SQL to be run against data lakes. Lakehouses also separate storage and compute functions so that either one can be scaled separately depending on the needs of a particular job.

