lakehouse
Here are 78 public repositories matching this topic...
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
-
Updated
Jul 17, 2024 - Java
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
-
Updated
Jul 16, 2024 - Java
YTsaurus is a scalable and fault-tolerant open-source big data platform.
-
Updated
Jul 16, 2024 - C++
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
-
Updated
Jul 16, 2024 - Java
Use SQL to build ELT pipelines on a data lakehouse.
-
Updated
May 25, 2022 - JavaScript
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
-
Updated
Jul 16, 2024 - Python
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
-
Updated
Jun 17, 2024 - Python
Examples of using Terraform to deploy Databricks resources
-
Updated
Jul 5, 2024 - HCL
Unified storage framework for the entire machine learning lifecycle
-
Updated
Mar 3, 2024 - Python
A curated list of open source tools used in analytical stacks and data engineering ecosystem
-
Updated
May 7, 2024
Lakehouse storage system benchmark
-
Updated
Feb 22, 2023 - Scala
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
-
Updated
Sep 2, 2023 - Dockerfile
Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
-
Updated
Dec 2, 2023 - Python
DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics for your Delta Lakehouse. Unity Catalog supported in the v0.7.0-rc1 release.Documentation here - https://databrickslabs.github.io/delta-oms/v0.7.0-rc1/
-
Updated
Nov 27, 2023 - Scala
Improve this page
Add a description, image, and links to the lakehouse topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the lakehouse topic, visit your repo's landing page and select "manage topics."