Databricks has announced its acquisition of Tabular, Inc., a data management startup founded by Ryan Blue, Daniel Weeks, and Jason Reid.
This acquisition brings together the original creators of Apache Iceberg™ and Linux Foundation Delta Lake, two leading open-source lakehouse formats.
The collaboration aims to lead the way in data compatibility, eliminating limitations based on lakehouse formats.
The Rise of Lakehouse Architecture and Format Incompatibility
Databricks pioneered the lakehouse architecture in 2020, integrating traditional data warehousing workloads with AI workloads on a single, governed copy of data.
This architecture requires all data to be in an open format, allowing different workloads, applications, and engines to access the same data.
With ANYRUN You can Analyze any URL, Files & Email for Malicious Activity : Start your Analysis
The lakehouse architecture has revolutionized enterprise productivity by democratizing data access, contrasting with proprietary data warehouses that create vendor lock-in.
According to a survey by the MIT Technology Review, 74% of enterprises have deployed a lakehouse, highlighting the architecture’s widespread adoption.
The foundation of the lakehouse is open-source data formats that enable ACID transactions on data stored in object storage, improving the reliability and performance of data operations.
The Road to Interoperability
Databricks introduced Delta Lake UniForm last year to address the challenges of format incompatibility. Uniform tables provide interoperability across Delta Lake, Iceberg, and Hudi, supporting the Iceberg restful catalog interface.
This allows companies to use familiar analytics engines and tools across all their data.
With the addition of the original Iceberg team, Databricks plans to invest heavily in broadening the ambitions of Delta Lake UniForm, aiming for more excellent compatibility and interoperability.
Both Databricks and Tabular have a history of championing open-source formats.
Databricks, the largest and most successful independent open-source company by revenue, has donated 12 million lines of code to open-source projects.
This acquisition underscores Databricks’ commitment to open formats and open-source data in the cloud, ensuring companies remain in control of their data and free from proprietary vendor lock-in.
The acquisition of Tabular by Databricks marks a significant step towards achieving data interoperability and enhancing the lakehouse architecture.
By bringing together the creators of Apache Iceberg and Delta Lake, Databricks is poised to lead the way in data compatibility, ensuring that enterprises can maximize the value of their data without being constrained by format limitations.
The collaboration promises a future where data interoperability is the norm, driving innovation and productivity across industries.
Looking for Full Data Breach Protection? Try Cynet's All-in-One Cybersecurity Platform for MSPs:
Try Free Demo