Building the next generation of data engineering with Delta Lake #sparkaisummit
Sameer Paranjpye’s Post
More Relevant Posts
-
Apache Spark and Delta Lake continue to lead at developing in new, best-in-class data processing functionality and making it openly available.
This is a BIG DEAL: open variant data type. Makes JSON and semi-structured data processing 10x faster without losing the flexibility. We have heard from many customers that they really want this data type that’s available in some proprietary data warehouses, but don’t like the lock-in. So here you go. An open Variant type in open source Apache Spark and Delta Lake. https://lnkd.in/gJwD6mgh
Introducing the Open Variant Data Type in Delta Lake and Apache Spark
databricks.com
To view or add a comment, sign in
-
Databricks' ingestion capabilities about to get a huge boost by joining forces with Arcion labs. Looking forward to working with Rajkumar Sen and the Arcion team! https://lnkd.in/gna4Mq7S
Databricks Agrees to Acquire Arcion, the Leading Provider for Real-Time Enterprise Data Replication Technology
databricks.com
To view or add a comment, sign in
-
Had an amazing week at our Bengaluru R&D site! Energizing to be around such a great set of folks and super excited for all the new projects kicking off at the new center Rohit A, Prasad Deshpande, Pulkit Singhal, Jeffry Issac, Xavier S Raj, shivaraj Krishnan, Gopala KrishnaMurthy Sangesapu, and many other Bricksters have come together to build our new site! https://lnkd.in/gAcCrRM6
To view or add a comment, sign in
-
-
Do your data crunching with Spark and program in plain English. Try out Spark English SDK!
The hottest new programming language is English! The English SDK for Apache Spark is now available. You can download it at http://pyspark.ai/. Give it a try today! We welcome your contributions! #apachespark #pyspark #chatgpt4 #llm #databricks #dais2023
Introducing English as the New Programming Language for Apache Spark
databricks.com
To view or add a comment, sign in
-
Sameer Paranjpye reposted this
STANDING ROOM ONLY for "Introduction to Data Streaming on the Lakehouse" with Zoé Durand & Yue Zhang! #DAIS #databricks #datastreaming
To view or add a comment, sign in
-
-
Sameer Paranjpye reposted this
Good morning, #DataAISummit 😎 Whether you're joining us in-person or virtually, we want to hear from you! What's on your agenda this week?
To view or add a comment, sign in
-
Sameer Paranjpye reposted this
Super excited to launch the preview of Unity Catalog's Apache Hive Metastore API, which allows any system that understands Hive to connect to Unity! This is a big part of our open platform philosophy, so that customers can use their favorite analytics systems while centralizing and simplifying their data architecture. Apache Hive is the most widely used catalog API in the industry, so lots of software knows to connect to it, including Amazon Athena, Presto, Trino, Apache Spark and more. All these work with our preview!
Extending Databricks Unity Catalog with an Open Apache Hive Metastore API
databricks.com
To view or add a comment, sign in
-
Databricks docs, now powered by #LLMs!
Did you all notice the new #AI assistant on Databricks docs pages? Just ask your question and get the answer with a link to the best page to read. If you wish to learn how to build such applications using LLMs, register for this free course on #EdX: https://lnkd.in/gDfHQ_in
To view or add a comment, sign in
-
Chief Technology Officer at U.S. Bank
5yTerrific work! Congrats Sameer!!