article thumbnail

An Exploration Of The Expectations, Ecosystem, and Realities Of Real-Time Data Applications

Data Engineering Podcast

In this episode Shruti Bhat gives her view on the state of the ecosystem for real-time data and the work that she and her team at Rockset is doing to make it easier for engineers to build those experiences. Just connect it to your database/data warehouse/data lakehouse/whatever you’re using and let them do the rest.

article thumbnail

StreamNative Brings Streaming Data To The Cloud Native Landscape With Pulsar

Data Engineering Podcast

Pulsar is a well engineered and robust platform for building the core of any system that relies on durable access to easily scalable streams of data. You monitor your website to make sure that you’re the first to know when something goes wrong, but what about your data? Can you start by giving an overview of what Pulsar is?

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Exploring Processing Patterns For Streaming Data Integration In Your Data Lake

Data Engineering Podcast

In this episode Ori Rafael shares his experiences from Upsolver and building scalable stream processing for integrating and analyzing data, and what the tradeoffs are when coming from a batch oriented mindset. Can you start by giving an overview of the state of the market for data lakes today?

Data Lake 100
article thumbnail

Maintaining Your Data Lake At Scale With Spark

Data Engineering Podcast

This conversation was useful for getting a better idea of the challenges that exist in large scale data analytics, and the current state of the tradeoffs between data lakes and data warehouses in the cloud. Interview Introduction How did you get involved in the area of data management?

Data Lake 100
article thumbnail

Data Ingestion: 7 Challenges and 4 Best Practices

Monte Carlo

Data ingestion is the process of collecting data from various sources and moving it to your data warehouse or lake for processing and analysis. It is the first step in modern data management workflows. Source : Fundamentals of Data Engineering by Joe Reis and Matt Housley. There are trade-offs.

article thumbnail

Building A Data Lake For The Database Administrator At Upsolver

Data Engineering Podcast

What used to be entirely managed by the database engine is now a composition of multiple systems that need to be properly configured to work in concert. In order to bring the DBA into the new era of data management the team at Upsolver added a SQL interface to their data lake platform. We talked last in November of 2018.

Data Lake 100