Table file formats - Z-Order compaction: Apache Iceberg
Waitingforcode
APRIL 7, 2023
Last time you discovered the Z-Order compaction in Delta Lake. But guess what? Apache Iceberg also has this feature!
Waitingforcode
APRIL 7, 2023
Last time you discovered the Z-Order compaction in Delta Lake. But guess what? Apache Iceberg also has this feature!
ArcGIS
APRIL 7, 2023
We're happy to announce the conda init command is now enabled for ArcGIS users of Python! Learn about how to use it, how it works, and benefits.
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
KDnuggets
APRIL 7, 2023
Utilizing the power of GPT-3.5 to develop a simple summarize generator application.
databricks
APRIL 7, 2023
We are excited to announce that cluster policies are now generally available. Why Databricks cluster policies? Databricks cluster policies enable administrators to: limit.
Speaker: Timothy Chan, PhD., Head of Data Science
Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.
KDnuggets
APRIL 7, 2023
Sparse Data Survival Guide: Strategies for Success with Machine Learning.
databricks
APRIL 7, 2023
This is part two of a multi-part series to share key insights and tactics with Senior Executives leading data and AI transformation initiatives.
Data Engineering Digest brings together the best content for data engineering professionals from the widest variety of industry thought leaders.
Cloudyard
APRIL 7, 2023
Read Time: 3 Minute, 44 Second During the last post we discuss about the DIRECTORY tables. We have seen how the directory table helps to retrieve the snowflake hosted file URL for each file present in stage.In continuation of the same we will discuss about three key functions being use with Directory tables. These functions are used to generate the URL and grant the access based on their authorizations.
Confluent
APRIL 7, 2023
Confluent public sector CTO shares thoughts on how a data in motion approach can meet the goals of the National Cybersecurity Strategy.
Snowflake
APRIL 7, 2023
Spring has sprung—and with it comes a new crop of Snowflake Startup Challenge semi-finalists! The 2023 submission pool was the largest to date—twice as many submissions as last year—with entries that spanned not just the globe but the breadth of the Snowflake platform. Our judges put a lot of careful consideration into selecting the top 10, and we offer our sincere thanks to every company that sent in an entry this year—we know how much hard work goes into these submissions, and we appreciate it
The Modern Data Company
APRIL 7, 2023
Technical debt is something that many companies are aware of and are attempting to address. It is a big enough issue that several of our recent blog posts ( Lessons in Technical Debt from Southwest Airlines , Start Paying Down Your Technical Debt Today , and A Better Way to Plan the Payoff of Technical Debt) discussed it at length. What about data debt?
Speaker: Anne Steiner and David Laribee
As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.
Ascend.io
APRIL 7, 2023
Every business leader’s dream is to have real-time data at their fingertips. With it, they could discover invaluable insights, pivot in real-time, and connect their work to direct revenue impact. But they’re living in what is essentially the opposite of their fantasy. With the monolithic architectures most organizations have today, business users are stuck, constantly waiting for new data pipelines to be built or amended based on their requests.
Let's personalize your content