Fri.Apr 07, 2023

article thumbnail

Table file formats - Z-Order compaction: Apache Iceberg

Waitingforcode

Last time you discovered the Z-Order compaction in Delta Lake. But guess what? Apache Iceberg also has this feature!

130
130
article thumbnail

Conda Init and ArcGIS Pro

ArcGIS

We're happy to announce the conda init command is now enabled for ArcGIS users of Python! Learn about how to use it, how it works, and benefits.

Python 111
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Text Summarization Development: A Python Tutorial with GPT-3.5

KDnuggets

Utilizing the power of GPT-3.5 to develop a simple summarize generator application.

Python 134
article thumbnail

Announcing General Availability of Cluster Policies

databricks

We are excited to announce that cluster policies are now generally available. Why Databricks cluster policies? Databricks cluster policies enable administrators to: limit.

78
article thumbnail

Beyond the Basics of A/B Tests: Innovative Experimentation Tactics You Need to Know as a Data or Product Professional

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Best Machine Learning Model For Sparse Data

KDnuggets

Sparse Data Survival Guide: Strategies for Success with Machine Learning.

article thumbnail

The Executive’s Guide to Data, Analytics and AI Transformation, Part 2: Identify and prioritize use cases

databricks

This is part two of a multi-part series to share key insights and tactics with Senior Executives leading data and AI transformation initiatives.

More Trending

article thumbnail

Directory Tables functions

Cloudyard

Read Time: 3 Minute, 44 Second During the last post we discuss about the DIRECTORY tables. We have seen how the directory table helps to retrieve the snowflake hosted file URL for each file present in stage.In continuation of the same we will discuss about three key functions being use with Directory tables. These functions are used to generate the URL and grant the access based on their authorizations.

article thumbnail

Putting the National Cybersecurity Strategy in Motion

Confluent

Confluent public sector CTO shares thoughts on how a data in motion approach can meet the goals of the National Cybersecurity Strategy.

Data 57
article thumbnail

Snowflake Startup Challenge 2023: Meet the 10 Semi-Finalists

Snowflake

Spring has sprung—and with it comes a new crop of Snowflake Startup Challenge semi-finalists! The 2023 submission pool was the largest to date—twice as many submissions as last year—with entries that spanned not just the globe but the breadth of the Snowflake platform. Our judges put a lot of careful consideration into selecting the top 10, and we offer our sincere thanks to every company that sent in an entry this year—we know how much hard work goes into these submissions, and we appreciate it

Raw Data 105
article thumbnail

Do You Manage Your Data Debt Alongside Your Technical Debt?

The Modern Data Company

Technical debt is something that many companies are aware of and are attempting to address. It is a big enough issue that several of our recent blog posts ( Lessons in Technical Debt from Southwest Airlines , Start Paying Down Your Technical Debt Today , and A Better Way to Plan the Payoff of Technical Debt) discussed it at length. What about data debt?

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Data Mesh vs. Data Fabric: Which One Is Right for You?

Ascend.io

Every business leader’s dream is to have real-time data at their fingertips. With it, they could discover invaluable insights, pivot in real-time, and connect their work to direct revenue impact. But they’re living in what is essentially the opposite of their fantasy. With the monolithic architectures most organizations have today, business users are stuck, constantly waiting for new data pipelines to be built or amended based on their requests.