Sat.May 15, 2021 - Fri.May 21, 2021

article thumbnail

Confluent CLI Launches Exciting New Features and an Intuitive UI

Confluent

With so many technologies in the modern development ecosystem, a common complaint is having to go through the mental gymnastics of adopting new products and keeping up with ever-expanding feature […].

article thumbnail

A Holistic Approach To Data Governance Through Self Reflection At Collibra

Data Engineering Podcast

Summary Data governance is a phrase that means many different things to many different people. This is because it is actually a concept that encompasses the entire lifecycle of data, across all of the people in an organization who interact with it. Stijn Christiaens co-founded Collibra with the goal of addressing the wide variety of technological aspects that are necessary to realize such an important and expansive process.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

#ClouderaLife Spotlight: Kathleen Merto, Early Talent Program Manager

Cloudera

Meet Kathleen Merto. To her colleagues, she’s Kat. . She works on our Emerging Talent team managing the hiring process for Interns and entry level roles. It’s a job she feels passionately about, so much so that she was eager to give her whole team a shout out! . Kat fell into the perfect career path for her. Growing up, she witnessed her mom, a nurse of 41 years now, dedicate so much of herself to helping others.

article thumbnail

Twelve Thoughts About the Data Mesh

Teradata

The concept of Data Mesh is abuzz in the industry right now. Find out why we're so enthusiastic about it.

Data 97
article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Error Handling Patterns for Apache Kafka Applications

Confluent

Apache Kafka® applications run in a distributed manner across multiple containers or machines. And in the world of distributed systems, what can go wrong often goes wrong. This blog post […].

Kafka 126
article thumbnail

A visual guide to Azure Data Factory

A Cloud Guru: Data Engineering

A while back we published the Visual Guide to Azure Fundamentals on A Cloud Guru. The post got a lot of positive feedback so we thought we’d do another one — this time focused on Azure Data Factory! What is a visual guide? Visual guides are hi-resolution “sketchnotes.” They summarize a given topic or content […] The post A visual guide to Azure Data Factory appeared first on A Cloud Guru.

Data 52

More Trending

article thumbnail

Thirteen Thoughts About the Data Mesh

Teradata

The concept of Data Mesh is abuzz in the industry right now. Find out why we're so enthusiastic about it.

Data 72
article thumbnail

Kafka Summit Europe 2021 Recap

Confluent

And that’s a wrap on Kafka Summit Europe 2021, the first of three global Kafka Summits this year. We’ve seen 17,000 registrations from over 7,000 companies and 137 different countries. […].

Kafka 121
article thumbnail

Data Makes Your Tools Smarter

Grouparoo

When I was in charge of Product/Engineering at TaskRabbit, it was always challenging to prioritize integrations being requested by our Marketing, Sales, and Customer Success teams. First and foremost, most engineers just hate working on these kinds of integrations. Often, this preference alone is the deciding factor in organizations for what gets prioritized or not.

Data 52
article thumbnail

Coffee With Cloudera Partners: AWS

Cloudera

Enterprises are adopting a hybrid cloud approach. While more Cloudera customers want to move apps and data to the cloud, they also want to continue using their data centers for security and governance. By having both on-premises and cloud environments, organizations increase their agility, and hybrid model is gaining momentum. A hybrid approach benefits many organizations as it allows them to make best use of on-premises infrastructure while taking advantage of additional compute capacity and l

AWS 64
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Why CEOs Must Lead a New Relationship with Data

Teradata

CEOs need a new relationship with data if they are to successfully transition to the hyper-personalized, hyper-localized future most recognize as today’s immediate imperative.

Data 52
article thumbnail

Introducing Cluster RBAC, Audit Logs, and BYOK for Enterprise-Grade Security

Confluent

When it comes to launching your next app with data in motion, few things pose the same risk to going live as meeting requirements for data security and compliance. Doing […].

article thumbnail

Data-driven performance improvements in grocery retail: Pursuing the 1%

Retail Insight

Elite sport is a results business. T he difference between winning and losing often com es down to the finest of margins. As Al Pacino said in Any Given Sunday , it is all about the ‘inches’.

Retail 52
article thumbnail

How XOps Is Hoping to Unite All the Disparate Ops Disciplines Under One Banner

DataKitchen

The post How XOps Is Hoping to Unite All the Disparate Ops Disciplines Under One Banner first appeared on DataKitchen.

52
article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating

article thumbnail

Why CEOs Must Lead a New Relationship with Data

Teradata

CEOs need a new relationship with data if they are to successfully transition to the hyper-personalized, hyper-localized future most recognize as today’s immediate imperative.

Data 52
article thumbnail

Popular Use Cases for Real-Time Analytics

Rockset

In 2008, Dominos Pizza released its pizza tracker so that fans could monitor in real time if their pizza was in the oven or out for delivery. By 2019, 65% of Dominos’ sales came through digital channels including home devices and emoji texts, reimagining the brand for the digital era. The Dominos’ Pizza Tracker is the quintessential example of real-time analytics.

Retail 40
article thumbnail

Blinkist Chooses Monte Carlo to Deliver More Reliable Data Pipelines Through Data Observability

Monte Carlo

Monte Carlo today announced Berlin-based microlearning app Blinkist has selected Monte Carlo to achieve more reliable data through data observability. As a high-growth company with over 16 million users worldwide, Blinkist leverages paid performance marketing to fuel customer acquisition — and those channels rely on accurate behavioral data to optimize campaign spend.

article thumbnail

Data Analyst Salary- The Complete 2023 Guide

ProjectPro

If you are wondering about the average data analyst salary, you have landed on the right page. This post will cover the demand for data analysts, followed by their salaries based on factors such as experience level, skills, location, industries, roles, etc. The job title data analyst usually comes with a massive set of roles and responsibilities, which makes the data analytics salary tends to be high.

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

Listen Carefully

Teradata

The data-driven, digital-first era has multiplied the complexity of customer conversations – but it has also provided the means to generate and act on real insight.

IT 52
article thumbnail

The Complete Customer Data Stack: Data Validation

RudderStack

In this post, you will know about common challenges to data validation and how RudderStack can break them down & make it a smooth step in your workflow

article thumbnail

How AutoTrader built a more reliable data platform with Monte Carlo

Monte Carlo

As companies ingest more and more data to power their data platforms, the opportunity for broken data pipelines only grows. Wouldn’t it be great if you had a 10,000-foot view of your data pipeline health? Here’s how Ed Kent, Principal Developer at Auto Trader UK , the largest digital automotive marketplace in the UK and Ireland, is solving this problem by applying end-to-end Data Observability across their cloud warehouse, business intelligence systems, and beyond.

article thumbnail

Connecting Your Data Mesh with DataOps

DataKitchen

The post Connecting Your Data Mesh with DataOps first appeared on DataKitchen.

Data 52
article thumbnail

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Speaker: David Bard, Principal at VP Product Coaching

In the fast-paced world of digital innovation, success is often accompanied by a multitude of challenges - like the pitfalls lurking at every turn, threatening to derail the most promising projects. But fret not, this webinar is your key to effective product development! Join us for an enlightening session to empower you to lead your team to greater heights.

article thumbnail

NVIDIA RAPIDS in Cloudera Machine Learning

Cloudera

Introduction. In the previous blog post in this series, we walked through the steps for leveraging Deep Learning in your Cloudera Machine Learning (CML) projects. This year, we expanded our partnership with NVIDIA , enabling your data teams to dramatically speed up compute processes for data engineering and data science workloads with no code changes using RAPIDS AI.

article thumbnail

Simplifying Event Filtering and Value Aggregation with RudderStack

RudderStack

RudderStack endorses its sophisticated mechanism solution to Simplifying Event Filtering and Value Aggregation outwardly introducing any standard mistakes.

IT 40
article thumbnail

Unlocking The Power of Data Lineage In Your Platform with OpenLineage

Data Engineering Podcast

Summary Data lineage is the common thread that ties together all of your data pipelines, workflows, and systems. In order to get a holistic understanding of your data quality, where errors are occurring, or how a report was constructed you need to track the lineage of the data from beginning to end. The complicating factor is that every framework, platform, and product has its own concepts of how to store, represent, and expose that information.

Metadata 100
article thumbnail

The Architecture of Uber’s API gateway

Uber Engineering

API gateways are an integral part of microservices architecture in recent years. An API gateway provides a single point of entry for all our apps and provides an interface to access data, logic, or functionality from back-end microservices. It also … The post The Architecture of Uber’s API gateway appeared first on Uber Engineering Blog.

article thumbnail

Reimagined: Building Products with Generative AI

“Reimagined: Building Products with Generative AI” is an extensive guide for integrating generative AI into product strategy and careers featuring over 150 real-world examples, 30 case studies, and 20+ frameworks, and endorsed by over 20 leading AI and product executives, inventors, entrepreneurs, and researchers.

article thumbnail

Streaming Market Data with Flink SQL Part II: Intraday Value-at-Risk

Cloudera

This article is the second in a multipart series to showcase the power and expressibility of FlinkSQL applied to market data. In case you missed it, part I starts with a simple case of calculating streaming VWAP. Code and data for this series are available on github. Speed matters in financial markets. Whether the goal is to maximize alpha or minimize exposure, financial technologists invest heavily in having the most up-to-date insights on the state of the market and where it is going.

SQL 100
article thumbnail

Protect Personally Identifiable Information (PII) in Your Apps Using RudderStack

RudderStack

Use RudderStack to protect the Personally Identifiable Information (PII) in Your Apps. We simplify the process of making PII notes on the streaming data.

Process 40
article thumbnail

How do thread priorities affect your Android app?

Booking.com Engineering

Introduction Threads are essential for responsive UI applications. When programming in Android, we make sure that any kind of work that could cause the slightest lagging is scheduled to a separate thread, other than the one responsible for the UI updates. And even though there are various high level constructs available for the developer’s convenience, how threading works at a very low level leaks from all these abstractions nonetheless.

Java 52
article thumbnail

Kafka to Delta Lake, as fast as possible

Scribd Technology

Streaming data from Apache Kafka into Delta Lake is an integral part of Scribd’s data platform, but has been challenging to manage and scale. We use Spark Structured Streaming jobs to read data from Kafka topics and write that data into Delta Lake tables. This approach gets the job done but in production our experience has convinced us that a different approach is necessary to efficiently bring data from Kafka to Delta Lake.

Kafka 52
article thumbnail

The Big Payoff of Application Analytics

Outdated or absent analytics won’t cut it in today’s data-driven applications – not for your end users, your development team, or your business. That’s what drove the five companies in this e-book to change their approach to analytics. Download this e-book to learn about the unique problems each company faced and how they achieved huge returns beyond expectation by embedding analytics into applications.