Sat.Apr 17, 2021 - Fri.Apr 23, 2021

article thumbnail

What’s New in Apache Kafka 2.8

Confluent

I’m proud to announce the release of Apache Kafka 2.8.0 on behalf of the Apache Kafka® community. The 2.8.0 release contains many new features and improvements. This blog post highlights […].

Kafka 138
article thumbnail

Moving Machine Learning Into The Data Pipeline at Cherre

Data Engineering Podcast

Summary Most of the time when you think about a data pipeline or ETL job what comes to mind is a purely mechanistic progression of functions that move data from point A to point B. Sometimes, however, one of those transformations is actually a full-fledged machine learning project in its own right. In this episode Tal Galfsky explains how he and the team at Cherre tackled the problem of messy data for Addresses by building a natural language processing and entity resolution system that is served

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Relationship intelligence will shape the workplace of the future

Cloudera

Our latest Influential Women in Data session featured Brenda Le Sueur from Cambridge Assessments. Brenda has worked across many organisations and continents, but what has always been crucial to her is relationships – how we cultivate them, how we nurture them and how they, in turn, define us. I sat down with Brenda to ask her about her journey as a woman in tech and understand more about the impact of relationships on our career.

article thumbnail

Reshaping the supermarket post-pandemic

Retail Insight

Social distancing and a life lived largely online have been the reality for over a year. But, as the world gradually emerges from lockdown, ha s the shape of retail really changed forever?

Retail 52
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Monitoring Your Event Streams: Tutorial for Observability Into Apache Kafka Clients

Confluent

Why should you monitor your Apache Kafka® client applications? Apart from the usual reasons for monitoring any application, such as ensuring uptime SLAs, there are a few specific reasons for […].

Kafka 70
article thumbnail

Welcome, Pedro!

Grouparoo

Building an open source tool to connect data to many different services means a lot of integrations. It can be pretty tricky, so we were lucky to meet Pedro S Lopez a few weeks back when he started adding several plugins to that integration list. He has now come aboard officially and will work more on the core product. Pedro makes the Grouparoo team an international one.

More Trending

article thumbnail

How to Approach Your Data Engineering Transformation

Silectis

Should you build your own tooling, take a “best of breed” approach, or buy a turnkey data engineering platform? We’ve got you covered. Data Engineering Platforms: Build, Best of Breed, or Buy? Every company wants to be data-driven. Modern organizations that thrive based on data have a common strength: a solid data engineering practice.

article thumbnail

The Worst of Times - The Best of Times

Teradata

As customer behavior changes rapidly, the challenges & opportunities for fast, flexible, agile, and future fit improvements for retailers are huge. Read more.

Retail 52
article thumbnail

The battle to combat data sprawl: what CIOs need to do now

DataKitchen

The post The battle to combat data sprawl: what CIOs need to do now first appeared on DataKitchen.

Data 52
article thumbnail

Deep Learning with Nvidia GPUs in Cloudera Machine Learning

Cloudera

Introduction. In our previous blog post in this series , we explored the benefits of using GPUs for data science workflows, and demonstrated how to set up sessions in Cloudera Machine Learning (CML) to access NVIDIA GPUs for accelerating Machine Learning Projects. While the time-saving potential of using GPUs for complex and large tasks is massive, setting up these environments and tasks such as wrangling NVIDIA drivers, managing CUDA versions and deploying custom engines for your specific proje

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Making the Remote Onboarding a Success

Zalando Engineering

When the pandemic started in 2020 many Zalando employees went into home office. It changed our working habits and many other things and Zalando published remote working guidelines to support their employees. This concentrates only on remote working, but what happens if you change companies during the pandemic? Joining a new company and getting onboarded can be already pretty tough during normal times.

article thumbnail

Hyper-Personalization: Understanding Customers Using Digital Payments Data

Teradata

Hyper-personalization is a must-have for businesses today. But how do digital payments data help? By bringing granularity to your personalization strategies.

Data 52
article thumbnail

Data Analyst Responsibilities-What does a data analyst do?

ProjectPro

Are you passionate about numbers and algebraic functions? Does the idea of evaluating, processing, analyzing, and interpreting statistical data makes you roll up your sleeves and get the job done? Do you love to distinguish the trends and patterns in data? Do you enjoy sharing your work and communicating your knowledge with others in the team? Do you have the attitude of self-learning and can figure things out on your own?

article thumbnail

HDFS Data Encryption at Rest on Cloudera Data Platform

Cloudera

Introduction: Encryption of Data at Rest is a highly desirable or sometimes mandatory requirement for data platforms in a range of industry verticals including HealthCare, Financial & Government organizations. The capability increases security and protects sensitive data from various kinds of attack that could be internal or external to the platform.

MySQL 68
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Understanding Types with SQLite and Node.js

Grouparoo

Two fun facts about SQLite : The initial release was more than 20 years ago! It is the most widely used database (and likely one of the most widely deployed pieces of software). And here are a few of my opinions on SQLite: It's super cool. We don't talk about it enough. It's actually really easy to use (which is likely why it's so widely used).

Bytes 52
article thumbnail

How Resident Reduced Data Issues by 90% with Monte Carlo

Monte Carlo

Many data leaders tell us that their data scientists and engineers spend 40 percent or more of their time tackling data issues instead of working on projects that actually move the needle. It doesn’t have to be this way. Here’s how the data engineering team at Resident, a house of direct-to-consumer furnishings brands, reduced their data incidents by 90% with data observability at s cale.

article thumbnail

Drinking our own champagne – Cloudera upgrades to CDP Private Cloud

Cloudera

Like most of our customers, Cloudera’s internal operations rely heavily on data. For more than a decade, Cloudera has built internal tools and data analysis primarily on a single production CDH cluster. This cluster runs workloads for every department – from real-time user interfaces for Support to providing recommendations in the Cloudera Data Platform (CDP) Upgrade Advisor to analyzing our business and closing our books.

Cloud 116
article thumbnail

Apache Ozone and Dense Data Nodes

Cloudera

This post was co-authored by two Cisco Employees as well: Karthik Krishna, Silesh Bijjahalli. Today’s enterprise data analytics teams are constantly looking to get the best out of their platforms. Storage plays one of the most important roles in the data platforms strategy, it provides the basis for all compute engines and applications to be built on top of it.

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

#ClouderaLife Spotlight: Bogi Egyed, Engineering Manager

Cloudera

Meet Boglarka Egyed, also known as “Bogi” to her colleagues. . She’s a 5-year Clouderan who recently transitioned into the role of Engineering Manager. . Bogi originally graduated from college with her degree in Applied Mathematics but has spent her career as a Software Engineer. “Mathematics provided me with solid fundamentals to use in this field but programming was what really caught my attention due to its creative nature while being able to get results fast.” .

article thumbnail

The Intersection of Climate and Capital Markets

Cloudera

Happy Earth Day! Earth Day was introduced in 1970 and has celebrated various milestone achievements including expanding globally and leveraging the power of social media to expand climate awareness and action. A great summary and history can be found on earthday.org/history. For those in financial services, climate initiatives are another major market event with far-reaching impact on capital adequacy and compliance regulations.