Sat.Oct 24, 2020 - Fri.Oct 30, 2020

article thumbnail

Preparing Your Clients and Tools for KIP-500: ZooKeeper Removal from Apache Kafka

Confluent

As described in the blog post Apache Kafka® Needs No Keeper: Removing the Apache ZooKeeper Dependency, when KIP-500 lands next year, Apache Kafka will replace its usage of Apache ZooKeeper […].

Kafka 140
article thumbnail

Cloud Native Data Security As Code With Cyral

Data Engineering Podcast

Summary One of the most challenging aspects of building a data platform has nothing to do with pipelines and transformations. If you are putting your workflows into production, then you need to consider how you are going to implement data security, including access controls and auditing. Different databases and storage systems all have their own method of restricting access, and they are not all compatible with each other.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data security vs usability: you can have it all

Cloudera

Growing up, were you ever told you can’t have it all? That you can’t eat all the snacks in one sitting? That you can’t watch the complete Back to the Future trilogy as well as study for your science exam in one evening? Over time, we learn to set priorities, make a decision for one thing over the other, and compromise. Just like when it comes to data access in business.

article thumbnail

Survey: Enterprise Data More Important Than Ever Since Onset of COVID-19

Teradata

Our new global survey reveals how business leaders are changing the way they think about about data -- from their trust in to to the role it plays in a post-pandemic recovery.

Data 64
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Building Streaming Data Architectures with Qlik Replicate and Apache Kafka

Confluent

A fundamental challenge with today’s “data explosion” is finding the best answer to the question, “So where do I put my data?” while avoiding the longer-term problem of data warehouses, […].

article thumbnail

How Grouparoo works as a team

Grouparoo

When Brian, Evan, and I first talked about starting a company, we already had some ideas in mind about what we might want to do differently from our past roles. The three of us had all worked together before at TaskRabbit , but since we were starting a brand new company, we decided to approach how we would work from a first principles approach. I thought we’d share some tidbits about how we work right now.

More Trending

article thumbnail

Reconnecting the Retail Brain: Learning From the Octopus

Teradata

For too many retailers, brain & body have become separate, with data informing discrete projects & engagements but not used to transform entire business processes.

Retail 52
article thumbnail

How to Get Your Organization to Appreciate Apache Kafka

Confluent

If you want to enable your organization to leverage the full value of event-driven architectures, it is not enough to just integrate Apache Kafka® and wait for people to join […].

Kafka 83
article thumbnail

Why I Am Joining Rockset

Rockset

I’m excited to soon be the newest member of Rockset. I will be joining a truly spectacular engineering team, working on a product that leverages deep technical insights to make real-time analytics easy. My passion is building infrastructure that makes things simpler for users, supporting people at higher levels of the stack by giving them clean APIs and predictable behavior.

article thumbnail

Why Veterans at Cloudera are Urging Us to Vote

Cloudera

With both the US election and Veterans Day right around the corner, veterans at Cloudera have been telling us their stories, including why they want us all to Make Time to Vote. . The US has a long and evolving history with voting. Across generations, people have fought to gain and improve voting rights for all Americans, and in return, are honored for their sacrifices by having those rights be performed and visions fulfilled.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Reimagining Business Amidst the COVID-19 Pandemic

Teradata

COVID-19 has forced many businesses to rethink their business models due to changes in customer requirements, but it has also opened up a world of new opportunities.

IT 52
article thumbnail

Netflix Android and iOS Studio Apps?—?now powered by Kotlin Multiplatform

Netflix Tech

Netflix Android and iOS Studio Apps?—?now powered by Kotlin Multiplatform By David Henry & Mel Yahya Over the last few years Netflix has been developing a mobile app called Prodicle to innovate in the physical production of TV shows and movies. The world of physical production is fast-paced, and needs vary significantly between the country, region, and even from one production to the next.

Coding 111
article thumbnail

Rockset Raises $40M Series B to Empower Developers Building Real-Time Analytics

Rockset

Today, Rockset announced $40M in Series B funding from Sequoia and Greylock , our two investors who have partnered with us right from the beginning. Additionally, we announced support for fully managed, secure private deployments of Rockset within a customer’s Amazon VPC. These are important milestones for both our company and product, but this announcement is less a celebration of Rockset than a recognition of our hundreds of beloved customers who have launched amazing real-time applications.

article thumbnail

Listening to the Customer in the 21st Century: It’s All About Data

Cloudera

The customer has never been more right. Across industries, customers have become conditioned to demand not only near-instant responses to their needs but that their needs be anticipated in advance. Financial institutions are not given a pass, despite a competitive landscape flooded with regulation and privacy considerations. The customer still has expectations for a personalized, timely, and relevant experience.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Look at the Cloud. What Do You See?

Teradata

Identify the true capabilities of a modern analytics architecture and achieve what really matters: answers. Find out more.

Cloud 59
article thumbnail

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

Netflix Tech

By Tianlong Chen and Ioannis Papapanagiotou Netflix has more than 195 million subscribers that generate petabytes of data everyday. Data scientists and engineers collect this data from our subscribers and videos, and implement data analytics models to discover customer behaviour with the goal of maximizing user joy. Usually Data scientists and engineers write Extract-Transform-Load (ETL) jobs and pipelines using big data compute technologies, like Spark or Presto , to process this data and perio

article thumbnail

Case Study: Rumble’s Real-Time Leaderboards Empower Users to Lead Healthier Lifestyles

Rockset

Many of us have become more conscious about how much activity we’re getting in a day--and it shows. Purchases for smartwatches that track calories and activities have dramatically increased since 2014. These smartwatches have helped people train for races, track different types of workouts, and be mindful of how much movement they are getting in a day.

article thumbnail

Log Reduction Techniques with CFM

Cloudera

Cloudera services logs offer a breadth of information to assist in cluster maintenance; from assisting in security checks, auditing tasks, and validation for performance tuning and testing tasks – to name a few. . However, log records generated by these services do not hold the same value for every organisation. For example Cyber teams may find more value in logs that outline user behaviour when accessing the data, whilst operational teams may prefer logs that show the spikes in load time throug

Kafka 65
article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

New Applied ML Research: Meta-Learning & Structural Time Series

Cloudera

At Cloudera Fast Forward we work to make the recently possible useful. Our goal is to take the incredible data science and machine learning research developments we see emerging from academia and large industrial labs, and bridge the gap to products and processes that are useful to practitioners working across industries. In the past year, we’ve released research reports and prototypes exploring Deep Learning for Anomaly Detection , Causality for Machine Learning and NLP for Automated Question A

article thumbnail

Healthcare Data Impact Awards finalists shine in Data for Good category

Cloudera

Cloudera’s annual Data Impact Awards will be announced during a virtual celebration on November 18, 2020. If you’d like to join us and hear more about the winners, you can register here. . As always, we’re excited that the finalists represent a cross-section of industries. Personally, I’m thrilled to talk more about one of our healthcare finalists.

article thumbnail

CDP Data Visualization: Self-Service Data Visualization For The Full Data Lifecycle

Cloudera

With the massive explosion of data across the enterprise — both structured and unstructured from existing sources and new innovations such as streaming and IoT — businesses have needed to find creative ways of managing their increasingly complex data lifecycle to speed time to insight. At Cloudera, we set out to directly address these lifecycle challenges through the Cloudera Data Platform (CDP) — the only hybrid-cloud, multi-cloud enterprise data platform built for the full data lifecycle. .

article thumbnail

DELL/EMC taking the next step with PowerScale and ECS certification on CDP Private Cloud Base

Cloudera

Cloudera and Dell/EMC are continuing our long and successful partnership of developing shared storage solutions for analytic workloads running in hybrid cloud. . Customer demand has always been the key driver of roadmap features on our platforms. Since the inception of Cloudera Data Platform (CDP), Dell / EMC PowerScale and ECS have been highly requested solutions to be certified by Cloudera.

article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating

article thumbnail

A Day in the Life of a Content Analytics Engineer

Netflix Tech

Part of our series on who works in Analytics at Netflix?—?and what the role entails by Rocio Ruelas Back when we were all working in offices, my favorite days were Monday, Wednesday, and Friday. Those were the days with the best hot breakfast, and I’ve always been a sucker for free food. I started the day by arriving at the LA office right before 8am and finding a parking spot close to the entrance.