Sat.Jul 24, 2021 - Fri.Jul 30, 2021

article thumbnail

Uber’s Fulfillment Platform: Ground-up Re-architecture to Accelerate Uber’s Go/Get Strategy

Uber Engineering

Introduction to Fulfillment at Uber. Uber’s mission is to help our consumers effortlessly go anywhere and get anything in thousands of cities worldwide. At its core, we capture a consumer’s intent and fulfill it by matching it with the right … The post Uber’s Fulfillment Platform: Ground-up Re-architecture to Accelerate Uber’s Go/Get Strategy appeared first on Uber Engineering Blog.

article thumbnail

Protecting Data Integrity in Confluent Cloud: Over 8 Trillion Messages Per Day

Confluent

It’s about maintaining the right data even when no one is watching. Last year, Confluent announced support for Infinite Storage, which fundamentally changes data retention in Apache Kafka® by allowing […].

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Building a Multi-Tenant Managed Platform For Streaming Data With Pulsar at Datastax

Data Engineering Podcast

Summary Everyone expects data to be transmitted, processed, and updated instantly as more and more products integrate streaming data. The technology to make that possible has been around for a number of years, but the barriers to adoption have still been high due to the level of technical understanding and operational capacity that have been required to run at scale.

Building 100
article thumbnail

#ClouderaLife Spotlight: Vinicius Cardoso, Sr Solutions Engineering

Cloudera

Meet Vinicius Cardoso, better known as Vini. . He is a Sr. Solutions Engineer (SE) working in Australia. . In his role, customers are at the center of everything he does. Wearing the hat of Enterprise Architect, he dives deep to understand customer’s organization goals, initiatives and requirements in order to identify the key capabilities that need to be delivered. .

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Building a Roadmap for Enterprise Data and Analytics – A Framework

Teradata

Building a data analytics roadmap for a large, complex enterprise can be daunting. Breaking it down into essentials helps manage complexity, avoid pitfalls, & set the program in the right direction.

article thumbnail

Speed, Scale, Storage: Our Journey from Apache Kafka to Performance in Confluent Cloud

Confluent

At Confluent, we focus on the holy trinity of performance, price, and availability, with the goal of delivering a similar performance envelope for all workloads across all supported cloud providers. […].

Cloud 120

More Trending

article thumbnail

Enterprise Data Science Workflows with AMPs and Streamlit

Cloudera

Here in the virtual Fast Forward Lab at Cloudera , we do a lot of experimentation to support our applied machine learning research, and Cloudera Machine Learning product development. We believe the best way to learn what a technology is capable of is to build things with it. Only through hands-on experimentation can we discern truly useful new algorithmic capabilities from hype.

article thumbnail

Ransomware is Becoming the Most Prevalent Malware Attack - Don’t Become the Next Victim

Teradata

Ransomware attacks can be devastating. That’s why it’s important to stay informed about what ransomware is, how it works and the types of ransomware there are.

IT 52
article thumbnail

Design Considerations for Cloud-Native Data Systems

Confluent

Twenty years ago, the data warehouses of choice were Oracle and Teradata. Since then, growth and innovation has shifted to the cloud, and a new generation of data systems have […].

Systems 113
article thumbnail

How The Farmer’s Dog Achieved Rapid ML-Based Anomaly Detection with Monte Carlo

Monte Carlo

Companies across all industries are striving to become data-driven: making decisions based on data and building a culture of data trust and transparency. But data downtime —periods of time where data is missing, broken or otherwise erroneous—undermines those efforts and can cost companies upwards of $15 million annually. And very often, the ability to achieve more reliable data is both time-intensive and intensely manual.

Food 40
article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Learn How Cloudera Drives Healthcare Data Insights at HIMSS 21

Cloudera

HIMSS21 is just a few days away, and we hope you will join us to talk about how we can all achieve better health outcomes by working together. Health organizations across the world are evaluating safety precautions as COVID-19 cases continue to wax and wane and they consider universal questions such as, when is it safe to allow our administrative staff to return to the office, and how can we reassure our patients that we are committed to their health and safety?

article thumbnail

Ransomware is Becoming the Most Prevalent Malware Attack - Don’t Become the Next Victim

Teradata

Ransomware attacks can be devastating. That’s why it’s important to stay informed about what ransomware is, how it works and the types of ransomware there are.

IT 52
article thumbnail

From On-Prem to Cloud-Native: Multi-Tenancy in Confluent Cloud

Confluent

Multi-tenancy brings cost-efficiency to infrastructure, and when done correctly, creates an economy of scale. Done incorrectly and you degrade the user experience and create maintenance nightmares for operators. This is […].

Cloud 109
article thumbnail

Google Data Scientist Interview Questions To Get You Hired

ProjectPro

Google data science interviews are challenging. The data scientist interview questions are tricky, specific to Google’s data products, and cover a wide range of data science and machine learning concepts. The good news is that the right preparation can make a big difference and get you hired at one of the FANG companies. If you’re interviewing for a data scientist role at Google or you’re just curious about what a data scientist interview at Google looks like - we’ve brok

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Driving Standards & Collaboration in Telco with Data & AI

Cloudera

I’m thrilled to report that Cloudera today announced its membership of the TM Forum , the leading industry standards and collaboration group for the telecommunications industry. This is an important step for our company and for our telecommunications and media customers and partners, adding significant momentum and acceleration to our development of solutions for the industry.

article thumbnail

20x Faster Ingestion with Rockset's New DynamoDB Connector

Rockset

Since its introduction in 2012, Amazon DynamoDB has been one of the most popular NoSQL databases in the cloud. DynamoDB, unlike a traditional RDBMS, scales horizontally, obviating the need for careful capacity planning, resharding, and database maintenance. As a result, DynamoDB is the database of choice for companies building event-driven architectures and user-friendly, performant applications at scale.

NoSQL 40
article thumbnail

Making Apache Kafka Serveless: Lessons From Confluent Cloud

Confluent

Serverless offerings in the cloud are a favorite among software engineers—a prime example are object stores such as AWS S3. For the system designer, however, it is an engineering challenge […].

Cloud 104
article thumbnail

15 TensorFlow Projects Ideas for Beginners to Practice in 2023

ProjectPro

Developed by the Google Brain Team, TensorFlow is an open-source platform that helps machine learning engineers and data scientists build models and deploy applications easily. With TensorFlow, getting started, building models, and debugging is made easy with access to high-level APIs like Keras. TensorFlow is equipped with features, like state-of-the-art pre-trained models, p opular machine learning datasets , and increased ease of execution for mathematical computations, making it popular amon

Project 40
article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

How to Teach Data Engineering

Pipeline Data Engineering

Frequently we receive positive comments about our curriculum and our general approach to data engineering, especially in the light of the hype vs. the realness that surrounds it. However, the selected content would be ineffective and empty for our students without the right delivery. Selecting the fitting pedagogical approach and combining it with the most effective didactical methods are key to running a successful course, so here are the whys and hows behind what a Pipeline Academy student is

article thumbnail

Maximize Your Data Lakehouse Using Dremio Architecture

Preset

Apache Superset™ and Dremio provide a powerful accelerated lakehouse platform.

article thumbnail

Kafka Summit APAC 2021 Recap

Confluent

The second of this year’s three online Kafka Summits is now complete! We hope you were able to join us for Kafka Summit APAC 2021 yesterday. We had over 13,000 […].

Kafka 104
article thumbnail

RudderStack Product News Vol. #009 - Our Q3 Product Roadmap

RudderStack

In this update, we give you visibility into our product roadmap. The team has been working hard on planning and focus areas, and we're excited about Q3.

40
article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating

article thumbnail

Data Movement in Netflix Studio via Data Mesh

Netflix Tech

By Andrew Nguonly , Armando Magalhães , Obi-Ike Nwoke , Shervin Afshar , Sreyashi Das , Tongliang Liu , Wei Liu , Yucheng Zeng Background Over the next few years, most content on Netflix will come from Netflix’s own Studio. From the moment a Netflix film or series is pitched and long before it becomes available on Netflix, it goes through many phases.

Data 101
article thumbnail

Five Strategies to Accelerate Data Product Development

Cloudera

Introduction. With this first article of the two-part series on data product strategies, I am presenting some of the emerging themes in data product development and how they inform the prerequisites and foundational capabilities of an Enterprise data platform that would serve as the backbone for developing successful data product strategies. Once we have identified those capabilities, the second article explores how the Cloudera Data Platform delivers those prerequisite capabilities and has enab

article thumbnail

Reduce Your Data Infrastructure TCO with Confluent’s New Splunk S2S Source Premium Connector

Confluent

Data is at the center of our world today, especially with the ever-increasing amount of machine-generated log data collected from applications, devices, and sensors from almost every modern technology. The […].

article thumbnail

Dogfooding at RudderStack: Our Data Stack

RudderStack

Here is a quick guide to how we leverage some of the functionality in and around the RudderStack application right here at RudderLabs.

Data 40
article thumbnail

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Speaker: David Bard, Principal at VP Product Coaching

In the fast-paced world of digital innovation, success is often accompanied by a multitude of challenges - like the pitfalls lurking at every turn, threatening to derail the most promising projects. But fret not, this webinar is your key to effective product development! Join us for an enlightening session to empower you to lead your team to greater heights.

article thumbnail

Recommender Systems: Behind the Scenes of Machine-Learning-Based Personalization

AltexSoft

Steve Jobs once said, “People don’t know what they want until you show it to them”. Well, try arguing that considering that we all watch videos suggested by YouTube, buy goods suggested by Amazon, and watch TV shows suggested by Netflix. People like being guided and given relevant offers and recommendations. They like being treated in a personal manner.

article thumbnail

Adding Context And Comprehension To Your Analytics Through Data Discovery With SelectStar

Data Engineering Podcast

Summary Companies of all sizes and industries are trying to use the data that they and their customers generate to survive and thrive in the modern economy. As a result, they are relying on a constantly growing number of data sources being accessed by an increasingly varied set of users. In order to help data consumers find and understand the data is available, and help the data producers understand how to prioritize their work, SelectStar has built a data discovery platform that brings everyone

BI 100
article thumbnail

Data cleaning for nulls with SQL vs. code

Grouparoo

When preparing your data set for analysis, it is crucial to ensure that your data set is both complete and accurate. One step in this process is deciding how to handle null values. Depending on how your data is going to be used, you may not want null values at all! Let's clean some data We're going to take a look at calculating Lifetime Value (LTV) of a customer.

SQL 52
article thumbnail

Knowledge Graph Technologies Accelerate and Improve the Data Model Definition for Master Data

Zalando Engineering

The Master Data Management Challenge Master data management (MDM) is a technology-enabled discipline in which business and Information Technology work together to ensure the uniformity, accuracy, stewardship, semantic consistency and accountability of the enterprise's official shared master data assets. 1 At Zalando we are at an early phase of realising MDM for our internal data assets and we have chosen to do it in a consolidated style.

article thumbnail

Reimagined: Building Products with Generative AI

“Reimagined: Building Products with Generative AI” is an extensive guide for integrating generative AI into product strategy and careers featuring over 150 real-world examples, 30 case studies, and 20+ frameworks, and endorsed by over 20 leading AI and product executives, inventors, entrepreneurs, and researchers.