Sat.Sep 19, 2020 - Fri.Sep 25, 2020

article thumbnail

Apache Kafka DevOps with Kubernetes and GitOps

Confluent

Operating critical Apache Kafka® event streaming applications in production requires sound automation and engineering practices. Streaming applications are often at the center of your transaction processing and data systems, requiring […].

Kafka 143
article thumbnail

Cutting Through The Noise And Focusing On The Fundamentals Of Data Engineering With The Data Janitor

Data Engineering Podcast

Summary Data engineering is a constantly growing and evolving discipline. There are always new tools, systems, and design patterns to learn, which leads to a great deal of confusion for newcomers. Daniel Molnar has dedicated his time to helping data professionals get back to basics through presentations at conferences and meetups, and with his most recent endeavor of building the Pipeline Data Engineering Academy.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Five Steps Towards Delivering Better Analytic Outcomes

Teradata

Get tips on how to cast a more critical eye on the seemingly endless amount of data-driven conclusions presented to us. Learn more.

Data 106
article thumbnail

Operational Database Security – Part 2

Cloudera

In this blogpost, we are going to take a look at some of the OpDB related security features of a CDP Private Cloud Base deployment. We are going to talk about auditing, different security levels, security features of Data Catalog, and Client Considerations. You can find part 1 of this series, here. . Auditing. Comprehensive auditing is provided to enable enterprises to effectively and efficiently meet their compliance requirements by auditing access and other types of operations across OpDB (thr

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Building a Machine Learning Logging Pipeline with Kafka Streams at Twitter

Confluent

Twitter, one of the most popular social media platforms today, is well known for its ever-changing environment—user behaviors evolve quickly; trends are dynamic and versatile; and special and emergent events […].

article thumbnail

3 Ways to Offload Read-Heavy Applications from MongoDB

Rockset

According to over 40,000 developers, MongoDB is the most popular NOSQL database in use right now. The tool’s meteoric rise is likely due to its JSON structure which makes it easy for Javascript developers to use. From a developer perspective, MongoDB is a great solution for supporting modern data applications. Nevertheless, developers sometimes need to pull specific workflows out of MongoDB and integrate them into a secondary system while continuing to track any changes to the underlying MongoDB

MongoDB 52

More Trending

article thumbnail

Choosing the right Data Warehouse SQL Engine: Apache Hive LLAP vs Apache Impala

Cloudera

Aren’t two superheroes better than one? Some of the most powerful results come from combining complementary superpowers, and the “dynamic duo” of Apache Hive LLAP and Apache Impala, both included in Cloudera Data Warehouse , is further evidence of this. Both Impala and Hive can operate at an unprecedented and massive scale, with many petabytes of data.

article thumbnail

Infrastructure Modernization with Google Anthos and Apache Kafka

Confluent

The promise of cloud computing is simplicity, speed, and cost savings. But what about workloads that can’t move to the cloud? Are they stuck using expensive legacy tooling and practices? […].

Kafka 57
article thumbnail

Exports is not a function

Grouparoo

I have been working on the Salesforce integration. That experience will be its own story. In the process, though, I found something tricky that I might be uniquely experiencing given the combinatorics of the modern Node/Javascript/Typescript world. Grouparoo connects with sources, processes the data from them, and sends that data to destinations. When data comes from a source, we call it an import.

Coding 52
article thumbnail

Today’s ‘Breakfast Roll People’ Will Change How Energy Retail Operates

Teradata

What will it take for energy retailers to transition to world-class segment leaders? The answer is millions of modest improvements, implemented by business users themselves.

Retail 52
article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Using Data to Drive Meaningful Diversity and Inclusion Efforts

Cloudera

A conversation with civil rights activist and author Dr. Mary Frances Berry about the importance of data in diversity and inclusions initiatives. . COVID-19 has forced businesses to change in ways we didn’t know was possible, and at a speed many had never imagined. The summer of 2020 made something else very clear; while we’ve been agile on digital transformation in business, we have failed to tap data to help us confront deep-seated social justice issues.

Data 64
article thumbnail

Build a Slack Dashboard (Part 1): Extracting Data Using Meltano

Preset

Build a beautiful Slack dashboard using open source tools Meltano and Superset. Part 1 of 3.

article thumbnail

Building a Data Science Platform in 10 days

Afterpay Tech

Photo by Pietro Jeng on Unsplash By Letian Wang Context At Afterpay, we are generating lots of data from customer transactions, website views and consumer referrals every day. Being able to derive insights from this data, and to use those insights to  improve our consumer experience and provide value to our merchants and consumers, is a critical competitive differentiator for Afterpay.

article thumbnail

Customer Journey Analytics & Real-Time Marketing: Lessons Learned from Those That Got it Right

Teradata

Learn about the early adopters for both Customer Journey Analytics and Real-Time Marketing who overcame initial hurdles and realized superior business outcomes.

IT 52
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Cloudera Data Platform in AWS Marketplace Simplifies and Accelerates Cloud Adoption

Cloudera

As organizations look to optimize the speed and cost of their cloud journey in today’s rapidly evolving economy, Cloudera is delighted to announce the availability of Cloudera Data Platform (CDP) Public Cloud in AWS Marketplace. Now customers can easily, confidently and cost-effectively discover, procure and deploy the world’s first Enterprise Data Cloud, powered by AWS, for faster time-to-insight from their advanced analytics and machine learning services.

AWS 83