Sat.Oct 03, 2020 - Fri.Oct 09, 2020

article thumbnail

Self Service Real Time Data Integration Without The Headaches With Meroxa

Data Engineering Podcast

Summary Analytical workloads require a well engineered and well maintained data integration process to ensure that your information is reliable and up to date. Building a real-time pipeline for your data lakes and data warehouses is a non-trivial effort, requiring a substantial investment of time and energy. Meroxa is a new platform that aims to automate the heavy lifting of change data capture, monitoring, and data loading.

article thumbnail

Project Metamorphosis Month 6: Secure Apache Kafka in Confluent Cloud

Confluent

The cloud opens up exciting new opportunities for information gathering, analysis, and sharing that can make every organization’s products and services better. Thanks to the cloud and its decentralized nature, […].

Cloud 109
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

7 New Ways Cloudera Is Investing in Our Culture

Cloudera

As Cloudera offices around the world continue to cope with the impact of COVID-19, we have worked hard to ease stress and adapt to remote working. People are the heart of our company and we’re investing in creative, new ways to make every Clouderan feel valued and appreciated. Clouderans are superstars at work and at home, and burn-out is unhealthy for employees, their families, and the company.

Designing 102
article thumbnail

Announcing Vantage on Google Cloud

Teradata

Teradata Vantage on Google Cloud is now generally available! Vantage on Google Cloud is an as-a-service offer in which customers can get the most analytic value from their data. Read more.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

The Curse of Dimensionality

Domino Data Lab: Data Engineering

Danger of Big Data Big data is the rage. This could be lots of rows (samples) and few columns (variables) like credit card transaction data, or lots of columns (variables) and few rows (samples) like genomic sequencing in life sciences research. The Curse of Dimensionality , or Large P, Small N, ((P >> N)) problem applies to the latter case of lots of variables measured on a relatively few number of samples.

article thumbnail

Restoring Balance to the Cluster: Self-Balancing Clusters in Confluent Platform 6.0

Confluent

Apache Kafka® scales well. A Kafka cluster can grow to tens or hundreds of brokers and easily sustain tens of GB per second of read and write traffic. But scaling […].

Kafka 101

More Trending

article thumbnail

Zero Down – and Pay Only for What You Use with Teradata Consumption Pricing

Teradata

Consumption Pricing is a usage-based option with automatic elasticity in which you pay only for compute resources consumed for successful queries, plus storage. Learn more.

59
article thumbnail

3 Tools to Help Debug Slow Queries in MongoDB

Rockset

Regardless of what database you pick to run your application—MongoDB, Postgres, Oracle, or Cassandra—you will eventually encounter the same issue: slow queries. Slow queries can be the result of inefficient query design, inefficient table design, or general infrastructure problems. Although it may be tempting to add more machines or further complicate your data infrastructure to speed up your queries, improving the queries themselves is usually the best place to start when you want to improve da

MongoDB 40
article thumbnail

Introducing Cluster Linking in Confluent Platform 6.0

Confluent

With the release of Confluent Platform 6.0 comes a preview of Confluent Cluster Linking available to self-managed customers and in Confluent Cloud for our early access partners. Cluster Linking is […].

Cloud 93
article thumbnail

Building a Simple CRUD web application and image store using Cloudera Operational Database and Flask

Cloudera

The Cloudera Operational Database (COD) is a managed dbPaaS solution available as an experience in Cloudera Data Platform (CDP). It offers multi-modal client access with NoSQL key-value using Apache HBase APIs and relational SQL with JDBC (via Apache Phoenix). The latter makes COD accessible to developers who are used to building applications that use MySQL, Postgres, etc.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Accelerating Innovation in the Analytic Ecosystem: Accessibility

Teradata

In the final part of this 3-part series on reducing conflict between business & IT to accelerate innovation, we focus on enabling accessibility to data with security & governance.

article thumbnail

Build A StackOverflow Dashboard (Part 2): Crafting BigQuery Views and Superset Charts

Preset

In part 2, we'll start to visualize trends using Superset charts.

article thumbnail

Getting Started with Kafka Connect for New Relic

Confluent

It’s 3:00 am and PagerDuty keeps firing you alerts about your application being down. You need to figure out what the issue is, if it’s impacting users, and resolve it […].

Kafka 49
article thumbnail

How Zalando prepares for Cyber Week

Zalando Engineering

Introduction Cyber Week has become an increasingly important time of the year in e-commerce. In 2019 , we have attracted 840,000 new customers and our sales (Gross Merchandise Volume) increased by 32% compared to the previous year. During the event we grew faster as a business than throughout the year where we grow at a 20-25% rate. Our peak orders per minute reached 7,200 compared to 4,200 the year before (+71% YoY).

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Retailers - Don't be a Data Zombie!

Teradata

The retailer of the future is like a live organism - it will use a data brain to develop new agility and effective responses to rapidly evolving situations. Read more.

Retail 52
article thumbnail

The Superset REST API

Preset

A high level tour of Apache Superset's REST API

40
article thumbnail

Collaboration is Key to Reducing Pain and Finding Value in Data

Cloudera

This is a guest blog post, authored by John Zantey, Director and Co-founder, Qabsu. When it comes to cloud, being an early adopter does not necessarily put you ahead of the game. I know of companies that have been perpetually “doing cloud” for 10 years, but very few that have “done cloud” in a way that democratises and makes data accessible, with minimal pain points.

Bytes 65
article thumbnail

7 Requirements for Digital Transformation

Cloudera

Digital transformation is not just about technological transformation of the organization, it’s about transforming the culture of an organization. It’s not enough to bolt technology onto an existing strategy and consider it transformed. That’s the message from our Chief Marketing Officer Mick Hollison discussing digital transformation with Charlene Li at Cloudera Now. .

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.