Sat.Aug 08, 2020 - Fri.Aug 14, 2020

article thumbnail

Closing The Loop On Event Data Collection With Iteratively

Data Engineering Podcast

Summary Event based data is a rich source of information for analytics, unless none of the event structures are consistent. The team at Iteratively are building a platform to manage the end to end flow of collaboration around what events are needed, how to structure the attributes, and how they are captured. In this episode founders Patrick Thompson and Ondrej Hrebicek discuss the problems that they have experienced as a result of inconsistent event schemas, how the Iteratively platform integrat

article thumbnail

Teradata Vantage: Born for Cloud Before Cloud Was Born

Teradata

Teradata Workload Management enables Vantage to be fully optimized for cloud & hybrid deployments & to efficiently deliver the lowest cost for enterprise analytics.

Cloud 124
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Confluent Announces Offer for Nonprofits Providing COVID-19 Relief

Confluent

In March, I wrote about Confluent’s commitment to our customers, employees, and community during the COVID-19 pandemic. In some respects, it’s hard to believe that only a few months have […].

104
104
article thumbnail

Improving our video encodes for legacy devices

Netflix Tech

by Mariana Afonso , Anush Moorthy , Liwei Guo , Lishan Zhu , Anne Aaron Netflix has been one of the pioneers of streaming video-on-demand content?—?we announced our intention to stream video over 13 years ago, in January 2007?—?and have only increased both our device and content reach since then. Given the global nature of the service and Netflix’s commitment to creating a service that members enjoy, it is not surprising that we support a wide variety of streaming devices, from set-top-boxes and

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Power BI Template App for Stripe

FreshBI

So, what is a Power BI Template App? A Power BI Template App is a published Power BI solution that can be used by any company that has the data platform for which the Template App was created. Can you imagine picking your entire Power BI Solution off the shelf - one crafted for your specific business needs and your specific data structure. Power BI Template Apps are designed to be such an out-of-the-box solution and this blog post is an example of such for a Power BI Solution for Stripe.

BI 52
article thumbnail

Chief Data Analytics Officers – The Key to Data-Driven Success?

Teradata

Banks were among the pioneers of the new role of Chief Data Officer in the early 21st century, yet the role remains hard to define and under-utilized. Learn more.

More Trending

article thumbnail

Analytics-on-the-fly: from batch to real-time user engagement

Rockset

It was the winter of 2007 when I logged into my newly created Facebook account for the very first time and I was amazed to see Facebook immediately show me three of my friends with whom I had lost touch since elementary school. One of them was working in London in a multinational bank, the other one was an engineer at Google in their Silicon Valley office office and the third one was running a restaurant in my town of Guwahati, a sleepy town on the India-Myanmar border.

Hadoop 52
article thumbnail

How Nielsen Scaled Access To Data Analytics Using Apache Superset

Preset

Learn why Nielsen migrated to Superset for visualization and dashboards.

article thumbnail

Answers in the Cloud, No Matter Where Your Data Is

Teradata

Vantage on Azure provides enterprise-grade real-time business intelligence through a comprehensive solution that combines analytics, data lakes, & data warehouse technologies.

article thumbnail

The Curious Incident of the State Store in Recovery in ksqlDB

Confluent

When operating cloud infrastructure, “time is money” is more than a cliché—it is interpreted literally as every processing second stacks up on the monthly bill. ksqlDB strives to reduce these […].

Cloud 83
article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

The Future is Serverless: What About Your Data Stack?

Rockset

Originally published on July 8, 2020 Yesterday I read an analyst report that the serverless architecture market will be $21B by 2025. I also recently met with Alex DeBrie, author of the DynamoDB book and enjoyed learning about his serverless philosophy. He wrote a great post about the key factors for choosing serverless databases here , and we had a fascinating conversation about serverless indexing systems that complement them.

BI 40
article thumbnail

Superset 0.37, Viz Plugins, Row-Level Security, Better Code Quality

Preset

Summary of Superset 0.

Coding 40
article thumbnail

Accelerating Innovation in the Analytic Ecosystem: Flexibility

Teradata

In part 1 of this 3 part series on reducing conflict between business & IT to accelerate innovation, we focus on enabling flexibility for tools, languages & libraries.

IT 52
article thumbnail

Apache Ozone Fault Injection Framework

Cloudera

One of the key challenges of building an enterprise-class robust scalable storage system is to validate the system under duress and failing system components. This includes, but is not limited to: failed networks, failed or failing disks, arbitrary delays in the network or IO path, network partitions, and unresponsive systems. Apache Ozone fault injection framework is designed to validate Ozone under heavy stress and failed or failing system components.

Hadoop 96
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Case Study: eGoGames Esports Platform Uses Rockset for Real-Time Analytics on Gaming Data

Rockset

From business communications and financial transactions to trip planning and activity tracking, much of our lives run through smartphones today. eGoGames will help you add competitive esports to that list. As the first European esports platform for mobile devices, eGoGames offers head-to-head, league, and tournament competition for skill-based mobile games.

BI 40
article thumbnail

Telltale: Netflix Application Monitoring Simplified

Netflix Tech

By Andrei U., Seth Katz , Janak Ramachandran , Jeff Butsch , Peter Lau , Ram Vaithilingam , and Greg Burrell Our Telltale Vision An alert fires and you get paged in the middle of the night. A metric crossed a threshold. You’re half awake and wondering, “Is there really a problem or is this just an alert that needs tuning? When was the last time somebody adjusted our alert thresholds?

article thumbnail

Multi-Threaded Message Consumption with the Apache Kafka Consumer

Confluent

Multithreading is “the ability of a central processing unit (CPU) (or a single core in a multi-core processor) to provide multiple threads of execution concurrently, supported by the operating system.” […].

Kafka 104
article thumbnail

Integration: Apache Kafka & Nifi

RandomTrees

By Anshul Ghogre Introduction Apache NiFiis designed to automate the flow of data between software systems. It is based on the “NiagaraFiles” software previously developed by the NSA, it supports powerful and scalable directed graphs of data routing, transformation, and system mediation logic. Apache Kafka is used for building real-time data pipelines and streaming apps.

Kafka 52
article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

Rapid Experimentation and Growth Using Real-Time Analytics

Rockset

You may hear the phrase that the world is moving from batch to real-time a lot. While traditional “business intelligence” has come a long way in the past 20 years, the world of real-time analytics is still in its early days. Traditional BI had its Renaissance moments with the advent of Big Data technologies such as Hadoop, and then cloud data lakes and warehouses have brought everyone to the Modern era.

BI 40
article thumbnail

Computational Causal Inference at Netflix

Netflix Tech

Jeffrey Wong , Colin McFarland Every Netflix data scientist, whether their background is from biology, psychology, physics, economics, math, statistics, or biostatistics, has made meaningful contributions to the way Netflix analyzes causal effects. Scientists from these fields have made many advancements in causal effects research in the past few decades, spanning instrumental variables, forest methods, heterogeneous effects, time-dynamic effects, quantile effects, and much more.