Sat.Sep 05, 2020 - Fri.Sep 11, 2020

article thumbnail

Simplify Your Data Architecture With The Presto Distributed SQL Engine

Data Engineering Podcast

Summary Databases are limited in scope to the information that they directly contain. For analytical use cases you often want to combine data across multiple sources and storage locations. This frequently requires cumbersome and time-consuming data integration. To address this problem Martin Traverso and his colleagues at Facebook built the Presto distributed query engine.

article thumbnail

Cloudera Named Leader in The Forrester Wave: Notebook-Based Predictive Analytics and Machine Learning, Q3 2020

Cloudera

Cloudera has been named a Leader in The Forrester Wave : Notebook-Based Predictive Analytics and Machine Learning, Q3 2020. At Cloudera, we are committed to always staying at the forefront of data and analytics innovation — enabling enterprises to more optimally work with data to deliver analytic results across the business quickly and securely. For enterprise machine learning teams, this means having the right platform, tools, and processes that streamline end-to-end ML to tackle once-impossibl

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

The Cause and Effect of Supply Chain Fragility, and How to Fix It

Teradata

The fragility of your supply chain existed long before COVID-19 brought it into sharp relief. Discover the secret to true supply chain resilience.

IT 105
article thumbnail

Seamlessly Swapping the API backend of the Netflix Android app

Netflix Tech

How we migrated our Android endpoints out of a monolith into a new microservice by Rohan Dhruva , Ed Ballot As Android developers, we usually have the luxury of treating our backends as magic boxes running in the cloud, faithfully returning us JSON. At Netflix, we have adopted the Backend for Frontend (BFF) pattern : instead of having one general purpose “backend API”, we have one backend per client (Android/iOS/TV/web).

Java 93
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

How to introduce Data Science at your company

DareData

Machine Learning, Data Science and Artificial Intelligence are three terms that have been intertwined and used in multiple conversations during the past decade. Probably, in the business world, no other theme has caused so many questions, doubts, eyebrow raises and el dorado hopes. If you are reading this post you might have some level of interest in understanding what Data Science / Machine Learning or Artificial Intelligence are and trust me, you are not alone in the world.

article thumbnail

Covid-19 Accelerates The Need for Retail, Manufacturing Supply Chains To Adapt

Cloudera

The ongoing disruption to critical supply chains in both the manufacturing and retail space has seen businesses having to respond quickly, turning to data, analytics, and new technologies to better predict and manage ‘real-time’ business disruptions. . To find out more about how COVID-19 has impacted the manufacturing and retail industries Vijay Raja, Director of Industry & Solutions Marketing at Cloudera sat down for a round-table discussion with Michael Ger , Managing Director of Manufactu

More Trending

article thumbnail

Test-data management  support in Test Automation Development

Data Science Blog: Data Engineering

Data is centric in testing of several applications because data is critical to organizations. Businesses are becoming more data-driven, and hence it is imperative that as Automation Test developers, the value of the test-data is understood and completely harnessed during Test Automation development. The test-data involved in both Manual/Automation testing encompasses the test-data inputs, test-data outputs, and the test-data flow.

article thumbnail

Deploying Confluent Operator on Red Hat OpenShift Container Platform on AWS

Confluent

Confluent Operator allows you to deploy and manage Confluent Platform as a cloud-native, stateful container application on Kubernetes and OpenShift. The automation provided by Kubernetes, Operator, and Helm greatly simplifies […].

AWS 52
article thumbnail

How-to: Index Data from S3 Using CDP Data Hub

Cloudera

This blog post will present a simple “hello world” kind of example on how to get data that is stored in S3 indexed and served by an Apache Solr service hosted in a Data Discovery and Exploration cluster in CDP. For the curious: DDE is a pre-templeted Solr-optimized cluster deployment option in CDP, and recently released in tech preview. We will only cover AWS and S3 environments in this blog.

AWS 84
article thumbnail

Accelerate your Data Migration to Snowflake

RandomTrees

Snowflake Overview A data warehouse is a critical part of any business organization. Lot of cloud-based data warehouses are available in the market today, out of which let us focus on Snowflake. Snowflake is an analytical data warehouse that is provided as Software-as-a-Service (SaaS). Built on new SQL database engine, it provides a unique architecture designed for the cloud.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Teradata: An Enduring Legacy

Teradata

Teradata’s legacy of success is based upon three building blocks: People – Technology --Partnership. Learn how how a small piece of that legacy began and grew.

article thumbnail

Implementing Message Prioritization in Apache Kafka

Confluent

Users of messaging technologies such as JMS and AMQP often use message prioritization so that messages can be processed in a different order based on their importance. It doesn’t take […].

Kafka 44
article thumbnail

The Future Of The Telco Industry And Impact Of 5G & IoT – Part 1

Cloudera

Technology like IoT, edge computing and 5G are changing the face of CSPs. Communication Service Providers (CSPs) are in the middle of a data-driven transformation. The current scale and pace of change in the Telecommunications sector is being driven by the rapid evolution of new technologies like the Internet of Things (IoT), 5G, advanced data analytics, and edge computing.

article thumbnail

Top Marketing Challenges for Tech Companies

Grouparoo

Martech Challenges in 2020 In the process of starting Grouparoo, we interviewed a hundred people who work in Marketing at various levels and roles. They spanned levels from independent contributors to executives and covered a wide range of marketing disciplines including Marketing Ops, Marketing Automation, Product Marketing, and more. Across our interviews, we heard about a diversity of experiences, but we heard a few common themes: Marketing’s scope is increasing Marketing is becoming more and

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

To Integrate or Not to Integrate Data? That is the Question.

Teradata

Learn why a data-centric organization requires an objective approach to manage and integrate its data.

Data 59
article thumbnail

Accelerate AI & ML projects using Databricks

RandomTrees

Databricks and its role in Data Prep for AI solutions Databricks is a buzzword in Data Science. It is so due to a lot of reasons. In order to work with massive amounts of data in petabytes or even more, Apache Spark is widely used. Apache Spark is an open-source, fast cluster computing system and a highly popular framework for big data analysis. This framework processes the data in parallel that helps to boost the performance.

Project 52
article thumbnail

Operational Database Security – Part 1

Cloudera

In this blog post, we are going to take a look at some of the OpDB related security features of a CDP Private Cloud Base deployment. We are going to talk about encryption, authentication and authorization. . Data-at-rest encryption. Transparent data-at-rest encryption is available through the Transparent Data Encryption (TDE) feature in HDFS. . TDE provides the following features: Transparent, end-to-end encryption of data.

article thumbnail

Meet Boris Malensek, Our Head Of Engineering In Merchant Operations

Zalando Engineering

We spoke about his professional journey within Zalando, the evolution of Merchant Operations, and the engineering culture within the company. The interview was initially conducted for Zalando’s External Talent Community. Boris, let’s go back to the start. What attracted you to Zalando in the first place? The main reason for my attraction to Zalando was how quickly the company was able to adapt to change.

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

Data Champions: Balancing IT and Business Needs

Cloudera

Digital transformation has been on the agenda for a long time, but the sudden need to respond to the unprecedented challenges of 2020, has meant the buzzword has become an executable reality for many enterprises. I recently came across a KPMG report that revealed that 80% of executives are increasing investments on emerging technologies now, to drive higher realized value in the future.

IT 104
article thumbnail

Building an effective data approach in a hybrid cloud world – part 3

Cloudera

In our last two posts, we talked with Deloitte’s Marc Beierschoder and Martin Mannion respectively about the requirement organizations have to deploy their data and analytics , quickly, into a hybrid environment. On top of that, there is the fundamental aspect of consistent security and governance of your enterprise data cloud and need for multiple users with different requirements to access data flexibly.

Cloud 65