Top Data Engineering Digest Kafka Google Cloud Content for Week of Jul 18

Sat.Jul 18, 2020 - Fri.Jul 24, 2020

AWS RDS PostgreSQL Setup

Start Data Engineering

JULY 18, 2020

RDS AWS RDS is a managed service provided by AWS to run a relational database. We will see how to setup a postgres instance using AWS RDS. Log in to your AWS account. Go to Services -> RDS Click on Create Database, In the Create Database prompt, choose Standard Create option with PostgreSQL as engine type. In the Template section choose Free Tier and type in a DB Identifier, Master username and Master password.

PostgreSQL

PostgreSQL AWS Relational Database Database

Introducing Domain-Oriented Microservice Architecture

Uber Engineering

JULY 23, 2020

Introduction. Recently there has been substantial discussion around the downsides of service oriented architectures and microservice architectures in particular. While only a few years ago, many people readily adopted microservice architectures due to the numerous benefits they provide such as … The post Introducing Domain-Oriented Microservice Architecture appeared first on Uber Engineering Blog.

Architecture

Architecture Engineering

Join 16,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

MORE WEBINARS

Trending Sources

Doing Good with Data: Teradata's COVID-19 Resiliency Dashboard

Teradata

JULY 19, 2020

To help our customers navigate the world's new normal, our teams have created a business-centric, execution-focused tool – we call it the Resiliency Dashboard.

Data

Data IT

Webinars

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Understanding User Needs and Satisfying Them

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

MORE WEBINARS

Making Wind Energy More Efficient With Data At Turbit Systems

Data Engineering Podcast

JULY 20, 2020

Summary Wind energy is an important component of an ecologically friendly power system, but there are a number of variables that can affect the overall efficiency of the turbines. Michael Tegtmeier founded Turbit Systems to help operators of wind farms identify and correct problems that contribute to suboptimal power outputs. In this episode he shares the story of how he got started working with wind energy, the system that he has built to collect data from the individual turbines, and how he is

Systems

Systems Manufacturing Machine Learning Algorithm

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

Database

Stitch S3 DB Integration

Start Data Engineering

JULY 18, 2020

Given Source S3 path and file delimiter data warehouse connection details (endpoint, port, username, password and database name) data warehouse schema name and table name Run frequency Steps Log into your stitch account, here Click on the Destination tab and use the data warehouse connection details to establish a destination database. Click on Add Integration button on your dashboard.

Data Warehouse

Data Warehouse Database Data

Measuring and Monitoring a Stream Processing Cloud Service: Inside Confluent Cloud ksqlDB

Confluent

JULY 20, 2020

While preparing for the launch of Confluent Cloud ksqlDB, the ksqlDB Team built a system of metrics and monitoring that enabled insight into the experience of operating ksqlDB, the associated […].

Cloud

Cloud Process Systems

Machine Learning for a Better Developer Experience

Netflix Tech

JULY 20, 2020

Stanislav Kirdey , William High Imagine having to go through 2.5GB of log entries from a failed software build?—?3 million lines?—?to search for a bug or a regression that happened on line 1M. It’s probably not even doable manually! However, one smart approach to make it tractable might be to diff the lines against a recent successful build, with the hope that the bug produces unusual lines in the logs.

Machine Learning

Machine Learning Algorithm Data Science Coding

More Trending

Machine Learning for a Better Developer Experience

Netflix Tech

JULY 20, 2020

Machine Learning

Machine Learning Algorithm Data Science Coding

Why You Need to Treat Models Like Data

Teradata

JULY 20, 2020

Models are crucial in creating business value and need to be handled with care. That's why models should be treated like data. Learn more.

Data

Stitch Database to data warehouse Integration

Start Data Engineering

JULY 18, 2020

Given Source database connection details (endpoint, port, username, password and database name) Source table to replicate destination schema name run frequency can be set to 10min We are assuming the destination data warehouse is already setup in stitch. Steps Log into your stitch account. here Click on Add Integration button on your dashboard. Choose PostgreSQL option as the integration in the next page.

Data Warehouse

Data Warehouse Database PostgreSQL Data

Data Privacy, Security, and Compliance for Apache Kafka

Confluent

JULY 21, 2020

Why data privacy for Apache Kafka®? As companies seek to leverage all forms of data for competitive advantage, there is a growing regulatory and reputational risk that calls for the […].

Kafka

Kafka Data Programming

Sharing Code in Next.JS Apps with Plugins

Grouparoo

JULY 22, 2020

At Grouparoo, our front-end website is built using React and Next.js. Next.js is an excellent tool made by Vercel that handles all the hard parts of making a React app for you - Routing, Server-side Rendering, Page Hydration and more. It includes a simple starting place to build your routes and pages, based on the file system. If you want a /about page, just make an /pages/about.tsx file!

Coding

Coding Project Building Process

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

Certification

I’ve got the latest tech – now I’m a data business, right?

Teradata

JULY 23, 2020

Without a data strategy, led from the top and encompassing the whole bank, tech investments run the risk of becoming expensive pet projects, driven by fads & siloed thinking.

Banking

Banking Data Project

Building Real-Time Data Architectures to Foster Innovation

Rockset

JULY 21, 2020

Lessons from scaling facebook's online data infrastructure There are 3 growth numbers that stand out when I look back at the hyper-growth years of facebook from 2007 until 2015, when I was managing facebook's online data infrastructure team: user growth, team growth and infrastructure growth. Facebook’s user base grew from ~50 million monthly active users to a billion and half during that time, which is about a 30x growth.

Data Architecture

Data Architecture Architecture Building Programming Language

Improved Robustness and Usability of Exactly-Once Semantics in Apache Kafka

Confluent

JULY 24, 2020

This blog post talks about the recent improvements on exactly-once semantics (EOS) to make it simpler to use and more resilient. EOS was first released in Apache Kafka® 0.11 and […].

Kafka

Kafka IT

Time-Series Bar Charts in Apache Superset

Preset

JULY 20, 2020

Diving deep into time-series bar charts in Superset.

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

Data Science

Move Up the (Data) Property-Ladder

Teradata

JULY 21, 2020

Soon, retailers will need to run millions of queries everyday just to compete. But scaling to these levels needs an enterprise-wide strategy to overcome current barriers.

Retail

Retail Data

Improving MongoDB Read Performance - Indexing, Replication and Sharding

Rockset

JULY 23, 2020

Read performance is crucial for databases. If it takes too long to read a record from a database, this can stall the request for data from the client application, which could result in unexpected behavior and adversely impact user experience. For these reasons, the read operation on your database should last no more than a fraction of a second. There are a number of ways to improve database read performance, though not all of these methods will work for every type of application.

MongoDB

MongoDB Database Project SQL

Data Enrichment in ksqlDB Using UDTFs

Confluent

JULY 23, 2020

This blog post applies to ksqlDB version 0.8.1 and later. Keeping a datacenter up and running is no walk in the park. It’s a job that involves mind-boggling amounts of […].

Data

Data Process

Apache Kafka as a Service with Confluent Cloud Now Available on AWS Marketplace

Confluent

JULY 22, 2020

You may already know that Confluent Cloud is available across AWS, Azure, and Google Cloud, allowing you to access the amazing stack built by Confluent including a battle-tested version of […].

AWS

AWS Cloud Google Cloud Kafka

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

Building

Sat.Jul 18, 2020 - Fri.Jul 24, 2020

AWS RDS PostgreSQL Setup

Introducing Domain-Oriented Microservice Architecture

Webinars

Trending Sources

Doing Good with Data: Teradata's COVID-19 Resiliency Dashboard

Webinars

Making Wind Energy More Efficient With Data At Turbit Systems

Get Better Network Graphs & Save Analysts Time

Stitch S3 DB Integration

Measuring and Monitoring a Stream Processing Cloud Service: Inside Confluent Cloud ksqlDB

Machine Learning for a Better Developer Experience

Sign up to get articles personalized to your interests!

More Trending

Machine Learning for a Better Developer Experience

Why You Need to Treat Models Like Data

Stitch Database to data warehouse Integration

Data Privacy, Security, and Compliance for Apache Kafka

Sharing Code in Next.JS Apps with Plugins

Understanding User Needs and Satisfying Them

I’ve got the latest tech – now I’m a data business, right?

Building Real-Time Data Architectures to Foster Innovation

Improved Robustness and Usability of Exactly-Once Semantics in Apache Kafka

Time-Series Bar Charts in Apache Superset

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Move Up the (Data) Property-Ladder

Improving MongoDB Read Performance - Indexing, Replication and Sharding

Data Enrichment in ksqlDB Using UDTFs

Apache Kafka as a Service with Confluent Cloud Now Available on AWS Marketplace

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Stay Connected