Sat.Jun 20, 2020 - Fri.Jun 26, 2020

article thumbnail

Aws Account

Start Data Engineering

1. AWS account Sign up for an AWS account at AWS Sign Up. You will be eligible for some free services for the first time sign up, ref: AWS Free Tier get your access key by clicking on your name -> My Security Credentials on the top pane and then clicking Create New Access Key.

AWS 130
article thumbnail

Bringing Business Analytics To End Users With GoodData

Data Engineering Podcast

Summary The majority of analytics platforms are focused on use internal to an organization by business stakeholders. As the availability of data increases and overall literacy in how to interpret it and take action improves there is a growing need to bring business intelligence use cases to a broader audience. GoodData is a platform focused on simplifying the work of bringing data to employees and end users.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

How Merging Companies Will Give Rise to Unified Data Streams

Confluent

Company mergers are becoming more common as businesses strive to improve performance and grow market share by saving costs and eliminating competition through acquisitions. But how do business mergers relate […].

Data 113
article thumbnail

Modernization Means Simplicity and Sophistication

Teradata

When it comes to being a modern data warehouse, your age really is just a number. It’s the underlying capabilities that actually count. Read more.

article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

Aws Emr

Start Data Engineering

EMR AWS EMR is a managed service provided by AWS to run Spark, HDFS, HIVE and other select software.

AWS 130
article thumbnail

Learnings from Distributed XGBoost on Amazon SageMaker

Zalando Engineering

Overview XGBoost is a popular Python library for gradient boosted decision trees. The implementation allows practitioners to distribute training across multiple compute instances (or workers), which is especially useful for large training sets. One tool used at Zalando for deploying production machine learning models is the managed service from Amazon called SageMaker.

More Trending

article thumbnail

How to Leverage Advanced Analytics in the Healthcare Domain

Teradata

Learn how Teradata Vantage's advanced analytics capabilities can analyze and predict useful diagnoses and insights in biomedicine and healthcare.

article thumbnail

Getting Started - Time Series Charts

Preset

In this blog we will understand better what are Time Series and provide some examples of time series visualizations in Superset

40
article thumbnail

Real-Time Recommendations for Event Ticketing Using MongoDB and Rockset

Rockset

When building data-driven applications, it’s been a common practice for years to move analytics away from the source database into either a slave, data warehouse or something similar. The main reason for this is that analytical queries, such as aggregations and joins, tend to require a lot more resources. When running, the detrimental impact on database performance could reverberate back to front-end users and have a negative impact on their experience.

MongoDB 40
article thumbnail

Announcing the Snowflake Sink Connector for Apache Kafka in Confluent Cloud

Confluent

We are excited to announce the preview release of the fully managed Snowflake sink connector in Confluent Cloud, our fully managed event streaming service based on Apache Kafka®. Our managed […].

Kafka 104
article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Big Tech is Poised to Pounce on Banking

Teradata

COVID-19 has changed banking, possibly for ever. But as banks wrestle with the pandemic & its after-effects, they must also focus on a bigger, imminent threat to their existence – & it’s not from FinTechs.

Banking 80
article thumbnail

PgBouncer on Kubernetes and how to achieve minimal latency

Zalando Engineering

Introduction In the new Postgres Operator release 1.5 we have implemented couple of new interesting features , including connection pooling support. Master Wq says there is "No greatest tool", to run something successfully in production one needs to understand pros and cons. Let's try to dig into the topic, and take a look at the performance aspect of connection pooler support, mostly from a scaling perspective.

article thumbnail

Announcing ksqlDB 0.10.0

Confluent

We’re excited to announce the release of ksqlDB 0.10.0, available now in the standalone distribution and on Confluent Cloud! This version includes a first-class Java client, improved Apache Kafka® key […].

Java 90
article thumbnail

Data is the Prize and the Strategy

Teradata

Big Tech wants your data. It will monetize it & deliver great services to your customers. If banks can’t find a way to do the same, they should give up now.

Banking 80
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Reducing the Total Cost of Operations for Self-Managed Apache Kafka

Confluent

We kicked off Project Metamorphosis last month by announcing a set of features that make Apache Kafka® more elastic, one of the most important traits of cloud-native data systems. This […].

Kafka 70