Sat.Oct 31, 2020 - Fri.Nov 06, 2020

article thumbnail

The Journey Begins

Team Data Science

Week 1: 10/9/20 - 10/16/20 In my quest to further improve my overall data science skills, I pulled the trigger on October 9th, 2020, and enrolled in a Data Engineering boot camp lead by Andreas Kretz. First a little bit about myself. I have a background in Aerospace Engineering and have been in the industry for close to 15 years now. A little more than a year ago, I decided to pivot to Machine Learning and Data Science.

article thumbnail

Add Version Control To Your Data Lake With LakeFS

Data Engineering Podcast

Summary Data lakes are gaining popularity due to their flexibility and reduced cost of storage. Along with the benefits there are some additional complexities to consider, including how to safely integrate new data sources or test out changes to existing pipelines. In order to address these challenges the team at Treeverse created LakeFS to introduce version control capabilities to your storage layer.

Data Lake 100
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Cloudera’s Pivot to a Virtual Internship Program

Cloudera

Typically, running smooth and successful internship programs requires in-person interactions with high touchpoints. From onboarding and regular meetings to coffee chats and welcome events to meet the team – it takes a lot to integrate a new intern. They’re not only new to the organization but new to the workforce, after all. . Yet, with most tech companies going fully remote, Early Talent teams had to consider their options.

article thumbnail

Announcing Pull Queries in Preview in Confluent Cloud ksqlDB

Confluent

“Persistent” queries have historically formed the basis of ksqlDB applications, which continuously transform, enrich, aggregate, materialize, and join your Apache Kafka® data using a familiar SQL interface. ksqlDB continuously executes […].

Cloud 77
article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

Branding Yourself

Team Data Science

Week 2: 10/16/20 - 10/23/20 Week 2 of the course consists of Modules 3 & 4. If you have not read my first blog go here. Module 3 focuses on creating a professional LinkedIn profile. Your LinkedIn profile is the world's access to you and how you want to be seen professionally. Below is a screenshot. So here, I have a professionally taken photograph, what I am interested in below, and the 'About' section that summarizes Me.in a professional sense.

article thumbnail

Connect Teradata Vantage to Salesforce Data With Azure Data Factory

Teradata

This "how-to" guide will help you to connect Teradata Vantage using the Native Object Store feature to query Salesforce data sourced by Microsoft Azure Data Factory.

Data 59

More Trending

article thumbnail

What’s New in Confluent Cloud Security

Confluent

Today, the ability to capture and harness the value of data in real time is critical for businesses to remain competitive in a data-driven world. Apache Kafka®, a scalable, open-source, […].

Cloud 52
article thumbnail

Liquidity Monitoring: Dislocation

Ripple Engineering

In a recent post , my teammate Jennifer Xia outlined our motivation and initial direction for tracking XRP liquidity in support of RippleNet’s On-Demand Liquidity (ODL) service. ODL leverages the digital asset XRP to facilitate cross-border payments by sourcing destination currencies right at the time of payment. Jennifer’s post introduces the concept of order books and defines the implied FX rate or the FX rate implied by a pair of trades bridged through XRP.

Finance 52
article thumbnail

Power BI Template App for SalesForce

FreshBI

So, what is a Power BI Template App? A Power BI Template App is a published Power BI solution that can be used by any company that has the data platform for which the Template App was created. Wouldn’t it be nice to pick your entire Power BI Solution off the shelf - one crafted for your specific business needs and your specific data structure. Power BI Template Apps are designed to be such an out-of-the-box solution and this blog post is an example of such for a Power BI Solution for Salesforce.

BI 52
article thumbnail

What Happened to the CEO in Waiting?

Teradata

Since the 2008 financial crisis the CFO's role has turned inward, & they have lost influence. What role should they play in the Bank of the Future, and can data be their savior?

Banking 52
article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Build a Slack Dashboard (Part 3): Transforming Data and Creating Cross Channel Visualizations

Preset

Build a beautiful Slack dashboard using open source tools Meltano and Superset. Part 3 of 3.

article thumbnail

Keeping Netflix Reliable Using Prioritized Load Shedding

Netflix Tech

How viewers are able to watch their favorite show on Netflix while the infrastructure self-recovers from a system failure By Manuel Correa , Arthur Gonigberg , and Daniel West Getting stuck in traffic is one of the most frustrating experiences for drivers around the world. Everyone slows to a crawl, sometimes for a minor issue or sometimes for no reason at all.

article thumbnail

An Overview of Real Time Data Warehousing on Cloudera

Cloudera

Users today are asking ever more from their data warehouse. This is resulting in advancements of what is provided by the technology, and a resulting shift in the art of the possible. As an example of this, in this post we look at Real Time Data Warehousing (RTDW), which is a category of use cases customers are building on Cloudera and which is becoming more and more common amongst our customers.

article thumbnail

Data Quality at Airbnb

Airbnb Tech

Part 1 —  Rebuilding at Scale Authors: Jonathan Parks, Vaughn Quoss, Paul Ellwood Introduction At Airbnb, we’ve always had a data-driven culture. We’ve assembled top-notch data science and engineering teams, built industry-leading data infrastructure, and launched numerous successful open source projects, including Apache Airflow and Apache Superset.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Responding to Security Vulnerabilities in Open Source Project

Preset

Preset's commitment to security in Apache Superset™

Project 40
article thumbnail

How insurers can better deliver at “The Moment of Truth”

Cloudera

It’s all about the Customer. Customers today expect services to be highly personalized. In a digital world tuned to understand your likes, dislikes, interests and preferences we expect a similar level of customization in all aspects of our lives. Insurance is no different. Insurance is not something the average consumer thinks about every day but when a life changing event happens, insurance becomes extremely important.

Insurance 111
article thumbnail

The Security Challenges of Data Warehousing in the Cloud

Cloudera

Many organizations struggle to meet growing and variable data warehouse demands. No matter how much they pad their annual IT budgets, there never seems to be enough capacity to cover unexpected business requests. This leads to resource restrictions for the various business units that use the platform. . When business units are not well served by central IT, “shadow IT” emerges.

Cloud 69
article thumbnail

Cloudera at BioData World Congress 2020 – Use Cases at Top 5 Pharmaceutical Organizations

Cloudera

BioData World Congress 2020 is next week, and I am looking forward to the opportunity to meet with decision makers and thought leaders working in omics, diagnostics and R&D from across Europe and beyond. Cloudera’s work with BioPharma organizations helps them link clinical and business knowledge with analytics expertise to drive patient-level insights and operational decision making in a dynamic environment.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.