Sat.Jul 27, 2019 - Fri.Aug 02, 2019

article thumbnail

Simplifying Data Integration Through Eventual Connectivity

Data Engineering Podcast

Summary The ETL pattern that has become commonplace for integrating data from multiple sources has proven useful, but complex to maintain. For a small number of sources it is a tractable problem, but as the overall complexity of the data ecosystem continues to expand it may be time to identify new ways to tame the deluge of information. In this episode Tim Ward, CEO of CluedIn, explains the idea of eventual connectivity as a new paradigm for data integration.

article thumbnail

Top 10 Best Podcasts on AI, Analytics, Data Science, Machine Learning

KDnuggets

Check out our latest Top 10 Most Popular Data Science and Machine Learning podcasts available on iTunes. Stay up to date in the field with these recent episodes and join in with the current data conversations.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Evolution of Netflix Conductor:

Netflix Tech

v2.0 and beyond By Anoop Panicker and Kishore Banala Conductor is a workflow orchestration engine developed and open-sourced by Netflix. If you’re new to Conductor, this earlier blogpost and the documentation should help you get started and acclimatized to Conductor. Netflix Conductor: A microservices orchestrator In the last two years since inception, Conductor has seen wide adoption and is instrumental in running numerous core workflows at Netflix.

article thumbnail

Enterprise Data Strategy: The Upside of Scarce Funding

Teradata

In a cost-cutting culture, directly linking data projects to top business initiatives is a good way to keep them from getting clipped. Learn more.

Data 73
article thumbnail

Beyond the Basics of A/B Tests: Innovative Experimentation Tactics You Need to Know as a Data or Product Professional

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Crafting the Perfect Internship Playlist

Pandora Engineering

Credit: Kanok Sulaiman Disclaimer: These are my experiences from being a Pandora software developer intern in the summer of 2019. All opinions expressed are my own, and represent no one except myself. I recently spent the last summer of my undergraduate program as an intern for Pandora Media in Oakland, CA. I gained a lot from my experience, and I’m writing this post to detail the application process, the lessons that I learned, and the company culture.

Java 52
article thumbnail

7 Tips for Dealing With Small Data

KDnuggets

At my workplace, we produce a lot of functional prototypes for our clients. Because of this, I often need to make Small Data go a long way. In this article, I’ll share 7 tips to improve your results when prototyping with small datasets.

Datasets 121

More Trending

article thumbnail

Why Multi-Dimensional Personalization is Worth the Investment

Teradata

It's not enough just to drive personalization in your marketing efforts, you need to take a multi-dimensional approach. Find out why it's worth the investment.

IT 49
article thumbnail

Introducing Confluent Platform 5.3

Confluent

Delivers the new Confluent Operator for cloud-native automation on Kubernetes, a redesigned Confluent Control Center user interface to simplify how you manage event streams, and a preview of Role-Based Access Control for enterprise-grade security. Over the past year, we’ve been amazed at how fast Confluent Platform has matured within our user base—both in terms of size and criticality of deployments.

Kafka 19
article thumbnail

Ten more random useful things in R you may not know about

KDnuggets

I had a feeling that R has developed as a language to such a degree that many of us are using it now in completely different ways. This means that there are likely to be numerous tricks, packages, functions, etc that each of us use, but that others are completely unaware of, and would find useful if they knew about them.

IT 119
article thumbnail

Understanding Tensor Processing Units

KDnuggets

The Tensor Processing Unit (TPU) is Google's custom tool to accelerate machine learning workloads using the TensorFlow framework. Learn more about what TPUs do and how they can work for you.

Process 119
article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Pytorch Cheat Sheet for Beginners and Udacity Deep Learning Nanodegree

KDnuggets

This cheatsheet should be easier to digest than the official documentation and should be a transitional tool to get students and beginners to get started reading documentations soon.

article thumbnail

How a simple mix of object-oriented programming can sharpen your deep learning prototype

KDnuggets

By mixing simple concepts of object-oriented programming, like functionalization and class inheritance, you can add immense value to a deep learning prototyping code.

article thumbnail

A 2019 Guide to Object Detection

KDnuggets

Object detection has been applied widely in video surveillance, self-driving cars, and object/people tracking. In this piece, we’ll look at the basics of object detection and review some of the most commonly-used algorithms and a few brand new approaches, as well.

Algorithm 113
article thumbnail

Easily Deploy Deep Learning Models in Production

KDnuggets

Getting trained neural networks to be deployed in applications and services can pose challenges for infrastructure managers. Challenges like multiple frameworks, underutilized infrastructure and lack of standard implementations can even cause AI projects to fail. This blog explores how to navigate these challenges.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Here’s how you can accelerate your Data Science on GPU

KDnuggets

Data Scientists need computing power. Whether you’re processing a big dataset with Pandas or running some computation on a massive matrix with Numpy, you’ll need a powerful machine to get the job done in a reasonable amount of time.

article thumbnail

Opening Black Boxes: How to leverage Explainable Machine Learning

KDnuggets

A machine learning model that predicts some outcome provides value. One that explains why it made the prediction creates even more value for your stakeholders. Learn how Interpretable and Explainable ML technologies can help while developing your model.

article thumbnail

What 70% of Data Science Learners Do Wrong

KDnuggets

Lessons learned from repeatedly smashing my head with a 2-meter long metal pole for a college engineering course.

article thumbnail

Can we trust AutoML to go on full autopilot?

KDnuggets

We put an AutoML tool to the test on a real-world problem, and the results are surprising. Even with automatic machine learning, you still need expert data scientists.

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

A Data Science Playbook for explainable ML/xAI

KDnuggets

This technical webinar on Aug 14 discusses traditional and modern approaches for interpreting black box models. Additionally, we will review cutting edge research coming out of UCSF, CMU, and industry.

article thumbnail

P-values Explained By Data Scientist

KDnuggets

This article is designed to give you a full picture from constructing a hypothesis testing to understanding p-value and using that to guide our decision making process.

Data 88
article thumbnail

GPU Accelerated Data Analytics & Machine Learning

KDnuggets

The future is here! Speed up your Machine Learning workflow using Python RAPIDS libraries support.

article thumbnail

Exploring Python Basics.

KDnuggets

This free ebook is a great resource for data science beginners, providing a good introduction into Python, coding with Raspberry Pi, and using Python to building predictive models.

Python 70
article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating

article thumbnail

Five Command Line Tools for Data Science

KDnuggets

You can do more data science than you think from the terminal.

article thumbnail

Decentralized and Collaborative AI: How Microsoft Research is Using Blockchains to Build More Transparent Machine Learning Models

KDnuggets

Recently, AI researchers from Microsoft open sourced the Decentralized & Collaborative AI on Blockchain project that enables the implementation of decentralized machine learning models based on blockchain technologies.

article thumbnail

Top KDnuggets tweets, Jul 24-30: Nothing but NumPy: Understanding and Creating Neural Nets w. Computational Graphs from Scratch; How Netflix works

KDnuggets

How Netflix works: the (hugely simplified) complex stuff that happens every time; Top Certificates and Certifications in Analytics, Data Science, ML; Nothing but NumPy: Understanding &Creating Neural Networks with Computation.

article thumbnail

Top Stories, Jul 22-28: Top 13 Skills To Become a Rockstar Data Scientist; This New Google Technique Help Us Understand How Neural Networks are Thinking

KDnuggets

Also: Convolutional Neural Networks: A Python Tutorial Using TensorFlow and Keras; Fantastic Four of Data Science Project Preparation; The Death of Big Data and the Emergence of the Multi-Cloud Era; The title CDO started out as a joke.

article thumbnail

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Speaker: David Bard, Principal at VP Product Coaching

In the fast-paced world of digital innovation, success is often accompanied by a multitude of challenges - like the pitfalls lurking at every turn, threatening to derail the most promising projects. But fret not, this webinar is your key to effective product development! Join us for an enlightening session to empower you to lead your team to greater heights.

article thumbnail

South Dakota State University: Data Visualization Developer and Analyst [Brookings, SD]

KDnuggets

South Dakota State University is seeking a Data Visualization Developer and Analyst in Brookings, SD, to create business intelligence tools and reports to support the use of a campus-wide business intelligence and decision support system, compile multiple visualizations into intuitive dashboards for campus-wide use, and more.

article thumbnail

Monash University: Lecturer / Sr Lecturer – Blockchain [Melbourne, Australia]

KDnuggets

Seeking a Lecturer / Sr Lecturer for the Monash Blockchain Technology Centre (Monash BTC): a visionary enterprise that will bring together world-leading expertise from across Monash to explore, develop and innovate the technology of blockchain, in collaboration with various industry and societal sectors.

article thumbnail

KDnuggets™ News 19:n28, Jul 31: Top 13 Skills To Become a Rockstar Data Scientist; Best Podcasts on AI, Analytics, Data Science

KDnuggets

Learn the essential skills needed to become a Data Science rockstar; Understand CNNs with Python + Tensorflow + Keras tutorial; Discover the best podcasts about AI, Analytics, Data Science; and find out where you can get the best Certificates in the field.

article thumbnail

Are We Ready to Partner With Machines? Data Science Salon Miami, September 10-11

KDnuggets

When it comes to AI, there’s plenty of talk of the future of machines. But it’s the people behind AI development who have the insights needed to shape that future. Register now to catch all of our speakers at the Data Science Salon Miami, Sep 10-11, 2019.

article thumbnail

Reimagined: Building Products with Generative AI

“Reimagined: Building Products with Generative AI” is an extensive guide for integrating generative AI into product strategy and careers featuring over 150 real-world examples, 30 case studies, and 20+ frameworks, and endorsed by over 20 leading AI and product executives, inventors, entrepreneurs, and researchers.