Sat.Nov 09, 2019 - Fri.Nov 15, 2019

article thumbnail

Designing For Data Protection

Data Engineering Podcast

Summary The practice of data management is one that requires technical acumen, but there are also many policy and regulatory issues that inform and influence the design of our systems. With the introduction of legal frameworks such as the EU GDPR and California’s CCPA it is necessary to consider how to implement data protectino and data privacy principles in the technical and policy controls that govern our data platforms.

Designing 100
article thumbnail

How to Speed up Pandas by 4x with one line of code

KDnuggets

While Pandas is the library for data processing in Python, it isn't really built for speed. Learn more about the new library, Modin, developed to distribute Pandas' computation to speedup your data prep.

Coding 122
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Rich Model, Poor Model

Teradata

An integrated data foundation allows data science models to be more accurate, actionable and engage more customers. Find out how your model can positively impact your bottom line.

article thumbnail

Workforce Analytics is Reinventing HR

U-Next

Introduction to Workforce Analytics Today, the need to understand what attracts skillful individuals to join an organization, stay motivated, and deliver outstanding results has become more important than ever. However, this is not a task which can be shouldered by the HR team alone; they need the right tools to deliver optimal results. Over the years, organizations around the globe have spent billions of dollars on employee performance analysis, talent recruitment, leadership training, and deve

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Page Simulator

Netflix Tech

Page Simulation for Better Offline Metrics at Netflix by David Gevorkyan , Mehmet Yilmaz , Ajinkya More , Gaurav Agrawal , Richard Wellington , Vivek Kaushal , Prasanna Padmanabhan , Justin Basilico At Netflix, we spend a lot of effort to make it easy for our members to find content they will love. To make this happen, we personalize many aspects of our service, including which movies and TV shows we present on each member’s homepage.

article thumbnail

How Data Analytics Can Assist in Fraud Detection

KDnuggets

A primary advantage of data analytics tools is that they can handle massive quantities of information at once. These solutions typically learn what's normal within a collection of information and how to spot anomalies.

More Trending

article thumbnail

The Complete Data Science LinkedIn Profile Guide

KDnuggets

With so many Data Scientists showing up on LinkedIn, it's time to make sure your profile is top-notch because your talent is still highly sought after. Recruitment specialists want to find you fast, and this guide will help you create the best profile to feature your expertise.

article thumbnail

Transfer Learning Made Easy: Coding a Powerful Technique

KDnuggets

While the revolution of deep learning now impacts our daily lives, these networks are expensive. Approaches in transfer learning promise to ease this burden by enabling the re-use of trained models -- and this hands-on tutorial will walk you through a transfer learning technique you can run on your laptop.

Coding 92
article thumbnail

Beginners Guide to the Three Types of Machine Learning

KDnuggets

The following article is an introduction to classification and regression — which are known as supervised learning — and unsupervised learning — which in the context of machine learning applications often refers to clustering — and will include a walkthrough in the popular python library scikit-learn.

article thumbnail

Topics Extraction and Classification of Online Chats

KDnuggets

This article provides covers how to automatically identify the topics within a corpus of textual data by using unsupervised topic modelling, and then apply a supervised classification algorithm to assign topic labels to each textual document by using the result of the previous step as target labels.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

How to Visualize Data in Python (and R)

KDnuggets

Producing accessible data visualizations is a key data science skill. The following guidelines will help you create the best representations of your data using R and Python's Pandas library.

Python 87
article thumbnail

On the sensationalism of artificial intelligence news

KDnuggets

With artificial intelligence and machine learning now a mainstay of our daily awareness, news organizations have been seen to overstate the reality behind progress in the field. Learn more about recent examples of media hyperbole and explore why this may be happening.

Media 84
article thumbnail

Python Lists and List Manipulation

KDnuggets

In Python, lists store an ordered collection of items which can be of different types. This post is an overview of lists and their manipulation.

Python 88
article thumbnail

Research Guide for Depth Estimation with Deep Learning

KDnuggets

In this guide, we’ll look at papers aimed at solving the problems of depth estimation using deep learning.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Testing Your Machine Learning Pipelines

KDnuggets

Let’s take a look at traditional testing methodologies and how we can apply these to our data/ML pipelines.

article thumbnail

How I Got Better at Machine Learning

KDnuggets

Check out this author's collection of tips and tricks that I learned over the years to get better at Machine Learning.

article thumbnail

Python Workout / Practices of a Python Pro / Classic Computer Science Problems in Python

KDnuggets

Whether you’re a beginner or an expert, there’s always new ways you can improve your Python coding. Save 40% off this trio of Manning Python books today! Just enter the code nlpropython40 at checkout when you buy from manning.com.

Python 59
article thumbnail

Tips for a cost-effective machine learning project

KDnuggets

Spoiler: you don’t need a VM running 24/7 to handle 16 requests a day.

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

AI ROI: The Questions You Need To Be Asking

KDnuggets

During this free Metis Corporate Training webinar, Dec 5 @ 12pm ET, Kerstin Frailey, Senior Data Scientist and Head of Executive Corporate Training at Metis, will walk through what you need to ask before, during, and after the lifetime of a data science project to accurately assess its impact on the business.

article thumbnail

KDnuggets™ News 19:n43, Nov 13: Dynamic Reports in Python and R; Creating NLP Vocabularies; What is Data Science?

KDnuggets

On KDnuggets this week: Orchestrating Dynamic Reports in Python and R with Rmd Files; How to Create a Vocabulary for NLP Tasks in Python; What is Data Science?; The Complete Data Science LinkedIn Profile Guide; Set Operations Applied to Pandas DataFrames; and much, much more.

article thumbnail

Top Stories, Nov 4-10: 10 Free Must-read Books on AI

KDnuggets

Also: Understanding Boxplots; Probability Learning: Maximum Likelihood; Designing Your Neural Networks; Facebook Has Been Quietly Open Sourcing Some Amazing Deep Learning Capabilities for PyTorch; 5 Statistical Traps Data Scientists Should Avoid.

article thumbnail

How to Extract Google Maps Coordinates

KDnuggets

In this article, I will show you how to quickly extract Google Maps coordinates with a simple and easy method.

62
article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating

article thumbnail

Top KDnuggets tweets, Nov 06-12: 10 FREE must-read ebooks on AI. Things just keep getting more interesting in the field, so use these resources to stay up to speed.

KDnuggets

Also: It's time to make your Data Science LinkedIn profile ready for recruiters.; Python Libraries for Interpretable Machine Learning - KDnuggets; Process your data with Pandas up to 4x faster with this new Python library.; How to Extract Google Maps Coordinates.

article thumbnail

MLOps for production-level machine learning [Nov 14 Webinar]

KDnuggets

This live webinar, Nov 14 @ 12pm EST, on MLOps for production-level machine learning, will detail MLOps, a compound of “machine learning” and “operations”, a practice for collaboration and communication between data scientists and operations professionals to help manage the production machine learning lifecycle. Register now.

article thumbnail

Understanding NLP and Topic Modeling Part 1

KDnuggets

In this post, we seek to understand why topic modeling is important and how it helps us as data scientists.

IT 61
article thumbnail

Facebook Adds This New Framework to It’s Reinforcement Learning Arsenal

KDnuggets

ReAgent is a new framework that streamlines the implementation of reasoning systems.

Systems 51
article thumbnail

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Speaker: David Bard, Principal at VP Product Coaching

In the fast-paced world of digital innovation, success is often accompanied by a multitude of challenges - like the pitfalls lurking at every turn, threatening to derail the most promising projects. But fret not, this webinar is your key to effective product development! Join us for an enlightening session to empower you to lead your team to greater heights.

article thumbnail

Page Simulator

Netflix Tech

Page Simulation for Better Offline Metrics at Netflix by David Gevorkyan , Mehmet Yilmaz , Ajinkya More , Gaurav Agrawal , Richard Wellington , Vivek Kaushal , Prasanna Padmanabhan , Justin Basilico At Netflix, we spend a lot of effort to make it easy for our members to find content they will love. To make this happen, we personalize many aspects of our service, including which movies and TV shows we present on each member’s homepage.

article thumbnail

Page Simulator

Netflix Tech

Page Simulation for Better Offline Metrics at Netflix by David Gevorkyan , Mehmet Yilmaz , Ajinkya More , Gaurav Agrawal , Richard Wellington , Vivek Kaushal , Prasanna Padmanabhan , Justin Basilico At Netflix, we spend a lot of effort to make it easy for our members to find content they will love. To make this happen, we personalize many aspects of our service, including which movies and TV shows we present on each member’s homepage.