Sat.Jul 20, 2019 - Fri.Jul 26, 2019

article thumbnail

Straining Your Data Lake Through A Data Mesh

Data Engineering Podcast

Summary The current trend in data management is to centralize the responsibilities of storing and curating the organization’s information to a data engineering team. This organizational pattern is reinforced by the architectural pattern of data lakes as a solution for managing storage and access. In this episode Zhamak Dehghani shares an alternative approach in the form of a data mesh.

Data Lake 100
article thumbnail

Convolutional Neural Networks: A Python Tutorial Using TensorFlow and Keras

KDnuggets

Different neural network architectures excel in different tasks. This particular article focuses on crafting convolutional neural networks in Python using TensorFlow and Keras.

Python 123
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

What Should Your Enterprise Expect from its Cloud Analytics Vendor?

Teradata

Large enterprises are investing heavily in cloud-based analytics technologies. What qualities should they be looking for in these cloud vendors? Find out more.

Cloud 69
article thumbnail

Fault Tolerance in Distributed Systems: Tracing with Apache Kafka and Jaeger

Confluent

Using Jaeger tracing, I’ve been able to answer an important question that nearly every Apache Kafka ® project that I’ve worked on posed: how is data flowing through my distributed system? Quick disclaimer: if you’re simply looking for an answer to that question, this post won’t provide that answer directly. Instead, in this post I will point you to an earlier blog post where I already answered that question and then I will focus on what should be your next question: now that I’m relying on Jaege

Kafka 54
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Operational Analytics: What every software engineer should know about low-latency queries on large data sets

Rockset

Introduction to Operational Analytics Operational analytics is a very specific term for a type of analytics which focuses on improving existing operations. This type of analytics, like others, involves the use of various data mining and data aggregation tools to get more transparent information for business planning. The main characteristic that distinguishes operational analytics from other types of analytics is that it is “analytics on the fly," which means that signals emanating from the vari

article thumbnail

This New Google Technique Help Us Understand How Neural Networks are Thinking

KDnuggets

Recently, researchers from the Google Brain team published a paper proposing a new method called Concept Activation Vectors (CAVs) that takes a new angle to the interpretability of deep learning models.

More Trending

article thumbnail

Why I Can’t Wait for Kafka Summit San Francisco

Confluent

The Kafka Summit Program Committee recently published the schedule for the San Francisco event, and there’s quite a bit to look forward to. For starters, it is a two-day event, which means we get to attend 14 talks, miss out on 42 talks (that we’ll later watch on video), and spend two days hanging out with our favorite community friends. While the keynotes have not been announced yet (they will be soon!

Kafka 18
article thumbnail

Is SQL needed to be a data scientist?

KDnuggets

As long as there is ‘data’ in data scientist, Structured Query Language (or see-quel as we call it) will remain an important part of it. In this blog, let us explore data science and its relationship with SQL.

SQL 116
article thumbnail

Top 13 Skills To Become a Rockstar Data Scientist

KDnuggets

Education, coding, SQL, big data platforms, storytelling and more. These are the 13 skills you need to master to become a rockstar data scientist.

Education 123
article thumbnail

Fantastic Four of Data Science Project Preparation

KDnuggets

This article takes a closer look at the four fantastic things we should keep in mind when approaching every new data science project.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Top Certificates and Certifications in Analytics, Data Science, Machine Learning and AI

KDnuggets

Here are the top certificates and certifications in Analytics, AI, Data Science, Machine Learning and related areas.

article thumbnail

A Gentle Introduction to Noise Contrastive Estimation

KDnuggets

Find out how to use randomness to learn your data by using Noise Contrastive Estimation with this guide that works through the particulars of its implementation.

article thumbnail

50% ends Friday – Research Frontiers, AI Kick-start, BootCamp, and Career Expo

KDnuggets

ODSC focuses on research at its conferences and invites the experts pushing the boundaries of AI to speak. Between the two upcoming conferences, researchers from more than 20 of the top research institutes in the country (Open AI, NASA’s JPL, Google, MIT CSAIL, BAIR, The Turing Institute, and Max Planck and more) will deliver talks and lead trainings at ODSC West 2019.

IT 54
article thumbnail

Neural Code Search: How Facebook Uses Neural Networks to Help Developers Search for Code Snippets

KDnuggets

Developers are always searching for answers to questions about their code. But how do they ask the right questions? Facebook is creating new NLP neural networks to help search code repositories that may advance information retrieval algorithms.

Coding 51
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Top KDnuggets tweets, Jul 17-23: Papers with Code: A Fantastic GitHub Resource for Machine Learning

KDnuggets

Also: Data Science Jobs Report 2019: Python Way Up, TensorFlow Growing Rapidly, R Use Double SAS; The Hundred-Page Machine Learning Book Book Review; The Evolution of a ggplot; Notes on Feature Preprocessing: The What, the Why, and the How.

article thumbnail

How to Share Data Science Secrets Without Sacrificing Security

KDnuggets

Learn how to incorporate security into your practices without slowing down your project. Read this ActiveState blog post to learn more.

article thumbnail

High-Quality AI And Machine Learning Data Labeling At Scale: A Brief Research Report

KDnuggets

Analyst firm Cognilytica estimates that as much as 80% of machine learning project time is spent on aggregating, cleaning, labeling, and augmenting machine learning model data. So, how do innovative machine learning teams prepare data in such a way that they can trust its quality, cost of preparation, and the speed with which it’s delivered?

article thumbnail

Wake Forest University: Executive Director, Business Analytics Programs, School of Business [Winston Salem, NC]

KDnuggets

Responsible for operational leadership and management of the Master of Science in Business Analytics programs. Serves as a thought partner with the program Associate Dean to develop and execute program strategy.

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.