Sat.Apr 23, 2022 - Fri.Apr 29, 2022

article thumbnail

15 Python Coding Interview Questions You Must Know For Data Science

KDnuggets

Solving the Python coding interview questions is the best way to get ready for an interview. That’s why we’ll lead you through 15 examples and five concepts these questions cover.

Coding 157
article thumbnail

Gain Visibility Into Your Entire Machine Learning System Using Data Logging With WhyLogs

Data Engineering Podcast

Summary There are very few tools which are equally useful for data engineers, data scientists, and machine learning engineers. WhyLogs is a powerful library for flexibly instrumenting all of your data systems to understand the entire lifecycle of your data from source to productionized model. In this episode Andy Dang explains why the project was created, how you can apply it to your existing data systems, and how it functions to provide detailed context for being able to gain insight into all o

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Introducing Current 2022: The Next Generation of Kafka Summit

Confluent

Data streaming is a new category of technology that is reshaping the way businesses operate, but there hasn’t been a place for everyone in the ecosystem to come together and […].

Kafka 104
article thumbnail

Emerging Risks are Systemic

Teradata

Managing the new class of emerging risks requires infusing the principles of resiliency and efficient risk analytics into traditional risk management frameworks.

Systems 97
article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

Top 5 Free Cloud Notebooks in 2022

KDnuggets

Create and collaborate on data science projects or train machine learning models using free cloud Jupyter notebook platforms. You get a hassle-free IDE experience and free compute resources.

Cloud 151
article thumbnail

#ClouderaLife Spotlight: Susana López Huertas, Senior Account Manager

Cloudera

April is Autism Awareness Month, and as we close out the month I sat down with Clouderan Susana L ó pez Huertas, who shared her story of raising a son with autism and the work she is doing to promote an environment where autistic adults can thrive in the workforce. . Meet Susana L ó pez Huertas. Susana, who has been a part of Cloudera for about a year, works out of the Madrid office as a senior account manager for the country’s Telecom, Media, and Central Public Sector accounts.

More Trending

article thumbnail

Big tech versus the airlines – who’s going to win in the modern retailing battle?

Teradata

Find out why data analytics and connectivity will be the difference between retailing taking off and being grounded.

Retail 98
article thumbnail

Data Scientist, Data Engineer & Other Data Careers, Explained

KDnuggets

In this article, we will have a look at five distinct data careers, and hopefully provide some advice on how to get one's feet wet in this convoluted field.

article thumbnail

Reflections of a Rockset UXer

Rockset

It is often said time flies when you are having fun and I couldn't agree more. I have been at Rockset for almost three years now and it is still so interesting to me. On one hand, I am just getting started and have so much more to do and on the other, I am so proud of the distance we have covered in the last few years! Photo by Daoudi Aissa on Unsplash Our customers tell us that the work we are doing matters to them: Rockset made me a hero on day three of my new job.

Medical 52
article thumbnail

3 Simple Steps For Snowflake Cost Optimization Without Getting Too Crazy

Monte Carlo

Most data pros know Snowflake’s pricing model is consumption based–you pay for what you use. What many don’t know is Snowflake actually WANTS you to optimize your costs and has provided helpful features to rightsize your consumption. Waste isn’t good for anyone. Instead of spinning cycles on deteriorated SQL queries, the data cloud provider would rather have you focus those Snowflake credits toward projects like building data apps.

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Beyond Matrix Factorization: Using hybrid features for user-business recommendations

Yelp Engineering

Yelp’s mission is to connect people with great local businesses. On the Recommendations & Discovery team, we sift through billions of users-business interactions to learn user preferences. Our solutions power several products across Yelp such as personalized push notifications, email engagement campaigns, the home feed, Collections and more. Here we discuss the generalized user to business recommendation model which is crucial to a lot of these applications.

article thumbnail

How Metadata Improves Security, Quality, and Transparency

KDnuggets

Metadata is the data providing context about the data, more than what you see in the rows and columns. By managing your metadata, you're effectively creating an encyclopedia of your data assets.

Metadata 149
article thumbnail

Streaming Data and Real-Time Analytics With Kafka + Rockset

Rockset

As Kafka Summit is in full swing in London this week and the topic of event streaming is all over my Linkedin feed, I saw a post asking " Is streaming dead? " referring to CNN+ being shut down. In the last few days, Netflix took a once-in-a-lifetime beating in the stock market , and CNN redefined fail fast ( pioneered by Silicon Valley ) when it announced the breaking news that it will shut down CNN+ just weeks after a very splashy debut.

Kafka 52
article thumbnail

Scribd is presenting at Data and AI Summit 2022

Scribd Technology

We are very excited to be presenting and attending this year’s Data and AI Summit which will be hosted virtually and physically in San Francisco from June 27th-30th. Throughout the course of 2021 we completed a number of really interesting projects built around delta-rs and the Databricks platform which we are thrilled to share with a broader audience.

Kafka 40
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Operational Analytics At Speed With Minimal Busy Work Using Incorta

Data Engineering Podcast

Summary A huge amount of effort goes into modeling and shaping data to make it available for analytical purposes. This is often due to the need to simplify the final queries so that they are performant for visualization or limited exploration. In order to cut down the level of effort involved in making data usable, Matthew Halliday and his co-founders created Incorta as an end-to-end, in-memory analytical engine that removes barriers to insights on your data.

article thumbnail

Best Data Science Career Tracks of 2022

KDnuggets

Top-rated data science tracks consist of multiple project-based courses covering all aspects of data. It includes an introduction to Python/R, data ingestion & manipulation, data visualization, machine learning, and reporting.

article thumbnail

A Window Into the Future of Data in Motion and What It Means for Businesses

Cloudera

Modern businesses have vast amounts of data at their fingertips and are acutely aware of how enterprise data strategies positively impact business outcomes. Despite this, only a handful of organisations interact with all stages of the data life cycle process to truly distill information that distinguishes future-ready businesses from the rest. Much potential remains untapped when businesses do not translate their data into actionable insights from the point it is created, eroding the usefulness

IT 94
article thumbnail

Operation-Based SLOs

Zalando Engineering

Anyone who has been following the topic of Site Reliability Engineering (SRE) has likely heard of Service Level Objectives (SLOs) , and Service Level Indicators (SLIs). SLIs and SLOs are at the core of the SRE practices. They are fundamental to establish the balance between building new features on a product, shipping fast, or working on the reliability of that product.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Evolution of ML Fact Store

Netflix Tech

by Vivek Kaushal At Netflix, we aim to provide recommendations that match our members’ interests. To achieve this, we rely on Machine Learning (ML) algorithms. ML algorithms can be only as good as the data that we provide to it. This post will focus on the large volume of high-quality data stored in Axion?—?our fact store that is leveraged to compute ML features offline.

article thumbnail

KDnuggets Top Posts for March 2022: Why Are So Many Data Scientists Quitting Their Jobs?

KDnuggets

Also: 8 Free MIT Courses to Learn Data Science Online; Build a Machine Learning Web App in 5 Minutes; Best Data Science Books for Beginners; Linear vs Logistic Regression; and more!

article thumbnail

Data Is Now a Team Sport

Cloudera

This week I participated in an informative event that Cloudera hosted with TechCrunch: Data and the Culture Transformation. The event was moderated by tech industry analyst Maribel Lopez, and we were joined by Shirley Collie, chief health analytics actuary at Discovery Health in South Africa. The conversations focused on how company data cultures are rapidly evolving and delivering new levels of value to businesses with the emergence of data ecosystems.

article thumbnail

DataOps Explained: How To Not Screw It Up

Monte Carlo

What is DataOps? DataOps is a discipline that merges data engineering and data science teams to support an organization’s data needs, in a similar way to how DevOps helped scale software engineering. Similar to how DevOps applies CI/CD to software development and operations, DataOps entails a CI/CD-like, automation-first approach to building and scaling data products.

IT 64
article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

KDnuggets News, April 27: A Brief Introduction to Papers With Code; Machine Learning Books You Need To Read In 2022

KDnuggets

A Brief Introduction to Papers With Code; Machine Learning Books You Need To Read In 2022; Building a Scalable ETL with SQL + Python; 7 Steps to Mastering SQL for Data Science; Top Data Science Projects to Build Your Skills.

article thumbnail

Want to Use Your Data Skills to Solve Global Problems? Here’s What You Need to Know

KDnuggets

Global risk management is an arena where data brings order to an unpredictable world. Johns Hopkins University’s part-time Master of Arts in Global Risk (online) takes just 18 to 21 months to complete. This multidisciplinary program helps professionals develop the skills to make forward-looking decisions that contribute to risk management.

article thumbnail

Connecting the Knowledge Ecosystem

KDnuggets

We’re proud to announce that the 4th annual Knowledge Graph Conference is taking place on May 2-6 at Cornell Tech, NYC and virtually on Airmeet.

107
107
article thumbnail

Data Management: How to Stay on Top of Your Customer’s Mind?

KDnuggets

Extract, profile, and manage your customer data in a flash with customer data management solutions, and achieve a customer-centric culture.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Top Data Science Projects to Build Your Skills

KDnuggets

Check out this list of data science project ideas that you can use to boost your skills, organized by level of expertise.

article thumbnail

Why You Need To Learn Python In 2022

KDnuggets

If you don’t already know a programming language, or if you’re deciding to choose another language, have a read and see if Python is for you.

Python 106
article thumbnail

Getting Deep Learning working in the wild: A Data-Centric Course

KDnuggets

Data-centric learning resources are somewhat scattered today, and that’s why we developed a new Data Centric Deep Learning course on the co:rise education platform. It is an introduction to a set of approaches and best practices, for people who are trying to do deep learning in the wild.

article thumbnail

7 Steps to Mastering SQL for Data Science

KDnuggets

SQL is a must-know for anyone working in the data industry. Here’s how you can learn it from scratch.

SQL 108
article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating