Sat.Mar 07, 2020 - Fri.Mar 13, 2020

article thumbnail

20+ Machine Learning Datasets & Project Ideas

KDnuggets

Upgrading your machine learning, AI, and Data Science skills requires practice. To practice, you need to develop models with a large amount of data. Finding good datasets to work with can be challenging, so this article discusses more than 20 great datasets along with machine learning project ideas for you to tackle today.

Datasets 160
article thumbnail

Why We Leverage Multi-tenancy in Uber’s Microservice Architecture

Uber Engineering

The performance of Uber’s services relies on our ability to quickly and stably launch new features on our platform , regardless of where the corresponding service lives in our tech stack. Foundational to our platform’s power is its microservice-based architecture … The post Why We Leverage Multi-tenancy in Uber’s Microservice Architecture appeared first on Uber Engineering Blog.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Scaling Data Governance For Global Businesses With A Data Hub Architecture

Data Engineering Podcast

Summary Data governance is a complex endeavor, but scaling it to meet the needs of a complex or globally distributed organization requires a well considered and coherent strategy. In this episode Tim Ward describes an architecture that he has used successfully with multiple organizations to scale compliance. By treating it as a graph problem, where each hub in the network has localized control with inheritance of higher level controls it reduces overhead and provides greater flexibility.

article thumbnail

Sharpening your Stream Processing Skills with Kafka Tutorials

Confluent

In the Apache Kafka® ecosystem, ksqlDB and Kafka Streams are two popular tools for building event streaming applications that are tightly integrated with Apache Kafka. While ksqlDB and Kafka Streams […].

Kafka 113
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

New Poll: Coronavirus impact on AI/Data Science/Machine Learning community

KDnuggets

Has coronavirus impacted your conference or other travel plans, and do you anticipate it causing further professional or educational disruption in the near future? Take part in the new KDnuggets poll and have your say.

article thumbnail

Query Lambdas: Increasing Developer Velocity for Application Development

Rockset

At Rockset we strive to make building modern data applications easy and intuitive. Data-backed applications come with an inherent amount of complexity - managing the database backend, exposing a data API (often using hard-coded SQL or an ORM to write queries), keeping the data and application code in sync. the list goes on. Just as Rockset has reimagined and dramatically simplified the traditional ETL pipeline on the data-loading side , we’re now proud to release a new product feature - Query La

SQL 52

More Trending

article thumbnail

Women in Tech: Growing Business and Shaping Culture at Confluent

Confluent

Every year on March 8th, Confluent is proud to celebrate International Women’s Day, a global holiday dedicated to honoring the accomplishments of women and advocating for gender equality around the […].

84
article thumbnail

Resources for Women in AI, Data Science, and Machine Learning

KDnuggets

For the international women's day, we feature resources to help more women enter and succeed in AI, Big Data, Data Science, and Machine Learning fields.

article thumbnail

How to work remotely at Zalando

Zalando Engineering

This document is heavily informed by remote work guidance from other companies and authors. Notable sources include FYI's 11 Best Practices for Working Remotely and Laurel Farrer’s How to Design Powerful Rituals for Successful Distributed Companies. Special thanks to Timo from GiantSwarm for sharing learnings in an ad-hoc phone call. Other sources are linked in the appendix.

article thumbnail

An Introduction to Teradata’s R and Python Package Bundles for Vantage Table Operators

Teradata

In the final part of this 3-part series, Tim Miller describes how to run R and Python in-database in Vantage using SCRIPT Table Operators.

Python 52
article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Kafka Summit London 2020 Update

Confluent

Given the growing concern and global impact of COVID-19 (better known as the coronavirus), we’ve made the decision to cancel the upcoming Kafka Summit London. While this decision was incredibly […].

Kafka 78
article thumbnail

50 Must-Read Free Books For Every Data Scientist in 2020

KDnuggets

In this article, we are listing down some excellent data science books which cover the wide variety of topics under Data Science.

article thumbnail

Stay informed about Covid-19 with your Superset Dashboard

Preset

Stay up to date on the Coronavirus cases with **Superset** dashboard & Public Data

Data 40
article thumbnail

Reflecting on my Career in Data for Women's History Month

Teradata

Monica Woolmer recaps her career in technology and business as an homage to Women's History Month.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Confluent’s Commitment to Our Customers, Employees, and Community Amid COVID-19 (Coronavirus)

Confluent

As the impact of COVID-19 (coronavirus) continues to spread, our top priority is the health and well-being of our customers, employees, and community. We are acutely aware that these are […].

65
article thumbnail

A Crash Course in Game Theory for Machine Learning: Classic and New Ideas

KDnuggets

Game theory is experiencing a renaissance driven by the evolution of AI. What are some classic and new ideas that data scientists should be aware of.

article thumbnail

Ready for changes with Hexagonal Architecture

Netflix Tech

by Damir Svrtan and Sergii Makagon As the production of Netflix Originals grows each year, so does our need to build apps that enable efficiency throughout the entire creative process. Our wider Studio Engineering Organization has built more than 30 apps that help content progress from pitch (aka screenplay) to playback: ranging from script content acquisition, deal negotiations and vendor management to scheduling, streamlining production workflows, and so on.

article thumbnail

Building a Mature Machine Learning Team

KDnuggets

After spending a lot of time thinking about the paths that software companies take toward ML maturity, this framework was created to follow as you adopt ML and then mature as an organization. The framework covers every aspect of building a team including product, process, technical, and organizational readiness, as well as recognizes the importance of cross-functional expertise and process improvements for bringing AI-driven products to market.

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

Decision Boundary for a Series of Machine Learning Models

KDnuggets

I train a series of Machine Learning models using the iris dataset, construct synthetic data from the extreme points within the data and test a number of Machine Learning models in order to draw the decision boundaries from which the models make predictions in a 2D space, which is useful for illustrative purposes and understanding on how different Machine Learning models make predictions.

article thumbnail

How To Build Your Own Feedback Analysis Solution

KDnuggets

Automating the analysis of customer feedback will sound like a great idea after reading a couple hundred reviews. Building an NLP solution to provide in-depth analysis of what your customers are thinking is a serious undertaking, and this guide helps you scope out the entire project.

Building 110
article thumbnail

The Most Useful Machine Learning Tools of 2020

KDnuggets

This articles outlines 5 sets of tools every lazy full-stack data scientist should use.

article thumbnail

Python Pandas For Data Discovery in 7 Simple Steps

KDnuggets

Just getting started with Python's Pandas library for data analysis? Or, ready for a quick refresher? These 7 steps will help you become familiar with its core features so you can begin exploring your data in no time.

Python 105
article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating

article thumbnail

Covid-19, your community, and you — a data science perspective

KDnuggets

Let's talk about covid-19; the reality, the numbers, and the data science.

article thumbnail

Google Open Sources TFCO to Help Build Fair Machine Learning Models

KDnuggets

A new optimization framework helps to incorporate fairness constraints in machine learning models.

article thumbnail

Math for Programmers!

KDnuggets

Math for Programmers teaches you the math you need to know for a career in programming, concentrating on what you need to know as a developer.

article thumbnail

Generate Realistic Human Face using GAN

KDnuggets

This article contain a brief intro to Generative Adversarial Network(GAN) and how to build a Human Face Generator.

Building 109
article thumbnail

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Speaker: David Bard, Principal at VP Product Coaching

In the fast-paced world of digital innovation, success is often accompanied by a multitude of challenges - like the pitfalls lurking at every turn, threatening to derail the most promising projects. But fret not, this webinar is your key to effective product development! Join us for an enlightening session to empower you to lead your team to greater heights.

article thumbnail

Few-Shot Image Classification with Meta-Learning

KDnuggets

Here is how you can teach your model to learn quickly from a few examples.

article thumbnail

KDnuggets™ News 20:n10, Mar 11: What impact is the coronavirus having on the AI/Data Science/Machine Learning community?; Recreating Fingerprints using Convolutional Autoencoders

KDnuggets

Also: Recreating Fingerprints using Convolutional Autoencoders; A simple and interpretable performance measure for a binary classifier; Resources for Women in AI, Data Science, and Machine Learning; Trends in Machine Learning in 2020; A Crash Course in Game Theory for Machine Learning; and much more.

article thumbnail

The Berlin Rent Freeze: How many illegal overpriced offers can I find online?

KDnuggets

This post presents an analysis of Berlin online real estate listings, investigating a controversial law capping rents in the state, which went into effect on February 23. Are current landlords already respecting the new rent cap?

article thumbnail

Software Interfaces for Machine Learning Deployment

KDnuggets

While building a machine learning model might be the fun part, it won't do much for anyone else unless it can be deployed into a production environment. How to implement machine learning deployments is a special challenge with differences from traditional software engineering, and this post examines a fundamental first step -- how to create software interfaces so you can develop deployments that are automated and repeatable.

article thumbnail

Reimagined: Building Products with Generative AI

“Reimagined: Building Products with Generative AI” is an extensive guide for integrating generative AI into product strategy and careers featuring over 150 real-world examples, 30 case studies, and 20+ frameworks, and endorsed by over 20 leading AI and product executives, inventors, entrepreneurs, and researchers.