Sat.Mar 05, 2022 - Fri.Mar 11, 2022

article thumbnail

Boost Your AI and ML Skills for Free at NVIDIA Conference

KDnuggets

Four-day conference offers hundreds of learning and development opportunities in AI, ML, DL, robotics, data science and high performance computing for developers at all levels.

article thumbnail

How to make Apache Kafka clients go fast(er) on Confluent Cloud

Confluent

Imagine your team wants to design a data streaming architecture and you’re in charge of creating the prototype. Within a few minutes, you provision a fully managed Apache Kafka® cluster […].

Kafka 124
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Women Leaders in Data Discuss Breaking Bias on International Women’s Day

Cloudera

As an official sponsor of International Women’s Da y, Cloudera is excited to celebrate Women’s History Month and International Women’s Day, and to take up the mantle of this year’s theme #BreakTheBias. . Even in industries where women are underrepresented, like tech, women have made a lot of progress. Progress over many decades has slowly transformed the workplace into an environment where women’s strengths are recognized and valued.

Big Data 113
article thumbnail

Connect Microsoft Azure Services to Vantage

Teradata

This getting started guide describes ‘high-level’ Teradata Vantage connectivity options with the Microsoft Azure Services. Find out more.

98
article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

Top Posts Feb 28 – Mar 6: The Complete Collection of Data Science Cheat Sheets – Part 2

KDnuggets

Also: Calculus: The hidden building block of machine learning; Decision Tree Algorithm, Explained; Telling a Great Data Story: A Visualization Decision Tree; The Complete Collection of Data Science Cheat Sheets – Part 1.

article thumbnail

An Introduction to Data Mesh

Confluent

Decentralized architectures continue to flourish as engineering teams look to unlock the potential of their people and systems. From Git, to microservices, to cryptocurrencies, these designs look to decentralization as […].

More Trending

article thumbnail

Women of Teradata: Hillary Ashton

Teradata

In honor of Women's History Month & Int'l Women's Day, we are spotlighting Hillary Ashton, Teradata's Chief Product Officer, as she looks back on her career and gives advice to young women in tech.

59
article thumbnail

8 Women in AI Who Are Striving to Humanize the World

KDnuggets

Some exceptional female researchers and engineers are working on projects to make the world a better place with the help of AI, data science, and machine learning.

article thumbnail

Confluent’s Data Streaming Platform Can Save Over $2.5M vs. Self-Managing Apache Kafka

Confluent

If you’re reading this, it’s likely because you are leveraging (or considering) Apache Kafka® in your organization—especially as it has become the de facto standard for data streaming. Adopted by […].

Kafka 62
article thumbnail

#ClouderaLife Spotlight: Vicki Zingiris

Cloudera

March 8 marks International Women’s Day and as we celebrate the accomplishments of dynamic women across the world, I sat across from one such Clouderan, Vicki Zingiris, Director of Value-Based Services. We discussed important initiatives at Cloudera, the influence that Martial Arts has had on how she leads, collaborates, and mentors, and concluded with some valuable advice for women in the workforce. .

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Why Mutability Is Essential for Real-Time Data Analytics

Rockset

This is the first post in a series by Rockset's CTO Dhruba Borthakur on Designing the Next Generation of Data Systems for Real-Time Analytics. We'll be publishing more posts in the series in the near future, so subscribe to our blog so you don't miss them! Posts published so far in the series: Why Mutability Is Essential for Real-Time Data Analytics Handling Out-of-Order Data in Real-Time Analytics Applications Handling Bursty Traffic in Real-Time Analytics Applications SQL and Complex Queries A

article thumbnail

The Significance of Data Quality in Making a Successful Machine Learning Model

KDnuggets

Good quality data becomes imperative and a basic building block of an ML pipeline. The ML model can only be as good as its training data.

article thumbnail

Monte Carlo Named To Enterprise Tech 30 For Second Consecutive Year

Monte Carlo

For the second consecutive year, Monte Carlo was today named to the Enterprise Tech 30 (ET30), an exclusive list of the most promising companies in enterprise technology, as determined by some of the world’s top venture capitalists. Sponsored by Wing Venture Capital and Nasdaq, more than 15,000 private venture-backed companies are considered. The list is then narrowed to 10 early stage ($25 million or less raised), 10 mid stage ($25 to $100 million raised), and 10 late stage ($100 million or mor

article thumbnail

dbt + Machine Learning: What makes a great baton pass?

dbt Developer Hub

Special Thanks: Emilie Schario, Matt Winkler dbt has done a great job of building an elegant, common interface between data engineers, analytics engineers, and any data-y role, by uniting our work on SQL. This unification of tools and workflows creates interoperability between what would normally be distinct teams within the data organization. I like to call this interoperability a “baton pass.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Why I Joined Rockset (With Six Months Hindsight)

Rockset

Photo by Adil from Pexels I’ve found that every startup today fits into one of two categories: A solution that focuses mainly on enhancing a larger solution or platform. A solution that has arrived in the wake of those that have come before. The first category is a symbiotic relationship — think of the shark and remora fish. Even though there is a mutual benefit, one of the parties has a more significant dependency on the other.

MongoDB 52
article thumbnail

Build a Machine Learning Web App in 5 Minutes

KDnuggets

In this article, you will learn to export your models and use them outside a Jupyter Notebook environment. You will build a simple web application that is able to feed user input into a machine learning model, and display an output prediction to the user.

article thumbnail

Treat Your Data Like An Engineering Problem: An Interview with Snowflake Director of Product Management Chris Child

Monte Carlo

Monte Carlo’s Barr Moses sat down with Snowflake Director of Product Management Chris Child to talk about building data platforms at scale, how awesome data teams approach data quality, the role of data observability tools in the modern data stack, and more. To put it simply, to understand modern data engineering, you need to understand Snowflake. And as your data platform becomes productized, you need to get serious about data quality.

article thumbnail

Developer Friendly Application Persistence That Is Fast And Scalable With HarperDB

Data Engineering Podcast

Summary Databases are an important component of application architectures, but they are often difficult to work with. HarperDB was created with the core goal of being a developer friendly database engine. In the process they ended up creating a scalable distributed engine that works across edge and datacenter environments to support a variety of novel use cases.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

New Features in Cloudera Streams Messaging for CDP Public Cloud 7.2.14

Cloudera

With the launch of CDP Public Cloud 7.2.14, Cloudera Streams Messaging for Data Hub deployments has gotten some powerful new features! In this release , the Streams Messaging templates in Data Hub will come with Apache Kafka 2.8 and Cruise Control 2.5 providing new core features and fixes. KConnect has been added and gains additional capabilities with new connectors and Stateless Apache NiFi capabilities which can run NiFi Flows as connectors.

Cloud 102
article thumbnail

How a Level System can Help Forecast AI Costs

KDnuggets

To forecast costs for AI systems, it can be useful to talk about their “level” just like SAE has levels for self-driving cars. Adopting a level system can help organizations plan and prepare for AI systems that scale in complexity over time.

Systems 116
article thumbnail

How Long Does It Take to Learn Data Science Fundamentals?

KDnuggets

This article discusses 2 levels of data science learning, and the amount of time that will need to go into each. From 6 months to 4 years, this write-up covers a number of skills and how long it takes to acquire them.

article thumbnail

How To Use Synthetic Data To Overcome Data Shortages For Machine Learning Model Training

KDnuggets

It takes time and considerable resources to collect, document, and clean data before it can be used. But there is a way to address this challenge – by using synthetic data.

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

Building a Tractable, Feature Engineering Pipeline for Multivariate Time Series

KDnuggets

A time series feature engineering pipeline requires different transformations such as imputation and window aggregation, which follows a sequence of stages. This article demonstrates the building of a pipeline to derive multivariate time series features such that the features can then be easily tracked and validated.

Building 110
article thumbnail

Open Data and Why it is Necessary

KDnuggets

Open data improves accessibility and encourages universal participation, which allows companies to create cutting-edge, data-driven technologies and make the world a better place.

IT 105
article thumbnail

Data Science: Reality vs Expectations

KDnuggets

In the majority of companies, the executives in charge of data science and the decision-making process using data science, have little or no education or understanding in actual data science. Where does this leave you, the data scientist?

article thumbnail

5 Data Science Projects to Learn 5 Critical Data Science Skills

KDnuggets

Learn these to take any data science project idea from brainstorm to deployment.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

New Ways of Sharing Code Blocks for Data Scientists

KDnuggets

Share the interactive code blocks to impress your colleagues or post it on social media.

Coding 147
article thumbnail

Adding an attention mechanism to RNNs

KDnuggets

This article is an excerpt from the book Machine Learning with PyTorch and Scikit-learn is the new book from the widely acclaimed and bestselling Python Machine Learning series, fully updated and expanded to cover PyTorch, transformers, graph neural networks, and best practices.

article thumbnail

KDnuggets News March 9, 2022: Build a Machine Learning Web App in 5 Minutes; 5 Applications of Computer Vision

KDnuggets

This week's top posts are: Build a Machine Learning Web App in 5 Minutes by Natassha Selvaraj; 5 Applications of Computer Vision by Devin Partida; 5 Data Science Projects to Learn 5 Critical Data Science Skills by Nate Rosidi.

article thumbnail

Using Data Science to Make Clean Energy More Equitable

KDnuggets

Here are some lessons inspired by a recent panel the author moderated about how data scientists can help put equity into practice.

article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating