Sat.Oct 01, 2022 - Fri.Oct 07, 2022

article thumbnail

The Art and Science of Data Storytelling with Brent Dykes

Jesse Anderson

My guest this week is Brent Dykes , Founder and Chief Data Storyteller at Analytics Hero. Before he founded his own company, he was at Omniture, Adobe, and Domo. Analytics Hero is a consulting business based around data storytelling Data storytelling was a new concept to me. Brent defines it as “as a structured approach for communicating insights to a targeted audience using narrative elements and explanatory visuals.

article thumbnail

The ABCs of NLP, From A to Z

KDnuggets

There is no shortage of tools today that can help you through the steps of natural language processing, but if you want to get a handle on the basics this is a good place to start. Read about the ABCs of NLP, all the way from A to Z.

Process 160
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Gain Visibility And Insight Into Your Supply Chains Through Operational Analytics Powered By Roambee

Data Engineering Podcast

Summary The global economy is dependent on complex and dynamic networks of supply chains powered by sophisticated logistics. This requires a significant amount of data to track shipments and operational characteristics of materials and goods. Roambee is a platform that collects, integrates, and analyzes all of that information to provide companies with the critical insights that businesses need to stay running, especially in a time of such constant change.

Metadata 100
article thumbnail

What’s New in Apache Kafka 3.3

Confluent

Apache Kafka 3.3 includes KRaft mode, improves partition scalability and resiliency while simplifying Kafka deployment, as well as updates to Kafka Streams, Connect, and more.

Kafka 113
article thumbnail

Beyond the Basics of A/B Tests: Innovative Experimentation Tactics You Need to Know as a Data or Product Professional

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Does Cost Reduction Play a Role in Digital Transformation?

Cloudera

Digital transformation. Everyone has their own ideas about what digital transformation means, so I decided to look up a few definitions. . Gartner : “Digital transformation can refer to anything from IT modernization (for example, cloud computing), to digital optimization, to the invention of new digital business models.”. CIO blog post : “Digital transformation is a foundational change in how an organization delivers value to its customers.”.

article thumbnail

Key-Value Databases, Explained

KDnuggets

Among the four big NoSQL database types, key-value stores are probably the most popular ones due to their simplicity and fast performance. Let’s further explore how key-value stores work and what are their practical uses.

Database 158

More Trending

article thumbnail

Introducing Stream Designer: The Visual Builder for Streaming Data Pipelines

Confluent

Confluent’s new Stream Designer is the industry’s first visual interface for rapidly building, testing, and deploying streaming data pipelines natively on Apache Kafka.

article thumbnail

Scaling Kafka Brokers in Cloudera Data Hub

Cloudera

This blog post will provide guidance to administrators currently using or interested in using Kafka nodes to maintain cluster changes as they scale up or down to balance performance and cloud costs in production deployments. Kafka brokers contained within host groups enable the administrators to more easily add and remove nodes. This creates flexibility to handle real-time data feed volumes as they fluctuate.

Kafka 79
article thumbnail

How to Get Up and Running with SQL – A List of Free Learning Resources

KDnuggets

We have compiled a list of the top free resources to help new data practitioners learn SQL. These include free online courses and resources to get the most out of your SQL skills.

SQL 133
article thumbnail

Hyper-scale time series forecasting done right

Teradata

There are various approaches to doing time-series forecasting. Amongst all the approaches, the right way is using an in-database approach. Read more to find out why.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Bringing Data Into Real Time: What You Missed at Current 2022

Confluent

Current 2022 is a wrap! Here are some of the top keynote speeches, exciting new data streaming technologies, popular sessions, and where to find videos online.

Data 104
article thumbnail

PyTorch Infra's Journey to Rockset

Rockset

Open source PyTorch runs tens of thousands of tests on multiple platforms and compilers to validate every change as our CI (Continuous Integration). We track stats on our CI system to power custom infrastructure, such as dynamically sharding test jobs across different machines developer-facing dashboards, see hud.pytorch.org , to track the greenness of every change metrics, see hud.pytorch.org/metrics , to track the health of our CI in terms of reliability and time-to-signal Our requirements for

AWS 52
article thumbnail

AI in FinTech: Managing the Finance of the Future

KDnuggets

Digital transformation is evolving, and so is the fintech industry by implementing AI trends and leveraging several benefits, such as optimizing productivity, increasing ROI, and enhancing security.

Finance 131
article thumbnail

What is SQL? What are its Applications and Benefits?

Emeritus

Everyone leveraged data from small-scale enterprises to Fortune 500 companies to ensure efficient operations. Frontrunners like the MAANG companies (Meta, Amazon, Apple, Netflix, and Google), have vast databases that hold a wide range of customer data. Here is where database management systems like MS Access come in, and a programming language known as Structured Query… The post What is SQL?

SQL 52
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Confluent for Startups: Get it right from the start

Confluent

Announcing Confluent for Startups! Get started with Apache Kafka, leverage our data streaming expertise, and set your business up with the best infrastructure for scale and success.

IT 57
article thumbnail

Organizing Talent: Return of the Data Center of Excellence

Monte Carlo

Will Larson (writer of An Elegant Puzzle – recommended read) may have said it best when he wrote that one of the best kinds of reorganization is the one you don’t do. However, data leaders inevitably reach a point where, due to team growth or evolving business demands, things just don’t work. Faced with these challenges, data organizations may swing back-and-forth between centralized vs. decentralized organizational structures until they achieve the right balance.

Data 52
article thumbnail

NLP Interview Questions

KDnuggets

What is NLP, and what types of questions related to NLP can you expect at the NLP-related job interviews?

157
157
article thumbnail

What Is Data Observability? Everything You Need To Know

Meltano

A recent study by Gartner predicted that only 20% of analytic insights will lead to business outcomes this year. Given that organizations are collecting higher volumes of data now than ever before, this seems like cause for concern. So what’s the problem? Where is this prediction coming from? The problem seems to be that many fail to achieve data agility and spend too much time troubleshooting data errors.

Data 52
article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

Event Streaming Architectures to Solve Problems for FinServ

Confluent

From real-time banking and mobile payments, learn how Apache Kafka and Confluent are powering the financial services industry with event-driven architecture for modern use cases.

article thumbnail

Reducing the Time to Value of your dbt Deployment with Slim CI

phData: Data Engineering

So you’ve been using dbt for a bit now… You have all of your transformations in dbt and your deployments are executing flawlessly, plus you noticed your development velocity has greatly increased. However, as your dbt repo has grown, you’ve begun to see that your deployments are taking even longer. You’ve spent a lot of time tagging your code to optimize your data refreshes, and while your refreshes run quickly, your deployments aren’t.

Cloud 52
article thumbnail

Interview Kickstart Data Science Interview Course — What Makes It Different?

KDnuggets

Interview Kickstart’s Data Science Interview Course is built by Data Scientists from MAANG and other big tech companies, the course promises to get you interview-ready in 15 weeks.

article thumbnail

A Brief Overview of Real-time Data

Striim

Traditionally, historical data (or batch data) was used for decision-making. However, lately, there’s a lot of focus on real-time data, which provides more business value. According to a survey by McKinsey , high-performing businesses are almost five times more likely to use real-time data, as compared to their counterparts. Real-time data is gaining prominence because it can help end-users to make decisions on the fly, allowing for more accurate and faster decision-making.

Media 52
article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating

article thumbnail

The Significance of O’Reilly’s Data Quality Fundamentals

Monte Carlo

In November of 2020, O’Reilly Media first approached us with the idea to author Data Quality Fundamentals: A Practitioner’s Guide to Building More Trustworthy Data Pipelines. It was an inflection point for a fledgling company that had only just begun to establish the category of data observability. We knew it wouldn’t be an easy feat, but we also knew it would be worthwhile – and important Poor data quality is one of the foremost challenges of our industry, and certainly one of the m

article thumbnail

Expensive Enterprise Hacks That Serve As A Lesson In Cybersecurity

U-Next

Every now and then, we come across news on data and network breaches in enterprises we thought had the most sophisticated and airtight cybersecurity measures. The fact is that, exploiters are not just becoming smarter but more creative as well. New avenues and loopholes are being exploited to infiltrate into networks and systems to extract sensitive data and information from businesses. .

Media 52
article thumbnail

Debunking the Myth of the Citizen Data Scientist

KDnuggets

While there are some benefits to having citizen data scientists, they are no silver bullet – and they certainly aren’t a replacement for true data scientists.

Data 115
article thumbnail

What Are the Benefits of a Multi-Cluster Warehouse in Snowflake? | Propel Data Analytics Blog

Propel Data

In Snowflake, you allocate “virtual warehouses” (computing clusters) to execute the SQL database commands that you run on the data platform.

article thumbnail

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Speaker: David Bard, Principal at VP Product Coaching

In the fast-paced world of digital innovation, success is often accompanied by a multitude of challenges - like the pitfalls lurking at every turn, threatening to derail the most promising projects. But fret not, this webinar is your key to effective product development! Join us for an enlightening session to empower you to lead your team to greater heights.

article thumbnail

Data Governance and Strategy for the Global Enterprise

Cloudera

While the word “data” has been common since the 1940s, managing data’s growth, current use, and regulation is a relatively new frontier. . Governments and enterprises are working hard today to figure out the structures and regulations needed around data collection and use. According to Gartner, by 2023 65% of the world’s population will have their personal data covered under modern privacy regulations. .

article thumbnail

Automating Your Transformation Pipeline with dbt

phData: Data Engineering

So you’ve built your first set of transformations in dbt , but now you need to figure out how to automate your deployment and code changes to your various environments. However, you’re not sure where to even start planning, let alone making sure that you’re sticking to best practices (whether that’s running your code on a schedule or having it run based on certain actions within your git repository).

article thumbnail

Top Posts September 26 – October 2: Free Algorithms in Python Course

KDnuggets

Free Algorithms in Python Course • How to Select Rows and Columns in Pandas • Lessons from a Senior Data Scientist • A Day in the Life of a Data Scientist: Expert vs. Beginner • 7 Machine Learning Portfolio Projects to Boost the Resume.

Algorithm 108
article thumbnail

DataOps Observability: Taming the Chaos (part 1)

DataKitchen

Part 1: Defining the Problems. This is the first post in DataKitchen’s four-part series on DataOps Observability. Observability is a methodology for providing visibility of every journey that data takes from source to customer value across every tool, environment, data store, team, and customer so that problems are detected and addressed immediately.

article thumbnail

Reimagined: Building Products with Generative AI

“Reimagined: Building Products with Generative AI” is an extensive guide for integrating generative AI into product strategy and careers featuring over 150 real-world examples, 30 case studies, and 20+ frameworks, and endorsed by over 20 leading AI and product executives, inventors, entrepreneurs, and researchers.