Sat.Aug 05, 2023 - Fri.Aug 11, 2023

article thumbnail

Why Is Data Modeling So Challenging – How To Data Model For Analytics

Seattle Data Guy

Learning about how to data models from basic star schemas on the internet is like learning data science using the IRIS data set. It works great as a toy example. But it doesn’t match real life at all. Data modeling in real life requires you fully understand the data sources and your business use cases.… Read more The post Why Is Data Modeling So Challenging – How To Data Model For Analytics appeared first on Seattle Data Guy.

article thumbnail

A senior engineer/EM job search story

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. In every issue, I cover topics related to Big Tech and startups through the lens of engineering managers and senior engineers. In this article, we cover one out of five topics from today’s subscriber-only The Pulse issue. To get full issues twice a week, subscribe here.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Senior Engineer – The Number One Skill

Confessions of a Data Guy

Do you think I’m just trying to get you to click? Maybe. Maybe not. After working in and around Data Teams for well over a decade, with both the smartest people to touch the keyboard, and the others, it’s become quite clear to me what the number one skill that identifies a Senior level Engineering […] The post Senior Engineer – The Number One Skill appeared first on Confessions of a Data Guy.

article thumbnail

_spark_metadata in Apache Spark Structured Streaming issue is no more!

Waitingforcode

There are probably not that many people working today on the flat files with Structured Streaming than 5 years ago thanks to the table file formats. However, if you are in this group and are still generating CSVs or JSONs with the streaming sink, brace yourself, the memory problems are coming if you don't take action!

130
130
article thumbnail

Navigating the Future: Generative AI, Application Analytics, and Data

Generative AI is upending the way product developers & end-users alike are interacting with data. Despite the potential of AI, many are left with questions about the future of product development: How will AI impact my business and contribute to its success? What can product managers and developers expect in the future with the widespread adoption of AI?

article thumbnail

Quantifying The Return On Investment For Your Data Team

Data Engineering Podcast

Summary As businesses increasingly invest in technology and talent focused on data engineering and analytics, they want to know whether they are benefiting. So how do you calculate the return on investment for data? In this episode Barr Moses and Anna Filippova explore that question and provide useful exercises to start answering that in your company.

article thumbnail

Are reports of StackOverflow’s fall greatly exaggerated?

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. In every issue, I cover topics related to Big Tech and startups through the lens of engineering managers and senior engineers. In this article, we cover one out of five topics from today’s subscriber-only The Pulse issue. To get full issues twice a week, subscribe here.

Retail 172

More Trending

article thumbnail

Supercharging your Rust static executables with mimalloc

Tweag

Why link statically against musl? Have you ever faced compatibility issues when dealing with Linux binary executables? The culprit is often the libc implementation, glibc. Acting as the backbone of nearly all Linux distros, glibc is the library responsible for providing standard C functions. Yet, its version compatibility often poses a challenge. Binaries compiled with a newer version of glibc may not function on systems running an older one, creating a compatibility headache.

article thumbnail

Startup Spotlight: Tesorio Helps Finance Teams Tackle Cash Flow Challenges

Snowflake

Welcome to Snowflake’s Startup Spotlight, where we learn about awesome companies building businesses on Snowflake. Can accounts receivable be an agent of change? Tesorio Co-Founder and CTO Fabio Fleitas thinks so, and his startup’s AI/ML-driven platform aims to give finance teams better control over their cash flow so they can have greater impact on their organizations’ success.

Finance 90
article thumbnail

Fixit 2: Meta’s next-generation auto-fixing linter

Engineering at Meta

Fixit is dead! Long live Fixit 2 – the latest version of our open-source auto-fixing linter. Fixit 2 allows developers to efficiently build custom lint rules and perform auto-fixes for their codebases. Fixit 2 is available today on PyPI. Python is one of the most popular languages in use at Meta. Meta’s production engineers (PEs) are specialized software engineers (SWEs) who focus on reliability, efficiency, and scalability.

Python 84
article thumbnail

Data Scientists Need to Specialize to Survive the Tech Winter

KDnuggets

In this article, I explore the benefits of specialization for data scientists. Drawing on my own experience as a data scientist, I argue that specializing in a specific area can help you stand out in a crowded job market and provide you with more fulfilling career opportunities.

Data 95
article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

How to execute your operating model for Data and AI

databricks

In Part 1 of this blog series, we discussed how Databricks enables organizations to develop, manage and operate processes that extract value from.

Data 90
article thumbnail

Reimagining a classic Cheysson thematic map

ArcGIS

Here's a a re-think on a classic. I'll rationalize some data-viz choices and layout choices and end up with something completely different.

Data 83
article thumbnail

Confluent Champion: Niki Kapsi’s Journey From SDR to Commercial Account Executive

Confluent

Meet Commercial AE Niki Kapsi and learn about the “entrepreneurial” side of her role at Confluent.

98
article thumbnail

Overcoming Barriers in Multi-lingual Voice Technology: Top 5 Challenges and Innovative Solutions

KDnuggets

Voice assistants like Siri, Alexa and Google Assistant are household names, but they still don't do well in multilingual settings. This article first provides an overview of how voice assistants work, and then dives into the top 5 challenges for voice assistants when it comes to providing a superior multilingual user experience. It also provides strategies for mitigation of these challenges.

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

What’s new with Databricks SQL?

databricks

At this year's Data+AI Summit, Databricks SQL continued to push the boundaries of what a data warehouse can be, leveraging AI across the.

SQL 93
article thumbnail

The LLM Factory: Driven by Snowflake and NVIDIA 

Snowflake

Snowflake recently announced a collaboration with NVIDIA to make it easy to run NVIDIA accelerated computing workloads directly within Snowflake accounts. One interesting use case is to train, customize, and deploy large language models (LLMs) safely and securely within Snowflake. Our new Snowpark Container Services , currently in private preview, together with NVIDIA AI, makes this possible.

article thumbnail

What is an Apache Kafka Cluster? (And Why You Should Care)

Confluent

Learn what an Apache Kafka cluster is, and what makes a cluster special.

Kafka 96
article thumbnail

A Comprehensive Guide to MLOps

KDnuggets

Machine Learning Operations (MLOps) is a relatively new discipline that provides the structure and support necessary for machine learning (ML) models to thrive in production environments.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Multiple Stateful Operators in Structured Streaming

databricks

In the world of data engineering, there are operations that have been used since the birth of ETL. You filter. You join. You.

article thumbnail

How to Build a Fully Automated Data Drift Detection Pipeline

Towards Data Science

An Automate Guide to Detect and Handle Data Drift Continue reading on Towards Data Science »

article thumbnail

Mental Models and the User Experience by David Rees

Scott Logic

Mental Models and the User Experience I look at a piano, I see a bunch of keys, three pedals, and a box of wood. But Beethoven, Mozart, they saw it, they could just play. I couldn’t paint you a picture, I probably can’t hit the ball out of Fenway, and I can’t play the piano. - Good Will Hunting, 1997 Like Will (Matt Damon in Good Will Hunting), I only see a piano as a box with black and white buttons and a guitar as a piece of wood with strings; I know they both make music, something I certainly

article thumbnail

Unveiling StableCode: A New Horizon in AI-Assisted Coding

KDnuggets

This article explores StableCode, an innovative AI product by Stability AI, designed to enhance coding efficiency and accessibility. It delves into its unique features, underlying technology, and potential impact on the developer community.

Coding 90
article thumbnail

How Embedded Analytics Gets You to Market Faster with a SAAS Offering

Start-ups & SMBs launching products quickly must bundle dashboards, reports, & self-service analytics into apps. Customers expect rapid value from your product (time-to-value), data security, and access to advanced capabilities. Traditional Business Intelligence (BI) tools can provide valuable data analysis capabilities, but they have a barrier to entry that can stop small and midsize businesses from capitalizing on them.

article thumbnail

A New Partnership with Redox and How We Unlock Healthcare Data to Drive Advanced Analytics

databricks

Healthcare is sitting on mountains of data Pop quiz: Which industry accounts for about 30% of newly created data around the world and.

article thumbnail

Aurora REST API Integration: 2 Easy Methods to Load Data

Hevo

There are various sources for your business to acquire data and use them for productive decision-making. One among those insightful sources is REST API. To centralize data and obtain in-depth data analytics benefits, you can migrate data to Amazon Aurora from REST APIs.

article thumbnail

How Let’s Encrypt Powers Confluent Cloud to Automate Its Certificate Operations

Confluent

Learn why Confluent Cloud has chosen Let’s Encrypt as its Certificate Authority and how it leverages its automation features to spend less time managing certificates and more time building private networking features.

article thumbnail

5 Python Packages For Geospatial Data Analysis

KDnuggets

This article discusses the importance of geospatial analysis and introduces five essential Python packages for effectively handling and visualizing valuable insights from geospatial data.

Python 89
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Beyond Data-Driven: How Today’s Leading Retailers Are Leveraging Insights to Sell Better

Snowflake

Supply chain disruption continues to affect retailers, consumer packaged goods companies (CPGs), and customers. Constraints on the ability to produce goods have limited the availability of in-demand products, leading to inflation. Not only are manufacturers not making enough products in line with demand in industries such as automotive and electronics, at the same time, those products have become much more expensive.

Retail 52
article thumbnail

Data Validation Testing: Techniques, Examples, & Tools

Monte Carlo

The Definitive Guide to Data Validation Testing Data validation testing ensures your data maintains its quality and integrity as it is transformed and moved from its source to its target destination. By applying rules and checks, data validation testing verifies the data meets predefined standards and business requirements to help prevent data quality issues and data downtime.

article thumbnail

How to Plot the Heatmap Charts in Angular?

Workfall

Reading Time: 9 minutes A heatmap chart is a visual representation of data presented in a matrix format. It uses different colors to represent the magnitude of values, making it easy to identify patterns and trends within complex datasets. Warm colors depict higher values, while cooler colors indicate lower ones. This type of chart finds application in diverse fields such as data analysis, biology, finance, and web analytics, offering an efficient means to detect significant data points and corr

article thumbnail

KDnuggets News, August 9: Forget ChatGPT, This New AI Assistant Is Leagues Ahead • 7 Steps to Mastering Data Cleaning and Preprocessing Techniques

KDnuggets

Forget ChatGPT, This New AI Assistant Is Leagues Ahead and Will Change the Way You Work Forever • 7 Steps to Mastering Data Cleaning and Preprocessing Techniques • Fundamentals Of Statistics For Data Scientists and Analysts • I Created An AI App In 3 Days • Using SHAP Values for Model Interpretability in Machine Learning

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.