Sat.Dec 09, 2023 - Fri.Dec 15, 2023

article thumbnail

Uplevel your dbt workflow with these tools and techniques

Start Data Engineering

1. Introduction 2. Setup 3. Ways to uplevel your dbt workflow 3.1. Reproducible environment 3.1.1. A virtual environment with Poetry 3.1.2. Use Docker to run your warehouse locally 3.2. Reduce feedback loop time when developing locally 3.2.1. Run only required dbt objects with selectors 3.2.2. Use prod datasets to build dev models with defer 3.2.3. Parallelize model building by increasing thread count 3.

Datasets 130
article thumbnail

Data+AI Summit 2023, retrospective part 2

Waitingforcode

One week later than initially announced, but here it is, the second part for Data+AI Summit 2023 retrospective. I don't know how, but I managed to include some streaming-related talks here too!

Data 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Enhancing LLM Reasoning: Unveiling Chain of Code Prompting

KDnuggets

Chain of Code is an approach to interacting with language models, enhancing reasoning abilities through a blend of writing, executing, and simulating code execution, extending the capabilities of language models in logic, arithmetic, and linguistic tasks, especially those requiring a combination of these.

Coding 129
article thumbnail

Making Flink Serverless, With Queries for Less Than a Penny

Confluent

Dive into the serverless architecture of Confluent Cloud for Apache Flink and explore its benefits like reduced infrastructure costs, increased reliability, & seamless adoption.

article thumbnail

Navigating the Future: Generative AI, Application Analytics, and Data

Generative AI is upending the way product developers & end-users alike are interacting with data. Despite the potential of AI, many are left with questions about the future of product development: How will AI impact my business and contribute to its success? What can product managers and developers expect in the future with the widespread adoption of AI?

article thumbnail

Unapologetically Technical Episode 7 – Stephane Derosiaux

Jesse Anderson

What better year to start the Christmas season than to drop a new episode of Unapologetically Technical! In this episode, I interview Stephane Derosiaux from Conduktor. We talk about his time evolving architectures and creating real-time systems at Auchan (grocery) and Adeo/Leroy Merlin (Home Improvement). We discuss the issues of British food and how to find good food in London.

Food 100
article thumbnail

Build GenAI Apps Faster with New Foundation Model Capabilities

databricks

Following the announcements we made last week about Retrieval Augmented Generation (RAG), we're excited to announce major updates to Model Serving. Databricks Model.

Building 114

More Trending

article thumbnail

Real-Time Field Service Optimization

Confluent

Telcos use Confluent with event-driven microservices to enable real-time communications with 3rd-party field service providers, fulfilling customer service requests more efficiently.

108
108
article thumbnail

Our First Netflix Data Engineering Summit

Netflix Tech

Holden Karau Elizabeth Stone Pedro Duarte Chris Stephens Pallavi Phadnis Lee Woodridge Mark Cho Guil Pires Sujay Jain Tristan Reid Senthilnathan Athinarayanan Bharath Mummadisetty Abhinaya Shetty Judit Lantos Amanuel Kahsay Dao Mi Mick Dreeling Chris Colburn and Agata Gryzbek Introduction Earlier this summer Netflix held our first-ever Data Engineering Forum.

article thumbnail

Lakehouse Monitoring: A Unified Solution for Quality of Data and AI

databricks

Introduction Databricks Lakehouse Monitoring allows you to monitor all your data pipelines – from data to features to ML models – without additional too.

article thumbnail

5 Rare Data Science Skills That Can Help You Get Employed

KDnuggets

This article is about the less common data science skills that can help you get hired. While these skills are not as common as they are for technical jobs, they are certainly worth developing.

article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

Tips for labeling images for object detection models

ArcGIS

In this Part-1 of a two-part blog series, we will share tips for labeling objects on images for object detection deep learning models.

article thumbnail

How Much Data Do We Need? Balancing Machine Learning with Security Considerations

Towards Data Science

For a data scientist, there’s no such thing as too much data. But when we take a broader look at the organizational context, we have to balance our goals with other considerations. Photo by Trnava University on Unsplash Data Science vs Security/IT: A Battle for the Ages Acquiring and keeping data is the focus of a huge amount of our mental energy as data scientists.

article thumbnail

Even Santa Claus has AI fever

databricks

As CEO of the North Pole, Santa Claus oversees one of the world’s most complicated supply chain, manufacturing and logistics operations. Every year, S.

article thumbnail

5 Tools to Help Build Your LLM Apps

KDnuggets

Whether you're a seasoned ML engineer or a new LLM developer, these tools will help you get more productive and accelerate the development and deployment of your AI projects.

Building 120
article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Big improvements for field management in Geoprocessing in ArcGIS Pro 3.2

ArcGIS

In ArcGIS Pro 3.2, the field map parameter has been redesigned for improved usability and new capabilities.

article thumbnail

Cloudera Customer Story

Cloudera

Legal & General Investment Management (LGIM) is one of the largest global asset managers, managing £1.2 trillion on behalf of savers, retirees, and institutions worldwide. LGIM prides itself on being a responsible investor and is at the forefront of global index fund management and pension investment. Its strategies cover a broad array of asset classes and styles, including equities, bonds, property and alternatives, as well as multi-asset funds.

article thumbnail

Offline LLM Evaluation: Step-by-Step GenAI Application Assessment on Databricks

databricks

Background In an era where Retrieval-Augmented Generation (RAG) is revolutionizing the way we interact with AI-driven applications, ensuring the efficiency and effectiveness of.

article thumbnail

Back to Basics Bonus Week: Deploying to the Cloud

KDnuggets

Welcome back to the KDnuggets’ "Back to Basics" series. This is the BONUS week and we will dive into learning about deploying to the cloud.

Cloud 118
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

ArcGIS AI Models – Year in Review

ArcGIS

Learn about our recently released pretrained deep learning models available in the ArcGIS Living Atlas of the World.

article thumbnail

Predictions: The Cybersecurity Challenges of AI

Snowflake

Our recently released predictions report includes a number of important considerations about the likely trajectory of cybercrime in the coming years, and the strategies and tactics that will evolve in response. Every year, the story is “Attackers are getting more sophisticated, and defenders have to keep up.” As we enter a new era of advanced AI technology, we identify some surprising wrinkles to that perennial trend.

article thumbnail

#Volunteer Spotlight: Remus Lim

Cloudera

During Week of Giving Clouderans across the globe took time out of their busy schedules to give back and support causes meaningful to them. For many colleagues, however, giving and volunteering during Week of Giving is just one of the many ways they support the causes meaningful to them. We had the privilege of sitting down with Remus Lim, Regional VP of Sales in APAC who not only volunteered alongside his Singapore-based colleagues during Week of Giving but is dedicating an upcoming trip to phi

IT 85
article thumbnail

AI in Intimate Roles: Girlfriends and Therapists

KDnuggets

This article is a brief overview of the field of Emotion AI, and the potential applications of its technology in intimate roles.

article thumbnail

How Embedded Analytics Gets You to Market Faster with a SAAS Offering

Start-ups & SMBs launching products quickly must bundle dashboards, reports, & self-service analytics into apps. Customers expect rapid value from your product (time-to-value), data security, and access to advanced capabilities. Traditional Business Intelligence (BI) tools can provide valuable data analysis capabilities, but they have a barrier to entry that can stop small and midsize businesses from capitalizing on them.

article thumbnail

Layout sandwich

ArcGIS

How to make a layout sandwich with two synchronized map views, some masking, and some mischief.

114
114
article thumbnail

Harnessing the Data Cloud to Empower Our Own Marketing Team: Building a Digital Ads Ecosystem on Snowflake

Snowflake

You need metrics to do your job well as a marketer but getting clear, meaningful metrics is a huge challenge. While digital advertisers and paid media professionals are on the hook to build ample sales pipeline and maximize return on ad spend (ROAS), they’re also expected to deliver personalized advertising content while navigating evolving privacy requirements and adhering to consumer expectations—all while extracting insights from siloed ad platforms.

article thumbnail

Managing AI Security Risks: Introducing a new workshop for CISOs

databricks

Adopting AI is existentially vital for most businesses Machine Learning (ML) and generative AI (GenAI) are revolutionizing the future of work. Organizations understand.

article thumbnail

5 Free University Courses to Learn Python

KDnuggets

Looking for the best resources to learn Python programming? Check out these free university courses.

Python 133
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Monolith to Event-Driven Microservices: 5 Tips for Securing Business Buy-In

Confluent

Discover how McAfee saved significant hosting costs alone by shifting to microservices! McAfee’s Mahesh Tyagarajan spills the beans on getting business buy-in and what it means for customers.

IT 78
article thumbnail

Big improvements for field management in Geoprocessing in ArcGIS Pro 3.2

ArcGIS

In ArcGIS Pro 3.2, the field map parameter has been redesigned for improved usability and new capabilities.

article thumbnail

Privacy Preserving Single Post Analytics

LinkedIn Engineering

Authors: Ryan Rogers , Subbu Subramaniam , Lin Xu Contributors: Mark Cesar , Praveen Chaganlal , Jefferson Lai , Jennifer Li , Stephanie Chung , Margaret Taormina , Gavin Uathavikul , Laura Chen , Rahul Tandra , Siyao Sun , Vinyas Maddi , Shuai Zhang. Content creators post on LinkedIn with the goal of reaching and engaging specific audiences. Post analytics helps creators measure their post performance overall and with specific viewer demographics, so they can better understand what resonates an

article thumbnail

Undersampling Techniques Using Python

KDnuggets

The article discusses the undersampling data preprocessing techniques to address data imbalance challenges.

Python 128
article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.