Sat.May 11, 2024 - Fri.May 17, 2024

article thumbnail

5 Free University Courses to Learn Machine Learning

KDnuggets

Want to learn machine learning from the best of resources? Check out these free machine learning courses from the top universities of the world.

article thumbnail

Why You Should Replace Pandas with Polars

Confessions of a Data Guy

I’m still amazed to this day how many folks hold onto stuff they love, they just can’t let it go. I get it, sorta, I’m the same way. There are reasons why people do the things they do, even if they are hard for us to understand. It blows my mind when I see something […] The post Why You Should Replace Pandas with Polars appeared first on Confessions of a Data Guy.

IT 147
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Release Management For Data Platform Services And Logic

Data Engineering Podcast

Summary Building a data platform is a substrantial engineering endeavor. Once it is running, the next challenge is figuring out how to address release management for all of the different component parts. The services and systems need to be kept up to date, but so does the code that controls their behavior. In this episode your host Tobias Macey reflects on his current challenges in this area and some of the factors that contribute to the complexity of the problem.

article thumbnail

Mind the map: a new design for the London Underground map

ArcGIS

A modern take on the London tube map with updated accessible colours, a re-classification of lines by type, and line symbols scaled by frequency

Designing 135
article thumbnail

Going Beyond Chatbots: Connecting AI to Your Tools, Systems, & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

If AI agents are going to deliver ROI, they need to move beyond chat and actually do things. But, turning a model into a reliable, secure workflow agent isn’t as simple as plugging in an API. In this new webinar, Alex Salazar and Nate Barbettini will break down the emerging AI architecture that makes action possible, and how it differs from traditional integration approaches.

article thumbnail

Mastering Python: 7 Strategies for Writing Clear, Organized, and Efficient Code

KDnuggets

Optimize Your Python Workflow: Proven Techniques for Crafting Production-Ready Code

Python 146
article thumbnail

Data News — Week 24.20

Christophe Blefari

Lights on ( credits ) Hello you. The sun is out, the days are getting longer and Data News is still here. Next week marks 3 years of this newsletter/blog (yay 🎉 ). It'll be a time for looking back, reflecting and celebrating, but next week. This week, we reached 5000 members. Yes, 5000 of you read my content periodically. Just thank you ❤️ In the recent days I've been working on a new side project.

Food 130

More Trending

article thumbnail

How to Crush the Spider Benchmark with Ease on Databricks

databricks

How we reached 79.9% on the Spider dev dataset with Llama3 8B through savvy prompting and fine-tuning on Databricks.

Datasets 130
article thumbnail

10 Free Must-Take Data Science Courses to Get Started

KDnuggets

Want to start your data science journey? Then, let these courses guide you on that trip.

article thumbnail

Snowflake Invests in Metaplane for Deep, End-to-End Observability in the Data Cloud

Snowflake

According to Infosys, 35% of AI projects will either fail or experience delays because of poor data quality. There’s a huge gap between the data quality most companies have by default and the data quality needed for successful AI. And that gap is directly affecting the performance and reliability of AI systems everywhere. As organizations expand their use of Snowflake to build and deploy new AI-powered data applications, comprehensive data observability is critical to success.

Cloud 126
article thumbnail

What’s New for Spatial Analyst in ArcGIS Pro 3.3

ArcGIS

Spatial Analyst in ArcGIS Pro 3.3 offers new capabilities for suitability modeling, as well as density, distance, solar, and zonal analysis.

109
109
article thumbnail

Smart Tech + Human Expertise = How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

Unlocking the Potential of Private Data Sharing using Databricks Private Exchanges

databricks

We are thrilled to announce an exciting new feature on the Databricks Marketplace that simplifies the process of setting up private exchanges for.

Data 119
article thumbnail

The Easiest Way of Running Llama 3 Locally

KDnuggets

Download, install, and type one command in the terminal to start using Llama 3 on your laptop.

140
140
article thumbnail

5 Ways Advertising, Media and Entertainment Companies are Using Gen AI

Snowflake

The emergence of generative AI (gen AI) heralds a new, groundbreaking era for advertising, media and entertainment. According to a recent Snowflake report, Advertising, Media and Entertainment Data + AI Predictions 2024 , gen AI is going to transform the industry — from content creation to customer experience. The companies that will come out ahead during this time are those that most successfully and quickly supercharge their data strategy.

article thumbnail

Designing and testing for accessibility in GIS and mapping

ArcGIS

Review best practices for designing and testing for accessibility maps and apps throughout the ArcGIS system during the development process.

article thumbnail

The Ultimate Guide to Apache Airflow DAGS

With Airflow being the open-source standard for workflow orchestration, knowing how to write Airflow DAGs has become an essential skill for every data engineer. This eBook provides a comprehensive overview of DAG writing features with plenty of example code. You’ll learn how to: Understand the building blocks DAGs, combine them in complex pipelines, and schedule your DAG to run exactly when you want it to Write DAGs that adapt to your data at runtime and set up alerts and notifications Scale you

article thumbnail

Building DBRX-class Custom LLMs with Mosaic AI Training

databricks

We recently introduced DBRX : an open, state-of-the-art, general-purpose LLM. DBRX was trained, fine-tuned, and evaluated using Mosaic AI Training, scaling training to.

Building 119
article thumbnail

The Best Strategies for Fine-Tuning Large Language Models

KDnuggets

Learn how to master the art of fine-tuning LLMs for specialized tasks.

139
139
article thumbnail

Preserving Data Privacy in Life Sciences: How Snowflake Data Clean Rooms Make It Happen

Snowflake

The pharmaceutical industry generates a great deal of identifiable data (such as clinical trial data, patient engagement data) that has guardrails around “use and access.” Data captured for the intended purpose of use described in a protocol is called “primary use.” However, once anonymized, this data can be used for other inferences in what we can collectively define as secondary analyses.

article thumbnail

Multiresolution Object Detection with Text SAM

ArcGIS

This blog post will walk you through the process of running multi resolution deep learning over a range of cell sizes.

article thumbnail

Apache Airflow® Best Practices: DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Databricks Is a Glassdoor Best-Led Company in 2024

databricks

Databricks is pleased to announce we are ranked #2 in the inaugural annual Glassdoor Award List of Best-Led Companies in 2024 ! At.

111
111
article thumbnail

All About the AI Regulatory Landscape

KDnuggets

This post explores the evolving AI regulatory landscape and essential aspects of the EU Act law, crucial for understanding its impact.

IT 137
article thumbnail

Snowflake Advanced Certifications: Level Up to SnowPro Advanced and Show Off Your Snowflake Expertise

Snowflake

Did you know that Snowflake has five advanced role-based certifications to help you stand out in the data community as a Snowflake expert? The Snowflake Advanced Certification Series (Architect, Data Engineer, Data Scientist, Administrator, Data Analyst) offers role-based certifications designed for Snowflake practitioners with one to two years of experience (depending on the program).

article thumbnail

Text SAM: Extracting GIS Features Using Text Prompts

ArcGIS

Prompt Segment Anything Model (SAM) with free form text to extract features in your imagery

article thumbnail

How to Achieve High-Accuracy Results When Using LLMs

Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage

When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale. In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation m

article thumbnail

Research Survey: Productivity benefits from Databricks Assistant

databricks

In the fast-paced landscape of data science and engineering, integrating Artificial Intelligence (AI) has become integral for enhancing productivity. We’ve seen many tools.

article thumbnail

Pursue a Master’s in Data Science with the 3rd Best Online Program 2024

KDnuggets

100% online master’s program with flexible schedules designed for working professionals. Enrolling now for October 28th.

article thumbnail

How Snowflake and Merit Helped Provide Over 120,000 Students with Access to Education Funding 

Snowflake

Snowflake joined forces with Merit to provide an identity verification platform and a set of program delivery services that help run large-scale government programs in areas such as licensing regulations, workforce development, emergency management, and educational grants and scholarships. By augmenting the power of the Snowflake Data Cloud with Merit’s platform, Snowflake is enabling government and education entities to share data and empower secure, modern and robust collaboration.

Education 105
article thumbnail

Tools for Building Community Climate Resilience

ArcGIS

Discover the latest climate resilience planning tools with 18 new ready-to-use layers. Explore the Climate Resilience Index layers in ArcGIS Living Atlas.

Building 101
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Unapologetically Technical Episode 11 – Hubert Dulay

Jesse Anderson

In this episode of Unapologetically Technical, I interview Hubert Dulay, the author of Streaming Data Mesh and Developer Advocate at StarTree. We talked about his early experience with web backends like CORBA and SOAP and how those prepared him for data work. He shares his advice for those with web development skills to transition into data and what it’s like for a person leaving a company after a long tenure there.

IT 100
article thumbnail

LSTMs Rise Again: Extended-LSTM Models Challenge the Transformer Superiority

KDnuggets

Can LSTMs become the de facto standard for Language Modeling tasks once again?

129
129
article thumbnail

What Separates Hybrid Cloud and ‘True’ Hybrid Cloud?

Cloudera

Hybrid cloud plays a central role in many of today’s emerging innovations—most notably artificial intelligence (AI) and other emerging technologies that create new business value and improve operational efficiencies. But getting there requires data, and a lot of it. More than that, though, harnessing the potential of these technologies requires quality data—without it, the output from an AI implementation can end up inefficient or wholly inaccurate.

Cloud 99
article thumbnail

Scrum Master Resume: Tips, Samples, Skills Required

Knowledge Hut

A Scrum master is responsible for facilitating the process within a team and ensuring that all team members adhere to the Scrum methodology. He is also responsible for removing any impediments to the team's progress and ensuring the team can deliver their sprint commitments. To be able to apply for the role, you need to have an outstanding Scrum Master resume.

article thumbnail

Apache Airflow® 101 Essential Tips for Beginners

Apache Airflow® is the open-source standard to manage workflows as code. It is a versatile tool used in companies across the world from agile startups to tech giants to flagship enterprises across all industries. Due to its widespread adoption, Airflow knowledge is paramount to success in the field of data engineering.