Sat.Apr 06, 2024 - Fri.Apr 12, 2024

article thumbnail

Weekend maintenance kicks an Italian bank offline for days

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. In every issue, I cover topics related to Big Tech and startups through the lens of engineering managers and senior engineers. In this article, we cover one out of four topics from today’s subscriber-only The Pulse issue. To get full issues twice a week, subscribe here.

Banking 199
article thumbnail

Databricks Doubles Cost. Reddit Explodes. I’m in Trouble!

Confessions of a Data Guy

I recently did a post on Linkedin and Reddit about Databricks removing Standard Tier and forcing folks into Unity Catalog. The post got big traction and blew up, more than I thought. Enough for the Databricks folk to hunt me down at work and tell me I’m naughty. I will be writing a more in-depth […] The post Databricks Doubles Cost. Reddit Explodes.

Data 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data News — Week 24.15

Christophe Blefari

The fest we deserve ( credits ) I hope this Data News finds you well. In today's edition we have a large selection of links, I think you will enjoy it. But first I want to welcome all the new members joining this week after my new episode on DataGen with Robin Conquet. This is an episode in French and we talked mainly about the eventual end of the modern data stack.

BI 130
article thumbnail

Data enrichment strategies in Apache Flink

Waitingforcode

Data enrichment is a crucial step in making data more usable by the business users. Doing that with a batch is relatively easy due to the static nature of the dataset. When it comes to streaming, the task is more challenging.

Datasets 130
article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

10 GitHub Repositories to Master Python

KDnuggets

Learn Python through tutorials, blogs, books, project work, and exercises. Access all of it on GitHub for free and join a supportive open-source community.

Python 135
article thumbnail

How to JOIN datasets in Polars … compared to Pandas.

Confessions of a Data Guy

It’s been a while since I wrote about Polars on this blog, I’ve been remiss. Some time ago I wrote a very simple comparison of switching from Pandas to Polars, I didn’t put much real effort into it, yet it was popular, so this is my attempt at trying to expand on that topic a […] The post How to JOIN datasets in Polars … compared to Pandas. appeared first on Confessions of a Data Guy.

Datasets 113

More Trending

article thumbnail

Unapologetically Technical Episode 10 – Michael Drogalis

Jesse Anderson

And just like that, we’re down to the 10th episode of Unapologetically Technical! In this episode, I interview Michael Drogalis, the founder and CEO of ShadowTraffic where we talked about the early Hadoop era and how he saw the need for Kafka in the industry. He shared his journey of starting a new company in his 20s and being acquired by Confluent.

Hadoop 100
article thumbnail

7 Steps to Mastering Data Engineering

KDnuggets

The only data engineering roadmap you need for an introduction to concepts, tools, and techniques to collect, store, transform, analyze, and model data.

article thumbnail

Writing Apache Spark with Rust! Spark Connect Introduced.

Confessions of a Data Guy

I never thought I would live to see the day, it’s crazy. I’m not sure who’s idea it was to make it possible to write Apache Spark with Rust, Golang, or Python … but they are all genius. As of Apache Spark 3.4 it is now possible to use Spark Connect … a thin API […] The post Writing Apache Spark with Rust! Spark Connect Introduced. appeared first on Confessions of a Data Guy.

Python 100
article thumbnail

Snowflake Startup Challenge 2024: Announcing the 10 Semi-Finalists

Snowflake

In 2020, Snowflake announced a new global competition to recognize the work of early-stage startups building their apps — and their businesses — on Snowflake, offering up to $250,000 in investment as the top prize. Four years later, the Snowflake Startup Challenge has grown into a premiere showcase for emerging startups, garnering interest from companies in over 100 countries and offering a prize package featuring a portion of up to $1 million in potential investment opportunities and exclusive

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Bringing MegaBlocks to Databricks

databricks

At Databricks, we’re committed to building the most efficient and performant training tools for large-scale AI models. With the recent release of DBRX.

Building 121
article thumbnail

Project Management Organizational Structure: Types & Examples

Knowledge Hut

Project management plays a significant role in the success of every organization. It ensures that the project is on track, aids in efficient management of resources, and also keeps the stakeholders know what is project and what's happening in it. In this blog, we will look at three different project organizational structures: functional, matrix, and process.

Project 98
article thumbnail

High resolution data updates to Living Atlas World Elevation Layers (April 2024)

ArcGIS

In April 2024, elevation layers have been updated with high-res datasets of Wales, New Zealand & German states of Bavaria, Saxony and Brandenburg

Datasets 105
article thumbnail

7 Things Students Are Missing in a Data Science Resume

KDnuggets

Adding these 7 key elements to your resume will improve your odds of getting an interview call. Remember, after graduating from the university, your full-time job is to find a job, so put some effort into preparing your resume.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

DSPy on Databricks

databricks

Large language models (LLMs) have generated interest in effective human-AI interaction through optimizing prompting techniques. “Prompt engineering” is a growing methodology for tailoring.

article thumbnail

Learn About Cloudera’s Partner Network

Cloudera

Businesses around the world rely on an extensive network of partnerships to deliver quality customer experiences—and it’s no different here at Cloudera. Cloudera is building a robust partner ecosystem to meet the unique needs of its customers, working to provide exceptional and fulfilling experiences that help make Cloudera a leader in the multi-cloud data platform space.

Food 86
article thumbnail

May I Borrow That Idea? – Pasting Feature Layer Properties

ArcGIS

Starting with ArcGIS Pro 3.2, you can copy layer properties from one feature layer and paste them to another.

125
125
article thumbnail

The Case of Homegrown Large Language Models

KDnuggets

Recent developments in building large language models (LLMs) to boost generative AI in local languages have caught everyone’s attention. This post focuses on the needs and challenges of homegrown LLMs amid the fast-evolving technology landscape.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Snowflake Achieves C5 and TISAX Certifications, Expanding Compliance Scope in Germany

Snowflake

As Snowflake continues to expand our commitment to compliance, we are pleased to announce that we have successfully completed both C5 and TISAX attestations in Germany. Cloud Computing Compliance Controls Catalog (C5) C5 is an audited standard establishing baselines for cloud security. It was initially created for government agencies and organizations that work with the government to ensure security baselines are met by their cloud service providers (CSPs).

article thumbnail

Databricks Wins 2024 Google Cloud Partner of the Year Award

databricks

We're excited to announce that Databricks has been honored with the 2024 Google Cloud Technology Partner of the Year award for Data -.

article thumbnail

Multi-Scale Contour Styling in ArcGIS Pro

ArcGIS

How to configure scale-appropriate contour lines and their labels.

135
135
article thumbnail

The AI Transformation Strategy in the GenAI Era

KDnuggets

Similar to the iterative nature of AI projects, AI strategy also requires continuous adjustments to bring successful AI transformation.

Project 102
article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

Introducing the next-gen Meta Training and Inference Accelerator

Engineering at Meta

We are sharing details of our next generation chip in our Meta Training and Inference Accelerator (MTIA) family. MTIA is a long-term bet to provide the most efficient architecture for Meta’s unique workloads.

article thumbnail

Creating Brand-Aligned Images Using Generative AI

databricks

Image-generating technologies offer significant benefits for retail and consumer goods companies. By using generative models that produce both stylized and photo-realistic images from.

Retail 74
article thumbnail

Navigating the Cloud Modernization Journey: Insights from Precisely’s Partnership with AWS

Precisely

In an era where cloud technology is not just an option but a necessity for competitive business operations, the collaboration between Precisely and Amazon Web Services (AWS) has set a new benchmark for mainframe and IBM i modernization. As a Technical Architect at Precisely, I’ve had the unique opportunity to lead the AWS Mainframe Modernization Data Replication for IBM i initiative, a project that not only challenged our technical capabilities but also enriched our understanding of cloud

AWS 73
article thumbnail

5 Free SQL Courses for Data Science Beginners

KDnuggets

Are you looking to make a career in data science? Start by learning SQL with these free courses.

SQL 112
article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Schema Registry Clients in Action

Confluent

Learn what happens behind the scenes in Apache Kafka producer and consumer clients when communicating with Schema Registry and serializing/deserializing messages.

Kafka 76
article thumbnail

How Snowflake Enhanced GTM Efficiency with Data Sharing and Outreach Customer Engagement Data

Snowflake

Like many companies, Snowflake uses Outreach as a sales execution platform to help our sales teams improve prospecting efforts and efficiently follow up on leads. For Snowflake sales reps, Outreach is the central repository for almost all inbound and outbound communications with current and potential customers. For the sales development representative (SDR) leadership team, it’s an immensely valuable source of insights for sales enablement and automation.

BI 70
article thumbnail

Data Engineering Weekly #166

Data Engineering Weekly

dbt: 2024 State of Analytics Engineering The 2024 dbt’s state of analytical engineering report is out. Poor data quality and unlcear data ownership remains the top challenges for the data teams. Data Mesh continuously gaining popularity among the enterprises. It is a stark difference from the Gartner report about data mesh. I guess only the time will tell who wins in the data mesh vs data fabric war.

article thumbnail

5 Free Resources to Master Your Data Science Job Search

KDnuggets

Learn how to use various data science platforms to secure your first job.

article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating