Sat.Apr 06, 2024 - Fri.Apr 12, 2024

article thumbnail

Weekend maintenance kicks an Italian bank offline for days

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. In every issue, I cover topics related to Big Tech and startups through the lens of engineering managers and senior engineers. In this article, we cover one out of four topics from today’s subscriber-only The Pulse issue. To get full issues twice a week, subscribe here.

Banking 210
article thumbnail

Databricks Doubles Cost. Reddit Explodes. I’m in Trouble!

Confessions of a Data Guy

I recently did a post on Linkedin and Reddit about Databricks removing Standard Tier and forcing folks into Unity Catalog. The post got big traction and blew up, more than I thought. Enough for the Databricks folk to hunt me down at work and tell me I’m naughty. I will be writing a more in-depth […] The post Databricks Doubles Cost. Reddit Explodes.

Data 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Data News — Week 24.15

Christophe Blefari

The fest we deserve ( credits ) I hope this Data News finds you well. In today's edition we have a large selection of links, I think you will enjoy it. But first I want to welcome all the new members joining this week after my new episode on DataGen with Robin Conquet. This is an episode in French and we talked mainly about the eventual end of the modern data stack.

BI 130
article thumbnail

Data enrichment strategies in Apache Flink

Waitingforcode

Data enrichment is a crucial step in making data more usable by the business users. Doing that with a batch is relatively easy due to the static nature of the dataset. When it comes to streaming, the task is more challenging.

Datasets 130
article thumbnail

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Speaker: Maher Hanafi, VP of Engineering at Betterworks & Tony Karrer, CTO at Aggregage

Executive leaders and board members are pushing their teams to adopt Generative AI to gain a competitive edge, save money, and otherwise take advantage of the promise of this new era of artificial intelligence. There's no question that it is challenging to figure out where to focus and how to advance when it’s a new field that is evolving everyday. 💡 This new webinar featuring Maher Hanafi, VP of Engineering at Betterworks, will explore a practical framework to transform Generative AI pr

article thumbnail

10 GitHub Repositories to Master Python

KDnuggets

Learn Python through tutorials, blogs, books, project work, and exercises. Access all of it on GitHub for free and join a supportive open-source community.

Python 140
article thumbnail

How to JOIN datasets in Polars … compared to Pandas.

Confessions of a Data Guy

It’s been a while since I wrote about Polars on this blog, I’ve been remiss. Some time ago I wrote a very simple comparison of switching from Pandas to Polars, I didn’t put much real effort into it, yet it was popular, so this is my attempt at trying to expand on that topic a […] The post How to JOIN datasets in Polars … compared to Pandas. appeared first on Confessions of a Data Guy.

Datasets 113

More Trending

article thumbnail

Snowflake Startup Challenge 2024: Announcing the 10 Semi-Finalists

Snowflake

In 2020, Snowflake announced a new global competition to recognize the work of early-stage startups building their apps — and their businesses — on Snowflake, offering up to $250,000 in investment as the top prize. Four years later, the Snowflake Startup Challenge has grown into a premiere showcase for emerging startups, garnering interest from companies in over 100 countries and offering a prize package featuring a portion of up to $1 million in potential investment opportunities and exclusive

article thumbnail

Unapologetically Technical Episode 10 – Michael Drogalis

Jesse Anderson

And just like that, we’re down to the 10th episode of Unapologetically Technical! In this episode, I interview Michael Drogalis, the founder and CEO of ShadowTraffic where we talked about the early Hadoop era and how he saw the need for Kafka in the industry. He shared his journey of starting a new company in his 20s and being acquired by Confluent.

Hadoop 100
article thumbnail

Writing Apache Spark with Rust! Spark Connect Introduced.

Confessions of a Data Guy

I never thought I would live to see the day, it’s crazy. I’m not sure who’s idea it was to make it possible to write Apache Spark with Rust, Golang, or Python … but they are all genius. As of Apache Spark 3.4 it is now possible to use Spark Connect … a thin API […] The post Writing Apache Spark with Rust! Spark Connect Introduced. appeared first on Confessions of a Data Guy.

Python 100
article thumbnail

7 Steps to Mastering Data Engineering

KDnuggets

The only data engineering roadmap you need for an introduction to concepts, tools, and techniques to collect, store, transform, analyze, and model data.

article thumbnail

Leading the Development of Profitable and Sustainable Products

Speaker: Jason Tanner

While growth of software-enabled solutions generates momentum, growth alone is not enough to ensure sustainability. The probability of success dramatically improves with early planning for profitability. A sustainable business model contains a system of interrelated choices made not once but over time. Join this webinar for an iterative approach to ensuring solution, economic and relationship sustainability.

article thumbnail

Bringing MegaBlocks to Databricks

databricks

At Databricks, we’re committed to building the most efficient and performant training tools for large-scale AI models. With the recent release of DBRX.

Building 121
article thumbnail

High resolution data updates to Living Atlas World Elevation Layers (April 2024)

ArcGIS

In April 2024, elevation layers have been updated with high-res datasets of Wales, New Zealand & German states of Bavaria, Saxony and Brandenburg

Datasets 109
article thumbnail

Project Management Organizational Structure: Types & Examples

Knowledge Hut

Project management plays a significant role in the success of every organization. It ensures that the project is on track, aids in efficient management of resources, and also keeps the stakeholders know what is project and what's happening in it. In this blog, we will look at three different project organizational structures: functional, matrix, and process.

Project 98
article thumbnail

7 Things Students Are Missing in a Data Science Resume

KDnuggets

Adding these 7 key elements to your resume will improve your odds of getting an interview call. Remember, after graduating from the university, your full-time job is to find a job, so put some effort into preparing your resume.

article thumbnail

Navigating the Future: Generative AI, Application Analytics, and Data

Generative AI is upending the way product developers & end-users alike are interacting with data. Despite the potential of AI, many are left with questions about the future of product development: How will AI impact my business and contribute to its success? What can product managers and developers expect in the future with the widespread adoption of AI?

article thumbnail

DSPy on Databricks

databricks

Large language models (LLMs) have generated interest in effective human-AI interaction through optimizing prompting techniques. “Prompt engineering” is a growing methodology for tailoring.

article thumbnail

May I Borrow That Idea? – Pasting Feature Layer Properties

ArcGIS

Starting with ArcGIS Pro 3.2, you can copy layer properties from one feature layer and paste them to another.

131
131
article thumbnail

Snowflake Achieves C5 and TISAX Certifications, Expanding Compliance Scope in Germany

Snowflake

As Snowflake continues to expand our commitment to compliance, we are pleased to announce that we have successfully completed both C5 and TISAX attestations in Germany. Cloud Computing Compliance Controls Catalog (C5) C5 is an audited standard establishing baselines for cloud security. It was initially created for government agencies and organizations that work with the government to ensure security baselines are met by their cloud service providers (CSPs).

article thumbnail

The Case of Homegrown Large Language Models

KDnuggets

Recent developments in building large language models (LLMs) to boost generative AI in local languages have caught everyone’s attention. This post focuses on the needs and challenges of homegrown LLMs amid the fast-evolving technology landscape.

article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

Learn About Cloudera’s Partner Network

Cloudera

Businesses around the world rely on an extensive network of partnerships to deliver quality customer experiences—and it’s no different here at Cloudera. Cloudera is building a robust partner ecosystem to meet the unique needs of its customers, working to provide exceptional and fulfilling experiences that help make Cloudera a leader in the multi-cloud data platform space.

Food 86
article thumbnail

Multi-Scale Contour Styling in ArcGIS Pro

ArcGIS

How to configure scale-appropriate contour lines and their labels.

135
135
article thumbnail

Databricks Wins 2024 Google Cloud Partner of the Year Award

databricks

We're excited to announce that Databricks has been honored with the 2024 Google Cloud Technology Partner of the Year award for Data -.

article thumbnail

The AI Transformation Strategy in the GenAI Era

KDnuggets

Similar to the iterative nature of AI projects, AI strategy also requires continuous adjustments to bring successful AI transformation.

Project 107
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Introducing the next-gen Meta Training and Inference Accelerator

Engineering at Meta

We are sharing details of our next generation chip in our Meta Training and Inference Accelerator (MTIA) family. MTIA is a long-term bet to provide the most efficient architecture for Meta’s unique workloads.

article thumbnail

Combine, Visualize, and Analyze Responses from Participatory Mapping

ArcGIS

Answering regional geographers' favorite question: Where is the Midwest to you?

117
117
article thumbnail

How Snowflake Enhanced GTM Efficiency with Data Sharing and Outreach Customer Engagement Data

Snowflake

Like many companies, Snowflake uses Outreach as a sales execution platform to help our sales teams improve prospecting efforts and efficiently follow up on leads. For Snowflake sales reps, Outreach is the central repository for almost all inbound and outbound communications with current and potential customers. For the sales development representative (SDR) leadership team, it’s an immensely valuable source of insights for sales enablement and automation.

BI 76
article thumbnail

5 Free SQL Courses for Data Science Beginners

KDnuggets

Are you looking to make a career in data science? Start by learning SQL with these free courses.

SQL 117
article thumbnail

How To Get Promoted In Product Management

Speaker: John Mansour

If you're looking to advance your career in product management, there are more options than just climbing the management ladder. Join our upcoming webinar to learn about highly rewarding career paths that don't involve management responsibilities. We'll cover both career tracks and provide tips on how to position yourself for success in the one that's right for you.

article thumbnail

Creating Brand-Aligned Images Using Generative AI

databricks

Image-generating technologies offer significant benefits for retail and consumer goods companies. By using generative models that produce both stylized and photo-realistic images from.

Retail 73
article thumbnail

Navigating the Cloud Modernization Journey: Insights from Precisely’s Partnership with AWS

Precisely

In an era where cloud technology is not just an option but a necessity for competitive business operations, the collaboration between Precisely and Amazon Web Services (AWS) has set a new benchmark for mainframe and IBM i modernization. As a Technical Architect at Precisely, I’ve had the unique opportunity to lead the AWS Mainframe Modernization Data Replication for IBM i initiative, a project that not only challenged our technical capabilities but also enriched our understanding of cloud

AWS 72
article thumbnail

Schema Registry Clients in Action

Confluent

Learn what happens behind the scenes in Apache Kafka producer and consumer clients when communicating with Schema Registry and serializing/deserializing messages.

Kafka 76
article thumbnail

5 Free Resources to Master Your Data Science Job Search

KDnuggets

Learn how to use various data science platforms to secure your first job.

article thumbnail

How Embedded Analytics Gets You to Market Faster with a SAAS Offering

Start-ups & SMBs launching products quickly must bundle dashboards, reports, & self-service analytics into apps. Customers expect rapid value from your product (time-to-value), data security, and access to advanced capabilities. Traditional Business Intelligence (BI) tools can provide valuable data analysis capabilities, but they have a barrier to entry that can stop small and midsize businesses from capitalizing on them.