Sat.May 06, 2023 - Fri.May 12, 2023

article thumbnail

Datadog’s $65M/year customer mystery solved

The Pragmatic Engineer

The internet has been speculating the past few days on which crypto company spent $65M on Datadog in 2022. I confirmed it was Coinbase, and here are the details of what happened. Originally published on 11 May 2023. 👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of six topics in today’s subscriber-only The Scoop issue.

AWS 306
article thumbnail

OLTP Vs OLAP – What Is The Difference

Seattle Data Guy

If you’re relying on your OLTP system to provide analytics, you might be in for a surprise. While it can work initially, these systems aren’t designed to handle complex queries. Adding databases like MongoDB and CassandraDB only makes matters worse, since they’re not SQL-friendly – the language most analysts and data practitioners are used to.… Read more The post OLTP Vs OLAP – What Is The Difference appeared first on Seattle Data Guy.

MongoDB 208
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Polars – Laziness and SQL Context.

Confessions of a Data Guy

Polars is one of those tools that you just want … no … NEED a reason to use it. It’s gotten so bad, I’ve started to use it in my Rust code on the side, Polars that is. I mean you have a problem if you could use Polars Python, and you find yourself using […] The post Polars – Laziness and SQL Context. appeared first on Confessions of a Data Guy.

SQL 182
article thumbnail

Kinesis sequence number is not an Apache Kafka offset

Waitingforcode

I have used to say "Kinesis Data Streams is like Apache Kafka, an append-only streaming broker with partitions and offsets". Although often it's true, it's not that simple unfortunately.

Kafka 130
article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Compensation at Publicly Traded Tech Companies

The Pragmatic Engineer

Insights from 50 publicly traded tech companies, and a list of those paying the most and the least in median total compensation. 👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover two out of seven topics from today’s subscriber-only deep-dive on Compensation at publicly traded tech companies.

article thumbnail

Confluent Will Beat Your Cost of Running Kafka (or $100 on us)

Confluent

Running Kafka is costly, but Confluent has created a far more efficient product to lower your costs. Join the Cost Savings challenge to see for yourself.

Kafka 142

More Trending

article thumbnail

Data Scientist’s Guide to Cognitive Biases: A Free eBook

KDnuggets

Are you interested in exploring the topic of cognitive biases? Want to see how they may be affecting your data science practice? Check out this free ebook for this and more.

article thumbnail

PagerDuty alternatives

The Pragmatic Engineer

This is a response to a tweet asking: "Why is there no competition to PagerDuty/Opsgenie? People in my team say it’s “just connecting to the Twilio API” but if it were that easy, there’d probably be a ton of competition." PagerDuty is the market-leading incident alerting tool. OpsGenie is Atlassian's incident management tool, which is widespread thanks to distribution.

Systems 216
article thumbnail

How Lakehouse powers NLP for Customer Service Analytics in Insurance

databricks

Download the Databricks Insurance NLP Solution Accelerator Introduction The current economic and social climate has redefined customer expectations and preferences. Society has been.

Insurance 107
article thumbnail

New Approaches to Visualizing Snowflake Query Statistics with Snowflake Technology Partners

Snowflake

As of December, customers got a whole new level of insight into Snowflake query performance and query execution statistics when Snowflake announced the public preview of the new get_query_operator_stats function, opening up programmatic access to Snowflake query profiles and providing customers a whole new level of insight into Snowflake query performance and query execution statistics.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Top Posts May 1-7: Machine Learning with ChatGPT Cheat Sheet

KDnuggets

Machine Learning with ChatGPT Cheat Sheet • HuggingChat Python API: Your No-Cost Alternative • AutoGPT: Everything You Need To Know • 8 Open-Source Alternative to ChatGPT and Bard • LangChain 101: Build Your Own GPT-Powered Applications

article thumbnail

Tackling the Hidden and Unhidden Costs of Kafka

Confluent

Low utilization and operational complexity dramatically increases Kafka costs, so we reinvented Kafka as a cloud-native and complete service to reduce costs for thousands of businesses at any scale.

Kafka 105
article thumbnail

Precisely Women in Technology: Meet Samantha Martino

Precisely

Technology is a vast industry that has something for everybody. Because of this, it attracts people from all backgrounds and areas of expertise. At Precisely, having diverse representation is the key to success, and as a result, it’s been highly important for the organization to support the unique perspective that employees bring to the table. The Precisely Women in Technology (PWIT) program was designed to connect women from across the organization to one another to offer support, an internal n

article thumbnail

PostgreSQL Import CSV: 3 Easy Methods

Hevo

As a business grows, the demand to efficiently handle and process the exponentially growing data also rises. A popular open-source relational database used by several organizations across the world is PostgreSQL. It is a perfect database management system that also assists developers to build applications, and administrators to protect data integrity and develop fault-tolerant environments.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

8 Free AI and LLMs Playgrounds

KDnuggets

If you’re interested in trying out AI for fun or learning more about them, then take a look at our list and explore the cutting-edge LLMs available in the wild.

111
111
article thumbnail

What Makes Confluent the World’s Most Trusted Cloud Data Streaming Platform

Confluent

Confluent manages 30,000+ Kafka clusters, produces over 3 trillion messages, and does durability checks on over 80 trillion Kafka messages per day while offering 99.99% uptime. Check out our cool stats!

Kafka 104
article thumbnail

Earned Value Management (EVM): Elements, Formulas, Benefits

Knowledge Hut

Many think that Earned value management is complicated paperwork and thus a lot of professionals stay away from it. On the other hand, successful project managers become superheroes to break this myth of earned value management (EVM). Earned Value Management has taken an important place in the world of project management and plays a vital role in the career of project management certification aspirants like PMI PMP certifications and PRINCE2 certificate.

article thumbnail

Connect Excel to PostgreSQL in 2 Easy Ways

Hevo

Microsoft Excel is a spreadsheet program included in the Microsoft Office Suite. It’s compatible with Windows, Mac OS X, Android, and iOS. It simplifies the creation of text and numeric grids, formulas calculations, graphing tools, pivot tables, and the VBA Macro programming language (Visual Basic for Applications).

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

KDnuggets News, May 10: HuggingChat Python API: Your No-Cost Alternative • Exploratory Data Analysis Techniques for Unstructured Data

KDnuggets

HuggingChat Python API: Your No-Cost Alternative • Exploratory Data Analysis Techniques for Unstructured Data • Stop Doing this on ChatGPT and Get Ahead of the 99% of its Users • ChatGPT as a Personalized Tutor for Learning Data Science Concepts • The Ultimate Open-Source Large Language Model Ecosystem

article thumbnail

#ClouderaLife Volunteer Spotlight: Alex Campos, Principal Technical Leader, Spain

Cloudera

Originally from Brazil, Alex previously lived in Chile and now lives in Spain. During his time living in Latin America in early 2016, Alex saw what he describes as a “knowledge gap” —s eeing the way skills, content and expertise are shared in an open, friendly way at conferences in the US, Alex wanted to replicate that in Latin America. To address this gap, Alex started planning meetups.

article thumbnail

Using Structured Streaming with Delta Sharing in Unity Catalog

databricks

We are excited to announce that support for using Structured Streaming with Delta Sharing is now generally available (GA) in Azure, AWS, and.

AWS 96
article thumbnail

12 Best Data Management Tools in 2023

Hevo

One of the biggest stumbling blocks of a business is the expansion of its Database. A few problems one might have to deal with while trying to expand their Database are storage problems, complicated management issues, and difficulty in the location, sharing, and checking of isolated data.

article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating

article thumbnail

Chatbot Arena: The LLM Benchmark Platform

KDnuggets

Chatbot Arena is a benchmark platform for large language models, where the community can contribute new models and evaluate them.

Process 113
article thumbnail

An ML based approach to proactive advertiser churn prevention

Pinterest Engineering

Erika Sun ML Engineer | Advertiser Growth Modeling Team; Ogheneovo Dibie Engineering Manager | Advertiser Growth Modeling Team Photo by Jason Blackeye on Unsplash Summary In this blog post, we describe a Machine Learning (ML) powered proactive churn prevention solution that was prototyped with our small & medium business (SMB) advertisers. Results from our initial experiment suggest that we can detect future churn with a high degree of predictive power and consequently empower our sales par

article thumbnail

Unifying Your Data Ecosystem with Delta Lake Integration

databricks

As organizations are maturing their data infrastructure and accumulating more data than ever before in their data lakes, Open and Reliable table formats.

article thumbnail

Import Excel into MySQL: 4 Easy Methods

Hevo

Microsoft Excel has been a traditional choice as a spreadsheet application for organizations across the world. The ease of access, power formulas, and the ability to make visually stunning reports has made Microsoft Excel is widely used tool.

MySQL 83
article thumbnail

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Speaker: David Bard, Principal at VP Product Coaching

In the fast-paced world of digital innovation, success is often accompanied by a multitude of challenges - like the pitfalls lurking at every turn, threatening to derail the most promising projects. But fret not, this webinar is your key to effective product development! Join us for an enlightening session to empower you to lead your team to greater heights.

article thumbnail

Build a ChatGPT-like Chatbot with These Courses

KDnuggets

Let's dive into some courses that will give you the knowledge you need to build an AI-based chatbot that looks good and functions well.

Building 109
article thumbnail

Visibility and Transparency

Cloudera

Out of the box Cloudera Data platform (CDP) performs superbly but over time, if data architecture, data engineering, and DevOps best practices are not maintained, you can get stuck maintaining the wild, wild west. In this six-part series, we’re focused on improving the health of your environment. Visibility and Transparency Improving environmental health is impossible if you’re flying blind.

article thumbnail

Cluster Policy Onboarding Primer

databricks

Introduction This blog is part of our Admin Essentials series, where we'll focus on topics important to those managing and maintaining Databricks environments.

article thumbnail

Snowflake Connector for Django Now Available 

Snowflake

We’re excited to announce that the Snowflake Connector for Django is now available on Snowflake Labs on GitHub. This integration provides Django apps easy access to data within the Snowflake Data Cloud without manually integrating against API endpoints. Now Python developers can easily and quickly build web applications that access Snowflake data by leveraging the Django framework.

Python 64
article thumbnail

Reimagined: Building Products with Generative AI

“Reimagined: Building Products with Generative AI” is an extensive guide for integrating generative AI into product strategy and careers featuring over 150 real-world examples, 30 case studies, and 20+ frameworks, and endorsed by over 20 leading AI and product executives, inventors, entrepreneurs, and researchers.