Sat.May 06, 2023 - Fri.May 12, 2023

article thumbnail

Datadog’s $65M/year customer mystery solved

The Pragmatic Engineer

The internet has been speculating the past few days on which crypto company spent $65M on Datadog in 2022. I confirmed it was Coinbase, and here are the details of what happened. Originally published on 11 May 2023. 👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of six topics in today’s subscriber-only The Scoop issue.

AWS 343
article thumbnail

OLTP Vs OLAP – What Is The Difference

Seattle Data Guy

If you’re relying on your OLTP system to provide analytics, you might be in for a surprise. While it can work initially, these systems aren’t designed to handle complex queries. Adding databases like MongoDB and CassandraDB only makes matters worse, since they’re not SQL-friendly – the language most analysts and data practitioners are used to.… Read more The post OLTP Vs OLAP – What Is The Difference appeared first on Seattle Data Guy.

MongoDB 208
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Polars – Laziness and SQL Context.

Confessions of a Data Guy

Polars is one of those tools that you just want … no … NEED a reason to use it. It’s gotten so bad, I’ve started to use it in my Rust code on the side, Polars that is. I mean you have a problem if you could use Polars Python, and you find yourself using […] The post Polars – Laziness and SQL Context. appeared first on Confessions of a Data Guy.

SQL 182
article thumbnail

Data Teams Survey 2023 Follow-Up

Jesse Anderson

The results and analysis from my 2023 Data Teams Survey left a few open questions. Let’s revisit these questions with some answers. Methodologies and Size of Company Figure 1 – Methodologies Broken Down By Size of Company Using Them We see a few commonalities across different company sizes, as shown in Figure 1. One striking commonality is that so many companies are using data mesh.

Data 147
article thumbnail

The Definitive Entity Resolution Buyer’s Guide

Are you thinking of adding enhanced data matching and relationship detection to your product or service? Do you need to know more about what to look for when assessing your options? Our Entity Resolution Buyer’s Guide gives you step-by-step details about everything you should consider when evaluating entity resolution technologies. We discuss use cases, technology, and deployment options, top ten evaluation criteria and more.

article thumbnail

Compensation at Publicly Traded Tech Companies

The Pragmatic Engineer

Insights from 50 publicly traded tech companies, and a list of those paying the most and the least in median total compensation. 👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover two out of seven topics from today’s subscriber-only deep-dive on Compensation at publicly traded tech companies.

More Trending

article thumbnail

Kinesis sequence number is not an Apache Kafka offset

Waitingforcode

I have used to say "Kinesis Data Streams is like Apache Kafka, an append-only streaming broker with partitions and offsets". Although often it's true, it's not that simple unfortunately.

Kafka 130
article thumbnail

Upscaling LinkedIn's Profile Datastore While Reducing Costs

LinkedIn Engineering

Co-Authors: Estella Pham and Guanlin Lu At peak, LinkedIn serves over 1.4 million member profiles per second. The number of requests to our storage infrastructure doubles every year. In the past, we addressed latency, throughput and cost issues by migrating off Oracle onto Espresso , an open-source document platform, and adding more nodes. We are now at the point where some of the core components are straining under the increasing load, and we can no longer address scaling concerns with the node

Database 135
article thumbnail

PagerDuty alternatives

The Pragmatic Engineer

This is a response to a tweet asking: "Why is there no competition to PagerDuty/Opsgenie? People in my team say it’s “just connecting to the Twilio API” but if it were that easy, there’d probably be a ton of competition." PagerDuty is the market-leading incident alerting tool. OpsGenie is Atlassian's incident management tool, which is widespread thanks to distribution.

article thumbnail

Confluent Will Beat Your Cost of Running Kafka (or $100 on us)

Confluent

Running Kafka is costly, but Confluent has created a far more efficient product to lower your costs. Join the Cost Savings challenge to see for yourself.

Kafka 142
article thumbnail

How Lakehouse powers NLP for Customer Service Analytics in Insurance

databricks

Download the Databricks Insurance NLP Solution Accelerator Introduction The current economic and social climate has redefined customer expectations and preferences. Society has been.

Insurance 114
article thumbnail

New Approaches to Visualizing Snowflake Query Statistics with Snowflake Technology Partners

Snowflake

As of December, customers got a whole new level of insight into Snowflake query performance and query execution statistics when Snowflake announced the public preview of the new get_query_operator_stats function, opening up programmatic access to Snowflake query profiles and providing customers a whole new level of insight into Snowflake query performance and query execution statistics.

SQL 112
article thumbnail

Metal as a Service (MaaS): DIY server-management at scale

LinkedIn Engineering

Guaranteeing that our servers are continually upgraded to secure and vetted operating systems is one major step that we take to ensure our members and customers can access LinkedIn to look for new roles, access new learning programs, or exchange knowledge with other professionals. LinkedIn has quite a large fleet of servers on-premise that depend on internal tooling to ensure they stay on the latest operating systems.

article thumbnail

Tackling the Hidden and Unhidden Costs of Kafka

Confluent

Low utilization and operational complexity dramatically increases Kafka costs, so we reinvented Kafka as a cloud-native and complete service to reduce costs for thousands of businesses at any scale.

Kafka 105
article thumbnail

Data Scientist’s Guide to Cognitive Biases: A Free eBook

KDnuggets

Are you interested in exploring the topic of cognitive biases? Want to see how they may be affecting your data science practice? Check out this free ebook for this and more.

Data 100
article thumbnail

Core Data Engineering: DAGs

Medium Data Engineering

🚀 Did you know that Directed Acyclic Graphs (DAGs) have several properties that make them well-suited for data flow programming and… Continue reading on Medium »

article thumbnail

Precisely Women in Technology: Meet Samantha Martino

Precisely

Technology is a vast industry that has something for everybody. Because of this, it attracts people from all backgrounds and areas of expertise. At Precisely, having diverse representation is the key to success, and as a result, it’s been highly important for the organization to support the unique perspective that employees bring to the table. The Precisely Women in Technology (PWIT) program was designed to connect women from across the organization to one another to offer support, an internal n

article thumbnail

What Makes Confluent the World’s Most Trusted Cloud Data Streaming Platform

Confluent

Confluent manages 30,000+ Kafka clusters, produces over 3 trillion messages, and does durability checks on over 80 trillion Kafka messages per day while offering 99.99% uptime. Check out our cool stats!

Kafka 104
article thumbnail

PostgreSQL Import CSV: 3 Easy Methods

Hevo

As a business grows, the demand to efficiently handle and process the exponentially growing data also rises. A popular open-source relational database used by several organizations across the world is PostgreSQL. It is a perfect database management system that also assists developers to build applications, and administrators to protect data integrity and develop fault-tolerant environments.

article thumbnail

Prompt engineering

Medium Data Engineering

Prompt engineering is the art and science of designing prompts that elicit the desired responses from a natural language generation (NLG)… Continue reading on Medium »

article thumbnail

Top Posts May 1-7: Machine Learning with ChatGPT Cheat Sheet

KDnuggets

Machine Learning with ChatGPT Cheat Sheet • HuggingChat Python API: Your No-Cost Alternative • AutoGPT: Everything You Need To Know • 8 Open-Source Alternative to ChatGPT and Bard • LangChain 101: Build Your Own GPT-Powered Applications

article thumbnail

SoftBank Selects Cloudera Data Platform to Leverage Customer Intelligence While Ensuring Data Security

Cloudera

SoftBank Corp. provides Japan-based mobile communications services, mobile device sales, fixed-line communications, and ISP services, with more than 80 million users nationwide. The company also provides a variety of solutions for enterprises, including data centers, cloud, security, global, artificial intelligence (AI), IoT, and digital marketing services.

article thumbnail

Connect Excel to PostgreSQL in 2 Easy Ways

Hevo

Microsoft Excel is a spreadsheet program included in the Microsoft Office Suite. It’s compatible with Windows, Mac OS X, Android, and iOS. It simplifies the creation of text and numeric grids, formulas calculations, graphing tools, pivot tables, and the VBA Macro programming language (Visual Basic for Applications).

article thumbnail

How to Shift-Left Data Reliability

Medium Data Engineering

As organizations rely more heavily on data analytics for decision-making, the amount of data being captured and fed into analytics data… Continue reading on Medium »

Data 98
article thumbnail

8 Free AI and LLMs Playgrounds

KDnuggets

If you’re interested in trying out AI for fun or learning more about them, then take a look at our list and explore the cutting-edge LLMs available in the wild.

100
100
article thumbnail

#ClouderaLife Volunteer Spotlight: Alex Campos, Principal Technical Leader, Spain

Cloudera

Originally from Brazil, Alex previously lived in Chile and now lives in Spain. During his time living in Latin America in early 2016, Alex saw what he describes as a “knowledge gap” —s eeing the way skills, content and expertise are shared in an open, friendly way at conferences in the US, Alex wanted to replicate that in Latin America. To address this gap, Alex started planning meetups.

AWS 91
article thumbnail

Earned Value Management (EVM): Elements, Formulas, Benefits

Knowledge Hut

Many think that Earned value management is complicated paperwork and thus a lot of professionals stay away from it. On the other hand, successful project managers become superheroes to break this myth of earned value management (EVM). Earned Value Management has taken an important place in the world of project management and plays a vital role in the career of project management certification aspirants like PMI PMP certifications and PRINCE2 certificate.

article thumbnail

DATA SCIENCE EGGS: CRISP MODEL

Medium Data Engineering

In my years of practicing Data Science and the continuous application of many different models which help in arriving at a data-backed… Continue reading on Medium »

article thumbnail

KDnuggets News, May 10: HuggingChat Python API: Your No-Cost Alternative • Exploratory Data Analysis Techniques for Unstructured Data

KDnuggets

HuggingChat Python API: Your No-Cost Alternative • Exploratory Data Analysis Techniques for Unstructured Data • Stop Doing this on ChatGPT and Get Ahead of the 99% of its Users • ChatGPT as a Personalized Tutor for Learning Data Science Concepts • The Ultimate Open-Source Large Language Model Ecosystem

article thumbnail

12 Best Data Management Tools in 2023

Hevo

One of the biggest stumbling blocks of a business is the expansion of its Database. A few problems one might have to deal with while trying to expand their Database are storage problems, complicated management issues, and difficulty in the location, sharing, and checking of isolated data.

article thumbnail

Using Structured Streaming with Delta Sharing in Unity Catalog

databricks

We are excited to announce that support for using Structured Streaming with Delta Sharing is now generally available (GA) in Azure, AWS, and.

AWS 100
article thumbnail

Data-Oriented Programming with Python

Medium Data Engineering

A recap on Data-Oriented Programming by Yehonathan Sharvit illustrated with Python examples (instead of JavaScript and Java) Continue reading on Towards Data Science »

Python 98
article thumbnail

Data-Oriented Programming with Python

Towards Data Science

Data-Oriented Programming in Python A recap on Data-Oriented Programming by Yehonathan Sharvit but illustrated with Python examples (instead of JavaScript and Java) Photo by AltumCode on Unsplash Data-Oriented Programming by Yehonathan Sharvit is a great book that gives a gentle introduction to the concept of data-oriented programming (DOP) as an alternative to good old object-oriented programming (OOP).

Python 87
article thumbnail

Chatbot Arena: The LLM Benchmark Platform

KDnuggets

Chatbot Arena is a benchmark platform for large language models, where the community can contribute new models and evaluate them.

Process 101