Sat.May 06, 2023 - Fri.May 12, 2023

article thumbnail

Datadog’s $65M/year customer mystery solved

The Pragmatic Engineer

The internet has been speculating the past few days on which crypto company spent $65M on Datadog in 2022. I confirmed it was Coinbase, and here are the details of what happened. Originally published on 11 May 2023. 👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of six topics in today’s subscriber-only The Scoop issue.

AWS 318
article thumbnail

OLTP Vs OLAP – What Is The Difference

Seattle Data Guy

If you’re relying on your OLTP system to provide analytics, you might be in for a surprise. While it can work initially, these systems aren’t designed to handle complex queries. Adding databases like MongoDB and CassandraDB only makes matters worse, since they’re not SQL-friendly – the language most analysts and data practitioners are used to.… Read more The post OLTP Vs OLAP – What Is The Difference appeared first on Seattle Data Guy.

MongoDB 208
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Polars – Laziness and SQL Context.

Confessions of a Data Guy

Polars is one of those tools that you just want … no … NEED a reason to use it. It’s gotten so bad, I’ve started to use it in my Rust code on the side, Polars that is. I mean you have a problem if you could use Polars Python, and you find yourself using […] The post Polars – Laziness and SQL Context. appeared first on Confessions of a Data Guy.

SQL 182
article thumbnail

Kinesis sequence number is not an Apache Kafka offset

Waitingforcode

I have used to say "Kinesis Data Streams is like Apache Kafka, an append-only streaming broker with partitions and offsets". Although often it's true, it's not that simple unfortunately.

Kafka 130
article thumbnail

Driving Responsible Innovation: How to Navigate AI Governance & Data Privacy

Speaker: Aindra Misra, Senior Manager, Product Management (Data, ML, and Cloud Infrastructure) at BILL

Join us for an insightful webinar that explores the critical intersection of data privacy and AI governance. In today’s rapidly evolving tech landscape, building robust governance frameworks is essential to fostering innovation while staying compliant with regulations. Our expert speaker, Aindra Misra, will guide you through best practices for ensuring data protection while leveraging AI capabilities.

article thumbnail

Compensation at Publicly Traded Tech Companies

The Pragmatic Engineer

Insights from 50 publicly traded tech companies, and a list of those paying the most and the least in median total compensation. 👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover two out of seven topics from today’s subscriber-only deep-dive on Compensation at publicly traded tech companies.

article thumbnail

Confluent Will Beat Your Cost of Running Kafka (or $100 on us)

Confluent

Running Kafka is costly, but Confluent has created a far more efficient product to lower your costs. Join the Cost Savings challenge to see for yourself.

Kafka 142

More Trending

article thumbnail

How Lakehouse powers NLP for Customer Service Analytics in Insurance

databricks

Download the Databricks Insurance NLP Solution Accelerator Introduction The current economic and social climate has redefined customer expectations and preferences. Society has been.

Insurance 118
article thumbnail

PagerDuty alternatives

The Pragmatic Engineer

This is a response to a tweet asking: "Why is there no competition to PagerDuty/Opsgenie? People in my team say it’s “just connecting to the Twilio API” but if it were that easy, there’d probably be a ton of competition." PagerDuty is the market-leading incident alerting tool. OpsGenie is Atlassian's incident management tool, which is widespread thanks to distribution.

Systems 231
article thumbnail

New Approaches to Visualizing Snowflake Query Statistics with Snowflake Technology Partners

Snowflake

As of December, customers got a whole new level of insight into Snowflake query performance and query execution statistics when Snowflake announced the public preview of the new get_query_operator_stats function, opening up programmatic access to Snowflake query profiles and providing customers a whole new level of insight into Snowflake query performance and query execution statistics.

article thumbnail

Top Posts May 1-7: Machine Learning with ChatGPT Cheat Sheet

KDnuggets

Machine Learning with ChatGPT Cheat Sheet • HuggingChat Python API: Your No-Cost Alternative • AutoGPT: Everything You Need To Know • 8 Open-Source Alternative to ChatGPT and Bard • LangChain 101: Build Your Own GPT-Powered Applications

article thumbnail

Launching LLM-Based Products: From Concept to Cash in 90 Days

Speaker: Christophe Louvion, Chief Product & Technology Officer of NRC Health and Tony Karrer, CTO at Aggregage

Christophe Louvion, Chief Product & Technology Officer of NRC Health, is here to take us through how he guided his company's recent experience of getting from concept to launch and sales of products within 90 days. In this exclusive webinar, Christophe will cover key aspects of his journey, including: LLM Development & Quick Wins 🤖 Understand how LLMs differ from traditional software, identifying opportunities for rapid development and deployment.

article thumbnail

Tackling the Hidden and Unhidden Costs of Kafka

Confluent

Low utilization and operational complexity dramatically increases Kafka costs, so we reinvented Kafka as a cloud-native and complete service to reduce costs for thousands of businesses at any scale.

Kafka 103
article thumbnail

Precisely Women in Technology: Meet Samantha Martino

Precisely

Technology is a vast industry that has something for everybody. Because of this, it attracts people from all backgrounds and areas of expertise. At Precisely, having diverse representation is the key to success, and as a result, it’s been highly important for the organization to support the unique perspective that employees bring to the table. The Precisely Women in Technology (PWIT) program was designed to connect women from across the organization to one another to offer support, an internal n

article thumbnail

PostgreSQL Import CSV: 3 Easy Methods

Hevo

As a business grows, the demand to efficiently handle and process the exponentially growing data also rises. A popular open-source relational database used by several organizations across the world is PostgreSQL. It is a perfect database management system that also assists developers to build applications, and administrators to protect data integrity and develop fault-tolerant environments.

article thumbnail

Data Scientist’s Guide to Cognitive Biases: A Free eBook

KDnuggets

Are you interested in exploring the topic of cognitive biases? Want to see how they may be affecting your data science practice? Check out this free ebook for this and more.

article thumbnail

How To Speak The Language Of Financial Success In Product Management

Speaker: Jamie Bernard

Success in product management goes beyond delivering great features - it’s about achieving measurable financial outcomes that resonate across the organization. By connecting your product’s journey with the company’s financial success, you’ll ensure that every feature, release, and innovation contributes to the bottom line, driving both customer satisfaction and business growth.

article thumbnail

What Makes Confluent the World’s Most Trusted Cloud Data Streaming Platform

Confluent

Confluent manages 30,000+ Kafka clusters, produces over 3 trillion messages, and does durability checks on over 80 trillion Kafka messages per day while offering 99.99% uptime. Check out our cool stats!

Kafka 103
article thumbnail

Snowflake Connector for Django Now Available 

Snowflake

We’re excited to announce that the Snowflake Connector for Django is now available on Snowflake Labs on GitHub. This integration provides Django apps easy access to data within the Snowflake Data Cloud without manually integrating against API endpoints. Now Python developers can easily and quickly build web applications that access Snowflake data by leveraging the Django framework.

Python 94
article thumbnail

Connect Excel to PostgreSQL in 2 Easy Ways

Hevo

Microsoft Excel is a spreadsheet program included in the Microsoft Office Suite. It’s compatible with Windows, Mac OS X, Android, and iOS. It simplifies the creation of text and numeric grids, formulas calculations, graphing tools, pivot tables, and the VBA Macro programming language (Visual Basic for Applications).

article thumbnail

Data Masking: The Core of Ensuring GDPR and other Regulatory Compliance Strategies

KDnuggets

This article has provided an overview of data masking and its importance in ensuring compliance with GDPR and other global regulations.

Data 137
article thumbnail

What Is Entity Resolution? How It Works & Why It Matters

Entity Resolution Sometimes referred to as data matching or fuzzy matching, entity resolution, is critical for data quality, analytics, graph visualization and AI. Learn what entity resolution is, why it matters, how it works and its benefits. Advanced entity resolution using AI is crucial because it efficiently and easily solves many of today’s data quality and analytics problems.

article thumbnail

An ML based approach to proactive advertiser churn prevention

Pinterest Engineering

Erika Sun ML Engineer | Advertiser Growth Modeling Team; Ogheneovo Dibie Engineering Manager | Advertiser Growth Modeling Team Photo by Jason Blackeye on Unsplash Summary In this blog post, we describe a Machine Learning (ML) powered proactive churn prevention solution that was prototyped with our small & medium business (SMB) advertisers. Results from our initial experiment suggest that we can detect future churn with a high degree of predictive power and consequently empower our sales par

article thumbnail

Unifying Your Data Ecosystem with Delta Lake Integration

databricks

As organizations are maturing their data infrastructure and accumulating more data than ever before in their data lakes, Open and Reliable table formats.

Data Lake 100
article thumbnail

Earned Value Management (EVM): Elements, Formulas, Benefits

Knowledge Hut

Many think that Earned value management is complicated paperwork and thus a lot of professionals stay away from it. On the other hand, successful project managers become superheroes to break this myth of earned value management (EVM). Earned Value Management has taken an important place in the world of project management and plays a vital role in the career of project management certification aspirants like PMI PMP certifications and PRINCE2 certificate.

article thumbnail

8 Free AI and LLMs Playgrounds

KDnuggets

If you’re interested in trying out AI for fun or learning more about them, then take a look at our list and explore the cutting-edge LLMs available in the wild.

116
116
article thumbnail

Provide Real Value in Your Applications with Data and Analytics

The complexity of financial data, the need for real-time insight, and the demand for user-friendly visualizations can seem daunting when it comes to analytics - but there is an easier way. With Logi Symphony, we aim to turn these challenges into opportunities. Our platform empowers you to seamlessly integrate advanced data analytics, generative AI, data visualization, and pixel-perfect reporting into your applications, transforming raw data into actionable insights.

article thumbnail

#ClouderaLife Volunteer Spotlight: Alex Campos, Principal Technical Leader, Spain

Cloudera

Originally from Brazil, Alex previously lived in Chile and now lives in Spain. During his time living in Latin America in early 2016, Alex saw what he describes as a “knowledge gap” —s eeing the way skills, content and expertise are shared in an open, friendly way at conferences in the US, Alex wanted to replicate that in Latin America. To address this gap, Alex started planning meetups.

article thumbnail

Using Structured Streaming with Delta Sharing in Unity Catalog

databricks

We are excited to announce that support for using Structured Streaming with Delta Sharing is now generally available (GA) in Azure, AWS, and.

AWS 103
article thumbnail

12 Best Data Management Tools in 2023

Hevo

One of the biggest stumbling blocks of a business is the expansion of its Database. A few problems one might have to deal with while trying to expand their Database are storage problems, complicated management issues, and difficulty in the location, sharing, and checking of isolated data.

article thumbnail

KDnuggets News, May 10: HuggingChat Python API: Your No-Cost Alternative • Exploratory Data Analysis Techniques for Unstructured Data

KDnuggets

HuggingChat Python API: Your No-Cost Alternative • Exploratory Data Analysis Techniques for Unstructured Data • Stop Doing this on ChatGPT and Get Ahead of the 99% of its Users • ChatGPT as a Personalized Tutor for Learning Data Science Concepts • The Ultimate Open-Source Large Language Model Ecosystem

article thumbnail

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Speaker: Maher Hanafi, VP of Engineering at Betterworks & Tony Karrer, CTO at Aggregage

Executive leaders and board members are pushing their teams to adopt Generative AI to gain a competitive edge, save money, and otherwise take advantage of the promise of this new era of artificial intelligence. There's no question that it is challenging to figure out where to focus and how to advance when it’s a new field that is evolving everyday. 💡 This new webinar featuring Maher Hanafi, VP of Engineering at Betterworks, will explore a practical framework to transform Generative AI pr

article thumbnail

Visibility and Transparency

Cloudera

Out of the box Cloudera Data platform (CDP) performs superbly but over time, if data architecture, data engineering, and DevOps best practices are not maintained, you can get stuck maintaining the wild, wild west. In this six-part series, we’re focused on improving the health of your environment. Visibility and Transparency Improving environmental health is impossible if you’re flying blind.

article thumbnail

Cluster Policy Onboarding Primer

databricks

Introduction This blog is part of our Admin Essentials series, where we'll focus on topics important to those managing and maintaining Databricks environments.

article thumbnail

Import Excel into MySQL: 4 Easy Methods

Hevo

Microsoft Excel has been a traditional choice as a spreadsheet application for organizations across the world. The ease of access, power formulas, and the ability to make visually stunning reports has made Microsoft Excel is widely used tool.

MySQL 83
article thumbnail

What are Large Language Models and How Do They Work?

KDnuggets

Large language models represent a significant advancement in natural language processing and have transformed the way we interact with language-based technology. Learn why they’re important and how they work.

article thumbnail

The AI Superhero Approach to Product Management

Speaker: Conrado Morlan

In this engaging and witty talk, industry expert Conrado Morlan will explore how artificial intelligence can transform the daily tasks of product managers into streamlined, efficient processes. Using the lens of a superhero narrative, he’ll uncover how AI can be the ultimate sidekick, aiding in data management and reporting, enhancing productivity, and boosting innovation.