Tue.Mar 18, 2025

article thumbnail

Top 5 Data Visualization Tools for Data Scientists

KDnuggets

Out of many data visualization tools, which five should you use? Three Python libraries, JavaScript, and R library should cover most of your data science needs.

article thumbnail

Unapologetically Technical Episode 18 – Adrian Woodhead

Jesse Anderson

In this episode of Unapologetically Technical, I interview Adrian Woodhead, a distinguished software engineer at Human and a true trailblazer in the European Hadoop ecosystem. Adrian, who even authored a chapter in the seminal work “Hadoop: The Definitive Guide,” shares his remarkable journey through the tech world, from his roots in South Africa to his current role pushing the boundaries of data engineering.

Hadoop 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Survey: What’s in your tech stack?

The Pragmatic Engineer

We want to capture an accurate snapshot of software engineering, today – and need your help! Tell us about your tech stack and get early access to the final report, plus extra analysis We’d like to know what tools, languages, frameworks and platforms  you  are using today. Which tools/frameworks/languages are popular and why?

article thumbnail

Snowflake Startup Spotlight: DeepTempo

Snowflake

Welcome to Snowflakes Startup Spotlight, where we learn about awesome companies building businesses on Snowflake. In this edition, find out how Evan Powell, founder and CEO of DeepTempo , is harnessing AI alongside a team of skilled security experts to protect the digital world from increasingly sophisticated cyberattacks. Describe your company in one sentence.

article thumbnail

Whats New in Apache Airflow 3.0 –– And How Will It Reshape Your Data Workflows?

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

Gartner Data & Analytics Summit Takeaway: “Why is nobody listening?”

Precisely

Is your data AI-ready? That was a consistent theme at this years Gartner Data & Analytics Summit in Orlando, Florida. There were many Gartner keynotes and analyst-led sessions that had titles like: Scale Data and Analytics on Your AI Journeys” What Everyone in D&A Needs to Know About (Generative) AI: The Foundations AI Governance: Design an Effective AI Governance Operating Model The advice offered during the event was relevant, valuable, and actionable.

article thumbnail

How to quickly deliver data to business users? #1. Adv Data types & Schema evolution

Start Data Engineering

1. Introduction 1.1. Pre-requisites 2. Use Schema evolution & advanced data types to quickly deliver new columns to the end-user 2.1. Enable schema evolution for additive column changes 2.2. Model 1:1 relationship as STRUCT and 1:M relationships as ARRAY[STRUCTS] to keep schema changes self contained 2.3. Naming conventions should represent relationship 3.

Data 130

More Trending

article thumbnail

Insights on AI Sustainability at Data Centre World 2025 by Oliver Cronk

Scott Logic

Last week, I had the opportunity to speak at and attend Data Centre World as part of the larger Tech Show London 2025. This massive event sprawled across half of the ExCeL centre, bringing together industry vendors, academics, and innovators across multiple technology domains. While the conference covered loads of topics, I was particularly drawn to sessions focusing on sustainable data centres and AI computing.

article thumbnail

Introducing Apache Kafka® 4.0

Confluent

Major milestone release Apache Kafka 4.0 removes ZooKeeper entirely, provides early access to Queues for Kafka, and enables faster rebalances, in addition to many other new KIPs.

Kafka 119
article thumbnail

Make your business apps smarter with ThoughtSpot Embedded

ThoughtSpot

In todays digital economy, businesses arent just competing on products and servicestheyre competing on insights and decisions. The ability to deliver real-time, contextual analytics within applications and portals isnt just a nice to have; its a critical advantage. Your users expect instant access to insights without switching between tools, hunting for reports, or waiting for analysts to provide answers.

article thumbnail

Case Study: Flywire

Preset

Product Preset Cloud Fully-managed, cloud-hosted service for Apache Superset Managed Private Cloud Preset with additional security in your private cloud Preset Certified Superset Deploy QA-approved Superset on any infrastructure Preset Embedded Dashboards Interactive analytics in your custom applications Preset API Managing your Preset workspaces as code Use Cases Business Intelligence (BI) Analytics and visualizations powered by Apache Superset for modern data stacks Internal Tooling Embedded a

BI 52
article thumbnail

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

Speaker: Tamara Fingerlin, Developer Advocate

Apache Airflow® 3.0, the most anticipated Airflow release yet, officially launched this April. As the de facto standard for data orchestration, Airflow is trusted by over 77,000 organizations to power everything from advanced analytics to production AI and MLOps. With the 3.0 release, the top-requested features from the community were delivered, including a revamped UI for easier navigation, stronger security, and greater flexibility to run tasks anywhere at any time.

article thumbnail

Simplify your map

ArcGIS

Two short videos with tips for making pop-ups and symbols for maps in ArcGIS Pro and ArcGIS Online.

Designing 106
article thumbnail

Do I Need to Learn MicroPython as a Data Scientist?

KDnuggets

A simple guide that tells you what you need to know about MicroPython and why you should use it as a Data Scientist

Data 101
article thumbnail

Simplify your map

ArcGIS

Two short videos with tips for making pop-ups and symbols for maps in ArcGIS Pro and ArcGIS Online.

Designing 103
article thumbnail

Downloading tens of millions of container images daily from the Serverless optimized Artifact Registry

databricks

Introduction In this blog, we share the journey of building a Serverless optimized Artifact Registry from the ground up. The main goals are to ensure.

article thumbnail

A Guide to Debugging Apache Airflow® DAGs

In Airflow, DAGs (your data pipelines) support nearly every use case. As these workflows grow in complexity and scale, efficiently identifying and resolving issues becomes a critical skill for every data engineer. This is a comprehensive guide with best practices and examples to debugging Airflow DAGs. You’ll learn how to: Create a standardized process for debugging to quickly diagnose errors in your DAGs Identify common issues with DAGs, tasks, and connections Distinguish between Airflow-relate

article thumbnail

Getting Started with AutoGluon: Your First Steps in Automated Machine Learning

KDnuggets

Want to build powerful machine learning models in minutes, not weeks? AutoGluon makes it ridiculously easy — no PhD required!

article thumbnail

Tableflow Is GA: Unifying Apache Kafka® Topics with Apache Iceberg™️ and Delta Lake Tables in a Few Clicks

Confluent

Tableflow represents Kafka topics as Apache Iceberg (GA) and Delta Lake (EA) tables in a few clicks to feed any data warehouse, data lake, or analytics engine of your choice

Kafka 72
article thumbnail

Breaking the Tyranny of Chalk Brackets in March Madness

Elder Research

Filling out your March Madness bracket? Playing it safe might win your poolbut wheres the fun in that? Lets talk strategy.

IT 59
article thumbnail

New with Confluent Platform 7.9: Oracle XStream CDC Connector, Client-Side Field Level Encryption (EA), Confluent for VS Code, and More

Confluent

Confluent Platform 7.9 introduces the Oracle XStream CDC Connector, Client-Side Field Level Encryption (EA), Confluent for VS Code, and more.

Coding 72
article thumbnail

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Speaker: Alex Salazar, CEO & Co-Founder @ Arcade | Nate Barbettini, Founding Engineer @ Arcade | Tony Karrer, Founder & CTO @ Aggregage

There’s a lot of noise surrounding the ability of AI agents to connect to your tools, systems and data. But building an AI application into a reliable, secure workflow agent isn’t as simple as plugging in an API. As an engineering leader, it can be challenging to make sense of this evolving landscape, but agent tooling provides such high value that it’s critical we figure out how to move forward.

article thumbnail

Snowflakeが注目するスタートアップ企業:DeepTempo

Snowflake

SnowflakeSnowflake DeepTempo CEOEvan PowellAI DeepTempoSnowflake 6 AI AIAILogLMAIAI DeepTempo AI TempoSnowflake Snowflake TempoAscentPoC DeepTempoSOCSnowflake Snowflake Snowflake Snowflake SnowflakeSnowflake45AnvilogicSplunk SnowflakeCIOSnowflakeSnowflake Snowflake LogLMMLSnowflake Snowflake SnowflakeLLMAISaaSAISnowflake Snowflake DeepTempo deeptempo.ai deeptempo.

52
article thumbnail

Confluent for VS Code Simplifies Real-Time Data Streaming Projects for Developers

Confluent

Confluent for VS Code streamlines workflows, accelerates development cycles, and enhances real-time data processing, all within a unified environment.

Coding 72
article thumbnail

Information Overload? 5 Data Sustainability Tips for 2025

Monte Carlo

Managing company data is a lot like running a kitchen. When everything is labeled, organized, and properly stored, cooking is a breeze. But without a system? You end up with expired ingredients, duplicate spices, and a fridge full of things you forgot you had. Data sustainability keeps your information accurate, accessible, and useful over timeso youre not wasting storage space, money, or time hunting down what you need.

article thumbnail

New with Confluent Platform 7.9: Oracle XStream CDC Connector, Client-Side Field Level Encryption (EA), Confluent for VS Code, and More

Confluent

Confluent Platform 7.9 introduces the Oracle XStream CDC Connector, Client-Side Field Level Encryption (EA), Confluent for VS Code, and more.

Coding 69
article thumbnail

How to Modernize Manufacturing Without Losing Control

Speaker: Andrew Skoog, Founder of MachinistX & President of Hexis Representatives

Manufacturing is evolving, and the right technology can empower—not replace—your workforce. Smart automation and AI-driven software are revolutionizing decision-making, optimizing processes, and improving efficiency. But how do you implement these tools with confidence and ensure they complement human expertise rather than override it? Join industry expert Andrew Skoog as he explores how manufacturers can leverage automation to enhance operations, streamline workflows, and make smarter, data-dri

article thumbnail

New in Confluent Cloud: Tableflow, Freight Clusters, Apache Flink® AI Enhancements, and More

Confluent

CC 2025 Q1 adds Tableflow, Freight clusters, Flink AI enhancements, and more!

Cloud 59
article thumbnail

Meet the Oracle XStream CDC Source Connector

Confluent

Meet the Oracle XStream CDC Source Connector and learn how to unlock real-time operational data and achieve high performance data streaming from Oracle databases.

article thumbnail

Tableflow Is GA: Unifying Apache Kafka® Topics with Apache Iceberg™️ and Delta Lake Tables in a Few Clicks

Confluent

Tableflow represents Kafka topics as Apache Iceberg (GA) and Delta Lake (EA) tables in a few clicks to feed any data warehouse, data lake, or analytics engine of your choice

Kafka 45
article thumbnail

New in Confluent Cloud: Tableflow, Freight Clusters, Apache Flink® AI Enhancements, and More

Confluent

CC 2025 Q1 adds Tableflow, Freight clusters, Flink AI enhancements, and more!

Cloud 45
article thumbnail

The Ultimate Guide to Apache Airflow DAGS

With Airflow being the open-source standard for workflow orchestration, knowing how to write Airflow DAGs has become an essential skill for every data engineer. This eBook provides a comprehensive overview of DAG writing features with plenty of example code. You’ll learn how to: Understand the building blocks DAGs, combine them in complex pipelines, and schedule your DAG to run exactly when you want it to Write DAGs that adapt to your data at runtime and set up alerts and notifications Scale you