Sat.Oct 07, 2023 - Fri.Oct 13, 2023

article thumbnail

The Power of a Semantic Layer: A Data Engineer’s Guide

KDnuggets

Looking to understand the semantic layer and how it can improve your data stack? This GigaOm Sonor report on Semantic Layers can help you delve deeper.

Data 83
article thumbnail

Going from Developer to CEO: Chronosphere

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. In every issue, I cover topics related to Big Tech and startups through the lens of engineering managers and senior engineers. In this article, we cover three out of eight topics from today’s deepdive into tech scaleup Chronosphere. To get full issues twice a week, subscribe here.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Using Data To Illuminate The Intentionally Opaque Insurance Industry

Data Engineering Podcast

Summary The insurance industry is notoriously opaque and hard to navigate. Max Cho found that fact frustrating enough that he decided to build a business of making policy selection more navigable. In this episode he shares his journey of data collection and analysis and the challenges of automating an intentionally manual industry. Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data management Introducing RudderStack Profiles.

Insurance 162
article thumbnail

How to use the DockerOperator

Marc Lamberti

Do you wonder how to use the DockerOperator in Airflow to kick off a docker image? Or how to run a task without creating dependency conflicts? In this tutorial, you will discover everything you need about the DockerOperator with practical examples. If you’re new to Airflow, I’ve created a course you can check out here. Ready? Let’s go!

AWS 130
article thumbnail

Navigating the Future: Generative AI, Application Analytics, and Data

Generative AI is upending the way product developers & end-users alike are interacting with data. Despite the potential of AI, many are left with questions about the future of product development: How will AI impact my business and contribute to its success? What can product managers and developers expect in the future with the widespread adoption of AI?

article thumbnail

Table file formats - vacuum: Delta Lake

Waitingforcode

If you have some experience with RDBMS, who doesn't btw, you have probably run a VACUUM command to reclaim the storage space occupied by deleted or obsolete rows. If you're now working with Delta Lake, you can do the same!

130
130
article thumbnail

Data News — Week 23.40

Christophe Blefari

( credits ) Hey, I'm a bit late once again. I hope this newsletter edition finds you well. This is almost a raw edition, I had quite a big amount of links, I hope you will like this selection. Gen AI 🤖 OpenAI’s plan to build the "iPhone of artificial intelligence" — Obviously this is one of the main struggle for OpenAI.

Python 130

More Trending

article thumbnail

Why SQL is THE Language to Learn for Data Science

KDnuggets

SQL is the essential data science language due to its universal database accessibility, efficient data cleaning capabilities, seamless integration with other languages, and requirement for most data science jobs.

article thumbnail

Unapologetically Technical Episode 5 – Neil Avery

Jesse Anderson

Unapologetically Technical is finally back with a new episode! In this episode of Unapologetically Technical, I had the pleasure of interviewing Neil Avery from Liquidlabs. We discussed his experiences creating grid computing systems at major banks like Royal Bank of Scotland and Deutchebank, as well as his journey to founding a startup called Logscape and working as a consultant at Excellian.

Banking 100
article thumbnail

How to Become a Project Director? In 5 Simple Steps

Knowledge Hut

Project management involves muti faceted skills and competencies. There are various skilled people involved in project management, from project coordinators to project consultants, the list is endless. One key role in project management is the project director. These individuals are in the top line of project management, they are responsible for making crucial decisions involved in the projects.

Project 96
article thumbnail

Llama 2 Foundation Models Available in Databricks Lakehouse AI

databricks

We’re excited to announce that Meta AI’s Llama 2 foundation chat models are available in the Databricks Marketplace for you to fine-tune and dep.

article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

Accelerate Your Machine Learning Journey with Uplimit’s Metaflow Mastery Course

KDnuggets

Ready to take your machine learning skills to new heights? Dive into the world of Metaflow with us and elevate your expertise with Uplimit's Full-Stack Machine Learning with Metaflow course!

article thumbnail

Projetando a arquitetura orientada a eventos da Loggi para flexibilidade e produtividade em engenharia

Confluent

With Confluent Cloud, Loggi migrated to an event-driven architecture, powering real-time analytics, boosting productivity, and cutting costs.

article thumbnail

Build an Actionable Customer 360 in the Data Cloud with Hightouch Events

Snowflake

Easily collect and store digital events directly to create a complete composable customer data platform (CDP) Marketers are increasingly leveraging the Snowflake Data Cloud as the foundation for all of their customer data analytics and activation. Marketing teams are creating composable customer data platforms (CDPs) on the Data Cloud to build a 360-degree view of each customer.

Cloud 98
article thumbnail

Databricks Obtains ISO 27701 Certification

databricks

We’re excited to announce that Databricks has obtained the International Standards Organization (ISO) 27701 certification as a data processor. This certification reflects our c.

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Rust Burn Library for Deep Learning

KDnuggets

A new deep learning framework built entirely in Rust that aims to balance flexibility, performance, and ease of use for researchers, ML engineers, and developers.

article thumbnail

Introducing Apache Kafka 3.6

Confluent

Apache Kafka 3.6 brings Tiered Storage Early Access, migrating clusters from ZooKeeper to KRaft with no downtime, a grace period for stream-table joins, and more!

Kafka 89
article thumbnail

Mastering data integration from SAP Systems with prompt engineering

Towards Data Science

Construction engineer investigating his work — Stable diffusion Introduction In our previous publication, From Data Engineering to Prompt Engineering , we demonstrated how to utilize ChatGPT to solve data preparation tasks. Apart from the good feedback we have received, one critical point has been raised: Prompt engineering may help with simple tasks, but is it really useful in a more challenging environment?

article thumbnail

Scalable, In-House Quality Measurement with a NCQA-Certified Engine on the Lakehouse

databricks

This blog was written in collaboration with David Roberts (Analytics Engineering Manager), Kevin P. Buchan Jr (Assistant Vice President, Analytics), and Yubin Park.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Best Practices for Building ETLs for ML

KDnuggets

This article talks about several best practices for writing ETLs for building training datasets. It delves into several software engineering techniques and patterns applied to ML.

Building 118
article thumbnail

Unleashing the Potential of Demand Forecasting with Data Streaming

Confluent

Learn how data streaming enables you to accurately predict future customer demands while delivering the right products in the right quantities to satisfy customer demand without creating a surplus.

Data 70
article thumbnail

Snowflake and Partners Develop Award-Winning Solution to Give Telecoms and Consumers the Power to Reduce Carbon Emissions with Generative AI

Snowflake

In the age of climate consciousness, industries worldwide are grappling with the urgent need to reduce their carbon footprints. One industry that has come under increased scrutiny is telecommunications, where Scope 3 emissions , or the indirect emissions that occur in a company’s value chain that the company has no direct control over, alone account for a staggering 85% of a typical telecom company’s carbon footprint.

article thumbnail

Announcing public preview of Databricks Assets Bundles: Apply software development best practices with ease

databricks

We are delighted to announce that Databricks Asset Bundles are now in public preview. Bundles, for short, facilitate the adoption of software engineering.

article thumbnail

How Embedded Analytics Gets You to Market Faster with a SAAS Offering

Start-ups & SMBs launching products quickly must bundle dashboards, reports, & self-service analytics into apps. Customers expect rapid value from your product (time-to-value), data security, and access to advanced capabilities. Traditional Business Intelligence (BI) tools can provide valuable data analysis capabilities, but they have a barrier to entry that can stop small and midsize businesses from capitalizing on them.

article thumbnail

KDnuggets News, October 11: 3 Data Science Projects to Land That Job • 7 Steps to Mastering NLP

KDnuggets

This week: What three data science projects should you choose to guarantee you get the job? • A 7 step guide to help you go from the fundamentals of machine learning and Python to Transformers, recent advances in NLP, and beyond.

article thumbnail

Propel Telecom Growth with Location-Based Context

Precisely

Telecom providers invest heavily in infrastructure, so it’s vital that they optimize those investments by using an intelligent planning process. That means making data-driven decisions based on rich, contextual, location-based data. Is your company making the right investments in infrastructure? That depends on the answers to three questions: Are you building in the right place?

article thumbnail

Data Streaming and Artificial Intelligence: The Future of Real-Time Social Media Monitoring

Confluent

Learn how data streaming and artificial intelligence enables you to project your brand’s reputation with real-time social media monitoring.

Media 78
article thumbnail

Databricks and Shell collaborate to simplify industrial time series data analytics on the Lakehouse

databricks

Written in partnership with Shell. The energy industry is all about physical assets – from terminals, ships and pipelines to refineries and wind f.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Comparing Natural Language Processing Techniques: RNNs, Transformers, BERT

KDnuggets

RNN, Transformers, and BERT are popular NLP techniques with tradeoffs in sequence modeling, parallelization, and pre-training for downstream tasks.

Process 131
article thumbnail

Google DeepMind’s Eli Collins to Headline IMPACT: The Data Observability Summit on November 8

Monte Carlo

Today, I’m thrilled to announce that Eli Collins, VP of Product at Google DeepMind, will join us on stage as our surprise keynote speaker at IMPACT: The Data Observability Summit ! Alongside Billy Beane ( yes, that Billy Beane ), Annie Duke, author of one of my favorite books, Thinking in Bets , and Nga Phan, SVP of Product at Salesforce AI, Eli will round out our slate of data and AI keynotes for the conference.

article thumbnail

Life Happens in Real Time, Not in Batches: Choosing a Data Streaming Platform and Stream Processing Engine

Confluent

Learn about the key capabilities of a data streaming platform and what factors to consider when choosing a stream processing engine like Apache Flink® to fuel use cases with real-time data.

Process 62
article thumbnail

Announcing the General Availability of the Databricks SQL Statement Execution API

databricks

Today, we are excited to announce the general availability of the Databricks SQL Statement Execution API on AWS and Azure, with support for.

SQL 82
article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.