Tue.Oct 29, 2024

article thumbnail

Unapologetically Technical Episode 14 – Cliff Crosland

Jesse Anderson

Unapologetically Technical’s newest episode is now live! In this episode of Unapologetically Technical, I interview Cliff Crosland, the co-founder and CEO of Scanner.dev. Cliff Crosland is a data engineer passionate about helping people wrangle massive log volumes. He sees logs as a treasure trove of insights and believes effective log analysis is critical in today’s complex systems.

article thumbnail

Fine-Tuning GPT-4o

KDnuggets

Learn how to enhance GPT-4o performance for legal text clarification on your old laptop with just a few lines of code.

Coding 118
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Differential Backups in MyRocks Based Distributed Databases at Uber

Uber Engineering

Learn about how the Storage team at Uber significantly reduced costs and improved speed for backups of its Petabyte-scale, MyRocks-based distributed databases by devising a Differential Backups solution.

article thumbnail

10 Useful Python One-Liners for Data Cleaning

KDnuggets

Here are some useful Python one-liners for common data cleaning tasks.

Python 131
article thumbnail

Apache Airflow® Best Practices for ETL and ELT Pipelines

Whether you’re creating complex dashboards or fine-tuning large language models, your data must be extracted, transformed, and loaded. ETL and ELT pipelines form the foundation of any data product, and Airflow is the open-source data orchestrator specifically designed for moving and transforming data in ETL and ELT pipelines. This eBook covers: An overview of ETL vs.

article thumbnail

Upgrading Uber’s MySQL Fleet  to version 8.0

Uber Engineering

Learn all about our journey of successfully upgrading our MySQL fleet at Uber from v5.7 to v8.0, enhancing performance and reliability.

MySQL 85
article thumbnail

Model Selection and Experimentation Automation with LLMs

KDnuggets

Automate the machine learning modelling important step with LLMs.

More Trending

article thumbnail

Understanding K-Fold Target Encoding to Handle High Cardinality

Towards Data Science

Balancing complexity and performance: An in-depth look at K-fold target encoding Photo by Mika Baumeister on Unsplash Introduction Data science practitioners encounter numerous challenges when handling diverse data types across various projects, each demanding unique processing methods. A common obstacle is working with data formats that traditional machine learning models struggle to process effectively, resulting in subpar model performance.

article thumbnail

Making Uber’s ExperimentEvaluation Engine 100x Faster

Uber Engineering

Learn how Uber was able to reduce evaluation latencies by a factor of 100x in their Experimentation platform, which is used to empower decision making across the company by processing over 10 million evaluations per second.

article thumbnail

How to Measure Design System at Scale

Uber Engineering

Learn how Uber made a breakthrough in tracking design metrics across Figma, Android, and iOS with Design System Observability.

article thumbnail

Streamlining Financial Precision: Uber’s Advanced Settlement Accounting System

Uber Engineering

Discover how Uber’s cutting-edge settlement accounting system processes over 1.2 billion transactions monthly, ensuring precise financial tracking, preventing fraud, and managing regulatory compliance with unmatched efficiency.

Systems 53
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Shifting E2E Testing Left at Uber

Uber Engineering

Learn how we achieved diff-time E2E testing for thousands of microservices at Uber.

67
article thumbnail

Lucene: Uber’s Search Platform Version Upgrade

Uber Engineering

Search powers critical business operations across Uber like in-app geo search and matching riders to drivers. Dive into how Uber performed a multi-version Lucene upgrade across a complex monorepo to enhance search accuracy.

40
article thumbnail

Transforming Executive Travel: Delegate Booking with Uber

Uber Engineering

Find out how Uber for Business launched delegate profiles on Administrative Professionals Day, empowering executive assistants to manage executive travel, streamlining processes, and optimizing efficiency.

Process 40