Thu.Apr 20, 2023

article thumbnail

Uber’s engineering level changes

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of five topics in today’s subscriber-only The Scoop issue. To get full newsletters twice a week, subscribe here. This is a bit of a ‘late scoop,’ which I initially missed when it happened. Better late than never! Until early 2022, the software engineering levels at Uber were: Engineering levels at Uber, 2014-2022 Back when I was at Uber in around 2020, I saw statisti

article thumbnail

Big Data Warsaw 2023 retrospective - for data engineers

Waitingforcode

After a 2-years break, I had a chance to speak again, this time at the Big Data Warsaw 2023. Even though I couldn't be at Warsaw that day, I enjoyed the experience and also watched other sessions available through the conference platform.

Big Data 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Unveiling the Potential of CTGAN: Harnessing Generative AI for Synthetic Data

KDnuggets

CTGAN and other generative AI models can create synthetic tabular data for ML training, data augmentation, testing, privacy-preserving sharing, and more.

Data 159
article thumbnail

Viral spam content detection at LinkedIn

LinkedIn Engineering

On the LinkedIn platform, members from around the world share their knowledge, perspectives, and discuss topics important to them. Our goal at LinkedIn is to enable them to do so in a safe, trusted, and professional environment. We’ve previously discussed the various systems used to create a safe and trusted experience for our members and how we keep the LinkedIn Feed relevant for our members on LinkedIn.

article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

Building a Data-Centric Platform for Generative AI and LLMs at Snowflake

Snowflake

Generative AI and large language models (LLMs) are revolutionizing many aspects of both developer and non-coder productivity with automation of repetitive tasks and fast generation of insights from large amounts of data. Snowflake users are already taking advantage of LLMs to build really cool apps with integrations to web-hosted LLM APIs using external functions , and using Streamlit as an interactive front end for LLM-powered apps such as AI plagiarism detection , AI assistant , and MathGPT.

Building 114
article thumbnail

Data capture techniques for business

InData Labs

Gaining valuable insight into customer preferences and concerns is paramount to the success of any business. The most efficient way of doing so is by implementing sophisticated yet straightforward data capture techniques. These involve types of data capture methods such as surveys, interviews, focus groups, market studies, and many more. Knowing your customers’ needs and.

Data 98

More Trending

article thumbnail

PyTorch on Databricks - Introducing the Spark PyTorch Distributor

databricks

Background and Motives Deep Learning algorithms are complex and time consuming to train, but are quickly moving from the lab to production because.

article thumbnail

The Base Rate Fallacy and its Impact on Data Science

KDnuggets

The base rate fallacy is the importance of considering all relevant data.

article thumbnail

Building Data Applications on the Lakehouse With the Databricks SQL Driver for GO

databricks

We are excited to announce the general availability of the Databricks SQL Driver for GO. This follows the recent general availability of Databricks.

SQL 81
article thumbnail

Creating a YouTube Data Pipeline with AWS and Apache Airflow

Towards Data Science

A solution for effectively managing YouTube data with cloud services and job schedulers Continue reading on Towards Data Science »

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

Building Data Applications on the Lakehouse With the Databricks SQL Driver for Node.js

databricks

We are excited to announce the general availability of the Databricks SQL Driver for NodeJS. This follows the recent general availability of Databricks.

SQL 71
article thumbnail

Data Governance: Framework, Tools, Principles, Benefits

Knowledge Hut

Data governance refers to the set of policies, procedures, mix of people and standards that organisations put in place to manage their data assets. It involves establishing a framework for data management that ensures data quality, privacy, security, and compliance with regulatory requirements. The mix of people, procedures, technologies, and systems ensures that the data within a company is reliable, safe, and simple for employees to access.

article thumbnail

Understanding and Optimizing Your Kafka Costs – Part 1: Infrastructure

Confluent

Quantifying the cost of running Kafka is challenging. In part 1, learn how to calculate Kafka costs stemming from infrastructure and the impact networking has on your cloud bill.

Kafka 57
article thumbnail

Top 12 Essential UI/UX Designer Skills in 2023

Knowledge Hut

The terms UI (User Interface) and UX (User Experience) are closely related and play an important role in the design of digital products and services that we use in our day-to-day lives. A well-designed UI and UX can make the product simple to use, aesthetically pleasing, and highly functional as compared to other products. But poor UI and UX design of a product can lead to confusion, frustration, and sometimes even abandonment of a product or service by the user.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Boosting Spark Union Operator Performance: Optimization Tips for Improved Query Speed

Towards Data Science

Demystify Spark Performance in Union Operator Continue reading on Towards Data Science »

article thumbnail

How RPR Provides Top-Notch Geocoding Data with Precisely

Precisely

By Reggie Nicolay, VP Marketing, RPR Every REALTOR ® wants to be a trusted local market expert. To accomplish that, they must stay on top of data and market insights daily. They also need to be prepared to field a wide variety of questions because they work with a wide variety of consumers, from first-time buyers to seasoned buyers, sellers, and investors.

article thumbnail

Cyber Security vs Data Science: Key Difference & Similarities

Knowledge Hut

In today's world, where technology is advancing at an unprecedented pace, the world of cybersecurity faces sophisticated threats and complex challenges daily. To combat these dirty challenges thrown by hackers, the field of data science has emerged as a powerful player in the battleground against cybercrimes. With the advanced growth in data analysis and machine learning, data scientists are able to uncover hidden patterns, predict attacks, and reveal insights in large datasets that would he

article thumbnail

Measuring Performance for iOS Apps at Uber Scale

Uber Engineering

Curious about the magic behind Uber’s iOS app performance? Check out our blog post to learn how we overcame scalability challenges in our approach to measuring app reliability metrics.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Kanban for Manufacturing: Benefits, Challenges, Examples

Knowledge Hut

Kanban manufacturing is a type of lean manufacturing that has become more popular in recent years because it makes manufacturing operations more efficient, reduces waste, and improves quality. Kanban was invented in Japan for the auto industry, but now a wide range of businesses and organizations from different industries across the globe use it. This article talks about Kanban for manufacturing, including its benefits, problems, and examples of its usage.

article thumbnail

Launching a New Files Experience for the Databricks Workspace

databricks

Today, we are excited to announce the general availability of files throughout the Databricks workspace. Files support allows Databricks users to store Python.

Python 52
article thumbnail

Immersive Learning: Implementation, Best Practices, Benefits

Knowledge Hut

Ever since its inception, immersive learning has been revolutionizing many aspects of our lives. From education to employee training - acquiring knowledge with immersive learning technologies is easy and exciting. It infuses advanced learning theory, spatial design, and data science to boost engagement and make the training experience much more effective and quicker.

article thumbnail

Using Dead Letter Queues with SQL Stream Builder

Cloudera

What is a dead letter queue (DLQ)? Cloudera SQL Stream builder gives non-technical users the power of a unified stream processing engine so they can integrate, aggregate, query, and analyze both streaming and batch data sources in a single SQL interface. This allows business users to define events of interest for which they need to continuously monitor and respond quickly.

SQL 79
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

What is React? Examples, Features, Components, Pros and Cons

Knowledge Hut

Could you spend your entire day without using your phone or the Internet? Probably, not! We live in an era where mobile phones and web applications have made a significant space in our lives. Everything has gone digital, from shopping to booking cabs, ordering food, and conducting transactions. All of these have been made possible by apps that cater to specific needs.

article thumbnail

New Snowflake Features Released in March 2023

Snowflake

In March, Snowflake released a number of exciting new capabilities including Python Worksheets, the Snowflake Connector for ServiceNow, and expanded support for geospatial data. Read on to learn more about the full set of features that were just announced. Snowflake Connectors Snowflake Connector for ServiceNow – public preview Ingest data from ServiceNow into Snowflake automatically.

Medical 54
article thumbnail

Top 10 Reasons Why Web Development is So Important

Knowledge Hut

In today's digital age, having a strong online presence is critical for businesses to attract and retain customers. Customers are increasingly using the internet to research products and services before making purchases, emphasizing the importance of online presence for businesses. The various ways in which a company promotes itself online, with a company's website being a critical component, are referred to as its online presence.

Media 52
article thumbnail

Benefits of Data Mesh and Top Examples to Unlock Success

Ascend.io

Parsing current data stacks is like playing the longest-lasting game of Jenga. Datasets and code are centralized into one big monolithic architecture. Only the most experienced data engineers can extract pieces of it. And every time they do, they risk toppling the whole infrastructure over. But there’s a better way to play the game: data mesh. Data mesh decentralizes and democratizes data across the organization.

article thumbnail

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Speaker: David Bard, Principal at VP Product Coaching

In the fast-paced world of digital innovation, success is often accompanied by a multitude of challenges - like the pitfalls lurking at every turn, threatening to derail the most promising projects. But fret not, this webinar is your key to effective product development! Join us for an enlightening session to empower you to lead your team to greater heights.

article thumbnail

Project Resource Management: Definition, Process and Template

Knowledge Hut

Project management is important for leading a business, as a function of which, we need to ensure that projects are completed within set timelines, planned budgets, and required quality standards of delivery. Effective project resource management is a highly essential ingredient of project management which involves identifying, acquiring, and effectively utilizing resources required to complete a required project successfully.

Project 52
article thumbnail

Mutt Data & H2O.ai Partner Up

Mutt Data

We’re Excited To Announce Our New Partnership! Partnerships are integral to Mutt Data’s mission of enabling companies to transition into an AI-driven future by implementing large-scale intelligent automation. This is why we are excited to announce our technology partnership with H2O.ai , the open-source leader in AI and automatic machine learning. They are expert providers of open-source and commercial AI and ML platforms used to build and deploy AI and ML models at scale.

article thumbnail

How to Manage Projects Effectively? Step-by-Step Guide

Knowledge Hut

In business organizations, projects are envisioned, planned, and initiated to take a business to the next level, and Project management plays an important role in achieving this through planning, organizing, and creating the path to meet both long-term and short-term business goals. Project management defines and determines the efficacy and effectiveness of a project.

Project 52
article thumbnail

17 Super Valuable Automated Data Lineage Use Cases With Examples

Monte Carlo

Data lineage is a visual diagram showing how data flows through your ETL pipeline from ingestion to consumption. Solutions with automated data lineage capabilities constantly update these graphs and illustrate them as nodes and edges, or in other words, the objects through which the data travels and the relationship between them. While not every automated data lineage solution is the same , important nodes include: The connectors or syncs that ingest the data Tables within a data warehouse or la

article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating