August, 2023

article thumbnail

Top 5 questions Data Engineers should ask before joining a startup

Towards Data Science

Advice from a startup founder in the data space on how to find a startup that works for you Photo by Leeloo Thefirst from Pexels.com So you want to join a startup huh? I’m not talking about a fancy Series E startup that’s about to go IPO funded by a16z. I’m talking about a real startup, from seed to series B — where every day can feel like you’re either about to soar or crash and burn — and there’s little in between.

article thumbnail

Why Is Data Modeling So Challenging – How To Data Model For Analytics

Seattle Data Guy

Learning about how to data models from basic star schemas on the internet is like learning data science using the IRIS data set. It works great as a toy example. But it doesn’t match real life at all. Data modeling in real life requires you fully understand the data sources and your business use cases.… Read more The post Why Is Data Modeling So Challenging – How To Data Model For Analytics appeared first on Seattle Data Guy.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

What is a Senior Software Engineer at Wise and Amazon?

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. In every issue, I cover topics related to Big Tech and startups through the lens of engineering managers and senior engineers. To get full issues twice a week, subscribe here. The past month, we’ve done deepdives in the newsletter on what a senior software engineer is at Big Tech , and at scaleups.

article thumbnail

MSSQL vs MySQL: Comparing Powerhouses of Databases

Analytics Vidhya

Introduction In the bustling arena of database management systems, two heavyweight contenders emerge, each carrying its arsenal of features and capabilities. In one corner, we have the suave and sophisticated Microsoft SQL Server (MSSQL), donned in the elegance of enterprise-level prowess. And in the other corner the scrappy and open-source MySQL, armed with its community-driven […] The post MSSQL vs MySQL: Comparing Powerhouses of Databases appeared first on Analytics Vidhya.

MySQL 228
article thumbnail

The AI Superhero Approach to Product Management

Speaker: Conrado Morlan

In this engaging and witty talk, we’ll explore how artificial intelligence can transform the daily tasks of product managers into streamlined, efficient processes. Using the lens of a superhero narrative, we’ll uncover how AI can be the ultimate sidekick, aiding in decision-making, enhancing productivity, and boosting innovation. Attendees will leave with practical tools and actionable insights, motivated to embrace AI and leverage its potential in their work. 🦸 🏢 Key objectives:

article thumbnail

The fancy data stack—batch version

Christophe Blefari

Summer Edition ( credits ) This is the first article of the Data News Summer Edition: how to build a data platform. I tried to be as short as possible in this first article, details will come in the following ones. The modern data stack has been criticised a lot, a few are saying it's dead other are saying we are in the post-modern era. The modern data stack as a collection of tools which interacts altogether to serve data to consumers is still relevant.

article thumbnail

Table file formats - commits: Delta Lake

Waitingforcode

One of the great features of modern table file formats is the ability to handle write conflicts. It wouldn't be possible without commits that are the topic of this new blog post.

IT 130

More Trending

article thumbnail

The Case of the Mysterious Recursive CTE

Confessions of a Data Guy

I still remember that day. A day that shall live on in infamy in my mind. Well over a decade ago, in the days when SQL Server roamed the land devouring souls on the Altar of Stored Procedures. There was only one tool available at the time. SQL. That’s it. There was one problem that […] The post The Case of the Mysterious Recursive CTE appeared first on Confessions of a Data Guy.

SQL 130
article thumbnail

How Games Typically Get Built

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. In every issue, I cover topics related to Big Tech and startups through the lens of engineering managers and senior engineers. In this article, we cover one out of for topics from the past newsletter issue Game Development Basics. To get the full issues, twice a week, subscribe here.

article thumbnail

ELT vs ETL: Unveiling the Differences and Similarities

Analytics Vidhya

Introduction In today’s data-driven world, seamless data integration plays a crucial role in driving business decisions and innovation. Two prominent methodologies have emerged to facilitate this process: Extract, Transform, Load (ETL) and Extract, Load, Transform (ELT). In this article, we will discuss ELT vs ETL, comparing their characteristics, benefits, and suitability for various use cases. […] The post ELT vs ETL: Unveiling the Differences and Similarities appeared first on Ana

article thumbnail

The Burtch Works 2023 Data Science & AI Professionals Salary Report is Here!

KDnuggets

The Burtch Works 2023 Data Science & AI Professionals salary report is here, and includes insightful data such as hiring and marketplace trends, compensation changes over time, and salary data. Get your copy here.

article thumbnail

Provide Real Value in Your Applications with Data and Analytics

The complexity of financial data, the need for real-time insight, and the demand for user-friendly visualizations can seem daunting when it comes to analytics - but there is an easier way. With Logi Symphony, we aim to turn these challenges into opportunities. Our platform empowers you to seamlessly integrate advanced data analytics, generative AI, data visualization, and pixel-perfect reporting into your applications, transforming raw data into actionable insights.

article thumbnail

Table file formats - isolation levels: Delta Lake

Waitingforcode

If Delta Lake implemented the commits only, I could stop exploring this transactional part after the previous article. But as for RDBMS, Delta Lake implements other ACID-related concepts. One of these are isolation levels.

130
130
article thumbnail

Snowflake and Instacart: The Facts

Snowflake

In the past few days, the scope and trajectory of Instacart’s use of Snowflake has been misrepresented by some on social media. Snowflake has partnered closely with Instacart to scale up to meet the company’s massive demand growth, and then to optimize for efficiency. Optimizations are undertaken on a workload-by-workload basis, and have been extremely successful.

Media 115
article thumbnail

Missing Data Demystified: The Absolute Primer for Data Scientists

Towards Data Science

Data Quality Chronicles Missing data, missing mechanisms, and missing data profiling Missing Data prevents data scientists to see the entire story the data has to tell. Sometimes, even the smallest pieces of information can provide a completely unique view of the world. Photo by Ronan Furuta on Unsplash. Earlier this year, I started a piece on several data quality issues (or characteristics) that heavily compromise our machine learning models.

Datasets 109
article thumbnail

A senior engineer/EM job search story

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. In every issue, I cover topics related to Big Tech and startups through the lens of engineering managers and senior engineers. In this article, we cover one out of five topics from today’s subscriber-only The Pulse issue. To get full issues twice a week, subscribe here.

article thumbnail

Entity Resolution: Your Guide to Deciding Whether to Build It or Buy It

Adding high-quality entity resolution capabilities to enterprise applications, services, data fabrics or data pipelines can be daunting and expensive. Organizations often invest millions of dollars and years of effort to achieve subpar results. This guide will walk you through the requirements and challenges of implementing entity resolution. By the end, you'll understand what to look for, the most common mistakes and pitfalls to avoid, and your options.

article thumbnail

Activating Data from the Lakehouse: Databricks Ventures Invests in Hightouch

databricks

It’s no secret that modern organizations are doubling down on their investments in data - investments that uncover deep customer insights that provide a.

Data 111
article thumbnail

KDnuggets News, August 30: 7 Projects Built with Generative AI • Beyond Numpy and Pandas: Lesser-Known Python Libraries

KDnuggets

7 Projects Built with Generative AI • Beyond Numpy and Pandas: Unlocking the Potential of Lesser-Known Python Libraries • 5 Ways You Can Use ChatGPT’s Code Interpreter For Data Science • GPT-4: 8 Models in One; The Secret is Out

Python 131
article thumbnail

What is Data Observability? 5 Key Pillars To Know

Monte Carlo

Editor’s Note : So much has happened since we first published this post and created the data observability category and Monte Carlo in 2019. We have updated this post to reflect this rapidly maturing space. You can read the original article linked at the bottom of this page. What is Data observability? The five pillars My data observability definition has not changed since I first coined it in 2019: Data observability refers to an organization’s comprehensive understanding of the health an

article thumbnail

16+ fascinating Big data examples

InData Labs

The world is generating an unprecedented amount of data every second. From online transactions and social media interactions to sensor readings and scientific research, the sheer volume, velocity, and variety of data have given rise to the concept of “Big data.” This vast ocean of information holds immense potential, capable of revolutionizing industries, driving innovation, Запись 16+ fascinating Big data examples впервые появилась InData Labs.

article thumbnail

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Speaker: Maher Hanafi, VP of Engineering at Betterworks & Tony Karrer, CTO at Aggregage

Executive leaders and board members are pushing their teams to adopt Generative AI to gain a competitive edge, save money, and otherwise take advantage of the promise of this new era of artificial intelligence. There's no question that it is challenging to figure out where to focus and how to advance when it’s a new field that is evolving everyday. 💡 This new webinar featuring Maher Hanafi, VP of Engineering at Betterworks, will explore a practical framework to transform Generative AI pr

article thumbnail

Precisely Women in Technology: Meet Monica Di Martino

Precisely

With an increasing number of women joining IT, it’s becoming a more inclusive environment. Precisely is committed to building a more inclusive work environment, which is why there are ample opportunities for women in the organization. One of the company’s initiatives is the Precisely Women in Technology (PWIT) program, that was established to be a place for women to come together, support each other, offer guidance, and more.

article thumbnail

Are reports of StackOverflow’s fall greatly exaggerated?

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. In every issue, I cover topics related to Big Tech and startups through the lens of engineering managers and senior engineers. In this article, we cover one out of five topics from today’s subscriber-only The Pulse issue. To get full issues twice a week, subscribe here.

Retail 174
article thumbnail

Sunrise: Zalando's developer platform based on Backstage

Zalando Engineering

Introduction Since 2021, Zalando invested in building up a developer portal called Sunrise, aimed to become the starting point for Builders at Zalando. The portal is based on Spotify's Backstage platform with additional extensions built internally. Sunrise enables everyone at Zalando to view and discover information about teams, applications, APIs, events, CI/CD pipelines, Infrastructure accounts and costs, and much more.

article thumbnail

5 Skills All Marketing Analytics and Data Science Pros Need Today

KDnuggets

Join us at the MADS conference in Washington, D.C., from Sept. 26 to 28, 2023. Learn more here and register with code KDN100 for $100 of your conference pass.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Organizing Generative AI Teams: 5 Lessons Learned From Data Science

Monte Carlo

You did it! After executive leadership vaguely promised stakeholders that new Gen AI features would be incorporated across the organization, your tiger team sprinted to produce a MVP that checks the box. Integrating that OpenAI API into your application wasn’t that difficult and it may even turn out to be useful. But now what happens? Tiger teams can’t sprint forever.

article thumbnail

A step-by-step guide to build an Effective Data Quality Strategy from scratch

Towards Data Science

A Step-by-Step Guide to Building an Effective Data Quality Strategy from Scratch How to build an interpretable data quality framework based on user expectations Photo by Rémi Müller on Unsplash As data engineers, we are (or should be) responsible for the quality of the data we provide. This is nothing new, but every time I join a data project I ask myself the same questions: When should I start working on data quality?

article thumbnail

Forging a Data Strategy for Success in Uncertain Times

Precisely

The results are in! The 2023 Data Integrity Trends and Insights Report , published in partnership between Precisely and Drexel University’s LeBow College of Business, delivers groundbreaking insights into the importance of trusted data. For the report, more than 450 data and analytics professionals worldwide were surveyed about the state of their data programs.

article thumbnail

Google Shutting down Firebase Dynamic Links

The Pragmatic Engineer

👋 Hi, this is Gergely with a free issue of the Pragmatic Engineer Newsletter. We cover one out of five topics in today’s subscriber-only The Pulse issue. If you’re not yet a full subscriber, you missed this week’s deepdive: The 2023 tech market, as seen by hiring managers. To get full newsletters twice a week, subscribe here.

Metadata 162
article thumbnail

Demystifying DAPs: A Practical Guide to Digital Adoption Success

Speaker: Pulkit Agrawal

Digital Adoption Platforms (DAPs) are revolutionizing the way organizations interact with and optimize their software applications. As digital transformation continues to accelerate, DAPs have become essential tools for enhancing user engagement and software efficiency. This session is your guide into the robust world of DAPs, exploring their origins, evolution, and the current trends shaping their development.

article thumbnail

Dashboard Design That Dazzles Your CEO

FreshBI

Understanding the CEO’s Design Perspective To design a dashboard suited for your CEO, it is required to think like a CEO, get into the mind of a CEO. If anyone on the team understands the importance of good design, then it's often the CEO. CEOs prioritize and understand the importance of good design so well that they often take it for granted that the products that they build and that they surround themselves with, are designed well - for beauty and for function.

article thumbnail

Who Will Make Money from the Generative AI Gold Rush?

KDnuggets

Buckle up for the Generative AI gold rush! Will BigTech rule with its picks and shovels? Which startups will strike it rich? Will “copilot for X” be the business strategy to hit pay dirt? How can startups dig moats to keep out other prospectors? And will the US once again have the richest gold seams?

IT 117
article thumbnail

Supercharging your Rust static executables with mimalloc

Tweag

Why link statically against musl? Have you ever faced compatibility issues when dealing with Linux binary executables? The culprit is often the libc implementation, glibc. Acting as the backbone of nearly all Linux distros, glibc is the library responsible for providing standard C functions. Yet, its version compatibility often poses a challenge. Binaries compiled with a newer version of glibc may not function on systems running an older one, creating a compatibility headache.

article thumbnail

Robinhood Wallet Adds Support for Bitcoin and Dogecoin, and Enables Ethereum Swaps

Robinhood

Bitcoin and Dogecoin support is now available to all Robinhood Wallet users, and in-app Ethereum Swaps started rolling out today Since launching to the general public nearly six months ago, Robinhood Wallet has seen significant adoption globally, with hundreds of thousands of users in more than 140 countries worldwide. We are always gathering feedback, and have heard loud and clear that people want access to more coins on more chains.

article thumbnail

Deliver Mission Critical Insights in Real Time with Data & Analytics

In the fast-moving manufacturing sector, delivering mission-critical data insights to empower your end users or customers can be a challenge. Traditional BI tools can be cumbersome and difficult to integrate - but it doesn't have to be this way. Logi Symphony offers a powerful and user-friendly solution, allowing you to seamlessly embed self-service analytics, generative AI, data visualization, and pixel-perfect reporting directly into your applications.