November, 2022

article thumbnail

Who is Still Hiring Software Engineers and EMs?

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of five topics in today’s subscriber-only The Scoop issue. To get this newsletter every week, subscribe here. This article was updated in December 2022. In the midst of gloomy news about hiring freezes and layoffs, let's highlight companies which are growing  and hiring.

article thumbnail

Data News — Week 22.47

Christophe Blefari

Capturing the news ( credits ) Hello you, I hope this data news finds you well. Time flies to be honest. I've launched in a rush an Advent of Data. The goal is simple, in December: 24 data people will produce 24 data gems. Every day a new piece of content will be release on a dedicated website. If you wanna join the initiative please reply, we are still looking for a few slots to be filled in.

Data 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

DuckDB: Getting started for Beginners

Marc Lamberti

DuckDB is an in-process OLAP DBMS written in C++ blah blah blah, too complicated. Let’s start simple, shall we? DuckDB is the SQLite for Analytics. It has no dependencies, is extremely easy to set up, and is optimized to perform queries on data. In this hands-on tutorial, you will learn what DuckDB is, how to use it, and why it is essential for you.

Python 130
article thumbnail

Tame The Entropy In Your Data Stack And Prevent Failures With Sifflet

Data Engineering Podcast

Summary The problems that are easiest to fix are the ones that you prevent from happening in the first place. Sifflet is a platform that brings your entire data stack into focus to improve the reliability of your data assets and empower collaboration across your teams. In this episode CEO and founder Salma Bakouk shares her views on the causes and impacts of "data entropy" and how you can tame it before it leads to failures.

Data 130
article thumbnail

A Diatribe against Data Contracts and their Abuses.

Confessions of a Data Guy

Ok, so I don’t really mean all that. Or do I? I have no idea what the future holds. Sometimes it’s easy to pick out the winners, like Databricks and Snowflake, you can see, feel, and taste the results of those data products, a delicious and delectable bounty to feast upon. Other things are harder […] The post A Diatribe against Data Contracts and their Abuses. appeared first on Confessions of a Data Guy.

Data 130
article thumbnail

Enabling The People, Enabling The Data with Kulani Likotsi

Jesse Anderson

My guest this week is Kulani Likotsi , the Head of Data Management and Data Governance at one of the four biggest banks in Africa. She’s had a rising career journey going from an analyst, to a Business Intelligence developer, to the data warehouse team, to the data governance team. I was impressed with Kulani’s volunteer spirit. Whenever there was a need, she volunteered.

Data 130

More Trending

article thumbnail

Data News — Week 22.46

Christophe Blefari

Scracthing the surface ( credits ) Hey you, a new Friday means data news. This week feels a bit like old data news with a variety of articles on different cool topics while I navigate through the actual data trends. Next Monday I'll present "How to build a data dream team" at Y42 meetup. I'll share in next week edition a written form of my talk.

Data 130
article thumbnail

How Much Math Do You Need in Data Science?

KDnuggets

There exist so many great computational tools available for Data Scientists to perform their work. However, mathematical skills are still essential in data science and machine learning because these tools will only be black-boxes for which you will not be able to ask core analytical questions without a theoretical foundation.

article thumbnail

A Look At The Data Systems Behind The Gameplay For League Of Legends

Data Engineering Podcast

Summary The majority of blog posts and presentations about data engineering and analytics assume that the consumers of those efforts are internal business users accessing an environment controlled by the business. In this episode Ian Schweer shares his experiences at Riot Games supporting player-focused features such as machine learning models and recommeder systems that are deployed as part of the game binary.

Data 130
article thumbnail

Seeing through hardware counters: a journey to threefold performance increase

Netflix Tech

By Vadim Filanovsky and Harshad Sane In one of our previous blogposts, A Microscope on Microservices we outlined three broad domains of observability (or “levels of magnification,” as we referred to them)?—?Fleet-wide, Microservice and Instance. We described the tools and techniques we use to gain insight within each domain. There is, however, a class of problems that requires an even stronger level of magnification going deeper down the stack to introspect CPU microarchitecture.

Bytes 145
article thumbnail

How to Build and Deploy Scalable Machine Learning in Production with Apache Kafka

Confluent

Apache Kafka’s Streams API embeds Machine Learning into any app or microservice (Java, Docker, Kubernetes, etc.) to add business value.

article thumbnail

The Scoop: Tech Layoffs in 2022

The Pragmatic Engineer

I get a lot of scoop sent by readers (thank you!). Sadly, in 2022, a good part of the scoop is about companies laying off people. Some of this scoop has not been reported before. I don't want to broadcast layoffs on Twitter or LinkedIn continuously, but also don't want this information to be lost. This page collects scoops I receive, some of which might not have been reported elsewhere.

article thumbnail

Data News — Week 22.45

Christophe Blefari

Mastodon and Hadoop are on a boat. ( credits ) Hey you, 11th of November was usually off for me. Since I've started my freelancing activities I don't really follow the usual calendar, working whenever I need/want. I mainly work 3 to 4 days a week. Which is awesome but it has a major drawback I never took a break longer than 1 week. Which, yeah, kinda sucks.

Data 130
article thumbnail

Introduction to Pandas for Data Science

KDnuggets

The Pandas library is core to any Data Science work in Python. This introduction will walk you through the basics of data manipulating, and features many of Pandas important features.

article thumbnail

Build Data Products Without A Data Team Using AgileData

Data Engineering Podcast

Summary Building data products is an undertaking that has historically required substantial investments of time and talent. With the rise in cloud platforms and self-serve data technologies the barrier of entry is dropping. Shane Gibson co-founded AgileData to make analytics accessible to companies of all sizes. In this episode he explains the design of the platform and how it builds on agile development principles to help you focus on delivering value.

Data 130
article thumbnail

Machine Learning for Fraud Detection in Streaming Services

Netflix Tech

By Soheil Esmaeilzadeh , Negin Salajegheh , Amir Ziai , Jeff Boote Introduction Streaming services serve content to millions of users all over the world. These services allow users to stream or download content across a broad category of devices including mobile phones, laptops, and televisions. However, some restrictions are in place, such as the number of active devices, the number of streams, and the number of downloaded titles.

article thumbnail

Stream Processing, CEP, Event Sourcing, and Data Streaming Explained

Confluent

What is stream processing, or complex event processing (CEP), and how does it work? Learn about real-time data and event stream analytics in this tutorial.

Process 126
article thumbnail

Cruel Changes at Twitter

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of five topics in today’s subscriber-only The Scoop issue. To get this newsletter every week, subscribe here. Last Thursday, I covered the turmoil at Twitter , of how people worked long hours through the weekend and how most expected layoffs of about 50%.

article thumbnail

Doing More with Less: 5 Ways Leading Organizations Maximize the Value of their Data

Teradata

"Doing more with less” is a familiar refrain echoing through the halls of many organizations. To answer this call, businesses are searching for efficiency gains & turning to data to unlock savings.

Data 98
article thumbnail

If I Had To Start Learning Data Science Again, How Would I Do It?

KDnuggets

While different ways to learn Data Science for the first time exist, the approach that works for you should be based on how you learn best. One powerful method is to evolve your learning from simple practice into complex foundations, as outlined in this learning path recommended by a physicist who turned into a Data Scientist.

article thumbnail

How DoorDash Secures Data Transfer Between Cloud and On-Premise Data Centers

DoorDash Engineering

As DoorDash’s business grows, engineers strive for a better network infrastructure to ensure more third-party services could be integrated into our system while keeping data securely transmitted. Due to security and compliance concerns, some vendors handling such sensitive data cannot expose services to the public Internet and therefore host their own on-premise data centers.

Cloud 97
article thumbnail

For your eyes only: improving Netflix video quality with neural networks

Netflix Tech

by Christos G. Bampis , Li-Heng Chen and Zhi Li When you are binge-watching the latest season of Stranger Things or Ozark, we strive to deliver the best possible video quality to your eyes. To do so, we continuously push the boundaries of streaming video quality and leverage the best video technologies. For example, we invest in next-generation, royalty-free codecs and sophisticated video encoding optimizations.

article thumbnail

Apache Kafka Goes 1.0

Confluent

The mission-critical deployments, the robust feature set, the long history all say that Kafka is an Enterprise-capable product. Apache Kafka is going 1.0!

Kafka 105
article thumbnail

Cloudera and Generation Partner to Reskill the Tech Talent of Tomorrow

Cloudera

Better job opportunities are life-changing, but the lack of accessible job training and placement opportunities can make landing the right opportunity very difficult, if not even impossible at times. . Generation is an economic mobility nonprofit working to prepare, place, and support people into life-changing careers that would otherwise be inaccessible. .

article thumbnail

Teradata Recognized as a Designated Member of the Amazon SageMaker Ready Program

Teradata

Teradata has joined the Amazon SageMaker Ready Program which differentiates Teradata as an AWS Partner Network member with a product that works with Amazon SageMaker & fully supports AWS customers.

article thumbnail

3 Useful Python Automation Scripts

KDnuggets

The post highlights three useful applications of using python to automate simple desktop tasks. Stay tuned till the end of the post to find the reference for a bonus resource.

Python 144
article thumbnail

Key Benefits of HR Digital Transformation

Analytics Training

Introduction to Digital HR . Digital HR refers to using technology, including software and apps, to improve how a company manages its employees. There are many digital transformation benefits associated with HR. The goal is to make it easier for businesses and their employees to connect, collaborate, share information and make decisions. . These are some of the top digital transformation statistics for HR and L&D: . 71% spend about a quarter of their time on social media to share human re

article thumbnail

Helping VFX studios pave a path to the cloud

Netflix Tech

By: Peter Cioni (Netflix), Alex Schworer (Netflix), Mac Moore (Conductor Tech.), Rachel Kelley (AWS), Ranjit Raju (AWS) Rendering is core to the the VFX process VFX studios around the world create amazing imagery for Netflix productions. Nearly every show that is produced today includes digital visual effects, from the creatures in Stranger Things , to recreating historic London in Bridgerton.

Cloud 99
article thumbnail

Running Kafka Streams Applications in AWS

Confluent

Zalando shares their experience and lessons learned running real-time Apache Kafka streams applications built in production on Amazon Web Services (AWS). .

Kafka 98
article thumbnail

When Private Cloud is the Right Fit for Public Sector Missions

Cloudera

It’s no secret that IT modernization is a top priority for the US federal government. A quick trip in the congressional time machine to revisit 2017’s Modernizing Government Technology Act surfaces some of the most salient points regarding agencies’ challenges: The federal government spends nearly 75% of its annual information technology funding on operating and maintaining existing legacy information technology systems.

Cloud 90
article thumbnail

Enabling static analysis of SQL queries at Meta

Engineering at Meta

UPM is our internal standalone library to perform static analysis of SQL code and enhance SQL authoring. UPM takes SQL code as input and represents it as a data structure called a semantic tree. Infrastructure teams at Meta leverage UPM to build SQL linters, catch user mistakes in SQL code, and perform data lineage analysis at scale. Executing SQL queries against our data warehouse is important to the workflows of many engineers and data scientists at Meta for analytics and monitoring use cases

SQL 74
article thumbnail

How LinkedIn Uses Machine Learning To Rank Your Feed

KDnuggets

In this post, you will learn to clarify business problems & constraints, understand problem statements, select evaluation metrics, overcome technical challenges, and design high-level systems.

article thumbnail

Impact of Digitization on HR Services and Processes

Analytics Training

Introduction to Digitization in Human Resources . Digitization in HR services is of utmost importance to an organization. It is a critical and strategic function that aims to optimize the workforce to meet business goals. The HR functions and processes have been evolving with advances in technology, changing consumer behavior patterns, and increasing globalization of markets.

Process 72
article thumbnail

Consistent caching mechanism in Titus Gateway

Netflix Tech

by Tomasz Bak and Fabio Kung Introduction Titus is the Netflix cloud container runtime that runs and manages containers at scale. In the time since it was first presented as an advanced Mesos framework, Titus has transparently evolved from being built on top of Mesos to Kubernetes, handling an ever-increasing volume of containers. As the number of Titus users increased over the years, the load and pressure on the system increased substantially.

Systems 83