Sat.Nov 12, 2022 - Fri.Nov 18, 2022

article thumbnail

Who is Still Hiring Software Engineers and EMs?

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of five topics in today’s subscriber-only The Scoop issue. To get this newsletter every week, subscribe here. This article was updated in December 2022. In the midst of gloomy news about hiring freezes and layoffs, let's highlight companies which are growing  and hiring.

article thumbnail

A Diatribe against Data Contracts and their Abuses.

Confessions of a Data Guy

Ok, so I don’t really mean all that. Or do I? I have no idea what the future holds. Sometimes it’s easy to pick out the winners, like Databricks and Snowflake, you can see, feel, and taste the results of those data products, a delicious and delectable bounty to feast upon. Other things are harder […] The post A Diatribe against Data Contracts and their Abuses. appeared first on Confessions of a Data Guy.

Data 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Enabling The People, Enabling The Data with Kulani Likotsi

Jesse Anderson

My guest this week is Kulani Likotsi , the Head of Data Management and Data Governance at one of the four biggest banks in Africa. She’s had a rising career journey going from an analyst, to a Business Intelligence developer, to the data warehouse team, to the data governance team. I was impressed with Kulani’s volunteer spirit. Whenever there was a need, she volunteered.

article thumbnail

Build Data Products Without A Data Team Using AgileData

Data Engineering Podcast

Summary Building data products is an undertaking that has historically required substantial investments of time and talent. With the rise in cloud platforms and self-serve data technologies the barrier of entry is dropping. Shane Gibson co-founded AgileData to make analytics accessible to companies of all sizes. In this episode he explains the design of the platform and how it builds on agile development principles to help you focus on delivering value.

Building 130
article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

The Scoop: Tech Layoffs in 2022

The Pragmatic Engineer

I get a lot of scoop sent by readers (thank you!). Sadly, in 2022, a good part of the scoop is about companies laying off people. Some of this scoop has not been reported before. I don't want to broadcast layoffs on Twitter or LinkedIn continuously, but also don't want this information to be lost. This page collects scoops I receive, some of which might not have been reported elsewhere.

article thumbnail

Introduction to Pandas for Data Science

KDnuggets

The Pandas library is core to any Data Science work in Python. This introduction will walk you through the basics of data manipulating, and features many of Pandas important features.

More Trending

article thumbnail

Taking A Look Under The Hood At CreditKarma's Data Platform

Data Engineering Podcast

Summary CreditKarma builds data products that help consumers take advantage of their credit and financial capabilities. To make that possible they need a reliable data platform that empowers all of the organization’s stakeholders. In this episode Vishnu Venkataraman shares the journey that he and his team have taken to build and evolve their systems and improve the product offerings that they are able to support.

MongoDB 100
article thumbnail

Doing More with Less: 5 Ways Leading Organizations Maximize the Value of their Data

Teradata

"Doing more with less” is a familiar refrain echoing through the halls of many organizations. To answer this call, businesses are searching for efficiency gains & turning to data to unlock savings.

Data 98
article thumbnail

If I Had To Start Learning Data Science Again, How Would I Do It?

KDnuggets

While different ways to learn Data Science for the first time exist, the approach that works for you should be based on how you learn best. One powerful method is to evolve your learning from simple practice into complex foundations, as outlined in this learning path recommended by a physicist who turned into a Data Scientist.

article thumbnail

Write What You Know: Turning Your Apache Kafka® Knowledge into a Technical Talk

Confluent

The call for papers for Kafka Summit London 2023 has opened, and we’re looking to hear about your experiences using and working with Kafka. If you’re stuck looking for ideas on what to talk about, write what you know.

Kafka 82
article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

#Clouderalife Volunteer Spotlight: Glaucia Esppenchutz

Cloudera

Cloudera’s November Volunteer Spotlight is Glaucia Esppenchutz , staff data engineer, based in Lisbon, Portugal. . Glaucia volunteers with Free Code Camp , an organization founded in 2014 that helps aspiring technicians learn to code for free. . Through the creation and publication of videos, articles, and interactive coding lessons — all freely available to the public — Free Code Camp is able to reach and train millions of people annually.

Coding 81
article thumbnail

Move faster, wait less: Improving code review time at Meta

Engineering at Meta

Code reviews are one of the most important parts of the software development process At Meta we’ve recognized the need to make code reviews as fast as possible without sacrificing quality We’re sharing several tools and steps we’ve taken at Meta to reduce the time waiting for code reviews When done well, code reviews can catch bugs , teach best practices , and ensure high code qualit y.

Coding 56
article thumbnail

How LinkedIn Uses Machine Learning To Rank Your Feed

KDnuggets

In this post, you will learn to clarify business problems & constraints, understand problem statements, select evaluation metrics, overcome technical challenges, and design high-level systems.

article thumbnail

How Real-time Healthcare Analytics Helps Improve Patient Care

Striim

It’s a Tuesday night. A nurse in the emergency department (ED) receives an alert on her smartphone: the ED will be overcrowded after 1.5 hours. The alert also gives suggestions, such as the number of beds that will be filled or what type of care will be required. The nurse uses this information to communicate with transport, radiology, and lab teams to make the necessary preparations.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Once Upon a Time in the Land of Data

Cloudera

I recently had the privilege of attending the CDAO event in Boston hosted by Corinium. Tracks represented financial services, insurance, retail and consumer packaged goods, and healthcare. Overall, it struck me that while data science is not new, most firms are still defining the mission of the data office and data officer. It’s clear firms seek to leverage data and embrace its potential insights, but most are forging ahead in largely uncharted territory.

article thumbnail

DataOps Observability: Taming the Chaos (Part 3)

DataKitchen

Part 3: Considering the Elements of Data Journeys. This is the third post in DataKitchen’s four-part series on DataOps Observability. Observability is a methodology for providing visibility of every journey that data takes from source to customer value across every tool, environment, data store, team, and customer so that problems are detected and addressed immediately.

article thumbnail

What To Expect for AI Quality Trends In 2023

KDnuggets

Based on the recent discussions with dozens of Fortune 500 data science teams, we can expect to see a continued spotlight on AI model quality in 2023.

article thumbnail

Artificial Intelligence (AI) in Cloud Computing

U-Next

Introduction . Artificial Intelligence (AI) is a process of programming computers to make decisions for themselves. This technology creates intelligent applications capable of reasoning, learning, and acting independently. Among many things, AI finds innumerable applications in cloud computing. Cloud computing delivers computing services—including servers, storage, databases, networking, software, analytics, and intelligence—over the Internet (“the cloud”) to offer faster innovation

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Unlocking HBase on S3 With the New Store File Tracking Feature

Cloudera

CDP Operational Database (COD) is a real-time auto-scaling operational database powered by Apache HBase and Apache Phoenix. It is one of the main data services that run on Cloudera Data Platform (CDP) Public Cloud. You can access COD from your CDP console. The cost savings of cloud-based object stores are well understood in the industry. Applications whose latency and performance requirements can be met by using an object store for the persistence layer benefit significantly with lower cost of o

article thumbnail

3 Questions with Daniel Kahneman, Author of Thinking, Fast and Slow

Monte Carlo

Last month at IMPACT 2022: The Data Observability Summit, I had the distinct privilege of chatting with Daniel Kahneman, Nobel Prize-winning economist and author of one of my favorite books, Thinking, Fast and Slow. Most notably, Daniel discussed the difference between two major types of thinking: System 1, decision making that operates automatically (say, doing simple multiplication) and System 2, decision making that requires effort and attention (for instance, a complex Calculus problem).

Systems 52
article thumbnail

Git for Data Science Cheatsheet

KDnuggets

Knowing git is no longer an option for data professionals. Grab this handy reference sheet now and make sure you know how to git the job done.

article thumbnail

How Does AI Aid in Creating Sound Business Strategies?

U-Next

Introduction . The usage of AI technology has been on the rise in the business world, especially when it comes to creating business strategies. . Artificial Intelligence (AI) and Machine Learning are currently used by businesses to make their operations more efficient, improve customer experience and achieve better results. As per Artificial Intelligence Statistics 2022 , AI adoption by businesses around the globe continued at a steady pace in 2022, with more than a third of companies (35%) re

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

Enriching Streams with Hive tables via Flink SQL

Cloudera

Introduction. Stream processing is about creating business value by applying logic to your data while it is in motion. Many times that involves combining data sources to enrich a data stream. Flink SQL does this and directs the results of whatever functions you apply to the data into a sink. Business use cases, such as fraud detection , advertising impression tracking, health care data enrichment, augmenting financial spend information, GPS device data enrichment, or personalized customer commun

SQL 56
article thumbnail

What is Data Engineering? Why is it a Popular Career Path?

Emeritus

Data has become a pivotal asset for all businesses but it can prove useless if it isn’t leveraged effectively. That’s where data engineering comes in. It lays down the foundation for data science applications by preparing raw data for collection and analysis. Specialized in a practice that mainly focuses on the end application of data… The post What is Data Engineering?

article thumbnail

Research Papers for NLP Beginners

KDnuggets

Read research papers on neural models, word embedding, language modeling, and attention & transformers.

Process 158
article thumbnail

Know Everything About AWK Advanced Filter

U-Next

Introduction . AWK is a scripting language for text processing that allows the user to perform operations on text files based on a set of conditions. The user can select which lines of text to process based on a set of criteria and can perform various operations on the text, such as printing, editing, or deleting. AWK is often used to extract data from text files or perform operations on files too large to be processed by other tools. .

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Write tests smarter, not harder

Booking.com Engineering

In my career, I’ve seen many times how teams started with automated testing. Not all attempts were successful. In this post, I’m going to share a few tips on creating a culture of automated testing in your team, and shaping the journey from zero-tests to a reliable set of tests at different levels. A common way in which some teams approach automated testing is that they set up a target, something like: “In this quarter, we will increase test coverage to X percent”.

Coding 52
article thumbnail

An EDGY approach to designing Enterprises by Oliver Cronk

Scott Logic

I recently attended the Intersection 22 Conference in Stockholm where EDGY was previewed. EDGY is a new graphical design language for visualising enterprises. This new open source design language shows great promise as a tool to bridge across siloed teams. Redefining Enterprise Architecture into a more holistic, cross functional form of “Enterprise Design” that drives shared understanding.

article thumbnail

6 Best Free Online Courses to Learn Python and Boost Your Career

KDnuggets

The demand for Data Scientists who are proficient in Python is at an all time high. Python has helped people boost their careers in finance, consulting, research, software tech, and robotics. Explore 6 courses designed to help you learn Python.

Python 115
article thumbnail

The Tale Of Success You Must Not Miss – Strategic Sales Management

U-Next

If the winners write history, then we are confident every one of our learners is here to make history. With the most robust industry-relevant curriculum delivered by expert faculty and a pedagogy lined by workshops and case studies for a comprehensive hands-on learning experience, our program is the best shot at owning a successful sales career for aspiring sales professionals. .

article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating