Thu.Jan 26, 2023

article thumbnail

Apple: The only big tech giant going against the job cuts tide

The Pragmatic Engineer

Comments

307
307
article thumbnail

The ChatGPT Cheat Sheet

KDnuggets

Impress your friends and loved ones by perfecting your ChatGPT prompt engineering game with this incredibly useful resource.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Building a Life Sciences Knowledge Graph with a Data Lake

databricks

This is a collaborative post from Databricks and wisecube.ai. We thank Vishnu Vettrivel, Founder, and Alex Thomas, Principal Data Scientist, for their contributions.

Data Lake 112
article thumbnail

Containerizing the Beast – Hadoop NameNodes in Uber’s Infrastructure

Uber Engineering

We recently containerized Hadoop NameNodes and upgraded hardware, improving NameNode RPC queue time from ~200 to ~20ms – A 10x improvement! With this radical change, Uber’s Hadoop customers are happier and admins rest more at night.

Hadoop 103
article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

An Introduction to Markov Chains

KDnuggets

Markov chains are often used to model systems that exhibit memoryless behavior, where the system's future behavior is not influenced by its past behavior.

Systems 108
article thumbnail

Work With Large Monorepos With Sparse Checkout Support in Databricks Repos

databricks

For your data-centered workloads, Databricks offers the best-in-class development experience and gives you the tools you need to adhere to code development best.

Coding 94

More Trending

article thumbnail

Best Practices and Guidance for Cloud Engineers to Deploy Databricks on AWS: Part 2

databricks

This is part two of a three-part series in Best Practices and Guidance for Cloud Engineers to deploy Databricks on AWS. You can.

AWS 90
article thumbnail

Hyperparameter Optimization: 10 Top Python Libraries

KDnuggets

Become familiar with some of the most popular Python libraries available for hyperparameter optimization.

Python 110
article thumbnail

One Minute Map Hacks: 71-75

ArcGIS

Another five hacks in an endless stream of one-minute how-to videos.

98
article thumbnail

Top 8 Data Science Slack Communities to Join in 2023

KDnuggets

Take your Data Science journey to the next level by joining these Slack communities in 2023.

article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

How to Compare Two Tables For Equality in BigQuery

Towards Data Science

Compare tables and extract their differences with standard SQL Continue reading on Towards Data Science »

article thumbnail

How to Use Terraform with Rockset

Rockset

The goal of this blog post is to provide best practices on how to use terraform to configure Rockset to ingest the data into two collections, and how to setup a view and query lambdas that are used in an application, plus to show the workflow of later updating the query lambdas. This mimics how we use terraform at Rockset to manage Rockset resources.

AWS 52
article thumbnail

What is Data Streaming? A Comprehensive Guide 101

Hevo

Real-time data is the need of the hour for businesses to make timely decisions, especially in cases of fraud detection or customer behavior analysis. Relying on traditional batch processing is not effective now.

Data 52
article thumbnail

Manufacturing Data Ingestion into Snowflake

Snowflake

Accessing data from the manufacturing shop floor is one of the key topics of interest with the majority of cloud platform vendors due to the pace of Industry 4.0 adoption. Industry 4.0, also known as the Fourth Industrial Revolution, refers to the emerging trend of technological transformation in manufacturing and related industries. It involves the integration of advanced technologies such as IoT, AI, and machine learning (ML) into the production process, resulting in “smarter” factories that a

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

To Understand Risk is to Understand the Opportunity to Create Value

Teradata

An organization’s potential for value-creation lies in its ability to embrace the intricate relationship & co-existence between risk & opportunity, & activate this through data-led transformation.

IT 52
article thumbnail

4 Useful BigQuery SQL Functions You May Not Know

Towards Data Science

And how to use them Continue reading on Towards Data Science »

SQL 86
article thumbnail

Podcast: Discovering Data: Build a little, test a little, learn a lot

DataKitchen

The post Podcast: Discovering Data: Build a little, test a little, learn a lot first appeared on DataKitchen.

article thumbnail

Building a Data Lake on PB scale with Apache Spark

Towards Data Science

How we deal with Big Data at Emplifi Continue reading on Towards Data Science »

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Unifying Messaging Experiences across LinkedIn

LinkedIn Engineering

Co-authors:�� Michele Ursino and Joe Xue Introduction At LinkedIn, we believe that an opportunity can arise from just one conversation, so having reliable and powerful messaging capabilities to enable people to have those meaningful and professional conversations is crucial. Over the years, we have evolved our messaging platform to meet the needs of our 900 million members and customers.�� From our legacy asynchronous email-based system to our current system, which features threaded conversation

article thumbnail

Tulip: Modernizing Meta’s data platform

Engineering at Meta

The technical journey discusses the motivations, challenges, and technical solutions employed for warehouse schematization, especially a change to the wire serialization format employed in Meta’s data platform for data interchange related to Warehouse Analytics Logging. Here, we discuss the engineering, scaling, and nontechnical challenges of modernizing Meta’s exabyte-scale data platform by migrating to the new Tulip format.

Bytes 102
article thumbnail

Streaming Big Data Files from Cloud Storage

Towards Data Science

Methods for efficient consumption of large files Photo by Aron Visuals on Unsplash Working with very large files can pose challenges to application developers related to efficient resource management and runtime performance. Text file editors, for example, can be divided into those that can handle large files, and those that make your CPU choke, make your PC freeze, and make you want to scream.

article thumbnail

PIM vs. CMS – What is the difference?

Precisely

With modern customers hungry for product data when making buying decisions, companies looking to boost their digital commerce results recognize the necessity for product content management. The question is – which system is best for disseminating their content to the market? When looking into available options, two platform types typically emerge: Product Information Management (PIM) or a Content Management System (CMS).

Retail 52
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Database Sharding: A Comprehensive Guide 101

Hevo

Nowadays, organizations store every single piece of data associated with their Products, Users, Sales, Marketing, and other departments. The reason behind this is to generate reports or perform analysis. The insights that are gained from the data are invaluable.

article thumbnail

PIM vs. CMS – What is the difference?

Precisely

With modern customers hungry for product data when making buying decisions, companies looking to boost their digital commerce results recognize the necessity for product content management. The question is – which system is best for disseminating their content to the market? When looking into available options, two platform types typically emerge: Product Information Management (PIM) or a Content Management System (CMS).

Retail 52
article thumbnail

Employee-facing Mutual TLS

Pinterest Engineering

Armen Tashjian | Security Engineer, Corporate Security This blog article is the second part of our recently released blog: Enforcing Device AuthN & Compliance at Pinterest. Intro As part of our device authentication and compliance initiative, Pinterest has implemented employee-facing mutual TLS with a custom identity provider in a way that results in a positive user experience.

article thumbnail

11 Predictions Data Experts Have for the Year Ahead

Snowflake

It’s 2023 and with the new year comes an opportunity to drive innovation, growth, and digital transformation with data in the face of ongoing economic turbulence. If Snowflake’s report, How to Win in Today’s Data Economy is any indication, data-driven organizations are poised to emerge as the winners of the year with 77% Data Economy Leaders, which is only 6% of those surveyed, experiencing annual revenue growth versus 36% of Data Economy Laggards, the lowest-performing survey group.

Retail 61
article thumbnail

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Speaker: David Bard, Principal at VP Product Coaching

In the fast-paced world of digital innovation, success is often accompanied by a multitude of challenges - like the pitfalls lurking at every turn, threatening to derail the most promising projects. But fret not, this webinar is your key to effective product development! Join us for an enlightening session to empower you to lead your team to greater heights.