March, 2023

article thumbnail

5 Machine Learning Skills Every Machine Learning Engineer Should Know in 2023

KDnuggets

Most essential skills are programming, data preparation, statistical analysis, deep learning, and natural language processing.

article thumbnail

5 Advance Projects for Data Science Portfolio

KDnuggets

Work on data analytics, time series, natural language processing, machine learning, and ChatGPT projects to improve your chance of getting hired.

Portfolio 176
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Lyft in Trouble

The Pragmatic Engineer

Originally published on 30 March 2023. 👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of five topics in today’s subscriber-only The Scoop issue. To get full issues twice a week, subscribe here. Disclaimer: I worked at Uber, Lyft's US competitor, between 2016-2020. As always, I aim to remain independent in my analysis: I hold no positions in any of the companies mentioned in this article, and have not been paid to write ab

article thumbnail

How to get started with dbt

Christophe Blefari

This article is meant to be a resource hub in order to understand dbt basics and to help get started your dbt journey. When I write dbt, I often mean dbt Core. dbt Core is an open-source framework that helps you organise data warehouse SQL transformation. dbt Core has been developed by dbt Labs, which was previously named Fishtown Analytics. The company has been founded in May 2016. dbt Labs also develop dbt Cloud which is a cloud product that hosts and runs dbt Core projects.

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Advanced NumPy: Broadcasting and Strides

Analytics Vidhya

Introduction NumPy is an open-source library in python and a must-learn if you want to enter the data science ecosystem. It is the library underpinning other important libraries such as Pandas, matplotlib, Scipy, scikit-learn, etc. One of the reasons this library is so foundational is because of its array of programming capabilities. Array programming, or […] The post Advanced NumPy: Broadcasting and Strides appeared first on Analytics Vidhya.

Python 269
article thumbnail

Announcing FawltyDeps - a dependency checker for your Python code

Tweag

It is a truth universally acknowledged that the Python packaging ecosystem is in need of a good dependency checker. In the least, it’s our hope to convince you that Tweag’s new dependency checker, FawltyDeps, can help you maintain an environment that is minimal and reproducible for your Python project, by ensuring that required dependencies are explicitly declared and detecting unused dependencies.

Python 136

More Trending

article thumbnail

ChatGPT for Data Science Cheat Sheet

KDnuggets

The latest KDnuggets cheat sheet covers using ChatGPT to your advantage as a data scientist. It's time to master prompt engineering, and here is a handy reference for helping you along the way.

article thumbnail

Is there a drop in software engineer job openings, globally?

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of five topics in today’s subscriber-only The Scoop issue. To get full issues twice a week, subscribe here. There’s plenty of news and anecdotal evidence suggesting the jobs market for software engineers is cooling. In October 2022, I wrote about the start of a Big Tech hiring slowdown.

article thumbnail

How LinkedIn automates cherry-picking commits to improve developer productivity

LinkedIn Engineering

Our developers at LinkedIn are constantly exploring ways to enhance and strengthen our platform, aiming to provide our members and customers with the greatest possible access to knowledge and connections. With approximately 15,000 code repositories, our developers work tirelessly to make thousands of code changes each day, improving functionality and resolving any issues that may arise.

Coding 116
article thumbnail

Top 6 Amazon S3 Interview Questions

Analytics Vidhya

Introduction S3 is Amazon Web Services cloud-based object storage service (AWS). It stores and retrieves large amounts of data, including photos, movies, documents, and other files, in a durable, accessible, and scalable manner. S3 provides a simple web interface for uploading and downloading data and a powerful set of APIs for developers to integrate S3.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Snowflake Connector for ServiceNow Available in Public Preview

Snowflake

ServiceNow, Inc. offers a well-known SaaS application, with companies in multiple industries using it to help manage digital workloads for a variety of departments and operations. What if it was as easy as just a few clicks to get ServiceNow data directly into your Snowflake account so you could combine it with other data sources, including ERPs, HRs, and CRMs?

article thumbnail

Streaming Ingestion for Apache Iceberg With Cloudera Stream Processing

Cloudera

Recently, we announced enhanced multi-function analytics support in Cloudera Data Platform (CDP) with Apache Iceberg. Iceberg is a high-performance open table format for huge analytic data sets. It allows multiple data processing engines, such as Flink, NiFi, Spark, Hive, and Impala to access and analyze data in simple, familiar SQL tables. In this blog post, we are going to share with you how Cloudera Stream Processing ( CSP ) is integrated with Apache Iceberg and how you can use the SQL Stream

Process 113
article thumbnail

A Complete Collection of Data Science Free Courses – Part 1

KDnuggets

The first part covers the list of Programming, Web scraping, Statistics & Probability, Data Analytics, SQL, and Business Intelligence free courses.

article thumbnail

Big Tech job-switching stats

The Pragmatic Engineer

👋 Hi, this is Gergely with a bonus, free issue of the Pragmatic Engineer Newsletter. We cover one out of five topics from The Scoop #39 , published two weeks ago, 23 February. To get full newsletters twice a week, subscribe here. I have collaborated with a tech recruiter - they’ve asked to be anonymous - who’s been running some very interesting queries on LinkedIn for software engineers.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Confluent Achieves Google Cloud Ready - AlloyDB Designation

Confluent

Confluent announced that it has successfully achieved Google Cloud Ready - AlloyDB Designation for AlloyDB for PostgreSQL, Google Cloud’s newest fully managed PostgreSQL-compatible database service for the most demanding enterprise database workloads.

article thumbnail

Complete Guide to Pub/Sub in Redis

Analytics Vidhya

Introduction Publish and Subscribe is a messaging mechanism having one or a set of senders sending messages and one or a group of receivers receiving these messages. These senders are called Publishers, responsible for publishing these messages, and the receivers are called Subscribers who subscribe to these Publishers to receive their notifications.

article thumbnail

Fine-Tuning Large Language Models with Hugging Face and DeepSpeed

databricks

Large language models (LLMs) are currently in the spotlight following the sensational release of ChatGPT. Many are wondering how to take advantage of.

article thumbnail

ML Training and Deployment Pipeline Using Databricks

Ripple Engineering

Summary Managing the entire lifecycle of a machine learning (ML) model from inception to deployment in production can be a daunting task involving multiple systems and lots of moving parts. At Ripple we have a mix of cloud providers (GCP and AWS) and internally managed tools (Gitlab, Artifactory, Vault etc.), and we needed a managed solution that would help us deliver models to product use cases within a short amount of time, which led us to choose Databricks.

article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

Reading Minds with AI: Researchers Translate Brain Waves to Images

KDnuggets

Two researchers from Osaka University were able to reconstruct highly accurate images from human brain activity obtained by fMRI. Read this article if you are curious to find out what all the hype is about.

143
143
article thumbnail

The Collapse of Silicon Valley Bank

The Pragmatic Engineer

It’s been a wild weekend, starting Friday. In case you somehow missed it: we went through the fastest bank run in history, in an event that impacted about half of all VC-funded startups in the US and UK. On Friday night, Silicon Valley Bank (SVB) was shut down by regulators, triggering a weekend of fear and uncertainty for many people and businesses with questions like: “can we make payroll next week?

Banking 180
article thumbnail

What is GPT-4? How it is better than ChatGPT

Edureka

We were already surprised by the wonders ChatGPT has been doing, and now GPT-4 has arrived with features nobody could have ever imagined. These days, one really can’t say what else we are going to explore in the future of language models, as every day is like a new challenge for the developers of ChatGPT. OpenAI has announced the release of its latest large language model, GPT-4.

IT 98
article thumbnail

Top 6 Microsoft HDFS Interview Questions

Analytics Vidhya

Introduction Microsoft Azure HDInsight(or Microsoft HDFS) is a cloud-based Hadoop Distributed File System version. A distributed file system runs on commodity hardware and manages massive data collections. It is a fully managed cloud-based environment for analyzing and processing enormous volumes of data. HDInsight works seamlessly with the Hadoop ecosystem, which includes technologies like MapReduce, Hive, […] The post Top 6 Microsoft HDFS Interview Questions appeared first on Analytics V

Hadoop 246
article thumbnail

The Big Payoff of Application Analytics

Outdated or absent analytics won’t cut it in today’s data-driven applications – not for your end users, your development team, or your business. That’s what drove the five companies in this e-book to change their approach to analytics. Download this e-book to learn about the unique problems each company faced and how they achieved huge returns beyond expectation by embedding analytics into applications.

article thumbnail

Top 30+ Project Management (PMP) Terms - Every Project Manager Should Know

Knowledge Hut

Project management is vital to the success of any company. It is responsible for keeping all project details organized, prioritized, and on track to meet deadlines and ensure quality. It also has a lot of influence over whether or not a project is completed successfully. If you're an entrepreneur looking to build your business, you'll want to ensure your project management has the skills necessary to keep things on track.

Project 98
article thumbnail

Demand and ETR Forecasting at Airports

Uber Engineering

In this post we will dive into the algorithm, data modeling, and system design that go into estimating the length of time drivers would have to wait for a trip request at a given location, empowering them to strategically remain or reposition.

article thumbnail

A Complete Collection of Data Science Free Courses – Part 2

KDnuggets

The second part covers the list of Machine Learning, Deep Learning, Computer Vision, Natural Language Processing, Data Engineering, and MLOps.

article thumbnail

Amazon doubling down on return to office

The Pragmatic Engineer

Comments

273
273
article thumbnail

The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Communication

Speaker: David Bard, Principal at VP Product Coaching

In the fast-paced world of digital innovation, success is often accompanied by a multitude of challenges - like the pitfalls lurking at every turn, threatening to derail the most promising projects. But fret not, this webinar is your key to effective product development! Join us for an enlightening session to empower you to lead your team to greater heights.

article thumbnail

How Will Artificial Intelligence Help Good Managers Become Great?

U-Next

Introduction – Adaptation and Evolution of AI in Management Several businesses use Machine Learning and Artificial Intelligence in management. The most significant AI tools are based on a vast amount of data, recognizing patterns, learning from them, and making definitive predictions. AI is becoming popular in project management because of its exceptional capacity to track particular trends and predict project situations and results.

article thumbnail

Data Science Blogathon 30th Edition- Women in Data Science

Analytics Vidhya

The Biggest Data Science Blogathon is now live! “Knowledge is power. Sharing knowledge is the key to unlocking that power.”― Martin Uzochukwu Ugwu Analytics Vidhya is back with the largest data-sharing knowledge competition- The Data Science Blogathon. This 30th edition of the Data Science Blogathon is particularly very important because we are celebrating the women in […] The post Data Science Blogathon 30th Edition- Women in Data Science appeared first on Analytics Vidhya.

article thumbnail

Ready for Data Transformation but Don’t Know Where to Start? Start Here.

The Modern Data Company

Not Getting Value from Your Data Transformation? Fix it Download (PDF) The post Ready for Data Transformation but Don’t Know Where to Start? Start Here. appeared first on TheModernDataCompany.

Data 98
article thumbnail

Setting Uber’s Transactional Data Lake in Motion with Incremental ETL Using Apache Hudi

Uber Engineering

Uber’s Global Data Warehouse team leveraged Apache Hudi to drastically improve performance of traditional batch ETL pipelines by going incremental, improving business-critical data’s freshness, quality, and completeness.

article thumbnail

Driving Business Impact for PMs

Speaker: Jon Harmer, Product Manager for Google Cloud

Move from feature factory to customer outcomes and drive impact in your business! This session will provide you with a comprehensive set of tools to help you develop impactful products by shifting from output-based thinking to outcome-based thinking. You will deepen your understanding of your customers and their needs as well as identifying and de-risking the different kinds of hypotheses built into your roadmap.