Top Data Engineering Digest Data Management Big Data Content for 2019

2019

Open Source Projects by Google, Uber and Facebook for Data Science and AI

KDnuggets

NOVEMBER 28, 2019

Open source is becoming the standard for sharing and improving technology. Some of the largest organizations in the world namely: Google, Facebook and Uber are open sourcing their own technologies that they use in their workflow to the public.

Data Science

Data Science Project Technology Data

Uber Infrastructure in 2019: Improving Reliability, Driving Customer Satisfaction

Uber Engineering

DECEMBER 19, 2019

Every day around the world, millions of trips take place across the Uber network, giving users more reliable transportation through ridesharing, bikes, and scooters, drivers and truckers additional opportunities to earn, employees and employers more convenient business travel, and hungry … The post Uber Infrastructure in 2019: Improving Reliability, Driving Customer Satisfaction appeared first on Uber Engineering Blog.

Transportation

Transportation Engineering Architecture

Join 16,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Demystifying DAPs: A Practical Guide to Digital Adoption Success

The AI Superhero Approach to Product Management

MORE WEBINARS

Trending Sources

Introducing ksqlDB

Confluent

NOVEMBER 20, 2019

Today marks a new release of KSQL, one so significant that we’re giving it a new name: ksqlDB. Like KSQL, ksqlDB remains freely available and community licensed, and you can […].

IT Process Management

Webinars

Demystifying DAPs: A Practical Guide to Digital Adoption Success

The AI Superhero Approach to Product Management

MORE WEBINARS

Engineering a Studio Quality Experience With High-Quality Audio at Netflix

Netflix Tech

MAY 1, 2019

by Guillaume du Pontavice, Phill Williams and Kylee Peña (on behalf of our Streaming Algorithms, Audio Algorithms, and Creative Technologies teams) Remember the epic opening sequence of Stranger Things 2 ? The thrill of that car chase through Pittsburgh not only introduced a whole new set of mysteries, but it returned us to a beloved and dangerous world alongside Dustin, Lucas, Mike, Will and Eleven.

Engineering

Engineering Algorithm Media Entertainment

The AI Superhero Approach to Product Management

Speaker: Conrado Morlan

In this engaging and witty talk, we’ll explore how artificial intelligence can transform the daily tasks of product managers into streamlined, efficient processes. Using the lens of a superhero narrative, we’ll uncover how AI can be the ultimate sidekick, aiding in decision-making, enhancing productivity, and boosting innovation. Attendees will leave with practical tools and actionable insights, motivated to embrace AI and leverage its potential in their work. 🦸 🏢 Key objectives:

Management

Our Commitment to Open Source Software

Cloudera

JULY 10, 2019

Open source has been core to the missions of both Hortonworks and Cloudera and central to our values and culture. With more than 700 engineers in the new Cloudera, our company writes a prodigious amount of open source code each year that’s contributed to more than 30 different open source projects. We’re also a very innovative open source company, having collectively launched more than a dozen new open source projects since the founding of the two companies. .

Consulting

Consulting Kafka Project Data Science

What Is the Biggest Challenge Facing CMOs Today? Building, Measuring, and Maintaining Brand Equity.

Teradata

MAY 1, 2019

Teradata CMO Martyn Etherington discusses how brands can build, measure, and maintain brand equity. He also explains why customer experience is critical to a brand's success.

Building

10 Free Top Notch Machine Learning Courses

KDnuggets

DECEMBER 6, 2019

Are you interested in studying machine learning over the holidays? This collection of 10 free top notch courses will allow you to do just that, with something for every approach to improving your machine learning skills.

Machine Learning

Machine Learning Deep Learning Python

More Trending

10 Free Top Notch Machine Learning Courses

KDnuggets

DECEMBER 6, 2019

Machine Learning

Machine Learning Deep Learning Python

10 Best and Free Machine Learning Courses, Online

KDnuggets

DECEMBER 26, 2019

Getting ready to leap into the world of Data Science? Consider these top machine learning courses curated by experts to help you learn and thrive in this exciting field.

Machine Learning

Machine Learning Data Science Deep Learning Education

Data Science Curriculum Roadmap

KDnuggets

DECEMBER 3, 2019

What follows is a set of broad recommendations, and it will inevitably require a lot of adjustments in each implementation. Given that caveat, here are our curriculum recommendations.

Data Science

Data Science Data IT Education

What is the most important question for Data Science (and Digital Transformation)

KDnuggets

DECEMBER 31, 2019

With so many buzzwords surrounding AI and machine learning, understanding which can bring business value and which are best left in the lab to mature is difficult. While machine learning offers significant power in driving digital transformations, a business must start with the right questions and leave the math to the development teams.

Data Science

Data Science Machine Learning Data

Uber’s Data Platform in 2019: Transforming Information to Intelligence

Uber Engineering

DECEMBER 17, 2019

Uber’s busy 2019 included our billionth delivery of an Uber Eats order, 24 million miles covered by bike and scooter riders on our platform, and trips to top destinations such as the Empire State Building, the Eiffel Tower, and the … The post Uber’s Data Platform in 2019: Transforming Information to Intelligence appeared first on Uber Engineering Blog.

Data

Data Engineering Building Big Data

Provide Real Value in Your Applications with Data and Analytics

The complexity of financial data, the need for real-time insight, and the demand for user-friendly visualizations can seem daunting when it comes to analytics - but there is an easier way. With Logi Symphony, we aim to turn these challenges into opportunities. Our platform empowers you to seamlessly integrate advanced data analytics, generative AI, data visualization, and pixel-perfect reporting into your applications, transforming raw data into actionable insights.

Raw Data

Interpretability part 3: LIME and SHAP

KDnuggets

DECEMBER 19, 2019

The third part in a series on leveraging techniques to take a look inside the black box of AI, this guide considers methods that try to explain each prediction instead of establishing a global explanation.

Getting Started with Automated Text Summarization

KDnuggets

NOVEMBER 28, 2019

This article will walk through an extractive text summarization process, using a simple word frequency approach, implemented in Python.

Python

Python Process

The 4 fastest ways not to get hired as a data scientist

KDnuggets

DECEMBER 18, 2019

Ready to try to get hired as a data scientist for the first time? Avoiding these common mistakes won’t guarantee an offer, but not avoiding them is a sure fire way for your application to be tossed into the trash bin.

Data

Explainability: Cracking open the black box, Part 1

KDnuggets

DECEMBER 4, 2019

What is Explainability in AI and how can we leverage different techniques to open the black box of AI and peek inside? This practical guide offers a review and critique of the various techniques of interpretability.

Entity Resolution: Your Guide to Deciding Whether to Build It or Buy It

Adding high-quality entity resolution capabilities to enterprise applications, services, data fabrics or data pipelines can be daunting and expensive. Organizations often invest millions of dollars and years of effort to achieve subpar results. This guide will walk you through the requirements and challenges of implementing entity resolution. By the end, you'll understand what to look for, the most common mistakes and pitfalls to avoid, and your options.

Optimizing Observability with Jaeger, M3, and XYS at Uber

Uber Engineering

NOVEMBER 26, 2019

When something goes wrong with a piece of code, engineers want to know all the relevant details of the error immediately so they can get right to work remedying the malfunction. . However, as technology has advanced, measuring system metrics and … The post Optimizing Observability with Jaeger, M3, and XYS at Uber appeared first on Uber Engineering Blog.

Engineering

Engineering Coding Technology Systems

Top KDnuggets tweets, Nov 20-26: How to Speed up Pandas by 4x with one line of code

KDnuggets

NOVEMBER 27, 2019

Also: Deep Learning for Image Classification with Less Data; How to Speed up Pandas by 4x with one line of code; 25 Useful #Python Snippets to Help in Your Day-to-Day Work; Automated Machine Learning Project Implementation Complexities.

Coding

Coding Deep Learning Machine Learning Python

The Essential Toolbox for Data Cleaning

KDnuggets

DECEMBER 5, 2019

Increase your confidence to perform data cleaning with a broader perspective of what datasets typically look like, and follow this toolbox of code snipets to make your data cleaning process faster and more efficient.

Datasets

Datasets Data Coding Process

Productionizing Distributed XGBoost to Train Deep Tree Models with Large Data Sets at Uber

Uber Engineering

DECEMBER 10, 2019

Michelangelo , Uber’s machine learning (ML) platform, powers machine learning model training across various use cases at Uber, such as forecasting rider demand , fraud detection , food discovery and recommendation for Uber Eats , and improving the accuracy of … The post Productionizing Distributed XGBoost to Train Deep Tree Models with Large Data Sets at Uber appeared first on Uber Engineering Blog.

Food

Food Machine Learning Data Engineering

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Speaker: Maher Hanafi, VP of Engineering at Betterworks & Tony Karrer, CTO at Aggregage

Executive leaders and board members are pushing their teams to adopt Generative AI to gain a competitive edge, save money, and otherwise take advantage of the promise of this new era of artificial intelligence. There's no question that it is challenging to figure out where to focus and how to advance when it’s a new field that is evolving everyday. 💡 This new webinar featuring Maher Hanafi, VP of Engineering at Betterworks, will explore a practical framework to transform Generative AI pr

Data Collection

Alternative Cloud Hosted Data Science Environments

KDnuggets

DECEMBER 19, 2019

Over the years new alternative providers have risen to provided a solitary data science environment hosted on the cloud for data scientist to analyze, host and share their work.

Data Science

Data Science Cloud Data Cloud Computing

Automatic Text Summarization in a Nutshell

KDnuggets

DECEMBER 18, 2019

Marketing scientist Kevin Gray asks Dr. Anna Farzindar of the University of Southern California about Automatic Text Summarization and the various ways it is used.

A Non-Technical Reading List for Data Science

KDnuggets

DECEMBER 2, 2019

The world still cannot be reduced to numbers on a page because human beings are still the ones making all the decisions. So, the best data scientists understand the numbers and the people. Check out these great data science books that will make you a better data scientist without delving into the technical details.

Data Science

Data Science Data

Google’s New Explainable AI Service

KDnuggets

DECEMBER 20, 2019

Google has started offering a new service for “explainable AI” or XAI, as it is fashionably called. Presently offered tools are modest, but the intent is in the right direction.

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

Data Science

10 Free Must-read Books on AI

KDnuggets

NOVEMBER 5, 2019

Artificial Intelligence continues to fill the media headlines while scientists and engineers rapidly expand its capabilities and applications. With such explosive growth in the field, there is a great deal to learn. Dive into these 10 free books that are must-reads to support your AI study and work.

Media

Media Engineering IT

Everything a Data Scientist Should Know About Data Management

KDnuggets

OCTOBER 22, 2019

For full-stack data science mastery, you must understand data management along with all the bells and whistles of machine learning. This high-level overview is a road map for the history and current state of the expansive options for data storage and infrastructure solutions.

Data Management

Data Management Management Data Storage Machine Learning

10 Free Top Notch Natural Language Processing Courses

KDnuggets

OCTOBER 7, 2019

Are you looking to learn natural language processing? This collection of 10 free top notch courses will allow you to do just that, with something for every approach to learning NLP and its varied topics.

Process

Process IT

How to Speed up Pandas by 4x with one line of code

KDnuggets

NOVEMBER 12, 2019

While Pandas is the library for data processing in Python, it isn't really built for speed. Learn more about the new library, Modin, developed to distribute Pandas' computation to speedup your data prep.

Coding

Coding Python Data Process Process

Demystifying DAPs: A Practical Guide to Digital Adoption Success

Speaker: Pulkit Agrawal

Digital Adoption Platforms (DAPs) are revolutionizing the way organizations interact with and optimize their software applications. As digital transformation continues to accelerate, DAPs have become essential tools for enhancing user engagement and software efficiency. This session is your guide into the robust world of DAPs, exploring their origins, evolution, and the current trends shaping their development.

Certification

Plotnine: Python Alternative to ggplot2

KDnuggets

DECEMBER 12, 2019

Python's plotting libraries such as matplotlib and seaborn does allow the user to create elegant graphics as well, but lack of a standardized syntax for implementing the grammar of graphics compared to the simple, readable and layering approach of ggplot2 in R makes it more difficult to implement in Python.

Python

Python IT Data Science Data

Which Data Science Skills are core and which are hot/emerging ones?

KDnuggets

SEPTEMBER 17, 2019

We identify two main groups of Data Science skills: A: 13 core, stable skills that most respondents have and B: a group of hot, emerging skills that most do not have (yet) but want to add. See our detailed analysis.

Data Science

Data Science Data Deep Learning Scala

Convolutional Neural Networks: A Python Tutorial Using TensorFlow and Keras

KDnuggets

JULY 26, 2019

Different neural network architectures excel in different tasks. This particular article focuses on crafting convolutional neural networks in Python using TensorFlow and Keras.

Python

Python Architecture

Nothing but NumPy: Understanding & Creating Neural Networks with Computational Graphs from Scratch

KDnuggets

AUGUST 23, 2019

Entirely implemented with NumPy, this extensive tutorial provides a detailed review of neural networks followed by guided code for creating one from scratch with computational graphs.

Coding

Coding Python

Deliver Mission Critical Insights in Real Time with Data & Analytics

In the fast-moving manufacturing sector, delivering mission-critical data insights to empower your end users or customers can be a challenge. Traditional BI tools can be cumbersome and difficult to integrate - but it doesn't have to be this way. Logi Symphony offers a powerful and user-friendly solution, allowing you to seamlessly embed self-service analytics, generative AI, data visualization, and pixel-perfect reporting directly into your applications.

Data Analytics

2019

Open Source Projects by Google, Uber and Facebook for Data Science and AI

Uber Infrastructure in 2019: Improving Reliability, Driving Customer Satisfaction

Webinars

Trending Sources

Introducing ksqlDB

Webinars

Engineering a Studio Quality Experience With High-Quality Audio at Netflix

The AI Superhero Approach to Product Management

Our Commitment to Open Source Software

What Is the Biggest Challenge Facing CMOs Today? Building, Measuring, and Maintaining Brand Equity.

10 Free Top Notch Machine Learning Courses

Sign up to get articles personalized to your interests!

More Trending

10 Free Top Notch Machine Learning Courses

10 Best and Free Machine Learning Courses, Online

Data Science Curriculum Roadmap

What is the most important question for Data Science (and Digital Transformation)

Uber’s Data Platform in 2019: Transforming Information to Intelligence

Provide Real Value in Your Applications with Data and Analytics

Interpretability part 3: LIME and SHAP

Getting Started with Automated Text Summarization

The 4 fastest ways not to get hired as a data scientist

Explainability: Cracking open the black box, Part 1

Entity Resolution: Your Guide to Deciding Whether to Build It or Buy It

Optimizing Observability with Jaeger, M3, and XYS at Uber

Top KDnuggets tweets, Nov 20-26: How to Speed up Pandas by 4x with one line of code

The Essential Toolbox for Data Cleaning

Productionizing Distributed XGBoost to Train Deep Tree Models with Large Data Sets at Uber

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Alternative Cloud Hosted Data Science Environments

Automatic Text Summarization in a Nutshell

A Non-Technical Reading List for Data Science

Google’s New Explainable AI Service

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

10 Free Must-read Books on AI

Everything a Data Scientist Should Know About Data Management

10 Free Top Notch Natural Language Processing Courses

How to Speed up Pandas by 4x with one line of code

Demystifying DAPs: A Practical Guide to Digital Adoption Success

Plotnine: Python Alternative to ggplot2

Which Data Science Skills are core and which are hot/emerging ones?

Convolutional Neural Networks: A Python Tutorial Using TensorFlow and Keras

Nothing but NumPy: Understanding & Creating Neural Networks with Computational Graphs from Scratch

Deliver Mission Critical Insights in Real Time with Data & Analytics

Stay Connected