Sat.Nov 23, 2019 - Fri.Nov 29, 2019

article thumbnail

Open Source Projects by Google, Uber and Facebook for Data Science and AI

KDnuggets

Open source is becoming the standard for sharing and improving technology. Some of the largest organizations in the world namely: Google, Facebook and Uber are open sourcing their own technologies that they use in their workflow to the public.

article thumbnail

Optimizing Observability with Jaeger, M3, and XYS at Uber

Uber Engineering

When something goes wrong with a piece of code, engineers want to know all the relevant details of the error immediately so they can get right to work remedying the malfunction. . However, as technology has advanced, measuring system metrics and … The post Optimizing Observability with Jaeger, M3, and XYS at Uber appeared first on Uber Engineering Blog.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Building A Real Time Event Data Warehouse For Sentry

Data Engineering Podcast

Summary The team at Sentry has built a platform for anyone in the world to send software errors and events. As they scaled the volume of customers and data they began running into the limitations of their initial architecture. To address the needs of their business and continue to improve their capabilities they settled on Clickhouse as the new storage and query layer to power their business.

article thumbnail

Conquering Hybrid Cloud with Replicated Event-Driven Architectures

Confluent

Potential advantages of hybrid cloud architectures include avoiding vendor lock-in, increasing system resilience, optimizing costs, and inducing price competition among cloud providers. Hybrid cloud architectures require the ability to securely […].

article thumbnail

Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You Need to Know

Speaker: Timothy Chan, PhD., Head of Data Science

Are you ready to move beyond the basics and take a deep dive into the cutting-edge techniques that are reshaping the landscape of experimentation? 🌐 From Sequential Testing to Multi-Armed Bandits, Switchback Experiments to Stratified Sampling, Timothy Chan, Data Science Lead, is here to unravel the mysteries of these powerful methodologies that are revolutionizing how we approach testing.

article thumbnail

Getting Started with Automated Text Summarization

KDnuggets

This article will walk through an extractive text summarization process, using a simple word frequency approach, implemented in Python.

Python 159
article thumbnail

TDA Announces InData Labs as the Winners in Global Agency Awards

InData Labs

Today we witness new technologies arising and new tendencies influencing the way we work and live. In order to stay competitive, it’s important to be aware of the latest trends, maintain a level of strong expertise, and offer best-in-class services. Our team is always striving to move with the times and keep all modern requirements. Запись TDA Announces InData Labs as the Winners in Global Agency Awards впервые появилась InData Labs.

More Trending

article thumbnail

Top KDnuggets tweets, Nov 20-26: How to Speed up Pandas by 4x with one line of code

KDnuggets

Also: Deep Learning for Image Classification with Less Data; How to Speed up Pandas by 4x with one line of code; 25 Useful #Python Snippets to Help in Your Day-to-Day Work; Automated Machine Learning Project Implementation Complexities.

Coding 129
article thumbnail

Two Years In The Life of AI, Machine Learning, Deep Learning and Java

KDnuggets

Where does Java stand in the world of artificial intelligence, machine learning, and deep learning? Learn more about how to do these things in Java, and the libraries and frameworks to use.

article thumbnail

A Doomed Marriage of Machine Learning and Agile

KDnuggets

Sebastian Thrun, the founder of Udacity, ruined my machine learning project and wedding.

article thumbnail

Lit BERT: NLP Transfer Learning In 3 Steps

KDnuggets

PyTorch Lightning is a lightweight framework which allows anyone using PyTorch to scale deep learning code easily while making it reproducible. In this tutorial we’ll use Huggingface's implementation of BERT to do a finetuning task in Lightning.

article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Probability Learning: Naive Bayes

KDnuggets

This post will describe various simplifications of Bayes' Theorem, that make it more practical and applicable to real world problems: these simplifications are known by the name of Naive Bayes. Also, to clarify everything we will see a very illustrative example of how Naive Bayes can be applied for classification.

IT 105
article thumbnail

Top 8 Data Science Use Cases in Marketing

KDnuggets

In this article, we want to highlight some key data science use cases in marketing. Let us concentrate on several instances that present particular interest and managed to prove their efficiency in the course of time.

article thumbnail

Would you buy insights from this guy? (How to assess and manage a Data Science vendor)

KDnuggets

With all the hype from data science vendors selling "actionable insights" to boost your company's bottom line, selecting your analytics partner should proceed through the same, careful process as any traditional business endeavor. Follow these questions and best practices to ensure you manage accordingly.

article thumbnail

Machine Learning 101: The What, Why, and How of Weighting

KDnuggets

Weighting is a technique for improving models. In this article, learn more about what weighting is, why you should (and shouldn’t) use it, and how to choose optimal weights to minimize business costs.

article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Spark NLP 101: LightPipeline

KDnuggets

A Pipeline is specified as a sequence of stages, and each stage is either a Transformer or an Estimator. These stages are run in order, and the input DataFrame is transformed as it passes through each stage. Now let’s see how this can be done in Spark NLP using Annotators and Transformers.

IT 90
article thumbnail

The Future of Careers in Data Science & Analysis

KDnuggets

As the fields of data science and analysis continue to expand, the next crop of bright minds is always needed. Learn more about the nuances of these jobs and find where you can fit in for a rewarding and interesting career.

article thumbnail

Markov Chains: How to Train Text Generation to Write Like George R. R. Martin

KDnuggets

Read this article on training Markov chains to generate George R. R. Martin style text.

117
117
article thumbnail

Content-based Recommender Using Natural Language Processing (NLP)

KDnuggets

A guide to build a content-based movie recommender model based on NLP.

Process 114
article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

Can Neural Networks Develop Attention? Google Thinks they Can

KDnuggets

Google recently published some work about modeling attention mechanisms in deep neural networks.

99
article thumbnail

Task-based effectiveness of basic visualizations

KDnuggets

This is a summary of a recent paper on an age-old topic: what visualisation should I use? No prizes for guessing “it depends!” Is this the paper to finally settle the age-old debate surrounding pie-charts??

IT 64
article thumbnail

AXA, State Auto and Hippo on how to fuse innovative tech to your company

KDnuggets

Join this live webinar: Fast, Agile, Service-Driven Insurance: Fuse Innovative Tech to Your Company DNA - AI, Chatbots, Automation and More, Dec 11 at 10:00am EST, to get actionable insight to develop your strategy.

article thumbnail

KDnuggets™ News 19:n45, Nov 27: Interpretable vs black box models; Advice for New and Junior Data Scientists

KDnuggets

This week: Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead; Advice for New and Junior Data Scientists; Python Tuples and Tuple Methods; Can Neural Networks Develop Attention? Google Thinks they Can; Three Methods of Data Pre-Processing for Text Classification.

article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating

article thumbnail

Cartoon: Thanksgiving, Big Data, and Turkey Data Science…

KDnuggets

A classic KDnuggets Thanksgiving cartoon examines the predicament of one group of fowl Data Scientists.

article thumbnail

Top Stories, Nov 18-24: How to Speed up Pandas by 4x with one line of code; Python, Selenium & Google for Geocoding Automation: Free and Paid

KDnuggets

Also: Automated Machine Learning Project Implementation Complexities; Text Encoding: A Review; The Notebook Anti-Pattern; Data Science for Managers: Programming Languages; 10 Free Must-read Books on AI.