Sat.Nov 28, 2020 - Fri.Dec 04, 2020

article thumbnail

A Data Scientist in Engineering Wonderland

Team Data Science

As a data scientist, I always felt a missing link between my developed models and putting them in the production process. Yes, I can create a pipeline, write a model, get results, and interpret the results, but if I cannot scale it, these all will sit on my Jupiter notebooks. This thought led me to my data engineering adventure. I am confident that learning data engineering will make me a better data scientist.

article thumbnail

Streaming Data Integration Without The Code at Equalum

Data Engineering Podcast

Summary The first stage of every good pipeline is to perform data integration. With the increasing pace of change and the need for up to date analytics the need to integrate that data in near real time is growing. With the improvements and increased variety of options for streaming data engines and improved tools for change data capture it is possible for data teams to make that goal a reality.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Project Metamorphosis Month 8: Complete Apache Kafka in Confluent Cloud

Confluent

This is the eighth and final month of Project Metamorphosis: an initiative that brings the best characteristics of modern cloud-native data systems to the Apache Kafka® ecosystem, served from Confluent […].

Kafka 97
article thumbnail

2020 Data Impact Award Winner Spotlight: Rush University Medical Center

Cloudera

After a tumultuous year, the final award category at the Data Impact Awards was a much needed pick me up for everyone in attendance. Showcasing some of the most inspiring and uplifting use cases of Cloudera’s technology, The Data for Good category recognizes organizations that are tackling the challenging issues affecting society and the planet — and we all know there are plenty of them in 2020!

Medical 76
article thumbnail

From Developer Experience to Product Experience: How a Shared Focus Fuels Product Success

Speaker: Anne Steiner and David Laribee

As a concept, Developer Experience (DX) has gained significant attention in the tech industry. It emphasizes engineers’ efficiency and satisfaction during the product development process. As product managers, we need to understand how a good DX can contribute not only to the well-being of our development teams but also to the broader objectives of product success and customer satisfaction.

article thumbnail

Risk-Based Wealth Management: What the Insurance Industry Gets Wrong

Teradata

Product-centric processes degrade customer experience. Insurers must insulate consumers from internal & regulatory-driven controls by placing them in the center of the customer experience.

article thumbnail

Open Source Highlight: Klio

Data Council

Klio is a framework for easy large-scale processing and ML research on binary files, such as audio files -- its original use case. As a matter of fact, it was developed for audio intelligence at Spotify, which open-sourced it earlier this year at the 2020 International Society for Music Information Retrieval Conference.

Process 52

More Trending

article thumbnail

#ClouderaLife: Unplugged

Cloudera

It’s a trick as old as time… or at least as old as technology. We all know that step one to solving for any tech issue is to turn it off and then turn it back on again. But would it solve for issues in advance of them happening? And could this work not only for technology but for the people behind the technology? Our leadership team decided to explore that theory.

article thumbnail

Data and Strategic Alignment in the Bank of the Future

Teradata

Strategic alignment is a fundamental building block for the bank of the future. It must rest on integrated data & financial data analysis that inform each stage on the enterprise value chain.

Banking 52
article thumbnail

5 things you should know about Real-Time Analytics

A Cloud Guru: Data Engineering

Running analytics on real-time data is a challenge many data engineers are facing today. But not all analytics can be done in real time! Many are dependent on the volume of the data and the processing requirements. Even logic conditions are becoming a bottleneck. For example, think about join operations on huge tables with more […] The post 5 things you should know about Real-Time Analytics appeared first on A Cloud Guru.

article thumbnail

Ensure Data Quality and Data Evolvability with a Secured Schema Registry

Confluent

Organizations define standards and policies around the usage of data to ensure the following: Data quality: Data streams follow the defined data standards as represented in schemas Data evolvability: Schemas […].

Data 87
article thumbnail

Peak Performance: Continuous Testing & Evaluation of LLM-Based Applications

Speaker: Aarushi Kansal, AI Leader & Author and Tony Karrer, Founder & CTO at Aggregage

Software leaders who are building applications based on Large Language Models (LLMs) often find it a challenge to achieve reliability. It’s no surprise given the non-deterministic nature of LLMs. To effectively create reliable LLM-based (often with RAG) applications, extensive testing and evaluation processes are crucial. This often ends up involving meticulous adjustments to prompts.

article thumbnail

Cloudera Operational Database Infrastructure Planning Considerations

Cloudera

In this blog post, let us take a look at how you can plan your infrastructure planning that you may have to do when deploying an operational database cluster on a CDP Private Cloud Base deployment. Note that you may have to do some planning assumptions when designing your initial infrastructure, and it must be flexible enough to scale up or down based on your future needs. .

article thumbnail

Teradata at AWS re:Invent

Teradata

Teradata is participating in AWS re:Invent 2020, demonstrating our cloud-first stance as a Gold sponsor. Find out more.

AWS 59
article thumbnail

A Visual Tour of the Global COVID-19 Vaccine Efforts

Preset

In response to the COVID-19 pandemic, hundreds of countries, organizations, universities, and companies came together to fund many vaccine candidates.

Data 40
article thumbnail

Getting Started with Spring Cloud Data Flow and Confluent Cloud

Confluent

Data is the currency of competitive advantage in today’s digital age. All organizations struggle with their data due to the sheer variety of data types and ways that it can […].

Cloud 57
article thumbnail

How to Build an Experimentation Culture for Data-Driven Product Development

Speaker: Margaret-Ann Seger, Head of Product, Statsig

Experimentation is often seen as an aspirational practice, especially at smaller, fast-moving companies who are strapped for time and resources. So, how can you get your team making decisions in a more data-driven way while continuing to remain lean and maintaining ship velocity? In this webinar, Margaret-Ann Seger, Head of Product at Statsig, will teach you how to build an experimentation culture from the ground-up, graduating from just getting started with data-driven development to operating

article thumbnail

Coffee with Cloudera: Cindy Maike, VP of Industry Solutions

Cloudera

Meet Cindy Maike, VP of Industry Solutions at Cloudera. Cindy has led the Industry Solutions team for over 3 years, with 6 years with Cloudera, and has been at the forefront of developing targeted vertical solutions for our customers and partners. Cindy is an exceptional female leader and we hope this blog gives you insight into the great work Cindy is doing with the Industry Solutions team!

article thumbnail

Intertoys

Teradata

Toy retailer uses Vantage on Azure, the modern cloud data analytics platform, as the building blocks for agility and cost-savings.

Retail 52
article thumbnail

How to Tackle Data Skew

Teradata

Learn how to use use Teradata's Global Space Accounting to counter our biggest villain: data skew.

Data 52
article thumbnail

How to configure clients to connect to Apache Kafka Clusters securely – Part 1: Kerberos

Cloudera

This is the first installment in a short series of blog posts about security in Apache Kafka. In this article we will explain how to configure clients to authenticate with clusters using different authentication mechanisms. Secured Apache Kafka clusters can be configured to enforce authentication using different methods, including the following: SSL – TLS client authentication.

Kafka 70
article thumbnail

Entity Resolution Checklist: What to Consider When Evaluating Options

Are you trying to decide which entity resolution capabilities you need? It can be confusing to determine which features are most important for your project. And sometimes key features are overlooked. Get the Entity Resolution Evaluation Checklist to make sure you’ve thought of everything to make your project a success! The list was created by Senzing’s team of leading entity resolution experts, based on their real-world experience.

article thumbnail

Making Privacy an Essential Business Process

Cloudera

Canada is poised to become a world-leader in privacy regulation and with new regulation comes record-breaking fines for those who can’t keep up. . In November, Canada introduced the Digital Charter Implementation Act. If passed, companies could face fines of up to five percent of global revenue or $25 million CAD — whichever is greater — for violating Canadians’ privacy.

Process 70
article thumbnail

2020 Data Impact Award Winner Spotlight: Telkomsel

Cloudera

2020 is a year that’s been defined by transformation. The way we work, how businesses operate, and even serve customers have all transformed in order to cope with the challenges that have been thrown our way. Amongst the chaos, some organizations have excelled. The Industry Transformation category at our Data Impact Awards celebrates these organizations— the ones that have looked digital transformation in the eye and said “bring it on!