Remove tag
article thumbnail

The Importance of Distributed Tracing for Apache-Kafka-Based Applications

Confluent

Apache-Kafka ® -based applications stand out for their ability to decouple producers and consumers using an event log as an intermediate layer. This article describes how to instrument Kafka-based applications with distributed tracing capabilities in order to make dataflows between event-based components more visible.

Kafka 111
article thumbnail

Building Real-Time Recommendations with Kafka, S3, Rockset and Retool

Rockset

When building a real-time customer 360 app, you’ll definitely need event data from a streaming data source, like Kafka. We’ll be building a basic version of this using Kafka, S3, Rockset, and Retool. We’ll integrate with Kafka and S3 through Rockset’s data connectors. user_purchases_v1 These are purchases made by the customer.

Kafka 52
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Introducing Vector Search on Rockset: How to run semantic search with OpenAI and Rockset

Rockset

Before vector search, search experiences primarily relied on keyword search, which frequently involved manually tagging data to identify and deliver relevant results. As an example, if we wanted to search for tagged keywords to deliver product results, we would need to manually tag “Fortnite” as a ”survival game” and ”multiplayer game.”

article thumbnail

EC2 & Session Manager (Toronto Project)

Team Data Science

select the ssm role You'll have the option to add tags to describe the role as well, but in a simple project in a brand new account like this I have opted not to do so. While I have already created the role 'MyEC2Role', you can do the same by clicking beside it on "Create New IAM Role". click create role 2.Select

Project 130
article thumbnail

Upgrade Journey: The Path from CDH to CDP Private Cloud

Cloudera

The customer also wanted to utilize the new features in CDP PvC Base like Apache Ranger for dynamic policies, Apache Atlas for lineage, comprehensive Kafka streaming services and Hive 3 features that are not available in legacy CDH versions. Support Kafka connectivity to HDFS, AWS S3 and Kafka Streams. Kafka, SRM, SMM.

Cloud 130
article thumbnail

How to Use Terraform with Rockset

Rockset

json" } } Kafka Collection Next we’ll setup a collection from a Confluent Cloud source, and add an ingest transformation that summarizes the data. file to pin the stable tag to the current version, so that the stable doesn’t change until we have properly tested it. This way the credentials will never have to be exposed to a human.

AWS 52
article thumbnail

Fraud Detection with Cloudera Stream Processing Part 1

Cloudera

We discussed how Cloudera Stream Processing (CSP) with Apache Kafka and Apache Flink could be used to process this data in real time and at scale. If the fraud score is above a certain threshold, NiFi immediately routes the transaction to a Kafka topic that is subscribed by notification systems that will trigger the appropriate actions.

Process 80