Wed.Oct 19, 2022

article thumbnail

Rust for Data Engineering

Simon Späti

Will Rust kill Python for Data Engineers? If you only came here to know this, my answer is no. Betteridge’s Law strikes again! But then again, you have to ask: was Python made for Data Engineering in the first place? Rust may not replace Python outright, but it has consumed more and more of JavaScript tooling and there are increasingly many projects trying to do the same with Python/Data Engineering.

article thumbnail

Independent Anniversary

Jesse Anderson

I have a calendar reminder that tells me when I founded Big Data Institute. It just told me I founded the company eight years ago. The reminder is called “Independent Anniversary.” It’s the day I split off and executed my vision for an independent, big data consulting company. Independence has all sorts of manifestations. For you, it’s an independent look at technology and vendors from someone who’s worked at a vendor (Cloudera) and worked in distributed systems for even longer.

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Rust for Data Engineering

Simon Späti

Will Rust kill Python for Data Engineers? If you only came here to know this, my answer is no. Betteridge’s Law strikes again! But then again, you have to ask: was Python made for Data Engineering in the first place? Rust may not replace Python outright, but it has consumed more and more of JavaScript tooling and there are increasingly many projects trying to do the same with Python/Data Engineering.

article thumbnail

Frameworks for Approaching the Machine Learning Process

KDnuggets

This post is a summary of 2 distinct frameworks for approaching machine learning tasks, followed by a distilled third. Do they differ considerably (or at all) from each other, or from other such processes available?

article thumbnail

Get Better Network Graphs & Save Analysts Time

Many organizations today are unlocking the power of their data by using graph databases to feed downstream analytics, enahance visualizations, and more. Yet, when different graph nodes represent the same entity, graphs get messy. Watch this essential video with Senzing CEO Jeff Jonas on how adding entity resolution to a graph database condenses network graphs to improve analytics and save your analysts time.

article thumbnail

Public or On-Prem? Telco giants are optimizing the network with the Hybrid Cloud

Cloudera

The telecommunications industry continues to develop hybrid data architectures to support data workload virtualization and cloud migration. However, while the promise of the cloud remains essential — not just for data workloads but also for network virtualisation and B2B offerings — the sheer volume and scale of data in the industry require careful management of the “journey to the cloud.”.

Cloud 78
article thumbnail

KDnuggets News, October 19: 3 Valuable Skills That Have Doubled My Income as a Data Scientist • The Complete Free PyTorch Course for Deep Learning

KDnuggets

3 Valuable Skills That Have Doubled My Income as a Data Scientist • The Complete Free PyTorch Course for Deep Learning • 7 Free Platforms for Building a Strong Data Science Portfolio • Mathematics for Machine Learning: The Free eBook • 25 Advanced SQL Interview Questions for Data Scientists.

More Trending

article thumbnail

Converting Text Documents to Token Counts with CountVectorizer

KDnuggets

The post explains the significance of CountVectorizer and demonstrates its implementation with Python code.

Python 108
article thumbnail

Apache Hop 2.1.0 is available

know.bi

The Apache Hop team just released version 2.1.0. This new release is the result of four and a half months of work on over 200 tickets and comes packed with new functionality, bug fixes and improvements.

article thumbnail

Designing Events and Event Streams: Introduction and Best Practices

Confluent

Designing Events and Event Streams: Introduction and Best Practices.

article thumbnail

Using Kafka Connect Securely in the Cloudera Data Platform

Cloudera

In this post I will demonstrate how Kafka Connect is integrated in the Cloudera Data Platform (CDP), allowing users to manage and monitor their connectors in Streams Messaging Manager while also touching on security features such as role-based access control and sensitive information handling. If you are a developer moving data in or out of Kafka, an administrator, or a security expert this post is for you.

Kafka 71
article thumbnail

Understanding User Needs and Satisfying Them

Speaker: Scott Sehlhorst

We know we want to create products which our customers find to be valuable. Whether we label it as customer-centric or product-led depends on how long we've been doing product management. There are three challenges we face when doing this. The obvious challenge is figuring out what our users need; the non-obvious challenges are in creating a shared understanding of those needs and in sensing if what we're doing is meeting those needs.

article thumbnail

PostgreSQL vs. MySQL: 10 Key Differences 

Meltano

PostgreSQL and MySQL are among the most popular open-source relational database management systems (RDMS) worldwide. Both RDMS enable businesses to organize and interlink large amounts of data, allowing for effective data management. For all of their similarities, PostgreSQL and MySQL differ from one another in many ways. In this PostgreSQL vs. MySQL comparison, we analyze crucial differences between the two database management systems to discover how they work and when to use them.

article thumbnail

DIY User Interview Tips for Product Managers

U-Next

Modern product managers know that they must speak with their users regularly. I started following this maxim diligently many years ago. However, far too frequently the thrill of speaking to a customer often trumped the value I got from it. I am glad to say that as time has passed, I feel I have gotten better at talking to users. . Helped by the fortunate opportunity to shadow a trained and brilliant qualitative research expert for a significant time, and forced by circumstances to depend solel