Tag: Amazon MSK

Real Time Streaming with Kafka and Flink – Introduction

This series updates a real time analytics app based on Amazon Kinesis from an AWS workshop. Data is ingested from multiple sources into a Kafka cluster instead and Flink (Pyflink) apps are used extensively for data ingesting and processing. As an introduction, this post compares the original architecture with the new architecture, and the app will be implemented in subsequent posts.

Read more

Kafka Connect for AWS Services Integration – Part 1 Introduction

Kafka Connect is a tool for scalably and reliably streaming data between Apache Kafka and other systems. It can be used to build real-time data pipeline on AWS effectively. In this post, I will introduce available Kafka connectors mainly for AWS services integration. Also, developing and deploying some of them will be covered in later posts.

Read more

Integrate Glue Schema Registry with Your Python Kafka App – Part 2

Glue Schema Registry provides a centralized repository for managing and validating schemas for topic message data and it can be utilized by many AWS services when building streaming apps. In this series, we discuss how to integrate Python Kafka producer and consumer apps In AWS Lambda with the Glue Schema Registry. In part 2, Kafka apps are deployed and their behaviour is discussed.

Read more

Integrate Glue Schema Registry with Your Python Kafka App – Part 1

Glue Schema Registry provides a centralized repository for managing and validating schemas for topic message data and it can be utilized by many AWS services when building streaming apps. In this series, we discuss how to integrate Python Kafka producer and consumer apps In AWS Lambda with the Glue Schema Registry. In part 1, I illustrate the infrastructure and Kafka apps.

Read more

Simplify Streaming Ingestion on AWS – Part 2 MSK and Athena

Streaming ingestion from Kafka (MSK) into Redshift and Athena can be much simpler as they now support direct integration. In part 2, we discuss an end-to-end streaming ingestion solution using EventBridge, Lambda, MSK and Athena. We also use AWS SAM integrated with Terraform for developing a Lambda function locally.

Read more

Simplify Streaming Ingestion on AWS – Part 1 MSK and Redshift

Streaming ingestion from Kafka (MSK) into Redshift and Athena can be much simpler as they now support direct integration. In part 1 of the simplify streaming ingestion on AWS series, we discuss an end-to-end streaming ingestion solution using EventBridge, Lambda, MSK and Redshift. We also use AWS SAM integrated with Terraform for developing a Lambda function locally.

Read more