Inside The Hive with Coco Li

Learn about Coco Li’s career journey, what she loves about working at Cevo, and her tips for other women entering the IT industry.

In AI We Trust – Part 1

This blog explores the key ingredients that make for a successful machine learning use case, and the importance of finding the right balance.

How to configure Kafka consumers to seek offsets by timestamp

We will discuss how to configure the Kafka consumer to seek offsets by timestamp where topic partitions are dynamically assigned by subscription. Docker Compose is used for building a single node Kafka cluster and running multiple consumer instances.

Data Build Tool (dbt) for Effective Data Transformation on AWS – Part 5 Athena

The data build tool (dbt) is an effective data transformation tool and it supports key AWS analytics services – Redshift, Glue, EMR and Athena. In part 5 of the dbt on AWS series, we discuss data transformation pipelines using dbt on Amazon Athena. Subsets of IMDb data are used as source and data models are developed in multiple layers according to the dbt best practices.