Skip to content

BHT-Math/Data-platform-for-real-time-data-analytics

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Techs

Apache Kafka Apache Flink Apache Iceberg Trino Apache Pinot Docker Kubernetes Prometheus Grafana

Prerequisites

  • Python (version >= 3.8)
  • Docker
  • Make
  • k3d
  • helm

About

Building a real-time analytics data platform using modern techs: Apache Kafka, Apache Flink, Apache Iceberg, Trino, and Apache Pinot, which are widely adopted in production by large-scale companies: Netflix, Uber, LinkedIn and Airbnb, etc.. to handle high-volume, low-latency data processing and analytics.

The goal of this project is to understanding of how these components work together as a complete data stack, including data ingestion, stream processing, storage, and real-time querying. It also demonstrates how to deploy, run, and test the platform locally using docker-compose and kubernetes via k3d with Helm charts.

About

Data platform for real-time analytics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 91.0%
  • Python 5.0%
  • Dockerfile 2.3%
  • Makefile 1.5%
  • Go Template 0.2%