web-stat

Basic web user statistics processing with Apache Kafka and Apache Spark structured steaming.

Getting Started

Clone repository

git clone https://github.com/NicolasPA/web-stat.git

In a new console, start Zookeeper server, Kafka server and create Kafka topics "user-visit" and "user-stat":
```
bash bin/start-kafka.sh
```
In a new console, start the Spark streaming job that reads "user-visit" and writes in "user-stat":
```
bash bin/start-spark.sh
```
In a new console, feed the Kafka input topic "user-visit" with some messages:
```
bash bin/feed-kafka.sh
```

In a new console, open a simple Kafka consumer to see what's written to "user-stat":

$KAFKA_HOME/bin/kafka-console-consumer.sh --bootstrap-server localhost:9092 --topic user-stat --from-beginning