ReAGEnT

Realtime Analysis of German Election noting tweets

Collects, sorts, analyzes and presents tweets from german politicians regarding the election 2021.

API Wrapper

The API Wrapper is responsible for collecting tweets from the Twitter API v2 endpoint using Spark Structured Streaming. The data is sorted, analyzed and saved to Mongo DB according to rules set in main class.

Written in Scala.

Twint API Wrapper

Historical data from Twitter is loaded with the help of the inofficial Twint API

Twint API GitHub

Written in Python.

SparkML

Raw data from the Mongo DB is loaded and used to train a model with the help of the Spark Machine Learning library (MLlib).

MLlib Guide

Written in Scala.

Spark (Backend)

This part is responsible for taking the raw information from the Mongo DB and computing the information for the frontend. Thereafter saving it again in the Mongo DB.

Apache Spark

Written in Scala.

AkkaHTTP

Routes Mongo DB content to the frontend.

Written in Scala.

Frontend v1

Web representation of analyzed data.

Written in JavaScript.

Frontend v2

Web representation of analyzed data. We are using the micro front end architecture, this project / repository acts as the container project that "contains" and loads the individual micro frontends. This container app is built with React, and it loads the micro front end with two different approaches:

Loading the bundled JS file from the local folder (src/wc)
Loading the bundled JS file from a remote source (in this case, from site hosted on Github Pages)

All the individual micro front ends are bundled into a single JS file and converted into a web component, so the project could be framework agnostic.

Please take a look into our Proof of Concept project for a more simplified example.

Dependencies that we used are:

Written in TypeScript.

Search Engine (Kafka-Elasticsearch)

Engine to search for tweets by keyword and filter them by party.

Elasticsearch provides the tweets as well as the search functionality. An Apache Kafka Producer extracts tweets via the Twitter Api in real time and inserts them into the Elasticsearch dataset.

Written in Java.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
ReAGEnT_presentation.pdf		ReAGEnT_presentation.pdf
ReAGEnT_sequence_diagram_v1.1.png		ReAGEnT_sequence_diagram_v1.1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReAGEnT

API Wrapper

Twint API Wrapper

SparkML

Spark (Backend)

AkkaHTTP

Frontend v1

Frontend v2

Search Engine (Kafka-Elasticsearch)

About

Releases

Packages

ReAGEnT-WiSe2021-22/Wiki

Folders and files

Latest commit

History

Repository files navigation

ReAGEnT

About

Resources

Stars

Watchers

Forks