Apache Cassandra: Building a Fast Data Platform

This repository contains my talk about Apache Cassandra and Fast Data.

Docker Containers

This talk covers the steps necessary to build a fast data platform. I have provided some sample containers that illustrate this. It is possible to run these on a single machine providing it has sufficient memory and CPUs. I used a virutal machine running Ubuntu 16.04 with 4 CPUs and 8GB of RAM.

The supporting containers can be built using the command:

./build

This will download all necessary files and create local containers.

Please note that none of these containers should be considered production worthy

Source Code

The services that ingest data, process the stream and then make the results avaialable are also available. These are found beneath the services subfolder.

You can build these individually with the sbt command:

sbt assembly

Building the docker containers (detailed above) will also create containers for the services.

The services are not considered production worthy either

Deploying the containers

There is another script that can be used to deploy everything. In order to access tweets through the Twitter API you need to update env.sh to contain your own Twitter credentials.

I also run the following command to prevent them accidentally getting added to git:

git update-index --assume-unchanged env.sh

Once this is done, just run

./deploy

When you have finished you can then run

./cleanup

This wil delete any provisioned containers but leave the images.

Presentation

The presentation can be found in the presentation subfolder.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
cassandra		cassandra
kafka		kafka
notebooks		notebooks
presentation		presentation
services		services
spark-cassandra-node		spark-cassandra-node
spark-node		spark-node
spark-server		spark-server
spark		spark
zeppelin		zeppelin
zookeeper		zookeeper
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build		build
cleanup		cleanup
deploy		deploy
env.sh		env.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Apache Cassandra: Building a Fast Data Platform

Docker Containers

Source Code

Deploying the containers

Presentation

About

Releases

Packages

Languages

License

simondale/fast-data

Folders and files

Latest commit

History

Repository files navigation

Apache Cassandra: Building a Fast Data Platform

Docker Containers

Source Code

Deploying the containers

Presentation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages