Implemntation of DBSCAN and K-means clustering algorithms in Scala using Spark framework. Algorithms deal only with two dimensional (x and y) data.
Program arguments: <input_file> <min_points_in_cluster> <epsilon>
Program arguments: <input_file> <number_of_clusters> <converge_distance>
Sample dataset file is included - data.txt.
- When launching on a cluster refer to Spak official documentation.
- In order to run on local machine use
-Dspark.master=local
VM option.