-
loading data from tables (excel files, csvs)
-
cleaning misformed data and missing values
-
filtering data using keywords and logical constraints
-
computing summary statistics
-
aggregating data via pivot tables
-
visualizing timeseries:
- delta plots for collections + envelope
- #collections by year
-
Normalization
-
Projection
-
Clustering (unsupervised learning)
-
Classification (supervised learning)