Collection of utilities to manage files, mainly regarding detecting duplicate files.
The process to find duplicate files is simple:
- Get the files to compare.
- For each file, generate the SHA256 hash code from its content.
- Compare the generated codes and report the files with the same hash code.
The aim of the project is to learn different ways of handling files, using different languages.
The project is composed of the following implementations:
Results using a laptop with an i7-7700HQ CPU, 8GB RAM, under Fedora 42. Dataset contains 882 images. All tools returning the same number of duplicate files (450).
Implementation | Total time |
---|---|
15 seconds | |
18 seconds | |
34 seconds |