Skip to content

Latest commit

 

History

History
37 lines (27 loc) · 787 Bytes

README.md

File metadata and controls

37 lines (27 loc) · 787 Bytes

File hash scraper

This is a Python tool for crawling directories to find and list any files identical by MD5 hash, regardless of filename.

Installation

Source is available from GitHub and the package is available on pip.

pip install file-hash-scraper

Or user install

pip install --user file-hash-scraper

Or install from GitHub clone.

git clone https://github.com/eroberson/file_hash_scraper.git
git checkout vN.N.N # Choose highest version tag instead of vN.N.N

pip install .

Usage

file_hash_scraper --dir d1 --dir d2 --dir d3 --dir d4 > identical_files.txt
file_hash_scraper --dir d1 --loglevel DEBUG 1>identical_files.txt 2>log
file_hash_scraper --dir d1 --readbuffer 4096 > identical_files.txt