Skip to content

Tool for scanning directories and listing identical files (by MD5 hash)

License

Notifications You must be signed in to change notification settings

eroberson/file_hash_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

File hash scraper

This is a Python tool for crawling directories to find and list any files identical by MD5 hash, regardless of filename.

Installation

Source is available from GitHub and the package is available on pip.

pip install file-hash-scraper

Or user install

pip install --user file-hash-scraper

Or install from GitHub clone.

git clone https://github.com/eroberson/file_hash_scraper.git
git checkout vN.N.N # Choose highest version tag instead of vN.N.N

pip install .

Usage

file_hash_scraper --dir d1 --dir d2 --dir d3 --dir d4 > identical_files.txt
file_hash_scraper --dir d1 --loglevel DEBUG 1>identical_files.txt 2>log
file_hash_scraper --dir d1 --readbuffer 4096 > identical_files.txt

About

Tool for scanning directories and listing identical files (by MD5 hash)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages