Ugh... I can feel your pain.
You should read up on the 'find' utility. Between find, md5sum and sort, you should be able to get a list of filenames sorted by checksum. Then you can look for duplicates in that list.
Find has the ability to recurse into a directory tree and call a utility or a script on each file in the tree.
Something like this:
I haven't tried the above, but it may(!) do something like what you need - at least it should give you an idea. Copy some files to a different spot and run your experiments there, before letting it go on the real data.Code:find /photos -name \*.jp* -print -exec md5sum {} \;|sort>filelist
Cheers,
Herman



Adv Reply




Bookmarks