|
|
A perl script to list duplicate files
This program does a lot more work than it needs, because it doesn't first collate lists of files by length (cheap to compute) before looking at the contents for the MD5 (expensive to compute). As an alternative approach, comparing files pair-wise using File::Compare (once determining that the length is the same) is often faster, as it reads only enough of the file to determine the first difference.
A perl script to list duplicate files
Your antecedent reference wasn't clear: File::Compare as an alternative to Digest::MD5?
revised version
A revised version of the script that follows your suggestion to do an initial collating by data-fork size is now available in the forums thread.
This new version runs about twice as fast as the original when I test it on the AIFF loop folders on my machine.
The new version also reports on cases where the resource forks differ even though the data-forks are the same. |
SearchFrom our Sponsor...Latest Mountain Lion HintsWhat's New:HintsNo new hintsComments last 2 daysNo new commentsLinks last 2 weeksNo recent new linksWhat's New in the Forums?
Hints by TopicNews from Macworld
From Our Sponsors |
|
Copyright © 2014 IDG Consumer & SMB (Privacy Policy) Contact Us All trademarks and copyrights on this page are owned by their respective owners. |
Visit other IDG sites: |
|
|
|
Created this page in 0.19 seconds |
|