generate similarity hashes to find nearly duplicate files
One of the questions that it's nice to be able to answer about a pair of files
is the degree of similarity between them. This command-line tool is useful for
estimating the "degree of similarity" between a pair of nominally sequential
files such as textfiles. The tool uses Manassas's "shingleprinting" technique.
simhash - file similarity hash tool
simhash [ -s nshingles ] [ -f nfeatures ] [ file ]
simhash [ -s nshingles ] [ -f nfeatures ] -w [ file ] ...
simhash -c hashfile hashfile
This program is used to compute and compare similarity hashes
of files. A similarit
simhash (0.0.20090101-1) unstable; urgency=low
* Initial release (Closes: #520401)
-- Thomas Koch <email@example.com> Thu, 19 Mar 2009 10:59:35 +0100
This package was debianized by Thomas Koch <firstname.lastname@example.org> on
Thu, 19 Mar 2009 10:59:35 +0100.
It was downloaded from http://wiki.cs.pdx.edu/forge/simhash.html
Copyright (C) 2005-2009 Bart Massey
Redistribution and use in source and binary forms, with or without
modification, are permitted under the terms of the BSD Licen
Browse inside simhash_0.0.20090101-1_kfreebsd-i386.deb
Results 1 - 1 of 1Search over 15 billion files
© 1997-2017 FileWatcher.com