generate similarity hashes to find nearly duplicate files
One of the questions that it's nice to be able to answer about a pair of files
is the degree of similarity between them. This command-line tool is useful for
estimating the "degree of similarity" between a pair of nominally sequential
files such as textfiles. The tool uses Manassas's "shingleprinting" technique;
simhash - file similarity hash tool
simhash [ -s nshingles ] [ -f nfeatures ] [ file ]
simhash [ -s nshingles ] [ -f nfeatures ] -w file ...
simhash [ -s nshingles ] [ -f nfeatures ] -m file ...
simhash -c hashfile hashfile
This program is used to compute an
simhash (0.0.20110213-1) unstable; urgency=low
* new upstream snapshot (Git commit f624c6)
* updated standards (3.9.3) and debhelper (9) versions
* updated my email address
* switch from cdbs to dh
* remove link to deprecated BSD license file
* add Vcs-* fields to debian/control
-- Thomas Koch <email@example.com> Fri, 15 Jun 2012 08:22:06 +0200
simhash (0.0.20090101-1) unstable; urge
This package was debianized by Thomas Koch <firstname.lastname@example.org> on
Thu, 19 Mar 2009 10:59:35 +0100.
It was downloaded from http://wiki.cs.pdx.edu/forge/simhash.html
Copyright (C) 2005-2009 Bart Massey
Redistribution and use in source and binary forms, with or without
modification, are permitted under the terms of the BSD License.
Browse inside simhash_0.0.20110213-1_i386.deb
Results 1 - 1 of 1Search over 15 billion files
© 1997-2017 FileWatcher.com