# This file is part of the software similarity tester SIM.
# Written by Dick Grune, Vrije Universiteit, Amsterdam.
# $Id: README,v 2.10 2012-06-05 14:58:38 Gebruiker Exp $
These programs test for similar (or equal) stretches in one or more program
files and can be used to detect common code or plagiarism. See sim.1.
Checkers are available for C, Java, Pascal, Modula-2, Lisp, Miranda and
This is SIM, Software and text similarity tester, most recent revision
by Dick Grune, Vrije Universiteit, Amsterdam, the Netherlands (email@example.com).
SIM tests lexical similarity in texts in C, Java, Pascal, Modula-2, Lisp,
Miranda and natural language. It can be used
- to detect potentially duplicated code fragments
2012-05-08 Dick Grune
* Changed to 16-bit tokens, for better resolution for sim_text and
on -F option, and for UTF-8 input.
It was not worth while to save the 8-bit token code: on serious
comparisons the increase in memory usage is about 10% (330 000 on a
maximum allocation of 3 030 976 for comparing the sources of MCD2).
2009-03-11 Dick Grune <firstname.lastname@example.org>
* newargs.c: added
Copyright (c) 1986, 2007, Dick Grune, Vrije Universiteit, The Netherlands
All rights reserved.