|File Search||Catalog||Content Search|
It can be used to recognize spam, and more generally sort incoming email into any number of categories such as work, play, and so on.
As a noise filter, it can be useful during the indexing of personal document collections.
DBACL and TREC 2005 This note explains how to use dbacl with the TREC 2005 Spam Filter Evaluation Toolkit (or spamjig for short). The spamjig is a system you can install to test and compare several spam filters with either public data or your own private data. It is/was developed as part of the NIST TREC 2005 conference. The TREC Spam Filter Evalutation Toolkit can be downloaded from the follow more»
DBACL - digramic Bayesian classifier PURPOSE dbacl is a command line program which can be used to categorize several types of text documents. Each document category is constructed as a maximum entropy language model, with respect to a reference measure based on digrams (character pairs). Before recognition can take place, a number of text corpora must be "learned". For example, an English ca more»
dbacl NEWS -- history of user-visible changes. From August 2004. Copyright (C) 2004, 2005 Laird Breyer. dbacl 1.12 Added the "Can spam filters play chess?" essay to the bundled documentation, look in the doc/chess directory. Added the TREC2005 options files to the TREC directory. Fixed some parsing bugs. There now is a new parser "-e char" which parses single characters. This isn't useful on i more»
BAYESOL(1) BAYESOL(1) NAME bayesol - a Bay more»
DBACL(1) DBACL(1) NAME dbacl - a digra more»
HMINE(1) HMINE(1) NAME hmine - a mail more»
HMINE(1) HMINE(1) NAME hypex - compu more»
MAILCROSS(1) MAILCROSS(1) NAME mailcross - a c more»