1
.\"Generated by db2man.xsl. Don't modify this, modify the source.
10
.de Sp \" Vertical space (when we can't use .PP)
16
.ie \\n(.$>=3 .ne \\$3
20
.TH "BOGOUTIL" 1 "" "" ""
22
bogoutil \- Dumps, loads, and maintains bogofilter database files
27
\fBbogoutil\fR [options] {\fB\-d\ \fIfile\fR\fR | \fB\-H\fR | \fB\-l\ \fIfile\fR\fR | \fB\-m\fR | \fB\-w\ \fIfile_or_dir\fR\fR | \fB\-p\ \fIfile_or_dir\fR\fR} \fIfile\&.db\fR
33
\fBbogoutil\fR {\fB\-r\fR | \fB\-R\fR} \fIdirectory\fR
39
\fBbogoutil\fR {\fB\-h\fR | \fB\-V\fR}
44
where \fBoptions\fR is
48
[\-v] [\-n] [\-D] [\-a\ \fIage\fR] [\-c\ \fIcount\fR] [\-s\ \fImin,max\fR] [\-y\ \fIdate\fR] [\-I\ \fIfile\fR] [\-x\ \fIflags\fR]
55
Bogoutil is part of the bogofilter Bayesian spam filter package\&.
58
It is used to dump and load bogofilter's Berkeley DB databases to and from text files, perform database maintenance functions, and to display the values for specific words\&.
63
The \fB\-d \fIfile\fR\fR option tells bogoutil to print the contents of the database file to \fBstdout\fR\&.
66
The \fB\-H \fIfile_or_dir\fR\fR option tells bogoutil to print a histogram of the specified database file to \fBstdout\fR\&. The output is similar to bogofilter \-vv\&. Finally, hapaxes (tokens which were only seen once) and pure tokens (tokens which were encountered only in ham or only in spam) are counted\&.
69
The \fB\-l \fIfile\fR\fR option tells bogoutil to load to load the data from \fBstdin\fR into the database file\&.
72
The \fB\-m\fR option tells bogoutil to perform maintenance functions on the specified database, i\&.e\&. discard tokens that are older than desired, have counts that are too small, or sizes (lengths) that are too long or too short\&.
75
The \fB\-w \fIfile_or_dir\fR\fR option tells bogoutil to display token information from the database\&. The option takes an argument, which is either the name of the wordlist (usually wordlist\&.db) or the name of the directory containing it\&. Tokens can be listed on the command line or piped to bogoutil\&. When there are extra arguments on the command line, bogoutil will use them as the tokens to lookup\&. If there are no extra arguments, bogoutil will read tokens from \fBstdin\fR\&.
78
The \fB\-p \fIfile_or_dir\fR\fR option tells bogoutil to display the database information for one or more tokens\&. The display includes a probability column with the token's spam score (computed using bogofilter's default values)\&. Option \fB\-p\fR takes the same arguments as option \fB\-w\fR \&.
81
The \fB\-r\fR option tells bogoutil to recalculate the ROBX value and print it as a six\-digit fraction\&.
84
The \fB\-R\fR option does the same as \fB\-r\fR, but prints more information and saves the result in the training database\&.
87
The \fB\-I \fIfile\fR\fR option tells bogoutil to read its input from \fIfile\fR rather than stdin\&.
90
The \fB\-v\fR option produces verbose output on \fBstderr\fR\&. This option is primarily useful for debugging\&.
93
The \fB\-D\fR redirects debug output to stdout (it usually goes to stderr)\&.
96
The \fB\-x \fIflags\fR\fR option sets debugging flags\&.
99
Option \fB\-n\fR stands for "replace non\-ascii characters"\&. It will replace characters with the high bit (0x80) by question marks\&. This can be useful if a word list has lots of unreadable tokens, for example from asian spam\&. The "bad" characters will be converted to question marks and matching tokens will be combined when used with '\-m' or '\-l', but not with '\-d'\&.
102
Option \fB\-a age\fR indicates an acceptable token age, with older ones being discarded\&. The age can be a date (in form YYYYMMMDD) or a day count, i\&.e\&. discard tokens older than \fBage\fR days\&.
105
Option \fB\-c value\fR indicates that tokens with counts less than or equal to \fBvalue\fR are to be discarded\&.
108
Option \fB\-s min,max\fR is used to discard tokens based on their size, i\&.e\&. length\&. All tokens shorter than \fBmin\fR or longer than \fBmax\fR will be discarded\&.
111
Option \fB\-y date\fR is specifies the date to give to tokens that don't have dates\&.
114
The \fB\-h\fR option prints the help message and exits\&.
117
The \fB\-V\fR option prints the version number and exits\&.
122
Bogoutil reads and writes text files where each nonblank line consists of a word, any amount of horizontal whitespace, a numeric word count, more whitespace, and (optionally) a date in form YYYYMMDD\&. Blank lines are skipped\&.
127
0 for successful operation\&. 1 for most errors\&. 3 for I/O or other errors\&. Error 3 usually means that something is seriously wrong with the database files\&.
132
Gyepi Sam <gyepi@praxis\-sw\&.com>\&.
135
Matthias Andree <matthias\&.andree@gmx\&.de>\&.
138
David Relson <relson@osagesoftware\&.com>\&.
141
For updates, see the bogofilter project page: \fIhttp://bogofilter.sourceforge.net/\fR\&.