1
.TH CALAMARIS 1 "$Date: 2003/05/14 21:22:37 $" Handmade "Calamaris Manual"
4
calamaris \- generate statistic out of log files from Proxy-Cache-Servers
13
is used to produce statistical output from
16
.B Inktomi Traffic Server,
17
.B Oops! proxy server,
19
.B Cisco Content Engines
20
or related Proxy log files. The resulting output can be
24
It is possible to cache calculated data in a file to use them in later runs.
26
This manual page describes the options of
28
and gives a few examples.
36
all: extracts all reports available,
39
.IR "-D 10 -d 20 -P 60 -r 20 -s -t 20" .
43
domain report: switches the top level and the second level report on. The
44
data is derived from the URL. The output is limited by n. (-1 is unlimited)
48
peak report: Measures the peaks of the Proxy usage in requests per second,
49
per minute and per hour. It reports peaks for TCP, UDP and ALL requests. If
52
these values were calculated with the old slow method, if set to
54
the new faster (but still slow) method is used.
58
performance report: Shows the throughput of TCP requests for every n minutes.
62
requester report: Switches the UDP and TCP requester reports on. The output
63
is limited by n. (-1 is unlimited)
67
Requested report: adds to each line of the requester report the requested
68
URLs. The output is limited by n. (-1 is unlimited, and can result in very
72
Using this option breaks the privacy of your users! (see PRIVACY-Section
77
status report: alters the default behaviour of
79
and makes the status reports more verbose.
83
type report: switches the content type and the file extension report on. The
84
output is limited by n. (-1 is unlimited)
88
Distribution Histogram: shows size-based distribution of requested objects,
89
smaller numbers result in more verbose reports. (choose 2, 10 or 100 for
92
Allocation report: shows how different sized objects perform on your Proxy.
97
format: sets the type of input logfiles. If set to
101
tries to guess the input file format. This is the Default.
104
If the first line of your input file is corrupted, Calamaris will stop with
109
expects native logfile derived from
110
.B Squid V1.1.beta26-V2.x
116
expects native logfile derived from
117
.B Squid V1.1.alpha1-V2.x
123
.B Smartfilter-Patch.
124
or squid-style logfiles out of
125
.B Cisco Content Engines.
126
(This only enables parsing of these kind of logfile, the additional
127
data will be ignored.)
129
(Logging of MIME-Headers breaks the privacy of your users! (see
130
PRIVACY-Section below)
134
expects native logfile derived from
135
.B Squid V1.1.alpha1-V1.1.beta25.
139
expects Logfiles from
140
.B NetCache up to V4.x.
141
(Please see the README on this.)
145
expects Logfiles from
146
.B Inktomi Traffic Server.
150
expects Logfiles in Extended Logfile Format (i.e. from
152
.B Novell Internet Caching System
159
expects Logfiles in Netscape Extended-1 or Netscape Extended-2
161
.B Netscape/ipoint Proxy-Server
165
Standard output format is plain ascii with 80 chars width.
168
.I -F type[,type[,type[,...]]]
169
Format: sets the format of the output-report. If set to
172
adds a subject header to the beginning of the report.
175
all output is given in html with tables. Can be combined with
180
all output is given in html with tables without HTML-Headers. Useful for
181
Server-Side-Includes.
184
gives out the raw numbers seperated by spaces. Useful for re-using the output
185
in other scripts. If you use this along with
187
the byte values are calculated in the given Unit, and displayed without
188
indication along with the numbers. the indication moves up to the header of
193
Meta: adds a custom string into the <HEAD> of a HTML-Report. Useful if you
194
want to add Stylesheets or something to the Report.
198
works only in combination with
204
logo: add a custom string to a HTML-Report. It'll be added to a table on the
206
.I -l '<A HREF="http://cord.de/"><IMG BORDER=0 SRC="http://cord.de/Images/cord/cordlog2n.gif" ALT="Cord"></A>'
207
will add my logo with a link to the Report.
211
works only in combination with
217
Unit: You can define this to K(ilo), M(ega), G(iga) or T(era) for
221
.I -S n[,n[,n[,...]]]
222
Show: Shows only the defined reports in the specified order. Default is to
223
display the reports as they are defined through the report-switches above. The
224
following numbers are defined:
230
1 Incoming request peak per protocol
233
2 Incoming transfer volume per protocol
236
3 Incoming requests by method
239
4 Incoming UDP-requests by status
242
5 Incoming TCP-requests by status
245
6 Outgoing requests by status
248
7 Outgoing requests by destination
251
8 Request-destinations by 2ndlevel-domain
254
9 Request-destinations by toplevel-domain
257
10 TCP-Request-protocol
260
11 Requested content-type
263
12 Requested extensions
266
13 Incoming UDP-requests by host
269
14 Incoming TCP-requests by host
272
15 Distribution Histogram
275
16 Performance in minute steps
278
Using this doesn't make
280
any faster, the internal calculations will
281
be done as the report-switches were set (see above).
287
input file: You can reuse a cache file generated with
289
to add old data to a new report. Several files can be seperated with a ':'.
292
if you use more than one cache file, make sure they are chronologicaly
293
ordered (oldest first).
296
if you reuse cache-files, which were not created with
297
.I -d -1 -r -1 -t -1 -R -1
298
the number of 'others' is likely wrong everywhere.
304
stores a summary of the computed information in
306
and you can reuse it at a later time with
310
The output file can be the same as the input file: it is simply overwritten
311
after reading the data. It is not recommended to change the options between
312
different runs if you include older data as this can result in strange
319
Host name: The name for the title or subject of the output. If set to
322
looks up the host name of the system its been run on.
326
no lookup: Switches the IP lookup for the requesters off.
330
Order: Changes the sort order in the reports to request size, default is
331
sorting by number of requests.
335
Interval: defines which time-interval should be parsed. t has to be the
336
format yyyymmddhhmmss
339
omitting the beginning or ending date is allowed.
343
N-level: All URL-Host reports will be switched from 2nd-level to
344
N-level-reports. (-1 shows a full urlhost-report)
347
This option is only useful with activated domain-report.
351
Time: Adjust the Performancereport in minutes for non GMT-Timezoners.
355
case: Switch to case-insensitive. This is useful for
356
the 'Requested extensions' report.
360
user: Switches the usage of eventually available ident information for
361
requester reports on.
364
This breaks the privacy of your users! (see PRIVACY-Section below)
368
zero: Switches reading from standard input off. You can use this to merge
369
many cache files to one (see
373
or to generate a report out of cache files.
377
benchmark: A switch for the impatient as it prints a '#' for every n parsed
382
verbose: print more information about what is Calamaris is doing
387
Loop: prints the internal loop to STDERR. (for Debugging)
391
Copyright: Prints the copyright information of
396
help: Prints a brief description of the command line options.
400
Version: Prints out the Version-Number.
405
This example mails the daily statistics to root:
406
cat /var/log/squid/access.log | nice -39
408
-aH 'daily worf' -F mail | mail root
411
This one only caches a summary for later use:
412
cat /var/log/squid/access.log |
414
-a -o daily.`date +"%w"` > /dev/null
417
You can then use the caches to have weekly statistics:
419
if [ $DAYOFWEEK = "0" ]; then
421
-a -i daily.1:daily.2:daily.3:daily.4:daily.5:daily.6:daily.0 -z -F mail -H "weekly worf" | mail root ;
425
If you have a problem with
427
please make sure that you use the recent version of
429
(see VERSION below). Also check that your proxy works correctly and doesn't
430
produce invalid Logfiles. (see the README for buglist and pointers.)
432
If you're sure that you've encountered a bug in Calamaris, please report it to
433
Calamaris-bug@cord.de. This also applies if Calamaris itself says 'please
439
can be (mis-)used to track what users are requesting.
441
So please read the following and think about it, before using
448
If you don't trust your users than there is something more wrong
449
than the loss of productivity.
453
Squid has some nice acl-mechanisms. If you think that your users
454
don't use the net properly, don't let them use it. (You can also
455
open the net at specific times or to specific sites, if you want.)
459
If you still want to use Calamaris that way, let your vict^Wusers
460
know, that they'll be monitored. (in Germany you have to let them
467
Cord Beermann <Cord@Wunder-Nett.org>. There are also a lot of people who
468
contributed code, gave ideas or requested features. Look them up in the
471
This man page was written by Philipp Frauenfelder <pfrauenf@debian.org>,
472
maintainer of the Debian package. Maintenance is now taken over by
477
Version of this manpage: $Id: calamaris.1,v 1.28 2003/05/14 21:22:37 cord Exp $
479
It describes the usage of Calamaris V2.58 and later.
481
Information about new releases, mailing lists, and other related issues
482
can be found from the
484
home page at http://Calamaris.Cord.de/
488
comes with "absolutely no warranty".
491
Copyright (C) 1997, 1998, 1999, 2000, 2001, 2002, 2003 Cord Beermann
493
This program is free software; you can redistribute it and/or modify it under
494
the terms of the GNU General Public License as published by the Free Software
495
Foundation; either version 2 of the License, or (at your option) any later
498
(If you modify and want to publish it under the name
500
please ask me. I don't want to confuse the 'audience' with many different
501
versions of the same name and/or Version number. (This is not part of the
502
license, it is only a favour i asked of you.))
504
This program is distributed in the hope that it will be useful, but WITHOUT
505
ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS
506
FOR A PARTICULAR PURPOSE. See the GNU General Public License for more
509
You should have received a copy of the GNU General Public License along with
510
this program; if not, write to the Free Software Foundation, Inc., 59 Temple
511
Place - Suite 330, Boston, MA 02111-1307, USA.