pkg://WWWstats-1.55-2.noarch.rpm:28668/
usr/
doc/
WWWstats-1.55/WWWstats.html
info downloads
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML>
<HEAD>
<TITLE>WWWstats</TITLE>
<META NAME="Author" CONTENT="Lars Marius Garshol">
<META NAME="Generator" CONTENT="Homemade (http://birk105.studby.uio.no/hovedfag/pilot.html)">
<META NAME="Description" CONTENT="Home page of the WWWstats log analysis package.">
<LINK REL=StyleSheet HREF="standard.css" TYPE="text/css" MEDIA=screen>
<LINK REL=top HREF="../lmg.html" TITLE="Homepage: Lars Marius Garshol">
</HEAD>
<BODY>
<DIV CLASS=partof>
This page is a part of <A HREF="../lmg.html">Homepage: Lars Marius Garshol</A>.
</DIV>
<H1>WWWstats</H1>
<TABLE>
<TR><TD><STRONG>Version:</STRONG>
<TD>1.55
<TR><TD><STRONG>Author:</STRONG>
<TD><A HREF="http://www.stud.ifi.uio.no/~larsga/">Lars Marius Garshol</A> -
<A HREF="mailto:larsga@ifi.uio.no">larsga@ifi.uio.no</A>.
<TR><TD><STRONG>Last release:</STRONG>
<TD>13.May.98
</TABLE>
<P>
[<A HREF="#whatis">What is</A>]
[<A HREF="#sys_reqs">System reqs</A>]
[<A HREF="WWWdocs.html">Documentation</A>]
[<A HREF="stats/index.html">Samples</A>]
[<A HREF="#license">License</A>]
[<A HREF="#download">Download</A>]
[<A HREF="#changes">Changes</A>]
[<A HREF="#next">Next version</A>]
</P><H2><A NAME="whatis">What is WWWstats?</A></H2>
<P>
WWWstats are some simple Perl scripts that read the log of a WWW
server and extracts some statistics from it, presented as web
pages. They are meant for use by a single person, but could probably
be used by large sites as well.
</P>
<P>
I made these scripts to learn Perl, and also because I couldn't find
any log analyzers that I really liked and that were OS-independent. I
add new features when I decide I want them or if someone manages
to convince me that I should add them. If you have an idea, just <A
HREF="mailto:larsga@ifi.uio.no">let me know</A>.
</P>
<H2><A NAME="sys_reqs">What do I need to run WWWstats?</A></H2>
<P>
You need a Perl interpreter, and nothing else. (Perl interpreters exist for
<A HREF="http://language.perl.com/info/software.html">UNIX</A>,
<A HREF="http://www.ActiveState.com/">Win32</A> and
<A HREF="http://language.perl.com/CPAN/ports/">many other platforms as well</A>.)
</P>
<P>
This is really all you need. If you don't have access to the server logs there is a CGI script
included in the package that can generate the log for you if you insert a reference to a particular
image on each page. This is not recommended if you have access to the server log, but if you
don't it can help you out.
</P>
<P>
If you use the server log it must be in the NCSA Common Log
Format (CLF). (The browser and referrer reports need the NCSA Extended Common Log Format.) Apache uses
CLF by default, as do most other servers. Most of them can also be reconfigured to use the
extended format. I'll add more documentation on how later. In the meantime: email me if you
have any questions about this.
</P>
<P>
Information on how to set up WWWstats and use it is in the
<A HREF="WWWdocs.html">documentation</A>.
</P>
<H2><A NAME="samples">Sample reports</A></H2>
<P>
There are six different reports. (Note that these reports have been redirected to file, which is why
they are so fast.) Since I now use separate reports for each part of my site I've put up a separate
<A HREF="stats/">page with the reports</A>.
</P>
<H2><A NAME="license">License</A></H2>
<P>
WWWstats is freeware, so you can do as you like with it. If you change
it or have any problems with it, please
<A HREF="mailto:larsga@ifi.uio.no">let me know</A>.
</P>
<H2><A NAME="download">Download</A></H2>
<P>
The program is only available in a <A HREF="WWWstats.zip">zipped</A>
version that includes this HTML page. (If you need any other formats
or have any problems installing or using it, <A
HREF="mailto:larsga@ifi.uio.no">let me know</A>.)
</P>
<H2><A NAME="next">Future improvements</A></H2>
<P>
If you have any opinions on this subject, either things you'd like to see or
opinions on the improvements suggested here, please send me an
<A HREF="mailto:larsga@ifi.uio.no">email</A> about it. If you want notification
by email when the next version is ready you can
<A HREF="WWWmail.html">add yourself to the announcement list</A>.
</P>
<H3>Next minor version (1.60)</H3>
<UL>
<LI>WWWyear: list first and last accesses.
<LI>WWWstats can skip robots (Optional).
<LI>WWWrefer: lowercase referring host names and look up IP addresses
<LI>WWWstats: add a ToC at the top
<LI>Add SSI make_log variant
<LI>Don't turn off image caching in make_log
<LI>An error report.
<LI>Ability to read Apache agent_log and MS IIS log format.
<LI>Average hits/day for various reports.
<LI>Ability to read a log that's been split into several files.
</UL>
<H3>Next major version</H3>
<UL>
<LI>Incremental stats gathering? (Will speed up script execution enormously.)
<LI>Use Java applets for more difficult graphs
<LI>Special Apache version? (Adapts to logfile format.)
</UL><H2>New features in version 1.54</H2>
<P>
Version 1.55 had only three simple changes:
</P>
<UL>
<LI>A bug fix to the DNSFile feature that made it actually speed things up. (Thanks to Thore
Harald Hoye for reporting this bug and emailing me a fix.)
<LI>A simple speedup of wwwweek.pl that makes it only read entries for the current week.
<LI>config.pl now tests for a common configuration mistake.
</UL>
<P>
Version 1.54 introduced several improvements:
</P>
<UL>
<LI>WWWyear now supports the year option.
<LI>WWWbrowser now shows an MSIE version breakdown.
<LI>The "-r" option is no longer necessary in redirects.
<LI>WWWrefer URL interpretation improved. Some configuration options added.
<LI>WWWrefer now shows relative search engine referral percentage.
<LI>User-agent, OS and search engine lists updated.
<LI>uaunknown.pl now also lists OSes.
<LI>Even better documentation. (Config.pl documented in HTML.)
<LI>
</UL>
<P>
Version 1.53 improved the user agent and robot lists a little, as well as some other minor
improvements. Version 1.52 added several new features:
</P>
<UL>
<LI>A referrer report. (Sort of in beta, while I figure out what I (and maybe others) want.)
<LI>DNS lookup of accessing hosts.
<LI>A script that can generate a log if you don't have access to the server logs.
<LI>IP addresses are shown in the stats by host and domain lists.
</UL>
<P>
Version 1.51 fixed two bugs in the year handling of wwwyear.pl and wwwweek.pl.
</P>
<P>
The new features in 1.5x since 1.43 are:
</P>
<UL>
<LI>CSS versions of all reports. (Optional, of course :-)
<LI>URL grouping (report several different URLs as one, in case a document
has been moved or renamed).
<LI>wwwyear: same table as WWWweek, with a column for each month.
<LI>wwwweek: sum of requests on top of list.
<LI>wwwmonth: show page popularity, like in WWWstats.
<LI>wwwstats: Removal of non-existent domains from the domain list.
<LI>A batch mode for all reports with option to redirect to file.
<LI>A report that lists unknown user-agents so that these may be added
to the lists.
<LI>More detailed documentation.
</UL><P>
<A NAME="changes">A</A> <A HREF="WWWchanges.html">version history</A> is also available.
</P>
<HR>
<ADDRESS>
13.May.98 22:14,
<A HREF="../lmg.html">Lars Marius Garshol</A>,
<A HREF="mailto:larsga@ifi.uio.no">larsga@ifi.uio.no</A>. A part of
<A HREF="../lmg.html">Homepage: Lars Marius Garshol</A>.
</ADDRESS>
</BODY>
</HTML>