From: eldy <>
Date: Sat, 19 Oct 2002 16:55:53 +0000 (+0000)
Subject: Updated documentation
X-Git-Tag: AWSTATS_5_1_BETA~4
X-Git-Url: http://git.ipfire.org/cgi-bin/gitweb.cgi?a=commitdiff_plain;h=bc25cfdb0e24c67ab2a76b768c325977016e3122;p=thirdparty%2FAWStats.git
Updated documentation
---
diff --git a/docs/awstats.pdf b/docs/awstats.pdf
index 30fe1050..f1322390 100644
Binary files a/docs/awstats.pdf and b/docs/awstats.pdf differ
diff --git a/docs/awstats_compare.html b/docs/awstats_compare.html
index 938d82e0..7dcfe5cf 100644
--- a/docs/awstats_compare.html
+++ b/docs/awstats_compare.html
@@ -41,7 +41,7 @@
| Features/Softwares | AWStats | Analog | Webalizer | HitBox |
-| Version - Date | 5.0 - September 2002 | 5.24 - July 2002 | 2.01-10 - April 2002 | NA |
+| Version - Date | 5.1 - October 2002 | 5.24 - July 2002 | 2.01-10 - April 2002 | NA |
| Language | Perl | C | C | Embedded HTML tag |
| Available on all platforms | Yes | Yes | Yes | NA |
| Sources available | Yes | Yes | Yes | No |
@@ -61,9 +61,9 @@
| Not ordered records tolerance and reorder for visits | Yes | Visits not supported | No | ? |
| Statistics for visits are based on | Pages ***** | Not supported | Pages ***** | Pages ***** |
| Statistics for unique visitors are based on | Pages ***** | Not supported | Not supported | Pages ***** |
-| Report domains/countries (nb detected) | Yes (259) | Yes (259) | Yes (252) | Yes (?) |
+| Report domains/countries (nb detected) | Yes (266) | Yes (266) | Yes (259) | Yes (?) |
| Report authenticated users | Yes | Yes | No | No |
-| Report/Filter robots (nb detected) | Yes/Yes (291**) | Yes / Yes (8) | No/No (0**) | No/No (0**) |
+| Report/Filter robots (nb detected) | Yes/Yes (304**) | Yes / Yes (8) | No/No (0**) | No/No (0**) |
| Report rush hours | Yes | Yes | Yes | Yes |
@@ -82,11 +82,11 @@
| Report by file size | No | Yes | No | No |
-| Report browsers (nb detected) | Yes (75*) | No | Yes (4*) | Yes (<20*) |
-| Report details of browsers versions | Yes | No | No | Yes |
+| Report browsers (nb detected) | Yes (76*) | No | Yes (4*) | Yes (<20*) |
+| Report details of browsers versions | Yes | No | Yes | Yes |
| Report OS (nb detected) | Yes (29) | Yes (25) | No (0) | ? |
-| Report search engines used (nb detected) | Yes (63***) | Yes (24) | No (0) | Yes (<20 ***) |
-| Report keywords/keyphrases used on search engines (nb detected) | Yes/Yes (65***) | Yes/No (29***) | No/Yes (14***) | Yes/No (<20***) |
+| Report search engines used (nb detected) | Yes (78***) | Yes (24) | No (0) | Yes (<20 ***) |
+| Report keywords/keyphrases used on search engines (nb detected) | Yes/Yes (81***) | Yes/No (29***) | No/Yes (14***) | Yes/No (<20***) |
| Report HTTP Errors | Yes
| Yes | Yes | No |
@@ -95,7 +95,7 @@
| Daily statistics | Yes | Yes | Yes | Yes |
| Monthly statistics | Yes | Yes | Yes | Yes |
| Yearly statistics | Yes | Yes | Yes | Yes |
-Benchmark with no DNS lookup in lines/seconds (full features enabled, with XLF format, on Athlon 1Ghz) | 4500**** | ?**** | 39000**** | NA No program to run |
+Benchmark with no DNS lookup in lines/seconds (full features enabled, with XLF format, on Athlon 1Ghz) | 4500**** | ?**** | 12000**** | NA No program to run |
Benchmark with DNS lookup in lines/seconds (full features enabled, with XLF format, on Athlon 1Ghz) | 80**** | 80**** | 80**** | NA No program to run |
| Main graphical statistics in one page | Yes | Yes | Yes | No |
@@ -104,17 +104,23 @@
* This number is not really the number of browsers detected. All browsers (known and unknown) can be detected by products
that support this feature (AWStats,Webalizer,HitBox). The number is only the number of known browsers for which different
-versions can be grouped in one browser name.
+versions are grouped by default in one browser name.
** AWStats can detect robots visits: All robots among the most common are detected, list is in robotslist.txt (250Kb). Products that are not able to do this give you false information, above all if your site
has few visitors. For example, if you're site was submitted to all famous search engines, robots can make 500 visits a month, to find updates or to see if your site is still online.
So, if you have only 2000 visits a month, products with no robot detection capabilities will report 2500 visits (A 25% error !). AWStats will report 500 visits from robots and 2000 visits from human visitors.
-*** AWStats has url syntax rules for the most popular search engines but AWStats has also an algorithm
-to detect keywords of unknown search engines with unknown url syntax rule. Those rules are updated with AWStats updates.
+*** AWStats has url syntax rules for the most popular search engines (that's the 'number detected').
+Those rules are updated with AWStats updates.
+But AWStats has also an algorithm to detect keywords of unknown search engines with unknown url syntax rules.
-**** This rate is for processing log records with full information (combined XLF/ELF or extended W3C log format) but
-you must keep in mind that all this times are without reverse DNS lookup. DNS lookup speed depends on your system, network and
+**** As you can see, some log analyzers have, by default, very poor (or not at all) robots, search engines, os or
+browsers detection capabilities. So to have a benchmark comparison that means something, some log analyzers features
+were 'enhanced' with AWStats databases, when it was possible (For example, Webalizer config file was completed with
+this file. With that, Webalizer features are a little bit close that thoose of AWStats.
+Without that, like using default conf file, Webalizer results are 3 times faster but with a lot of missing features).
+Benchmarks was made on a combined (XLF/CLF) log record on an Athlon 1GHz.
+You must keep in mind that all this times are without reverse DNS lookup. DNS lookup speed depends on your system, network and
Internet but not on the log analyzer you use. For this reason, DNS lookup is disabled in all log analyzer benchmarks.
Don't forget that DNS lookup is 95% (even with a lookup cache) of the time used by a log analyzer, so if your host is not
already resolved in log file and DNS lookup is enable, the total time of the process will be nearly the same whatever
diff --git a/wwwroot/cgi-bin/awstats.pl b/wwwroot/cgi-bin/awstats.pl
index 44560879..128c515b 100644
--- a/wwwroot/cgi-bin/awstats.pl
+++ b/wwwroot/cgi-bin/awstats.pl
@@ -4404,6 +4404,8 @@ if ($Debug) {
if ($Debug) { debug("UpdateStats is $UpdateStats",2); }
if ($UpdateStats && $FrameName ne "index" && $FrameName ne "mainleft") { # Update only on index page or when not framed to avoid update twice
+ print "Update for config '$FileConfig'\nWith data in log file '$LogFile'...\n";
+
my $lastprocessedyear=$lastyearbeforeupdate;
my $lastprocessedmonth=$ListOfYears{$lastyearbeforeupdate}||0;
my $lastprocessedyearmonth=sprintf("%04i%02i",$lastprocessedyear,$lastprocessedmonth);
@@ -4679,6 +4681,9 @@ if ($UpdateStats && $FrameName ne "index" && $FrameName ne "mainleft") { # Updat
if ($Debug) { debug("Open log file \"$LogFile\""); }
open(LOG,"$LogFile") || error("Error: Couldn't open server log file \"$LogFile\" : $!");
+ # Avoid premature EOF due to log files corrupted with \cZ or bin chars
+ binmode LOG;
+
my @field=();
my $counter=0;
# Reset counter for benchmark (first call to GetDelaySinceStart)