# # Modified Webalizer configuration file # Copyright 2003-2004 by Tristan Louis (see http://www.tnl.net for more info) # Distributed under the GNU General Public License. # # Modifications include: # - Better handling of user-agent strings # - Better handling of searchengine keywords # - Some basic domain name grouping # - Some basic referrer grouping # # Based on: # # Sample Webalizer configuration file # Copyright 1997-2000 by Bradford L. Barrett (brad@mrunix.net) # # Distributed under the GNU General Public License. See the # files "Copyright" and "COPYING" provided with the webalizer # distribution for additional information. # # This is a sample configuration file for the Webalizer (ver 2.01) # Lines starting with pound signs '#' are comment lines and are # ignored. Blank lines are skipped as well. Other lines are considered # as configuration lines, and have the form "ConfigOption Value" where # ConfigOption is a valid configuration keyword, and Value is the value # to assign that configuration option. Invalid keyword/values are # ignored, with appropriate warnings being displayed. There must be # at least one space or tab between the keyword and its value. # # As of version 0.98, The Webalizer will look for a 'default' configuration # file named "webalizer.conf" in the current directory, and if not found # there, will look for "/etc/webalizer.conf". # # LogFile defines the web server log file to use. If not specified # here or on on the command line, input will default to STDIN. If # the log filename ends in '.gz' (ie: a gzip compressed file), it will # be decompressed on the fly as it is being read. LogFile /var/log/httpd/access_log # LogType defines the log type being processed. Normally, the Webalizer # expects a CLF or Combined web server log as input. Using this option, # you can process ftp logs as well (xferlog as produced by wu-ftp and # others), or Squid native logs. Values can be 'clf', 'ftp' or 'squid', # with 'clf' the default. #LogType clf # OutputDir is where you want to put the output files. This should # should be a full path name, however relative ones might work as well. # If no output directory is specified, the current directory will be used. OutputDir /var/www/html/usage # HistoryName allows you to specify the name of the history file produced # by the Webalizer. The history file keeps the data for up to 12 months # worth of logs, used for generating the main HTML page (index.html). # The default is a file named "webalizer.hist", stored in the specified # output directory. If you specify just the filename (without a path), # it will be kept in the specified output directory. Otherwise, the path # is relative to the output directory, unless absolute (leading /). # Incremental processing allows multiple partial log files to be used # instead of one huge one. Useful for large sites that have to rotate # their log files more than once a month. The Webalizer will save its # internal state before exiting, and restore it the next time run, in # order to continue processing where it left off. This mode also causes # The Webalizer to scan for and ignore duplicate records (records already # processed by a previous run). See the README file for additional # information. The value may be 'yes' or 'no', with a default of 'no'. # The file 'webalizer.current' is used to store the current state data, # and is located in the output directory of the program (unless changed # with the IncrementalName option below). Please read at least the section # on Incremental processing in the README file before you enable this option. Incremental yes # IncrementalName allows you to specify the filename for saving the # incremental data in. It is similar to the HistoryName option where the # name is relative to the specified output directory, unless an absolute # filename is specified. The default is a file named "webalizer.current" # kept in the normal output directory. If you don't specify "Incremental" # as 'yes' then this option has no meaning. # ReportTitle is the text to display as the title. The hostname # (unless blank) is appended to the end of this string (seperated with # a space) to generate the final full title string. # Default is (for english) "Usage Statistics for". #ReportTitle Usage Stats for my site # HostName defines the hostname for the report. This is used in # the title, and is prepended to the URL table items. This allows # clicking on URL's in the report to go to the proper location in # the event you are running the report on a 'virtual' web server, # or for a server different than the one the report resides on. # If not specified here, or on the command line, webalizer will # try to get the hostname via a uname system call. If that fails, # it will default to "localhost". #HostName www.yourhost.com # HTMLExtension allows you to specify the filename extension to use # for generated HTML pages. Normally, this defaults to "html", but # can be changed for sites who need it (like for PHP embeded pages). #HTMLExtension html # PageType lets you tell the Webalizer what types of URL's you # consider a 'page'. Most people consider html and cgi documents # as pages, while not images and audio files. If no types are # specified, defaults will be used ('htm*', 'cgi' and HTMLExtension # if different for web logs, 'txt' for ftp logs). PageType htm* PageType cgi #PageType phtml PageType php #PageType pl # UseHTTPS should be used if the analysis is being run on a # secure server, and links to urls should use 'https://' instead # of the default 'http://'. If you need this, set it to 'yes'. # Default is 'no'. This only changes the behaviour of the 'Top # URL's' table. #UseHTTPS no # DNSCache specifies the DNS cache filename to use for reverse DNS lookups. # This file must be specified if you wish to perform name lookups on any IP # addresses found in the log file. If an absolute path is not given as # part of the filename (ie: starts with a leading '/'), then the name is # relative to the default output directory. See the DNS.README file for # additional information. DNSCache dns_cache.db # DNSChildren allows you to specify how many "children" processes are # run to perform DNS lookups to create or update the DNS cache file. # If a number is specified, the DNS cache file will be created/updated # each time the Webalizer is run, immediately prior to normal processing, # by running the specified number of "children" processes to perform # DNS lookups. If used, the DNS cache filename MUST be specified as # well. The default value is zero (0), which disables DNS cache file # creation/updates at run time. The number of children processes to # run may be anywhere from 1 to 100, however a large number may effect # normal system operations. Reasonable values should be between 5 and # 20. See the DNS.README file for additional information. DNSChildren 10 # HTMLPre defines HTML code to insert at the very beginning of the # file. Default is the DOCTYPE line shown below. Max line length # is 80 characters, so use multiple HTMLPre lines if you need more. #HTMLPre # HTMLHead defines HTML code to insert within the # block, immediately after the line. Maximum line length # is 80 characters, so use multiple lines if needed. #HTMLHead <META NAME="author" CONTENT="The Webalizer"> # HTMLBody defined the HTML code to be inserted, starting with the # <BODY> tag. If not specified, the default is shown below. If # used, you MUST include your own <BODY> tag as the first line. # Maximum line length is 80 char, use multiple lines if needed. HTMLBody <body bgcolor="#FFFFFF" text="#000000" link="#0000FF" vlink="#FF0000"> # HTMLPost defines the HTML code to insert immediately before the # first <HR> on the document, which is just after the title and # "summary period"-"Generated on:" lines. If anything, this should # be used to clean up in case an image was inserted with HTMLBody. # As with HTMLHead, you can define as many of these as you want and # they will be inserted in the output stream in order of apperance. # Max string size is 80 characters. Use multiple lines if you need to. #HTMLPost <br clear="all"> # HTMLTail defines the HTML code to insert at the bottom of each # HTML document, usually to include a link back to your home # page or insert a small graphic. It is inserted as a table # data element (ie: <TD> your code here </TD>) and is right # alligned with the page. Max string size is 80 characters. #HTMLTail <IMG SRC="msfree.png" ALT="100% Micro$oft free!"> # HTMLEnd defines the HTML code to add at the very end of the # generated files. It defaults to what is shown below. If # used, you MUST specify the </BODY> and </HTML> closing tags # as the last lines. Max string length is 80 characters. #HTMLEnd </body></html> # The Quiet option suppresses output messages... Useful when run # as a cron job to prevent bogus e-mails. Values can be either # "yes" or "no". Default is "no". Note: this does not suppress # warnings and errors (which are printed to stderr). Quiet yes # ReallyQuiet will supress all messages including errors and # warnings. Values can be 'yes' or 'no' with 'no' being the # default. If 'yes' is used here, it cannot be overriden from # the command line, so use with caution. A value of 'no' has # no effect. #ReallyQuiet no # TimeMe allows you to force the display of timing information # at the end of processing. A value of 'yes' will force the # timing information to be displayed. A value of 'no' has no # effect. TimeMe yes # GMTTime allows reports to show GMT (UTC) time instead of local # time. Default is to display the time the report was generated # in the timezone of the local machine, such as EDT or PST. This # keyword allows you to have times displayed in UTC instead. Use # only if you really have a good reason, since it will probably # screw up the reporting periods by however many hours your local # time zone is off of GMT. GMTTime yes # Debug prints additional information for error messages. This # will cause webalizer to dump bad records/fields instead of just # telling you it found a bad one. As usual, the value can be # either "yes" or "no". The default is "no". It shouldn't be # needed unless you start getting a lot of Warning or Error # messages and want to see why. (Note: warning and error messages # are printed to stderr, not stdout like normal messages). #Debug no # FoldSeqErr forces the Webalizer to ignore sequence errors. # This is useful for Netscape and other web servers that cache # the writing of log records and do not guarentee that they # will be in chronological order. The use of the FoldSeqErr # option will cause out of sequence log records to be treated # as if they had the same time stamp as the last valid record. # Default is to ignore out of sequence log records. #FoldSeqErr no # VisitTimeout allows you to set the default timeout for a visit # (sometimes called a 'session'). The default is 30 minutes, # which should be fine for most sites. # Visits are determined by looking at the time of the current # request, and the time of the last request from the site. If # the time difference is greater than the VisitTimeout value, it # is considered a new visit, and visit totals are incremented. # Value is the number of seconds to timeout (default=1800=30min) #VisitTimeout 1800 # IgnoreHist shouldn't be used in a config file, but it is here # just because it might be usefull in certain situations. If the # history file is ignored, the main "index.html" file will only # report on the current log files contents. Usefull only when you # want to reproduce the reports from scratch. USE WITH CAUTION! # Valid values are "yes" or "no". Default is "no". #IgnoreHist no # Country Graph allows the usage by country graph to be disabled. # Values can be 'yes' or 'no', default is 'yes'. # DailyGraph and DailyStats allows the daily statistics graph # and statistics table to be disabled (not displayed). Values # may be "yes" or "no". Default is "yes". # HourlyGraph and HourlyStats allows the hourly statistics graph # and statistics table to be disabled (not displayed). Values # may be "yes" or "no". Default is "yes". # GraphLegend allows the color coded legends to be turned on or off # in the graphs. The default is for them to be displayed. This only # toggles the color coded legends, the other legends are not changed. # If you think they are hideous and ugly, say 'no' here :) # GraphLines allows you to have index lines drawn behind the graphs. # I personally am not crazy about them, but a lot of people requested # them and they weren't a big deal to add. The number represents the # number of lines you want displayed. Default is 2, you can disable # the lines by using a value of zero ('0'). [max is 20] # Note, due to rounding errors, some values don't work quite right. # The lower the better, with 1,2,3,4,6 and 10 producing nice results. GraphLines 1 # The "Top" options below define the number of entries for each table. # Defaults are Sites=30, URL's=30, Referrers=30 and Agents=15, and # Countries=30. TopKSites and TopKURLs (by KByte tables) both default # to 10, as do the top entry/exit tables (TopEntry/TopExit). The top # search strings and usernames default to 20. Tables may be disabled # by using zero (0) for the value. TopSites 10 TopKSites 10 TopURLs 10 TopKURLs 10 TopReferrers 10 TopAgents 10 TopCountries 100 TopEntry 10 TopExit 10 TopSearch 10 TopUsers 0 # The All* keywords allow the display of all URL's, Sites, Referrers # User Agents, Search Strings and Usernames. If enabled, a seperate # HTML page will be created, and a link will be added to the bottom # of the appropriate "Top" table. There are a couple of conditions # for this to occur.. First, there must be more items than will fit # in the "Top" table (otherwise it would just be duplicating what is # already displayed). Second, the listing will only show those items # that are normally visable, which means it will not show any hidden # items. Grouped entries will be listed first, followed by individual # items. The value for these keywords can be either 'yes' or 'no', # with the default being 'no'. Please be aware that these pages can # be quite large in size, particularly the sites page, and seperate # pages are generated for each month, which can consume quite a lot # of disk space depending on the traffic to your site. AllSites yes AllURLs yes AllReferrers yes AllAgents yes AllSearchStr yes AllUsers no # The Webalizer normally strips the string 'index.' off the end of # URL's in order to consolidate URL totals. For example, the URL # /somedir/index.html is turned into /somedir/ which is really the # same URL. This option allows you to specify additional strings # to treat in the same way. You don't need to specify 'index.' as # it is always scanned for by The Webalizer, this option is just to # specify _additional_ strings if needed. If you don't need any, # don't specify any as each string will be scanned for in EVERY # log record... A bunch of them will degrade performance. Also, # the string is scanned for anywhere in the URL, so a string of # 'home' would turn the URL /somedir/homepages/brad/home.html into # just /somedir/ which is probably not what was intended. #IndexAlias home.htm #IndexAlias homepage.htm # The Hide*, Group* and Ignore* and Include* keywords allow you to # change the way Sites, URL's, Referrers, User Agents and Usernames # are manipulated. The Ignore* keywords will cause The Webalizer to # completely ignore records as if they didn't exist (and thus not # counted in the main site totals). The Hide* keywords will prevent # things from being displayed in the 'Top' tables, but will still be # counted in the main totals. The Group* keywords allow grouping # similar objects as if they were one. Grouped records are displayed # in the 'Top' tables and can optionally be displayed in BOLD and/or # shaded. Groups cannot be hidden, and are not counted in the main # totals. The Group* options do not, by default, hide all the items # that it matches. If you want to hide the records that match (so just # the grouping record is displayed), follow with an identical Hide* # keyword with the same value. (see example below) In addition, # Group* keywords may have an optional label which will be displayed # instead of the keywords value. The label should be seperated from # the value by at least one 'white-space' character, such as a space # or tab. # # The value can have either a leading or trailing '*' wildcard # character. If no wildcard is found, a match can occur anywhere # in the string. Given a string "www.yourmama.com", the values "your", # "*mama.com" and "www.your*" will all match. # Your own site should be hidden #HideSite tnl.net HideSite localhost #HideSite *.tnl.net #HideSite tnl.net # Your own site gives most referrals #HideReferrer *tnl.net #HideReferrer *.tnl.net HideReferrer Direct #HideReferrer tnl.net # This one hides non-referrers ("-" Direct requests) HideReferrer Request # Usually you want to hide these HideURL *.gif HideURL *.GIF HideURL *.jpg HideURL *.JPG HideURL *.png HideURL *.PNG HideURL *.ra HideURL *.css HideURL *.ico # Hiding agents is kind of futile #HideAgent RealPlayer # You can also hide based on authenticated username #HideUser root #HideUser admin # Grouping options #GroupURL /cgi-bin/* CGI Scripts #GroupURL /images/* Images GroupSite 12.44.31.110 <a href=http://www.attbi.com>ISP: AT&T</a> GroupSite *.012.net.il <a href=http://www.012.net.il>ISP: 012.net</a> GroupSite *.adelphia.net <a href=http://www.adelphia.net>ISP: Adelphia</a> GroupSite *.ameritech.net <a href=http://www.ameritech.net>ISP: SBC</a> GroupSite *.aol.com <a href=http://www.aol.com>ISP: AOL</a> GroupSite *.assertive.ca <a href=http://www.bchosting.com>ISP: BCHosting</a> GroupSite *.attbi.com <a href=http://www.attbi.com>ISP: AT&T</a> GroupSite *.att.net <a href=http://www.attbi.com>ISP: AT&T</a> GroupSite *.bchosting.com <a href=http://www.bchosting.com>ISP: BCHosting</a> GroupSite *.bellsouth.net <a href=http://www.bellsouth.net>ISP: BellSouth</a> GroupSite *.bezeqint.net <a href=http://www.bezeqint.net>ISP: Bezeq</a> GroupSite *.blueyonder.co.uk <a href=http://www.blueyonder.co.uk>ISP: BlueYonder</a> GroupSite *.bredbandsbolaget.se <a href=http://www.bredbandsbolaget.se>ISP: Bredbandsbolaget</a> GroupSite *.btopenworld.com <a href=http://www.btopenworld.com>ISP: BT Open World</a> GroupSite charter <a href=http://www.charter.com>ISP: Charter</a> GroupSite chello <a href=http://www.chello.nl>ISP: Chello</a> GroupSite *.comcast.net <a href=http://www.comcast.net>ISP: Comcast</a> GroupSite *.compuserve.com <a href=http://www.compuserve.com>ISP: Compuserve</a> GroupSite *.covad.net <a href=http://www.covad.com/>ISP: Covad</a> GroupSite *.cox.net <a href=http://www.cox.net>ISP: Cox Cable</a> GroupSite direcpc <a href=http://www.direcpc.com>ISP: DirecPC</a> GroupSite *.earthlink.net <a href=http://www.earthlink.net>ISP: Earthlink</a> GroupSite *.euronet.nl <a href=http://www.euronet.nl>ISP: Euronet</a> GroupSite *.inter.net.il <a href=http://www.msn.net.il>ISP: MSN</a> GroupSite *.intermonde.net <a href=http://www.intermonde.net>ISP: Intermonde</a> GroupSite *.level3.net <a href=http://www.level3.net>ISP: Level3</a> GroupSite *.mercury.net <a href=http://www.mercury.net>ISP: Mercury</a> GroupSite *.mindspring.com <a href=http://www.mindspring.com>ISP: Mindspring</a> GroupSite *.netvision.net.il <a href=http://www.netvision.net.il>ISP: Netvision</a> GroupSite *.ntl.com <a href=http://www.ntl.com>ISP: NTL</a> GroupSite *.optonline.net <a href=http://www.optonline.net>ISP: Optimum Online</a> GroupSite *.omnicast.net <a href=http://www.omnicast.net>ISP: Omnicast</a> GroupSite *.pacbell.net <a href=http://www.pacbell.net>ISP: PacBell</a> GroupSite prodigy <a href=http://www.prodigy.net>ISP: Prodigy</a> GroupSite *.rasserver.net <a href=http://icgcomm.com/products/isp/ras.asp>ISP: ICG</a> GroupSite *.rogers.com <a href=http://www.rogers.com>ISP: Rogers Broadband</a> GroupSite *.rr.com <a href=http://www.rr.com>ISP: Road Runner</a> GroupSite *.skyinet.net <a href=http://www.skyinet.net>ISP: Sky Internet</a> GroupSite sonera <a href=http://www.telia.com>ISP: Telia</a> GroupSite *.sonic.net <a href=http://www.sonic.net>ISP: Sonic.net</a> GroupSite *.speakeasy.net <a href=http://www.speakeasy.net>ISP: SpeakEasy</a> GroupSite sprint <a href=http://www.sprint.com>ISP: Sprint</a> GroupSite *.swbell.net <a href=http://www.swbell.net>ISP: SouthWestern Bell</a> GroupSite *.sympatico.ca <a href=http://www.sympatico.ca>ISP: Sympatico</a> GroupSite *.t-dialin.net <a href=http://www.t-dialin.net>ISP: T-Mobile</a> GroupSite telia <a href=http://www.telia.com>ISP: Telia</a> GroupSite tiscali <a href=http://www.tiscali.com>ISP: Tiscali</a> GroupSite *.rcn.com <a href=http://www.rcn.com>ISP: RCN</a> GroupSite verizon <a href=http://www.verizon.net>ISP: Verizon</a> GroupSite *.videotron.ca <a href=http://www.videotron.ca>ISP: Videotron</a> GroupSite wanadoo <a href=http://www.wanadoo.fr>ISP: Wanadoo</a> GroupSite *.webtv.net <a href=http://www.webtv.net>ISP: WebTV</a> GroupSite *.ad.jp <a href=http://www.ocn.ne.jp>Spider: OCN</a> GroupSite *.alexa.com <a href=http://www.alexa.com>Spider: Alexa</a> GroupSite *.archive.org <a href=http://www.archive.org>Spider: archive.org</a> GroupSite *.av.com <a href=http://www.av.com>Spider: Altavista</a> GroupSite *.bloglines.com <a href=http://www.bloglines.com>Spider: Bloglines.com</a> GroupSite *.fastsearch.net <a href=http://www.fastsearch.net>Spider: FAST</a> GroupSite *.googlebot.com <a href=http://www.googlebot.com>Spider: Google</a> GroupSite *.iit.cnr.it <a href=http://ubi.imc.pi.cnr.it/projects/ubicrawler/>Spider: UbiCrawler</a> GroupSite *.inktomi.com <a href=http://www.inktomi.com/>Spider: Inktomi</a> GroupSite *.inktomisearch.com <a href=http://www.inktomi.com/>Spider: Inktomi</a> GroupSite *.looksmart.com <a href=http://www.looksmart.com>Spider: Looksmart</a> GroupSite *.newsisfree.com <a href=http://www.newsisfree.com>Spider: Newsisfree.com</a> GroupSite *.ocn.ne.jp <a href=http://www.ocn.ne.jp>Spider: OCN</a> GroupSite *.oddpost.com <a href=http://www.oddpost.com>Spider: Oddpost</a> GroupSite *.sztaki.hu <a href=http://www.sztaki.hu>Spider: sztaki.hu</a> GroupSite *.teoma.com <a href=http://www.teoma.com>Spider: Teoma</a> GroupSite *.turnitin.com <a href=http://www.turnitin.com>Spider: Turn it in</a> GroupSite *.whois.sc <a href=http://www.whois.sc>Spider: Whois.sc</a> GroupSite *.acnielsen.co.uk <a href=http://www.acnielsen.co.uk>AC Nielsen</a> GroupSite *.adobe.com <a href=http://www.adobe.com>Adobe</a> GroupSite *.apple.com <a href=http://www.apple.com>Apple</a> GroupSite *.avantgo.com <a href=http://www.avantgo.com>AvantGo</a> GroupSite *.bbc.co.uk <a href=http://www.bbc.co.uk>BBC</a> GroupSite *.chevrontexaco.com <a href=http://www.chevrontexaco.com>Chevron/Texaco</a> GroupSite *.cisco.com <a href=http://www.cisco.com>Cisco Systems</a> GroupSite *.compaq.com <a href=http://www.compaq.com>Compaq</a> GroupSite *.haestad.com <a href=http://haestad.com>Haestad Methods</a> GroupSite *.hsbc.com <a href=http://www.hsbc.com>HSBC</a> GroupSite *.ibi.com <a href=http://www.informationbuilders.com>Information Builders</a> GroupSite *.ibm.com <a href=http://www.ibm.com>IBM</a> GroupSite *.intel.com <a href=http://www.intel.com>Intel</a> GroupSite *.internet.com <a href=http://www.internet.com/>Internet.com</a> GroupSite *.microsoft.com <a href=http://www.microsoft.com>Microsoft</a> GroupSite *.novell.com <a href=http://www.novell.com>Novell</a> GroupSite *.nytimes.com <a href=http://www.nytimes.com>New York Times</a> GroupSite *.pol.co.uk <a href=http://www.pol.co.uk>Energis (UK host)</a> GroupSite *.rnb.com <a href=http://www.hsbc.com>HSBC</a> GroupSite *.sun.com <a href=http://www.sun.com>Sun Microsystems</a> GroupSite *.toshiba.co.jp <a href=http://www.toshiba.co.jp/worldwide/index.html>Toshiba</a> GroupSite *.uu.net <a href=http://www.uu.net>UU-Net</a> HideSite 12.44.31.110 HideSite *.012.net.il HideSite *.acnielsen.co.uk HideSite *.ad.jp HideSite *.adelphia.net HideSite *.adobe.com HideSite *.alexa.com HideSite *.ameritech.net HideSite *.archive.org HideSite *.assertive.ca HideSite *.attbi.com HideSite *.att.net HideSite *.aol.com HideSite *.apple.com HideSite *.av.com HideSite *.avantgo.com HideSite *.bbc.co.uk HideSite *.bchosting.com HideSite *.bellsouth.net HideSite *.bezeqint.net HideSite *.blueyonder.co.uk HideSite *.bloglines.com HideSite *.bredbandsbolaget.se HideSite *.btopenworld.com HideSite charter HideSite chello HideSite *.chevrontexaco.com HideSite *.cisco.com HideSite *.comcast.net HideSite *.compaq.com HideSite *.compuserve.com HideSite *.covad.net HideSite *.cox.net HideSite direcpc HideSite *.earthlink.net HideSite *.euronet.nl HideSite *.fastsearch.net HideSite *.googlebot.com HideSite *.haestad.com HideSite *.ibi.com HideSite *.ibm.com HideSite *.iit.cnr.it HideSite *.inktomi.com HideSite *.inktomisearch.com HideSite *.intel.com HideSite *.intermonde.net HideSite *.internet.com HideSite *.inter.net.il HideSite *.level3.net HideSite *.looksmart.com HideSite *.mercury.net HideSite *.microsoft.com HideSite *.mindspring.com HideSite *.netvision.net.il HideSite *.newsisfree.com HideSite *.novell.com HideSite *.ntl.com HideSite *.nytimes.com HideSite *.ocn.ne.jp HideSite *.oddpost.com HideSite *.omnicast.net HideSite *.optonline.net HideSite *.pacbell.net HideSite *.pol.co.uk HideSite prodigy HideSite *.rasserver.net HideSite *.rcn.com HideSite *.rogers.com HideSite *.rnb.com HideSite *.rr.com HideSite *.skyinet.net HideSite sonera HideSite *.sonic.net HideSite *.speakeasy.net HideSite sprint HideSite *.sun.com HideSite *.swbell.net HideSite *.sympatico.ca HideSite *.sztaki.hu HideSite *.t-dialin.net HideSite telia HideSite *.teoma.com HideSite tiscali HideSite *.tnl.net HideSite *.toshiba.co.jp HideSite *.turnitin.com HideSite *.uu.net HideSite verizon HideSite *.videotron.ca HideSite wanadoo HideSite *.webtv.net HideSite *.whois.sc GroupReferrer 64.4.14.250 Email GroupReferrer 207.68.162.250 Email GroupReferrer aolmail Email GroupReferrer dmailweb Email GroupReferrer email Email GroupReferrer exchange Email GroupReferrer hotmail Email GroupReferrer inbox Email GroupReferrer Inbox Email GroupReferrer inctools Email GroupReferrer mail Email GroupReferrer mailcity Email GroupReferrer mailman Email GroupReferrer msgview Email GroupReferrer netaddress Email GroupReferrer neomail Email GroupReferrer newmail Email GroupReferrer port-mail Email GroupReferrer remarq Email GroupReferrer rocketmail Email GroupReferrer snoopmail Email GroupReferrer webacc Email GroupReferrer webmail Email GroupReferrer 217.154.97.2 <a href=217.154.97.2>DayOfBirth.co.uk</a> GroupReferrer dayofbirth.co.uk <a href=217.154.97.2>DayOfBirth.co.uk</a> GroupReferrer feedster.com <a href=http://www.feedster.com>Feedster</a> GroupReferrer internetweek.com/ <a href=http://www.internetweek.com>Internet Week</a> GroupReferrer internetwk.com/ <a href=http://www.internetweek.com>Internet Week</a> GroupReferrer macintouch.com/ <a href=http://www.macintouch.com>MacInTouch</a> GroupReferrer macmerc.com/ <a href=http://www.macmerc.com>MacMerc</a> GroupReferrer macnn.com/ <a href=http://www.macnn.com>MacNN</a> GroupReferrer macslash <a href=http://www.macslash.com>MacSlash</a> GroupReferrer macsurfer.com/ <a href=http://www.macsurfer.com>MacSurfer</a> GroupReferrer mywireservice.com/ <a href=http://www.mywireservice.com>My Wire Service</a> GroupReferrer syndic8.com/ <a href=http://www.syndic8.com>Syndic8</a> GroupReferrer technorati.com <a href=http://www.technorati.com>Technorati</a> GroupReferrer userland.com/ <a href=http://radio.userland.com>Radio Userland</a> GroupReferrer zdnet ZDnet GroupReferrer ipodlounge.com <a href=http://www.ipodlounge.com>Blog: IpodLounge</a> GroupReferrer lockergnome <a href=http://www.lockergnome.com>Blog: LockerGnome</a> GroupReferrer metafilter <a href=http://www.metafilter.com>Blog: Metafilter</a> GroupReferrer newsforge <a href=http://www.newsforge.com>Blog: NewsForge</a> GroupReferrer smartmobs <a href=http://www.smartmobs.com>Blog: Smart Mobs</a> GroupReferrer 66.218.71.233 <a href=http://www.yahoo.com>Search Engine: Yahoo!</a> GroupReferrer 216.109.117.135 <a href=http://www.yahoo.com>Search Engine: Yahoo!</a> GroupReferrer 216.239. <a href=http://www.google.com>Search Engine: Google</a> GroupReferrer alltheweb.com <a href=http://www.alltheweb.com>Search Engine: AllTheWeb</a> GroupReferrer altavista.com <a href=http://www.altavista.com>Search Engine: Altavista</a> GroupReferrer answerbus <a href=http://www.answerbus.com>Search Engine: Answerbus</a> GroupReferrer aolsearch.aol.ca <a href=http://aolsearch.aol.com>Search Engine: AOL</a> GroupReferrer search.aol.com <a href=http://aolsearch.aol.com>Search Engine: AOL</a> GroupReferrer search.aol.com.au <a href=http://aolsearch.aol.com>Search Engine: AOL</a> GroupReferrer aolsearch.aol.com <a href=http://aolsearch.aol.com>Search Engine: AOL</a> GroupReferrer aolsearch.aol.co.uk <a href=http://aolsearch.aol.com>Search Engine: AOL</a> GroupReferrer find.web.aol.com <a href=http://aolsearch.aol.com>Search Engine: AOL</a> GroupReferrer teensearch.aol.com <a href=http://aolsearch.aol.com>Search Engine: AOL</a> GroupReferrer ask <a href=http://www.ask.com>Search Engine: Ask Jeeves</a> GroupReferrer bloogz.com <a href=http://www.bloogz.com>Search Engine: Bloogz</a> GroupReferrer directhit <a href=http://www.directhit.com>Search Engine: Direct Hit</a> GroupReferrer dogpile <a href=http://www.dogpile.com>Search Engine: Dogpile</a> GroupReferrer excite <a href=http://www.excite.com>Search Engine: Excite</a> GroupReferrer google <a href=http://www.google.com>Search Engine: Google</a> GroupReferrer hotbot <a href=http://www.hotbot.com>Search Engine: Hotbot</a> GroupReferrer infoseek <a href=http://www.infoseek.com>Search Engine: InfoSeek</a> GroupReferrer kooqoo.com/ <a href=http://www.kooqoo.com>Search Engine: Kooqoo</a> GroupReferrer lycos <a href=http://www.lycos.com>Search Engine: Lycos</a> GroupReferrer mamma <a href=http://www.mamma.com>Search Engine: Mamma</a> GroupReferrer mywebsearch.com/ <a href=http://www.myway.com>Search Engine: MyWay.com</a> GroupReferrer myway.com/ <a href=http://www.myway.com>Search Engine: MyWay.com</a> GroupReferrer search.fr.msn.be/ <a href=http://www.msn.com>Search Engine: MSN</a> GroupReferrer search.latam.yupimsn.com/ <a href=http://www.msn.com>Search Engine: MSN</a> GroupReferrer search.msn <a href=http://www.msn.com>Search Engine: MSN</a> GroupReferrer search.t1msn.com.mx/ <a href=http://www.msn.com>Search Engine: MSN</a> GroupReferrer search.xtramsn.co.nz/ <a href=http://www.msn.com>Search Engine: MSN</a> GroupReferrer search.netscape.com/ <a href=http://www.msn.com>Search Engine: Netscape</a> GroupReferrer overture.com/ <a href=http://www.overture.com>Search Engine: Overture</a> GroupReferrer search.com/ <a href=http://www.search.com>Search Engine: Search.com</a> GroupReferrer teoma <a href=http://www.teoma.com>Search Engine: Teoma</a> GroupReferrer webcrawler.com/ <a href=http://www.webcrawler.com>Search Engine: WebCrawler</a> GroupReferrer websearch.com/ <a href=http://www.websearch.com>Search Engine: WebSearch.com</a> GroupReferrer yahoo <a href=http://www.yahoo.com>Search Engine: Yahoo!</a> GroupReferrer search Search Engine: Other search engines HideReferrer 64.4.14.250 HideReferrer 66.218.71.233 HideReferrer 207.68.162.250 HideReferrer 216.109.117.135 HideReferrer 216.239. HideReferrer 217.154.97.2 HideReferrer altavista.com HideReferrer answerbus HideReferrer search.aol.com HideReferrer search.aol.com.au HideReferrer aolsearch.aol.ca HideReferrer aolsearch.aol.com HideReferrer aolsearch.aol.co.uk HideReferrer find.web.aol.com HideReferrer teensearch.aol.com HideReferrer alltheweb.com HideReferrer ask HideReferrer avantbrowser.com HideReferrer lb.bcentral.com HideReferrer bloogz.com HideReferrer dayofbirth.co.uk HideReferrer directhit HideReferrer dogpile HideReferrer email HideReferrer excite HideReferrer feedster.com HideReferrer google HideReferrer hotbot HideReferrer infoseek HideReferrer ipodlounge.com HideReferrer kooqoo.com HideReferrer lockergnome HideReferrer lycos HideReferrer macintouch.com HideReferrer macmerc.com HideReferrer macnn.com HideReferrer macslash HideReferrer macsurfer.com HideReferrer mail HideReferrer mamma.com HideReferrer metafilter HideReferrer search.fr.msn.be HideReferrer search.latam.yupimsn.com HideReferrer search.msn HideReferrer search.t1msn.com.mx HideReferrer search.xtramsn.co.nz HideReferrer myway.com HideReferrer mywebsearch.com HideReferrer mywireservice.com HideReferrer search.netscape.com HideReferrer newsforge HideReferrer overture.com HideReferrer search.com HideReferrer search HideReferrer smartmobs HideReferrer syndic8.com HideReferrer technorati.com HideReferrer teoma HideReferrer userland.com HideReferrer webcrawler.com HideReferrer webmail HideReferrer websearch.com HideReferrer yahoo.com #GroupUser root Admin users #GroupUser admin Admin users #GroupUser wheel Admin users # The following is a great way to get an overall total # for browsers, and not display all the detail records. # (You should use MangleAgent to refine further...) GroupAgent Check&Get Program: Check&Get (Bookmark Manager) GroupAgent Custo Program: Custo (Offline Browser http://www.netwu.com) GroupAgent DLW Program: Download Wonder (Download Manager) GroupAgent Downloader Program: Website Downloader (Download Manager) GroupAgent EyeOnSite Program: EyeOnSite (Site Connectivity Monitor - http://www.virdi-software.com/eyeonsite/) GroupAgent eXactSite Program: eXactSite (HTML authoring. stupid user!) GroupAgent FavOrg Program: FavOrg (Bookmark Manager) GroupAgent Fetch Program: Fetch (Offline browser) GroupAgent GetRight Program: GetRight (Download Manager) GroupAgent HTTrack Program: HTTrack (Website Copier) GroupAgent Lachesis Program: Packet Loss Report (ftp.intel.com) GroupAgent lachesis Program: Packet Loss Report (ftp.intel.com) GroupAgent MSFrontPage Programming: Microsoft FrontPage (stupid user!) GroupAgent Offline Program: Offline Explorer (Offline Browser) GroupAgent Pix Program: Mister Pix (Image Downloader) GroupAgent pixfinder Program: PixFinder (Image Downloader) GroupAgent Powermarks Program: Powermarks (Bookmark Manager) GroupAgent SuperBot Program: SuperBot (Web Site Copier) GroupAgent SmartDownload Program: SmartDownload (Download Manager) GroupAgent Teleport Program: Teleport Pro (Offline Browser tenmax.com) GroupAgent TweakMASTER Program: TweakMASTER (Download Accelerator tweakmaster.com) GroupAgent UCmore Program: UCmore Toolbar (search accelerator) GroupAgent WebCopier Program: Web Copier (Offline Browser) GroupAgent WebReaper Program: WebReaper (Offline Browser) GroupAgent WebStripper Program: WebStripper (Offline Browser) GroupAgent WebZIP Program: WebZIP (Offline Browser) GroupAgent Xenu Program: Xenu (Link Checker) GroupAgent Alcatel- Device: Alcatel Mobile Phone GroupAgent AvantGo Device: AvantGo (Offline Browser) GroupAgent Blazer Device: Blazer (PalmOS browser) GroupAgent BlackBerry Device: BlackBerry GroupAgent DoCoMo Device: I-mode Compatible Mobile Phone GroupAgent Elaine Device: Palm browser GroupAgent Ericsson Device: Ericsson Mobile Phone GroupAgent iSiloX Device: iSiloX (PocketPC and PalmPC offline reader) GroupAgent jBrowser Device: WAP Browser jBrowser (built by Jataayu) GroupAgent JPluck Device: JPluck (PalmOS offline reader) GroupAgent LG- Device: LG Electronics Mobile Phone GroupAgent Liberate Device: Liberate (Digital TV) GroupAgent Mitsu Device: Mitsubishi Mobile Phone GroupAgent MOT- Device: Motorola Mobile Phone GroupAgent Netpliance Device: Netpliance I-Opener GroupAgent Nokia Device: Nokia Mobile Phone GroupAgent Panasonic Device: Panasonic Mobile Phone GroupAgent PHILIPS- Device: Philips Mobile Phone GroupAgent PowerTV Device: PowerTV Set top boxes GroupAgent SAGEM- Device: SAGEM Mobile Phone GroupAgent SAMSUNG- Device: Samsung Mobile Phone GroupAgent SEC- Device: Samsung Mobile Phone GroupAgent SHARP- Device: Sharp Mobile Phone GroupAgent SIE- Device: Siemens Mobile Phone GroupAgent SonyEricsson Device: Sony/Ericsson Mobile Phone GroupAgent www.wapsilon.com Device: www.wapsilon.com (WAP browser) GroupAgent Wapalizer Device: Wapalizer (WAP Browser) GroupAgent WebGo Device: Offline Browser WebGo (Windows/CE) GroupAgent WebTV Device: WebTV GroupAgent AmphetaDesk RSS: AmphetaDesk GroupAgent Awasu RSS: Awasu GroupAgent Bloglines RSS: Bloglines.com GroupAgent BlogStreetBot RSS: BlogStreet.com GroupAgent BottomFeeder RSS: BottomFeeder (http://www.cincomsmalltalk.com/BottomFeeder/) GroupAgent effnews.py RSS: EFFnews (http://effbot.org/zone/effnews-exe.htm) GroupAgent fastbuzz.com RSS: Fastbuzz.com GroupAgent FeedDemon RSS: Feed Demon GroupAgent Feedreader RSS: FeedReader GroupAgent FeedOnFeeds RSS: FeedOnFeeds Reader (http://minutillo.com/steve/feedonfeeds/) GroupAgent FeedValidator RSS: Archive.org Feed Validator GroupAgent intraVnews RSS: intraVnews GroupAgent Liferea RSS: Liferea (http://liferea.sourceforge.net/) GroupAgent MagpieRSS RSS: MagpieRSS (PHP-based reader) GroupAgent MovableType RSS: Movable Type GroupAgent MyHeadlines RSS: MyHeadlines (http://www.jmagar.com/myh4) GroupAgent NetNewsWire RSS: NetNewsWire GroupAgent NewsGator RSS: NewsGator GroupAgent Newz RSS: Newz Crawler GroupAgent nntp//rss RSS: nntp//rss (http://www.methodize.org/nntprss/) GroupAgent Radio* RSS: Radio Userland GroupAgent RssReader RSS: Rss Reader (http://www.RssReader.com) GroupAgent Oddbot RSS: OddPost.com GroupAgent PocketFeed RSS: PocketFeed (Pocket PC RSS reader) GroupAgent PostNuke RSS: PostNuke CMS GroupAgent PubSub.com RSS: PubSub.com GroupAgent RssBandit RSS: RSS Bandit GroupAgent RSSNewsTicker RSS: RSSNewsTicker.com GroupAgent searchguild RSS: RSS Voyager (http://www.searchguild.com) GroupAgent SharpReader RSS: SharpReader GroupAgent Syndigator RSS: Syndigator GroupAgent Syndirella RSS: Syndirella GroupAgent UltraLiberalFeedParser RSS: Ultra Liberal Feed Parser from Mark Pilgrim GroupAgent Urchin RSS: Urchin (http://urchin.sourceforge.net/) GroupAgent Wildgrape RSS: Wildgrape NewsDesk GroupAgent wlannews-spider RSS: Wlannews (http://wlannews.otaku42.de/) GroupAgent china SpamBot: china local browse 2.6 GroupAgent China SpamBot: china local browse 2.6 GroupAgent cloakBrowser SpamBot: Fantoma GroupAgent compatible) SpamBot: Pretends to be Mozilla 3.0 GroupAgent Dattatec.com-Sitios-Top SpamBot: Referrer Spam for Dattatec.com GroupAgent DTS SpamBot: Beijing Express Email Address Extractor GroupAgent EmailSiphon SpamBot: EmailSiphon GroupAgent fantomBrowser SpamBot: Fantoma GroupAgent fantomCrew SpamBot: Fantoma GroupAgent Franklin SpamBot: Franklin Locator GroupAgent Finder SpamBot: Mac Finder GroupAgent iaea.org SpamBot: Atomic Harvester 2000 GroupAgent Industry SpamBot: Industry Program GroupAgent IUFW SpamBot: IUFW Web GroupAgent IUPUI SpamBot: IUPUI Research Bot GroupAgent Lincoln SpamBot: Lincoln State Web Browser GroupAgent LinkSweeper SpamBot: LinkSweeper GroupAgent Microcomputers SpamBot: Franklin Locator GroupAgent Missauga SpamBot: Missauga Locate GroupAgent Missigua SpamBot: Missauga Locate GroupAgent NationalDirectory Spambot: National Directory Email Harvester GroupAgent Rainbow SpamBot: Under the Rainbow GroupAgent Shareware Spambot: Program Shareware GroupAgent stealthBrowser Spambot: Fantoma GroupAgent Sweeper Spambot: Mail Sweeper GroupAgent WEP SpamBot: WEP Search GroupAgent 348NorthNews Spider: 348north.com GroupAgent agricola Spider: http://www.conoze.com/agricola GroupAgent almaden.ibm.com/cs/crawler Spider: almaden.ibm.com GroupAgent amzn_assoc Spider: Amazon Associates GroupAgent AnswerBus Spider: AnswerBus.com GroupAgent antibot Spider: Antidot.net http://www.antidot.net/Welcome/jsp/robots.html GroupAgent Art-Online Spider: Art-Online.com GroupAgent http://Ask.24x.Info/ Spider: MnogoSearch.org GroupAgent ASPseek Spider: ASPseek.org free search engine software GroupAgent aspseek Spider: ASPseek.org free search engine software GroupAgent augurfind Spider: augurnet.ch (Swiss Search Engine) GroupAgent Baiduspider Spider: Baidu.com GroupAgent BarraHomeCrawler Spider: Barrahome.org GroupAgent BBot Spider: http://www.otthon.net/search/ GroupAgent Bilbo Spider: wise-guys.nl GroupAgent blo.gs Spider: blo.gs GroupAgent BlogBot Spider: Blogdex.net GroupAgent Blogdigger Spider: Blogdigger.com GroupAgent Blogosphere Spider: Blogosphere.us GroupAgent BlogPulse Spider: Blogpulse.com GroupAgent blogrunner Spider: Blogrunner.com GroupAgent BlogShares Spider: BlogShares.com GroupAgent BlogVisionEye Spider: Blogvision.com GroupAgent Blogwise Spider: Blogwise.com GroupAgent bloogz Spider: bloogz.com GroupAgent boitho.com Spider: boitho.com GroupAgent bookwatch@onfocus.com Spider: OnFocus.com Weblog BookWatch GroupAgent books@onfocus.com Spider: OnFocus.com Weblog BookWatch GroupAgent brainoff.com/geoblog/ Spider: The World as a Blog (brainoff.com/geoblog/) GroupAgent www.business-socket.com Spider: www.business-socket.com GroupAgent CJNetworkQuality Spider: CommissionJunction.com GroupAgent combine Spider: http://www.lub.lu.se/combine/ GroupAgent COMBINE Spider: http://www.lub.lu.se/combine/ GroupAgent Computer_and_Automation_Research_Institute_Crawler Spider: sztaki.hu GroupAgent CoolBot Spider: www.suchmaschine21.de (German Search Engine) GroupAgent CoologFeedSpider Spider: CoolLog http://www.webfanatic.lunarpages.com/coolog/ GroupAgent CopyHunter Spider: AWstats referrer log analyzer GroupAgent Cowbot Spider: Naver.com (Korean Search Engine) GroupAgent cyberalert Spider: cyberalert.com GroupAgent daypopbot Spider: DayPop.com GroupAgent DeepIndex Spider: DeepIndex.com GroupAgent DiceBot Spider: Dicelared.com GroupAgent Ecosystem/development Spider: Blogging Ecosystem GroupAgent EgotoBot Spider: Egoto.com GroupAgent ETS Spider: Freetranslation.com GroupAgent exactseek.com Spider: exactseek.com GroupAgent Exalead Spider: Exalead.com (AOL France) GroupAgent FAST Spider: All The Web GroupAgent Fast Spider: All The Web GroupAgent Feedster Spider: Feedster.com GroupAgent FlickBot Spider: DivX Networks FlickBot GroupAgent Fluffy Spider: Searchhippo.com GroupAgent Frontier Spider: Weblogs.com GroupAgent Gaisbot Spider: GAIS (http://gais.cs.ccu.edu.tw/ ) GroupAgent GalaxyBot Spider: Galaxy.com GroupAgent Genome Spider: Waypath.com GroupAgent GeonaBot Spider: Geona.com GroupAgent GERT! Spider: http://www.gert.erichsystems.com/ GroupAgent Gigabot Spider: Gigablast.com GroupAgent Google* Spider: Google.com GroupAgent gossamer-threads.com Spider: Links SQL GroupAgent grub-client Spider: Grub.org GroupAgent gsa-crawler Spider: Google Search Appliance GroupAgent Hatena Spider: Hatena (http://a.hatena.ne.jp/ ) GroupAgent HelpSpy Spider: HelpSpy.com GroupAgent htdig Spider: htdig (Open Source Search Engine) GroupAgent ia_archiver Spider: Archive.org GroupAgent INGRID/3.0 Spider: ilse.nl (Dutch search engine) GroupAgent InternetSeer Spider: InternetSeer.com (Web Site Monitoring) GroupAgent internetseer Spider: InternetSeer.com (Web Site Monitoring) GroupAgent IXE Spider: ideare.com GroupAgent janes-blogosphere Spider: BlogMatrix.com GroupAgent jiffe Spider: jiffe.com GroupAgent k2spider Spider: Verity Spider GroupAgent KooQoo Spider: KooQoo.com GroupAgent kozono Spider: OCN (http://www.ocn.ne.jp) GroupAgent larbin Spider: larbin (http://sourceforge.net/projects/larbin/) GroupAgent Leknor.com Spider: Leknor.com GZIP Tester GroupAgent Linkbot Spider: Linkbot link monitoring tool (Watchfire.com) GroupAgent LinkHype Spider: LinkHype.com GroupAgent LinksManager.com Spider: LinksManager.com GroupAgent LinkWalker Spider: seventwentyfour.com GroupAgent lithopssoft Spider: HTML Link Validator (http://www.lithopssoft.com) GroupAgent LNSpiderguy Spider: Lexis-Nexis GroupAgent Megite Spider: Megite.com GroupAgent metamark Spider: Metamark.net GroupAgent MnogoSearch Spider: MnogoSearch.org GroupAgent mogimogi Spider: www.goo.ne.jp (Japanese Search Engine) GroupAgent MSNBOT Spider: MSN.com GroupAgent msnbot Spider: MSN.com GroupAgent MyWireServiceBot Spider: MyWireService.com GroupAgent NaverRobot Spider: Naver.com (Korean Search Engine) GroupAgent NaverBot Spider: Naver.com (Korean Search Engine) GroupAgent Netcraft Spider: Netcraft Web Survey GroupAgent NetResearchServer Spider: Look.com GroupAgent NIF Spider: Newsisfree.com GroupAgent NG/1.0 Spider: Exalead.com (AOL France) GroupAgent NITLE Spider: Blogcensus.net GroupAgent NPBot Spider: NameProtect.com GroupAgent NRK-bruker Spider: NRK.no GroupAgent Nutch Spider: Nutch (nutch.org) GroupAgent Openbot Spider: OpenFind (http://www.openfind.com.tw/) GroupAgent Oracle Spider: Oracle Ultra Search GroupAgent Overture-WebCrawler Spider: Overture GroupAgent PerMan Spider: Big Globe (http://search.biglobe.ne.jp/) GroupAgent PolyBot Spider: PolyBot (http://cis.poly.edu/polybot/) GroupAgent Pompos Spider: Dir.com GroupAgent Popdexter Spider: Popdex.com GroupAgent psbot Spider: Picsearch.com GroupAgent QuepasaCreep Spider: Quepasa.com (Spanish site) GroupAgent Robozilla Spider: Link Checker for Dmoz.org GroupAgent Scooter Spider: Altavista GroupAgent Scrubby Spider: scrubtheweb.com GroupAgent search.ch Spider: search.ch GroupAgent searchspider.com Spider: searchspider.com GroupAgent SearchSpider.com Spider: searchspider.com GroupAgent semanticdiscovery Spider: semanticdiscovery.com GroupAgent SideWinder Spider: Infoseek GroupAgent slurp@inktomi.com Spider: Inktomi GroupAgent Speedy Spider: Entireweb.com GroupAgent spider@spider.ilab.sztaki.hu Spider: http://www.ilab.sztaki.hu/websearch/ GroupAgent SpiderKU Spider: Kasetsart University (Thailand. http: //www.cpe.ku.ac.th/ ) GroupAgent Spinne Spider: webauskunft.at GroupAgent Steeler Spider: Kitsuregawa Laboratory, The University of Tokyo GroupAgent SurveyBot Spider: whois.sc GroupAgent Syndic8 Spider: Syndic8.com GroupAgent Tagword Spider: Tagword - http://tagword.com/dmoz_survey.php GroupAgent Teoma Spider: Teoma GroupAgent Teradex Spider: Teradex.com (directory) GroupAgent Terrar Spider: Terrar (http://www.terrar.com) GroupAgent Technoratibot Spider: Technorati GroupAgent T-H-U-N-D-E-R-S-T-O-N-E Spider: Webinator (http://www.thunderstone.com/texis/site/pages/webinator.html) GroupAgent timboBot Spider: BreakingBlogs.com GroupAgent Tkensaku Spider: Tkensaku (http://www.tkensaku.com/q.html) GroupAgent TurnitinBot Spider: Turnitin.com GroupAgent http://www.tutorgig.com/ Spider: tutorgig.com GroupAgent UbiCrawler Spider: UbiCrawler (http://ubi.imc.pi.cnr.it/projects/ubicrawler/) GroupAgent Vagabondo Spider: kobala.nl GroupAgent verzamelgids Spider: verzamelgids.nl GroupAgent vias.ncsa.uiuc.edu Spider: VIAS (http://vias.ncsa.uiuc.edu) GroupAgent Vivante Spider: Vivante.com GroupAgent VoilaBot Spider: Voila.com GroupAgent W3C_Validator Spider: W3C Validator GroupAgent www.walhello.com Spider: Walhello.com GroupAgent Waypath Spider: Waypath.com GroupAgent WebCapture Spider: WebCapture.biz GroupAgent Webclipping Spider: Webclipping.com GroupAgent WebFilter Spider: http://www.ils.unc.edu/webfilter/ GroupAgent WebGather Spider: City Polytechnic of Hong Kong GroupAgent WebRACE Spider: WebRACE (University of Cyprus, Distributed Crawler) GroupAgent WebSearchBench Spider: WebSearchBench (http://websearchbench.cs.uni-dortmund.de/) GroupAgent websitealert.net Spider: websitealert.net (Monitoring System) GroupAgent Wotbox Spider: Wotbox.com GroupAgent WWWeasel Spider: WWWeasel.de GroupAgent Zao Spider: Zao (http://www.kototoi.org/zao/) GroupAgent Zealbot Spider: Looksmart.com GroupAgent ZyBorg Spider: WiseNut.com GroupAgent curl Programming: curl library (PHP) GroupAgent Incutio Programming: Incutio httpclient (PHP class - http://scripts.incutio.com/httpclient/) GroupAgent Indy Programming: Indy (Delphi-based client) GroupAgent Java Programming: Java-based client GroupAgent Jakarta Programming: Jakarta (Java) GroupAgent libwww-perl Programming: LIB-WWW (Perl library) GroupAgent LWP: Pogramming: LWP:: Simple (Perl library) GroupAgent NEWT Programming: Newt Active X SDK GroupAgent OPWV-SDK Programming: OpenWave Mobile Development SDK GroupAgent PEAR Programming: PEAR Library (PHP) GroupAgent PHP Programming: PHP-based client GroupAgent Python-urllib Programming: URLLIB (Python library) GroupAgent rdflib Programming: rdflib (Python RDF library) GroupAgent RPT-HTTPClient Programming: RPT-HTTP (Java) GroupAgent Snoopy Programming: Snoopy (PHP class - http://snoopy.sourceforge.net/ ) GroupAgent SOFTWING_TEAR_AGENT Programming: Softwing Tear Agent (Active Server Pages) GroupAgent W3CRobot Programming: Webbot Library (http://www.w3.org/Robot/) GroupAgent Wget Programming: Wget library (http://www.gnu.org/software/wget/wget.html) GroupAgent WinHttp.WinHttpRequest Program: WinHttp.WinHttpRequest library (Visual Basic) GroupAgent Works! Programming: IP*Works! (Windows library - http://nsoftware.com) GroupAgent Bison Proxy: Proxomitron (Proxomitron.info) GroupAgent BorderManager Proxy Novell Border Manager Security Suite GroupAgent CE-Preload Proxy: Cisco Content Engine GroupAgent DA Proxy: DA GroupAgent junkbuster Proxy: junkbuster (junkbusters.com) GroupAgent AppleWebKit Safari (OSX) GroupAgent avantbrowser Avant Browser (http://www.avantbrowser.com) GroupAgent BFS_method BeOS browser GroupAgent Camino Mozilla-based browser Camino (OSX) GroupAgent danux GNU/Linux (http://daniel.slashlog.org/) GroupAgent Firebird Mozilla Firebird GroupAgent iCab iCab (Mac) GroupAgent Konqueror Konqueror GroupAgent Links Links (Text-based browser) GroupAgent Lynx* Lynx (Text-based browser) GroupAgent Lotus-Notes Lotus Notes GroupAgent NCBrowser NCBrowser (RISC OS) GroupAgent NetPositive NetPositive (BeOS) GroupAgent Opera Opera GroupAgent SlimBrowser SlimBrowser (http://www.flashpeak.com/sbrowser/sbrowser.htm) GroupAgent w3m w3m (Text-based browser - http://w3m.sourceforge.net/ ) GroupAgent rv:1.4 Mozilla 1.4 GroupAgent 3.01 Navigator 3.01 (16-bit version) GroupAgent 3.02 Internet Explorer 3.02 GroupAgent 4.01 Internet Explorer 4.01 GroupAgent 5.01 Internet Explorer 5.01 GroupAgent 5.0 Internet Explorer 5.0 GroupAgent 5.23 Internet Explorer (Mac) GroupAgent 5.22 Internet Explorer (Mac) GroupAgent 5.21 Internet Explorer (Mac) GroupAgent 5.17 Internet Explorer (Mac) GroupAgent 5.16 Internet Explorer (Mac) GroupAgent 5.15 Internet Explorer (Mac) GroupAgent 5.14 Internet Explorer (Mac) GroupAgent 5.13 Internet Explorer (Mac) GroupAgent 5.12 Internet Explorer (Mac) GroupAgent 5.5 Internet Explorer 5.5 (Windows) GroupAgent 6.0 Internet Explorer 6.0 (Windows) GroupAgent Moozilla Moozilla (http://www.moo.ca/moozilla) GroupAgent Mozilla/1.1 Mozilla 1.1 GroupAgent Mozilla/3.04 Netscape 3.04 GroupAgent Mozilla/3.04Gold Netscape 3.04 GroupAgent Mozilla/4.04 Netscape 4 GroupAgent Mozilla/4.05 Netscape 4 GroupAgent Mozilla/4.06 Netscape 4 GroupAgent Mozilla/4.08 Netscape 4 GroupAgent Mozilla/4.51 Netscape 4.5 GroupAgent Mozilla/4.5 Netscape 4.5 GroupAgent Mozilla/4.61 Netscape 4.6 GroupAgent Mozilla/4.6 Netscape 4.6 GroupAgent Mozilla/4.7 Netscape 4.7 GroupAgent Mozilla/4.8 Netscape 4.8 GroupAgent MSIE Internet Explorer GroupAgent Mozilla Netscape HideAgent rv:1.4 HideAgent 3.01 HideAgent 3.02 HideAgent 4.01 HideAgent 5.0 HideAgent 5.01 HideAgent 5.12 HideAgent 5.13 HideAgent 5.14 HideAgent 5.15 HideAgent 5.16 HideAgent 5.17 HideAgent 5.21 HideAgent 5.22 HideAgent 5.23 HideAgent 5.5 HideAgent 6.0 HideAgent 348NorthNews HideAgent agricola HideAgent Alcatel- HideAgent almaden.ibm.com/cs/crawler HideAgent AmphetaDesk HideAgent amzn_assoc HideAgent AnswerBus HideAgent Art-Online HideAgent antibot HideAgent AppleWebKit HideAgent http://Ask.24x.Info/ HideAgent ASPseek HideAgent aspseek HideAgent augurfind HideAgent avantbrowser HideAgent AvantGo HideAgent Awasu HideAgent Baiduspider HideAgent BarraHomeCrawler HideAgent BBot HideAgent BFS_method HideAgent Bilbo HideAgent Bison HideAgent BlackBerry HideAgent Blazer HideAgent blo.gs HideAgent BlogBot HideAgent Blogdigger HideAgent Bloglines HideAgent Blogosphere HideAgent BlogPulse HideAgent blogrunner HideAgent BlogShares HideAgent BlogStreetBot HideAgent BlogVisionEye HideAgent Blogwise HideAgent bloogz HideAgent boitho.com HideAgent bookwatch@onfocus.com HideAgent books@onfocus.com HideAgent BorderManager HideAgent BottomFeeder HideAgent brainoff.com/geoblog/ HideAgent www.business-socket.com HideAgent Camino HideAgent CE-Preload HideAgent Check&Get HideAgent china HideAgent China HideAgent CJNetworkQuality HideAgent cloakBrowser HideAgent combine HideAgent COMBINE HideAgent compatible) HideAgent Computer_and_Automation_Research_Institute_Crawler HideAgent CoolBot HideAgent CoologFeedSpider HideAgent CopyHunter HideAgent Cowbot HideAgent curl HideAgent Custo HideAgent cyberalert HideAgent DA HideAgent danux HideAgent Dattatec.com-Sitios-Top HideAgent daypopbot HideAgent DeepIndex HideAgent DiceBot HideAgent DLW HideAgent DoCoMo HideAgent Downloader HideAgent DTS HideAgent Ecosystem/development HideAgent effnews.py HideAgent EgotoBot HideAgent Elaine HideAgent EmailSiphon HideAgent Ericsson HideAgent ETS HideAgent eXactSite HideAgent Exalead HideAgent exactseek.com HideAgent EyeOnSite HideAgent fantomBrowser HideAgent fantomCrew HideAgent FAST HideAgent Fast HideAgent fastbuzz.com HideAgent FavOrg HideAgent FeedDemon HideAgent Feedreader HideAgent FeedOnFeeds HideAgent Feedster HideAgent FeedValidator HideAgent Fetch HideAgent Finder HideAgent Firebird HideAgent FlickBot HideAgent Fluffy HideAgent Franklin HideAgent Frontier HideAgent Gaisbot HideAgent GalaxyBot HideAgent Genome HideAgent GeonaBot HideAgent GERT! HideAgent GetRight HideAgent Gigabot HideAgent grub-client HideAgent Google* HideAgent gossamer-threads.com HideAgent gsa-crawler HideAgent Hatena HideAgent HelpSpy HideAgent htdig HideAgent HTTrack HideAgent ia_archiver HideAgent iaea.org HideAgent iCab HideAgent Incutio HideAgent Industry HideAgent Indy HideAgent INGRID/3.0 HideAgent InternetSeer HideAgent internetseer HideAgent intraVnews HideAgent iSiloX HideAgent IUFW HideAgent IUPUI HideAgent IXE HideAgent Jakarta HideAgent janes-blogosphere HideAgent Java HideAgent jBrowser HideAgent jiffe HideAgent junkbuster HideAgent JPluck HideAgent k2spider HideAgent KooQoo HideAgent kozono HideAgent Lachesis HideAgent lachesis HideAgent larbin HideAgent Leknor.com HideAgent LG- HideAgent Liberate HideAgent libwww-perl HideAgent Liferea HideAgent Lincoln HideAgent Linkbot HideAgent LinkHype HideAgent Links HideAgent LinksManager.com HideAgent LinkSweeper HideAgent LinkWalker HideAgent lithopssoft HideAgent LNSpiderguy HideAgent Lotus-Notes HideAgent Lynx* HideAgent MagpieRSS HideAgent Megite HideAgent metamark HideAgent Microcomputers HideAgent Missauga HideAgent Missigua HideAgent Mitsu HideAgent mogimogi HideAgent Moozilla HideAgent MOT- HideAgent MovableType HideAgent Mozilla/1.1 HideAgent Mozilla/3.04 HideAgent Mozilla/3.04Gold HideAgent Mozilla/4.04 HideAgent Mozilla/4.05 HideAgent Mozilla/4.06 HideAgent Mozilla/4.08 HideAgent Mozilla/4.5 HideAgent Mozilla/4.51 HideAgent Mozilla/4.6 HideAgent Mozilla/4.61 HideAgent Mozilla/4.7 HideAgent Mozilla/4.8 HideAgent MSFrontPage HideAgent MSNBOT HideAgent msnbot HideAgent MyHeadlines HideAgent MyWireServiceBot HideAgent NationalDirectory HideAgent NaverBot HideAgent NaverRobot HideAgent NCBrowser HideAgent Netcraft HideAgent NetNewsWire HideAgent Netpliance HideAgent NetPositive HideAgent NetResearchServer HideAgent NewsGator HideAgent NEWT HideAgent Newz HideAgent NG/1.0 HideAgent NIF HideAgent NITLE HideAgent nntp//rss HideAgent Nokia HideAgent NPBot HideAgent NRK-bruker HideAgent Nutch HideAgent Openbot HideAgent Opera HideAgent Oddbot HideAgent Offline HideAgent OPWV-SDK HideAgent Oracle HideAgent Overture-WebCrawler HideAgent Panasonic HideAgent PEAR HideAgent PerMan HideAgent PHILIPS- HideAgent PHP HideAgent pixfinder HideAgent Pix HideAgent PocketFeed HideAgent PolyBot HideAgent Pompos HideAgent Popdexter HideAgent PostNuke HideAgent Powermarks HideAgent PowerTV HideAgent psbot HideAgent PubSub.com HideAgent Python-urllib HideAgent QuepasaCreep HideAgent Radio* HideAgent Rainbow HideAgent rdflib HideAgent Robozilla HideAgent RssBandit HideAgent RSSNewsTicker HideAgent RssReader HideAgent RPT-HTTPClient HideAgent SAGEM- HideAgent SAMSUNG HideAgent Scrubby HideAgent search.ch HideAgent SHARP- HideAgent SideWinder HideAgent slurp@inktomi.com HideAgent Scooter HideAgent searchguild HideAgent searchspider.com HideAgent SearchSpider.com HideAgent SEC- HideAgent semanticdiscovery HideAgent SIE- HideAgent SharpReader HideAgent Shareware HideAgent SlimBrowser HideAgent Snoopy HideAgent SOFTWING_TEAR_AGENT HideAgent SonyEricsson HideAgent spider@spider.ilab.sztaki.hu HideAgent SpiderKU HideAgent Spinne HideAgent SmartDownload HideAgent Speedy HideAgent stealthBrowser HideAgent Steeler HideAgent SuperBot HideAgent SurveyBot HideAgent Sweeper HideAgent Syndic8 HideAgent Syndirella HideAgent Syndigator HideAgent Tagword HideAgent Technoratibot HideAgent Teleport HideAgent Teoma HideAgent Teradex HideAgent Terrar HideAgent T-H-U-N-D-E-R-S-T-O-N-E HideAgent timboBot HideAgent Tkensaku HideAgent TurnitinBot HideAgent http://www.tutorgig.com/ HideAgent TweakMASTER HideAgent UbiCrawler HideAgent UCmore HideAgent UltraLiberalFeedParser HideAgent Urchin HideAgent Vagabondo HideAgent verzamelgids HideAgent vias.ncsa.uiuc.edu HideAgent Vivante HideAgent VoilaBot HideAgent W3C_Validator HideAgent W3CRobot HideAgent w3m HideAgent www.walhello.com HideAgent www.wapsilon.com HideAgent Wapalizer HideAgent Waypath HideAgent WebCapture HideAgent Webclipping HideAgent WebCopier HideAgent WebFilter HideAgent WebGather HideAgent WebGo HideAgent WebRACE HideAgent WebReaper HideAgent WebSearchBench HideAgent websitealert.net HideAgent WebStripper HideAgent WebTV HideAgent WebZIP HideAgent WEP HideAgent Wget HideAgent Wildgrape HideAgent WinHttp.WinHttpRequest HideAgent wlannews-spider HideAgent Works! HideAgent Wotbox HideAgent WWWeasel HideAgent Xenu HideAgent Zao HideAgent Zealbot HideAgent ZyBorg # HideAllSites allows forcing individual sites to be hidden in the # report. This is particularly useful when used in conjunction # with the "GroupDomain" feature, but could be useful in other # situations as well, such as when you only want to display grouped # sites (with the GroupSite keywords...). The value for this # keyword can be either 'yes' or 'no', with 'no' the default, # allowing individual sites to be displayed. #HideAllSites no # The GroupDomains keyword allows you to group individual hostnames # into their respective domains. The value specifies the level of # grouping to perform, and can be thought of as 'the number of dots' # that will be displayed. For example, if a visiting host is named # cust1.tnt.mia.uu.net, a domain grouping of 1 will result in just # "uu.net" being displayed, while a 2 will result in "mia.uu.net". # The default value of zero disable this feature. Domains will only # be grouped if they do not match any existing "GroupSite" records, # which allows overriding this feature with your own if desired. GroupDomains 1 # The GroupShading allows grouped rows to be shaded in the report. # Useful if you have lots of groups and individual records that # intermingle in the report, and you want to diferentiate the group # records a little more. Value can be 'yes' or 'no', with 'yes' # being the default. GroupShading yes # GroupHighlight allows the group record to be displayed in BOLD. # Can be either 'yes' or 'no' with the default 'yes'. GroupHighlight yes # The Ignore* keywords allow you to completely ignore log records based # on hostname, URL, user agent, referrer or username. I hessitated in # adding these, since the Webalizer was designed to generate _accurate_ # statistics about a web servers performance. By choosing to ignore # records, the accuracy of reports become skewed, negating why I wrote # this program in the first place. However, due to popular demand, here # they are. Use the same as the Hide* keywords, where the value can have # a leading or trailing wildcard '*'. Use at your own risk ;) #IgnoreSite bad.site.net #IgnoreURL /test* IgnoreURL /editor* IgnoreURL /other* #IgnoreReferrer file:/* #IgnoreAgent RealPlayer #IgnoreUser root # The Include* keywords allow you to force the inclusion of log records # based on hostname, URL, user agent, referrer or username. They take # precidence over the Ignore* keywords. Note: Using Ignore/Include # combinations to selectivly process parts of a web site is _extremely # inefficent_!!! Avoid doing so if possible (ie: grep the records to a # seperate file if you really want that kind of report). # Example: Only show stats on Joe User's pages... #IgnoreURL * #IncludeURL ~joeuser* # Or based on an authenticated username #IgnoreUser * IgnoreUser tristan #IncludeUser someuser # The MangleAgents allows you to specify how much, if any, The Webalizer # should mangle user agent names. This allows several levels of detail # to be produced when reporting user agent statistics. There are six # levels that can be specified, which define different levels of detail # supression. Level 5 shows only the browser name (MSIE or Mozilla) # and the major version number. Level 4 adds the minor version number # (single decimal place). Level 3 displays the minor version to two # decimal places. Level 2 will add any sub-level designation (such # as Mozilla/3.01Gold or MSIE 3.0b). Level 1 will attempt to also add # the system type if it is specified. The default Level 0 displays the # full user agent field without modification and produces the greatest # amount of detail. User agent names that can't be mangled will be # left unmodified. MangleAgents 0 # The SearchEngine keywords allow specification of search engines and # their query strings on the URL. These are used to locate and report # what search strings are used to find your site. The first word is # a substring to match in the referrer field that identifies the search # engine, and the second is the URL variable used by that search engine # to define it's search terms. SearchEngine 348north.com search= SearchEngine abcsearch.com terms= SearchEngine alltheweb.com q= SearchEngine altavista.com q= SearchEngine antisearch.net KEYWORDS= SearchEngine aolsearch query= SearchEngine ask.com ask= SearchEngine ask.co.uk ask= SearchEngine augurnet.ch q= SearchEngine baidu.com word= SearchEngine barrahome.org query= SearchEngine bbc. q= SearchEngine blogdex.net q= SearchEngine blogdigger.com queryString= SearchEngine blogosphere.us s= SearchEngine blogmatrix.com search= SearchEngine blogwise.com query= SearchEngine boitho.com query= SearchEngine btopenworld query= SearchEngine buscador.ya.com q= SearchEngine by.com query= SearchEngine daypop.com q= SearchEngine dir.com req= SearchEngine dmoz.org search= SearchEngine dogpile.com q= SearchEngine dpxml qkw= SearchEngine egoto.com keywords= SearchEngine elf8888.at query0= SearchEngine eureka. q= SearchEngine excite search= SearchEngine feedster.com q= SearchEngine fireball.de q= SearchEngine freeserve q= SearchEngine gais.cs.ccu.edu.tw q= SearchEngine galaxy.com k= SearchEngine gigablast.com q= SearchEngine google q= SearchEngine goo.ne.jp MT= SearchEngine hotbot. query= SearchEngine infoseek. qt= SearchEngine ixquick.com query= SearchEngine iwon searchfor= SearchEngine kobala.nl qr= SearchEngine lycos. query= SearchEngine look.com q= SearchEngine looksmart key= SearchEngine mamma. query= SearchEngine metacrawler q= SearchEngine msn. q= SearchEngine msxml qkw= SearchEngine mysearch.com serachfor= SearchEngine naver.com query= SearchEngine netscape. query= SearchEngine northernlight. qr= SearchEngine ntlworld.com q= SearchEngine openfind query= SearchEngine overture. Keywords= SearchEngine picsearch.com q= SearchEngine popdex query= SearchEngine quepasa.com q= SearchEngine search.com qt= SearchEngine searchspider.com q= SearchEngine search.earthlink q= SearchEngine suchmaschine21.de search= SearchEngine syndic8 ShowMatch= SearchEngine technorati query= SearchEngine teensearch query= SearchEngine teoma q= SearchEngine teradex.com q= SearchEngine texis q= SearchEngine ukplus key= SearchEngine voila kw= SearchEngine walhello key= SearchEngine waypath.com key= SearchEngine webcrawler searchText= SearchEngine webfanatic.lunarpages.com q= SearchEngine whois.sc q= SearchEngine wisenut q= SearchEngine yahoo p= # The Dump* keywords allow the dumping of Sites, URL's, Referrers # User Agents, Usernames and Search strings to seperate tab delimited # text files, suitable for import into most database or spreadsheet # programs. # DumpPath specifies the path to dump the files. If not specified, # it will default to the current output directory. Do not use a # trailing slash ('/'). #DumpPath /var/log/httpd # The DumpHeader keyword specifies if a header record should be # written to the file. A header record is the first record of the # file, and contains the labels for each field written. Normally, # files that are intended to be imported into a database system # will not need a header record, while spreadsheets usually do. # Value can be either 'yes' or 'no', with 'no' being the default. #DumpHeader no # DumpExtension allow you to specify the dump filename extension # to use. The default is "tab", but some programs are pickey about # the filenames they use, so you may change it here (for example, # some people may prefer to use "csv"). #DumpExtension tab # These control the dumping of each individual table. The value # can be either 'yes' or 'no'.. the default is 'no'. DumpSites no DumpURLs no DumpReferrers no DumpAgents no DumpUsers no DumpSearchStr no # End of configuration file... Have a nice day! FoldSeqErr yes