<<< Importing Files
Exporting Results >>>
Below you may find the most typical questions that may be answered using the log file analysis feature.
You can review some general information on log file analysis and a couple of other features here . Below you may take a closer look at examples that provide answers to the questions listed above.
Let's take the web site www.my_web_site_com for an example. The log files for the period of one week between Apr.1st 2003 and Apr.7th 2003 were imported. The web site is administered with Apache 2.0.x web server. If we do not yet have a project for this web site, we need to create it. Importing log files is the first step in the analysis. Then we receive the tree of visits, which will be our main working tool.
Let's search for answers to the questions we listed above.
1)What spiders have indexed the main page of my web site? And which one has done it more frequently?
Since the most relevant parameter here is the page, and then spider, select the "Path-Spider-Date" pattern. Now we rebuild the tree and open the node that corresponds to the home page ("/").
To find an answer to the second part of the question, you'll need to generate a report. Report type - report on indexed pages; grouping is set by Spider; and Summary column should be included into the report. The rest of parameters user defines at his own wish.
2)Has the Google search spider indexed the page « /en/download.html »?
In this case you also use the "Path-Spider-Date" pattern. Locate a node of nesting level 1 that is responsible for the page "/en/download.html", in the visits tree and open it. You can see that Googlebot spider has actually visited this page.
3)Which spider has indexed the largest number of pages? The largest number of forbidden pages?
Here again, you will have to generate a report.
Report type - spider visits report; grouping - date; date detalization - year. Also you should include Summary column into the report. If you want to know what spider has visited the most number of forbidden pages, also include Disallowed column.
4)Do spiders index *.css files?
If this question interests you, you ought to have .css files at your site. There are two ways to find out whether these files are indexed by spiders. First, you may find a node responsible for .css file in the visits tree, or second, you may create an Indexed Page Report and check it for any information on .css files.
5)Which spider has not addressed the robots.txt file?
To find an answer to this question, rebuild the tree using the "Spider-Path-Date" or "Spider-Date-Path" pattern . Spiders that had skipped the robots.txt file, will be marked with a special sign.
<<< Importing Files
Exporting Results >>>
|