<?xml version="1.0"?>
<!DOCTYPE wml PUBLIC "-//WAPFORUM//DTD WML 1.1//EN"
"http://www.wapforum.org/DTD/wml_1.1.xml">
<wml>
<card id="index" title="Text File" newcontext="true">
<p>
Received: with ECARTIS (v1.0.0; list gopher);
 Mon, 30 Jun 2003 10:12:02 -0500 (CDT)
Return-Path: &lt;spectre@floodgap.com&gt;
X-Original-To: gopher@complete.org
Delivered-To: gopher@complete.org
Received: by gesundheit.complete.org (Postfix, from userid 108)
	id A29291832049; Mon, 30 Jun 2003 10:11:59 -0500 (CDT)
X-Scanned-By: clamscan at complete.org
Received: from floodgap.com (netblock-66-159-214-137.dslextreme.com
 [66.159.214.137])
	by gesundheit.complete.org (Postfix) with ESMTP id 70B2B1832045
	for &lt;gopher@complete.org&gt;; Mon, 30 Jun 2003 10:11:53 -0500 (CDT)
Received: (from spectre@localhost)
	by floodgap.com (8.9.1/2003.05.26) id IAA30688
	for gopher@complete.org; Mon, 30 Jun 2003 08:21:25 -0700
From: Cameron Kaiser &lt;spectre@floodgap.com&gt;
Message-Id: &lt;200306301521.IAA30688@floodgap.com&gt;
Subject: [gopher] Re: bot&#x27;s running
In-Reply-To: &lt;004001c33ef0$e45c1920$43da5982@killspy&gt; from Ruliz Galaxor at
 &quot;Jun 30, 3 12:18:02 pm&quot;
To: gopher@complete.org
Date: Mon, 30 Jun 2003 08:21:25 -0700 (PDT)
X-Mailer: ELM [version 2.4ME+ PL39 (25)]
MIME-Version: 1.0
Content-type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 8bit
X-archive-position: 778
X-ecartis-version: Ecartis v1.0.0
Sender: gopher-bounce@complete.org
Errors-to: gopher-bounce@complete.org
X-original-sender: spectre@floodgap.com
Precedence: bulk
Reply-to: gopher@complete.org
List-help: &lt;mailto:ecartis@complete.org?Subject=help&gt;
List-unsubscribe: &lt;mailto:gopher-request@complete.org?Subject=unsubscribe&gt;
List-software: Ecartis version 1.0.0
List-Id: Gopher &lt;gopher.complete.org&gt;
X-List-ID: Gopher &lt;gopher.complete.org&gt;
List-subscribe: &lt;mailto:gopher-request@complete.org?Subject=subscribe&gt;
List-owner: &lt;mailto:jgoerzen@complete.org&gt;
List-post: &lt;mailto:gopher@complete.org&gt;
List-archive: &lt;http://www.complete.org/mailinglists/archives/&gt;
X-list: gopher
</p>
<p>&gt; Yes, I noticed... and I have a small question or actually favor to ask...
&gt; because I&#x27;m currently using the system in which the type of request should
&gt; also be present (e.g. 0/robots.txt).
&gt; so could the bot both check for robots.txt and 0/robots.txt? or is that a
&gt; problem?
</p>
<p>I think it will probably be okay. This is how it will work though:
</p>
<p>The bot will check for &quot;robots.txt&quot; first. If this works, fine, this is
accepted.
Next the bot will check for &quot;0/robots.txt&quot;. If this works, fine, this is
accepted;
otherwise, no robots.txt is used for the site.
</p>
<p>The reason this is worth bringing up is this could potentially map to
different selectors/files depending on the server, so the behaviour needs to
be known. Thus selector &quot;robots.txt&quot; always takes precedence if found.
</p>
<p>If this is no problem to everyone, I&#x27;ll take down the bot for a few minutes
this afternoon and add in the changes. Obviously whenever the bot restarts,
it refetches all robot exclusions; these are held in memory and not in
MySQL, since they&#x27;re transient anyway.
</p>
<p>&gt; greets and keep on the great work,
</p>
<p>Arigatoo :-)
</p>
<p>If people want to look up stats while the bot is crawling,
</p>
<p>	gopher://helsinki.floodgap.com/1/world/
</p>
<p>Refresh and watch the numbers change. Great for those coffee breaks.
</p>
<p>--
---------------------------------- personal: http://www.armory.com/~spectre/ --
 Cameron Kaiser, Floodgap Systems Ltd * So. Calif., USA * ckaiser@floodgap.com
-- Greek tailor shop: &quot;Euripedes?&quot; &quot;Yes -- Eumenides?&quot; ------------------------
</p>
</card>
</wml>
