<?xml version="1.0"?>
<!DOCTYPE wml PUBLIC "-//WAPFORUM//DTD WML 1.1//EN"
"http://www.wapforum.org/DTD/wml_1.1.xml">
<wml>
<card id="index" title="Text File" newcontext="true">
<p>
Received: with ECARTIS (v1.0.0; list gopher);
 Wed, 12 Oct 2005 18:46:47 -0500 (CDT)
Received: from netblock-66-159-214-137.dslextreme.com
	([66.159.214.137] helo=floodgap.com ident=nobody)
	by glockenspiel.complete.org with esmtp
	(Exim 4.50)
	id 1EPqJ2-0007g0-9K
	for gopher@complete.org; Wed, 12 Oct 2005 18:46:46 -0500
Received: (from spectre@localhost)
	by floodgap.com (6.6.6.666/2005.03.01) id QAA17070
	for gopher@complete.org; Wed, 12 Oct 2005 16:45:56 -0700
From: Cameron Kaiser &lt;spectre@floodgap.com&gt;
Message-Id: &lt;200510122345.QAA17070@floodgap.com&gt;
Subject: [gopher] Re: New Gopher Wayback Machine Bot
In-Reply-To: &lt;20051012180132.GA19083@complete.org&gt; from John Goerzen at &quot;Oct
 12, 5 01:01:32 pm&quot;
To: gopher@complete.org
Date: Wed, 12 Oct 2005 16:45:56 -0700 (PDT)
X-Mailer: ELM [version 2.4ME+ PL39 (25)]
MIME-Version: 1.0
Content-type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 8bit
X-Spam-Status: No (score 0.4): AWL=0.367, FORGED_RCVD_HELO=0.05
X-Virus-Scanned: by Exiscan on glockenspiel.complete.org at Wed,
 12 Oct 2005 18:46:46 -0500
X-archive-position: 1112
X-ecartis-version: Ecartis v1.0.0
Sender: gopher-bounce@complete.org
Errors-to: gopher-bounce@complete.org
X-original-sender: spectre@floodgap.com
Precedence: bulk
Reply-to: gopher@complete.org
List-help: &lt;mailto:ecartis@complete.org?Subject=help&gt;
List-unsubscribe: &lt;mailto:gopher-request@complete.org?Subject=unsubscribe&gt;
List-software: Ecartis version 1.0.0
List-Id: Gopher &lt;gopher.complete.org&gt;
X-List-ID: Gopher &lt;gopher.complete.org&gt;
List-subscribe: &lt;mailto:gopher-request@complete.org?Subject=subscribe&gt;
List-owner: &lt;mailto:jgoerzen@complete.org&gt;
List-post: &lt;mailto:gopher@complete.org&gt;
List-archive: &lt;http://www.complete.org/mailinglists/archives/&gt;
X-list: gopher
</p>
<p>&gt; Cameron, floodgap.com seems to have some sort of rate limiting and keeps
&gt; giving me a Connection refused error after a certain number of documents
&gt; have been spidered.
</p>
<p>I&#x27;m a little concerned about your project since I do host a number of large
subparts which are actually proxied services, and I think even a gentle bot
going methodically through them would not be pleasant for the other side
(especially if you mean to regularly update your snapshot).
</p>
<p>Veronica-2 doesn&#x27;t actually download content other than non-local selectors
in a directory to get around this problem since it doesn&#x27;t index the
content in any case, just the titles and selector data.
</p>
<p>I do support robots.txt, see
</p>
<p>	gopher.floodgap.com/0/v2/help/indexer
</p>
<p>--
---------------------------------- personal: http://www.armory.com/~spectre/ --
 Cameron Kaiser, Floodgap Systems Ltd * So. Calif., USA * ckaiser@floodgap.com
-- &quot;I&#x27;d love to go out with you, but I&#x27;m joining my split ends individually.&quot; -
</p>
<p></p>
</card>
</wml>
