Received: with ECARTIS (v1.0.0; list gopher); Wed, 23 Aug 2006 09:30:41 -0500 (CDT) Received: from outbound4.mail.tds.net ([216.170.230.94]) by glockenspiel.complete.org with esmtp (Exim 4.50) id 1GFtkb-0007SI-Qe for gopher@complete.org; Wed, 23 Aug 2006 09:30:40 -0500 Received: from outaamta01.mail.tds.net (outaamta01.mail.tds.net [216.170.230.31]) by outbound4.mail.tds.net (8.13.6/8.13.4) with ESMTP id k7NEUYaB010466 for ; Wed, 23 Aug 2006 09:30:34 -0500 Received: from [127.0.0.1] (really [69.21.205.10]) by outaamta01.mail.tds.net with ESMTP id <20060823143028.EECW5875.outaamta01.mail.tds.net@[127.0.0.1]> for ; Wed, 23 Aug 2006 09:30:28 -0500 Message-ID: <44EC663B.3020305@sdf.lonestar.org> Date: Wed, 23 Aug 2006 09:29:15 -0500 From: Benn Newman User-Agent: Thunderbird 1.5.0.5 (Windows/20060719) MIME-Version: 1.0 To: gopher@complete.org Subject: [gopher] Re: Gopherspace archive References: <33786.69.21.205.10.1155595127.squirrel@69.21.205.10> <20060815014741.GB5040@katherina.lan.complete.org> <44EC0986.9080504@route-add.net> <44EC4C25.3030008@route-add.net> <20060823140721.GD21150@excelhustler.com> In-Reply-To: <20060823140721.GD21150@excelhustler.com> Content-type: text/plain X-Spam-Status: No (score 0.0): none X-Virus-Scanned: by Exiscan on glockenspiel.complete.org at Wed, 23 Aug 2006 09:30:40 -0500 Content-Transfer-Encoding: 8bit X-archive-position: 1367 X-ecartis-version: Ecartis v1.0.0 Sender: gopher-bounce@complete.org Errors-to: gopher-bounce@complete.org X-original-sender: newmanbe@sdf.lonestar.org Precedence: bulk Reply-to: gopher@complete.org List-help: List-unsubscribe: List-software: Ecartis version 1.0.0 List-Id: Gopher X-List-ID: Gopher List-subscribe: List-owner: List-post: List-archive: X-list: gopher John Goerzen wrote: > OK, well there are about half a dozen people that would like a copy of > this. > > Do any of you that have expressed interest have the capability to put it > online where others can download it? > > Before I spend a weekend burning a whole stack of DVDs, perhaps we can > optimize this a bit. > > The 40GB is before compression. After compression with, say, tar.bz2, > it should be more manageable -- but still a significant amount of data. > > -- John I do not have a place to put it up for download; I could, however try to make a full text index (Using something even more portable then the full text search back-end I was using on my Gopher server, a porter's nightmere). I was reading a paper on the indexing system for refer (in a nutshell bibTeX for troff). It can also be used as a general indexer. The paper talks about (very impressive compared to using grep) an index of 32,000,000 bytes (~32megabytes) (apparently, that was all the English text they had on their system! Why do we need these big drives anyway! :)). With all the software and binary stuff taken out, I think it should (nearly) manageable. The index file shouldn't be nearly as big as the whole archive, I could then make a front end to that (yay for sed and awk!). -- Benn Newman -- Binary/unsupported file stripped by Ecartis -- -- Type: application/x-pkcs7-signature -- File: smime.p7s -- Desc: S/MIME Cryptographic Signature