Received: with ECARTIS (v1.0.0; list gopher);
 Fri, 28 Dec 2007 10:28:29 -0600 (CST)
Received: from static-71-170-11-156.dllstx.dsl-w.verizon.net ([71.170.11.156]
 helo=turquoise.pongonova.net)
	by glockenspiel.complete.org with esmtp
	(Exim 4.63)
	id 1J8I4Q-0002Q6-QI
	for gopher@complete.org; Fri, 28 Dec 2007 10:28:29 -0600
Received: by turquoise.pongonova.net (Postfix, from userid 1000)
	id 9B38C69C; Fri, 28 Dec 2007 10:29:23 -0600 (CST)
Date: Fri, 28 Dec 2007 10:29:23 -0600
From: brian@pongonova.net
To: gopher@complete.org
Subject: [gopher] Re: Improved binary file detection in Bucktooth 0.2.2
Message-ID: <20071228162923.GA26591@pongonova.net>
References: <20071228072339.GA25327@pongonova.net>
 <200712281349.lBSDnxwg011630@floodgap.com>
Mime-Version: 1.0
Content-type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <200712281349.lBSDnxwg011630@floodgap.com>
User-Agent: Mutt/1.5.5.1i
X-Spam-Status: No (score 0.6): AWL=0.000, NO_REAL_NAME=0.55
X-Virus-Scanned: by Exiscan on glockenspiel.complete.org at Fri,
 28 Dec 2007 10:28:29 -0600
Content-Transfer-Encoding: 8bit
X-archive-position: 1774
X-ecartis-version: Ecartis v1.0.0
Sender: gopher-bounce@complete.org
Errors-to: gopher-bounce@complete.org
X-original-sender: brian@pongonova.net
Precedence: bulk
Reply-to: gopher@complete.org
List-help: <mailto:ecartis@complete.org?Subject=help>
List-unsubscribe: <mailto:gopher-request@complete.org?Subject=unsubscribe>
List-software: Ecartis version 1.0.0
List-Id: Gopher <gopher.complete.org>
X-List-ID: Gopher <gopher.complete.org>
List-subscribe: <mailto:gopher-request@complete.org?Subject=subscribe>
List-owner: <mailto:jgoerzen@complete.org>
List-post: <mailto:gopher@complete.org>
List-archive: <http://www.complete.org/mailinglists/archives/>
X-list: gopher

On Fri, Dec 28, 2007 at 05:49:59AM -0800, Cameron Kaiser wrote:
> The other thing I might do is just expand the number of file extensions
> Bucktooth recognizes and generates item types for, since -B is the
> fall-through case and there will always be datasets falling in the tails
> of the bell curve.

There is a Perl module that used the /etc/magic file to determine file
types in the same way as "file" does (File::Type).   That might be one
approach...

  --Brian