From mboxrd@z Thu Jan  1 00:00:00 1970
X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/10864
Path: main.gmane.org!not-for-mail
From: Ken Raeburn <raeburn@cygnus.com>
Newsgroups: gmane.emacs.gnus.general
Subject: Re: timezone.el patterns in emacs 19.34
Date: 03 May 1997 19:44:25 -0400
Sender: raeburn@cygnus.com
Message-ID: <tx167x086jq.fsf@cygnus.com>
References: <199705020844.EAA06520@kr-laptop.cygnus.com> <kigwwphmk54.fsf@jagor.srce.hr> <tx1911xl0cp.fsf@cygnus.com> <kigafmdmc9n.fsf@jagor.srce.hr>
NNTP-Posting-Host: coloc-standby.netfonds.no
X-Trace: main.gmane.org 1035150665 26879 80.91.224.250 (20 Oct 2002 21:51:05 GMT)
X-Complaints-To: usenet@main.gmane.org
NNTP-Posting-Date: Sun, 20 Oct 2002 21:51:05 +0000 (UTC)
Return-Path: <ding-request@ifi.uio.no>
Original-Received: from ifi.uio.no (0@ifi.uio.no [129.240.64.2])
	by deanna.miranova.com (8.8.5/8.8.5) with SMTP id QAA15603
	for <steve@miranova.com>; Sat, 3 May 1997 16:52:09 -0700
Original-Received: from cygnus.com (cygnus.com [205.180.230.20]) by ifi.uio.no with ESMTP (8.6.11/ifi2.4) 
	id <BAA20323@ifi.uio.no> for <ding@ifi.uio.no> ; Sun, 4 May 1997 01:44:37 +0200
Original-Received: from tweedledumb.cygnus.com (tweedledumb.cygnus.com [192.80.44.1])
	by cygnus.com (8.8.5/8.8.5) with SMTP id QAA02364
	for <ding@ifi.uio.no>; Sat, 3 May 1997 16:44:29 -0700 (PDT)
Original-Received: from kr-laptop.cygnus.com by tweedledumb.cygnus.com (4.1/4.7) id AA26716; Sat, 3 May 97 19:44:27 EDT
Original-Received: (from raeburn@localhost)
          by kr-laptop.cygnus.com (8.8.5/8.8.4)
	  id TAA05781; Sat, 3 May 1997 19:44:27 -0400
Original-To: ding@ifi.uio.no
In-Reply-To: Hrvoje Niksic's message of 03 May 1997 06:04:52 +0200
Original-Lines: 54
X-Mailer: Gnus v5.4.37/Emacs 19.34
Xref: main.gmane.org gmane.emacs.gnus.general:10864
X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:10864

Hrvoje Niksic <hniksic@srce.hr> writes:

> > Besides, I think you'd probably wind up doing the regexp bit
> > anyways, just at the C level.  These patterns don't have to match
> > the *entire* string, just some substring.
> I would do no regexp matching.  If I had to resort to regexps, then I
> would rather leave it in Lisp, as it is.

Okay, so how would you get around the requirement of matching a
substring, without using regexps?  If you can do it, and get much
better performance than we've got now with the lisp code, great.  But
as I just indicated in other mail, my trivial little patch already
addresses the performance issue to some degree.  So you'll have to do
even better. :-)

And, by the way, if you do go ahead with this, I'd suggest returning
numbers instead of strings when practical.  They can be turned into
strings easily enough in lisp, and the gnus-dd-mmm usage just converts
them back to numbers and discards the strings, which means more
needless garbage to collect.  (Don't underestimate the cost of garbage
collection.  I've managed to speed up some code by tweaking it to
allocate less short-term heap storage.)

> NO!  I hate getdate.y.  First, because of the `.y' thingie. :-)

Aside from the substring issue, I think a parser would be well suited
to this task.  I see two advantages to it:
 * Tokenizes first, then looks for patterns in the tokens; this means
   any backtracking doesn't have to re-process every character.
 * Yacc sets up a FSM for matching multiple sequences in parallel (as
   do the more sophisticated regexp matchers), so if a match is going
   to be made, it'll be made in one pass; the hairy work is done at
   build time.
Certainly, yacc isn't the be-all and end-all of parsing technology, by
a long shot.  But it's up to this task.

>	  Then,
> there are copyright problems with it.

No, there aren't.  The version I'm looking at has a "public domain, no
copyright" label on it.  There would be no problem with incorporating
a copy into emacs.

>	  And, I'd like to have a fast
> routine that matches those 7 or so format strings.

Yes, that's the problem I see with it -- it'll handle more than we
want.  A date header saying "+23hours" is bogus, and should be treated
as such.  Though if getdate.y matches more "real" date formats than
timezone-parse-date, I don't think it would be a bad idea to support
them as well, since this is timezone-parse-date we're talking about,
and not gnus-parse-conforming-date-header.

Ken