From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/15160 Path: main.gmane.org!not-for-mail From: Hrvoje Niksic Newsgroups: gmane.emacs.gnus.general Subject: Re: Cool bug in URL parsing Date: 07 May 1998 17:25:08 +0200 Sender: owner-ding@hpc.uh.edu Message-ID: References: NNTP-Posting-Host: coloc-standby.netfonds.no X-Trace: main.gmane.org 1035154242 20529 80.91.224.250 (20 Oct 2002 22:50:42 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Sun, 20 Oct 2002 22:50:42 +0000 (UTC) Return-Path: Original-Received: from gwyn.tux.org (gwyn.tux.org [207.96.122.8]) by altair.xemacs.org (8.8.8/8.8.8) with ESMTP id JAA13707 for ; Thu, 7 May 1998 09:02:03 -0700 Original-Received: from sina.hpc.uh.edu (root@Sina.HPC.UH.EDU [129.7.3.5]) by gwyn.tux.org (8.8.8/8.8.8) with ESMTP id LAA00790 for ; Thu, 7 May 1998 11:28:12 -0400 Original-Received: from sina.hpc.uh.edu (lists@Sina.HPC.UH.EDU [129.7.3.5]) by sina.hpc.uh.edu (8.7.3/8.7.3) with ESMTP id KAH13361; Thu, 7 May 1998 10:27:19 -0500 (CDT) Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Thu, 07 May 1998 10:25:40 -0500 (CDT) Original-Received: from claymore.vcinet.com (claymore.vcinet.com [208.205.12.23]) by sina.hpc.uh.edu (8.7.3/8.7.3) with SMTP id KAA13343 for ; Thu, 7 May 1998 10:25:31 -0500 (CDT) Original-Received: (qmail 89 invoked by uid 504); 7 May 1998 15:25:12 -0000 Original-Received: (qmail 85 invoked from network); 7 May 1998 15:25:11 -0000 Original-Received: from jagor.srce.hr (hniksic@161.53.2.130) by claymore.vcinet.com with SMTP; 7 May 1998 15:25:10 -0000 Original-Received: (from hniksic@localhost) by jagor.srce.hr (8.8.8/8.8.6) id RAA01159; Thu, 7 May 1998 17:25:08 +0200 (MET DST) Original-To: ding@gnus.org X-Attribution: Hrvoje X-Face: Mie8:rOV<\c/~z{s.X4A{!?vY7{drJ([U]0O=W/xDi&N7XG KV^$k0m3Oe/)'e%3=$PCR&3ITUXH,cK>]bci&Ff%x_>1`T(+M2Gg/fgndU%k*ft [(7._6e0n-V%|%'[c|q:;}td$#INd+;?!-V=c8Pqf}3J In-Reply-To: Karl Kleinpaste's message of "07 May 1998 11:15:47 -0400" Original-Lines: 63 X-Mailer: Gnus v5.6.7/XEmacs 21.0(beta38) - "Repartida" Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:15160 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:15160 Karl Kleinpaste writes: > If you haven't stopped your setup from doing highlighting of URLs > embedded in text, here's an entertaining glitch to see. > > >From the end of this line containing the sequence to start a " everything will be highlighted as a supposed URL until, for example, > some quoted text shows up to provide the terminator. > > > Such as on this line here. > > Methinks there's a regexp that gets a /little/ too aggressive... What makes you think this is a bug? According to rfc1738: APPENDIX: Recommendations for URLs in Context URIs, including URLs, are intended to be transmitted through protocols which provide a context for their interpretation. In some cases, it will be necessary to distinguish URLs from other possible data structures in a syntactic structure. In this case, is recommended that URLs be preceeded with a prefix consisting of the characters "URL:". For example, this prefix may be used to distinguish URLs from other kinds of URIs. In addition, there are many occasions when URLs are included in other kinds of text; examples include electronic mail, USENET news messages, or printed on paper. In such cases, it is convenient to have a separate syntactic wrapper that delimits the URL and separates it from the rest of the text, and in particular from punctuation marks that might be mistaken for part of the URL. For this purpose, is recommended that angle brackets ("<" and ">"), along with the prefix "URL:", be used to delimit the boundaries of the URL. This wrapper does not form part of the URL and should not be used in contexts in which delimiters are already specified. In the case where a fragment/anchor identifier is associated with a URL (following a "#"), the identifier would be placed within the brackets as well. In some cases, extra whitespace (spaces, linebreaks, tabs, etc.) may need to be added to break long URLs across lines. The whitespace should be ignored when extracting the URL. No whitespace should be introduced after a hyphen ("-") character. Because some typesetters and printers may (erroneously) introduce a hyphen at the end of line when breaking a line, the interpreter of a URL containing a line break immediately after a hyphen should ignore all unencoded whitespace around the line break, and should be aware that the hyphen may or may not actually be part of the URL. Examples: Yes, Jim, I found it under but you can probably pick it up from . Note the warning in . -- Hrvoje Niksic | Student at FER Zagreb, Croatia --------------------------------+-------------------------------- I'm a Lisp variable -- bind me!