* URLs containing `?' are not fully comprehended
@ 2000-05-16 18:24 Karl Kleinpaste
2000-05-16 19:29 ` Arne Jørgensen
` (2 more replies)
0 siblings, 3 replies; 14+ messages in thread
From: Karl Kleinpaste @ 2000-05-16 18:24 UTC (permalink / raw)
The following is a legitimate URL.
mailto:him@over.there?subject=testing
It should induce a message buffer with Subject already filled in.
Technically, "?body=stuff" should work, too, but outside actual HTML
<a href> enclosures, I suspect no heuristic will work well because of
the likelihood of needing to catch spaces in body content, whereas
one-word subjects are pretty common.
Anyhow, "?anything" should still be considered part of the URL when
buttonizing and highlighting.
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-16 18:24 URLs containing `?' are not fully comprehended Karl Kleinpaste
@ 2000-05-16 19:29 ` Arne Jørgensen
2000-05-16 19:37 ` Bjørn Mork
2000-05-16 20:10 ` Simon Josefsson
2000-05-17 11:46 ` Per Abrahamsen
2000-05-17 14:23 ` Toby Speight
2 siblings, 2 replies; 14+ messages in thread
From: Arne Jørgensen @ 2000-05-16 19:29 UTC (permalink / raw)
Karl Kleinpaste <karl@charcoal.com> writes:
> The following is a legitimate URL.
>
> mailto:him@over.there?subject=testing
Are you sure? I thought it was just a Netscape and InternetExplorer
'feature' ?
In a http-request ? (and &) can be part of the url though.
/arne
--
stud. scient. Arne Jørgensen
Børglumvej 2, v. 351, DK-8240 Risskov, Denmark
phone: +45 89 37 93 51, cellular: +45 21 65 01 13
email: arne@daimi.au.dk, http://www.daimi.au.dk/~arne/
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-16 19:29 ` Arne Jørgensen
@ 2000-05-16 19:37 ` Bjørn Mork
2000-05-16 20:10 ` Simon Josefsson
1 sibling, 0 replies; 14+ messages in thread
From: Bjørn Mork @ 2000-05-16 19:37 UTC (permalink / raw)
arne+usenet@daimi.au.dk (Arne Jørgensen) writes:
> Karl Kleinpaste <karl@charcoal.com> writes:
>
> > The following is a legitimate URL.
> >
> > mailto:him@over.there?subject=testing
>
> Are you sure? I thought it was just a Netscape and InternetExplorer
> 'feature' ?
>From RFC 2368:
Following the syntax conventions of RFC 1738 [RFC1738], a "mailto"
URL has the form:
mailtoURL = "mailto:" [ to ] [ headers ]
to = #mailbox
headers = "?" header *( "&" header )
header = hname "=" hvalue
hname = *urlc
hvalue = *urlc
Bjørn
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-16 19:29 ` Arne Jørgensen
2000-05-16 19:37 ` Bjørn Mork
@ 2000-05-16 20:10 ` Simon Josefsson
2000-05-16 21:00 ` Kai Großjohann
1 sibling, 1 reply; 14+ messages in thread
From: Simon Josefsson @ 2000-05-16 20:10 UTC (permalink / raw)
Cc: ding
arne+usenet@daimi.au.dk (Arne Jørgensen) writes:
> > The following is a legitimate URL.
> >
> > mailto:him@over.there?subject=testing
>
> Are you sure? I thought it was just a Netscape and InternetExplorer
> 'feature' ?
No, the syntax is described in rfc2368:
mailtoURL = "mailto:" [ to ] [ headers ]
to = #mailbox
headers = "?" header *( "&" header )
header = hname "=" hvalue
hname = *urlc
hvalue = *urlc
And #mailbox is of course one or more rfc822-"mailbox", so the problem
of recognizing mailto URLs is not simple.
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-16 20:10 ` Simon Josefsson
@ 2000-05-16 21:00 ` Kai Großjohann
2000-05-17 14:21 ` Toby Speight
0 siblings, 1 reply; 14+ messages in thread
From: Kai Großjohann @ 2000-05-16 21:00 UTC (permalink / raw)
Cc: Arne Jørgensen, ding
Simon Josefsson <jas@pdc.kth.se> writes:
> And #mailbox is of course one or more rfc822-"mailbox", so the problem
> of recognizing mailto URLs is not simple.
With percent and bang paths and stuff? Whee!
kai
--
Beware of flying birch trees.
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-16 21:00 ` Kai Großjohann
@ 2000-05-17 14:21 ` Toby Speight
2000-05-17 16:21 ` Andreas Fuchs
0 siblings, 1 reply; 14+ messages in thread
From: Toby Speight @ 2000-05-17 14:21 UTC (permalink / raw)
Kai> Kai Großjohann <URL:mailto:Kai.Grossjohann@CS.Uni-Dortmund.DE>
0> In article <200005162100.XAA29747@marcy.cs.uni-dortmund.de>, Kai
0> wrote:
Kai> Simon Josefsson <jas@pdc.kth.se> writes:
>> And #mailbox is of course one or more rfc822-"mailbox", so the
>> problem of recognizing mailto URLs is not simple.
Kai> With percent and bang paths and stuff?
You have to write your percent symbol as "%25", of course.
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-17 14:21 ` Toby Speight
@ 2000-05-17 16:21 ` Andreas Fuchs
0 siblings, 0 replies; 14+ messages in thread
From: Andreas Fuchs @ 2000-05-17 16:21 UTC (permalink / raw)
Today, Toby Speight <streapadair@gmx.net> wrote:
>>> And #mailbox is of course one or more rfc822-"mailbox", so the
>>> problem of recognizing mailto URLs is not simple.
Kai> With percent and bang paths and stuff?
> You have to write your percent symbol as "%25", of course.
And what was the C trigraph for '?' again?
kind regards,
--
Andreas Stefan Fuchs in Real Life aka
asf@acm.org, asfuchs@gmx.at, asf@ycom.at in NNTP and SMTP,
antifuchs in IRCNet and
Relf Herbstfresser, Male 1/2 Elf Priest in AD&D
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-16 18:24 URLs containing `?' are not fully comprehended Karl Kleinpaste
2000-05-16 19:29 ` Arne Jørgensen
@ 2000-05-17 11:46 ` Per Abrahamsen
2000-05-17 12:17 ` Karl Kleinpaste
2000-05-17 14:23 ` Toby Speight
2 siblings, 1 reply; 14+ messages in thread
From: Per Abrahamsen @ 2000-05-17 11:46 UTC (permalink / raw)
Karl Kleinpaste <karl@charcoal.com> writes:
> Anyhow, "?anything" should still be considered part of the URL when
> buttonizing and highlighting.
It isn't, but I don't understand why. Here is the regexp:
(defcustom gnus-button-url-regexp "\\b\\(\\(www\\.\\|\\(s?https?\\|ftp\\|file\\|gopher\\|news\\|telnet\\|wais\\|mailto\\):\\)\\(//[-a-zA-Z0-9_.]+:[0-9]*\\)?\\([-a-zA-Z0-9_=!?#$@~`%&*+|\\/:;.,]\\|\\w\\)+\\([-a-zA-Z0-9_=#$@~`%&*+|\\/]\\|\\w\\)\\)"
As far as I can see, the relevant part is
\\([-a-zA-Z0-9_=!?#$@~`%&*+|\\/:;.,]\\|\\w\\)+
-^-
which *does* include question mark.
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-17 11:46 ` Per Abrahamsen
@ 2000-05-17 12:17 ` Karl Kleinpaste
2000-05-17 17:55 ` Per Abrahamsen
0 siblings, 1 reply; 14+ messages in thread
From: Karl Kleinpaste @ 2000-05-17 12:17 UTC (permalink / raw)
Per Abrahamsen <abraham@dina.kvl.dk> writes:
> As far as I can see, the relevant part is
> \\([-a-zA-Z0-9_=!?#$@~`%&*+|\\/:;.,]\\|\\w\\)+
That regexp always makes my eyes hurt, but I think the segment in
error is
[-a-zA-Z0-9_=#$@~`%&*+|\\/]
which lacks `?' (and `!', and a bunch of others...which raises the
question of why this segment isn't just like the preceding segment).
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-17 12:17 ` Karl Kleinpaste
@ 2000-05-17 17:55 ` Per Abrahamsen
0 siblings, 0 replies; 14+ messages in thread
From: Per Abrahamsen @ 2000-05-17 17:55 UTC (permalink / raw)
Karl Kleinpaste <karl@charcoal.com> writes:
> That regexp always makes my eyes hurt, but I think the segment in
> error is
> [-a-zA-Z0-9_=#$@~`%&*+|\\/]
> which lacks `?' (and `!', and a bunch of others...which raises the
> question of why this segment isn't just like the preceding segment).
If you notice, there is no "*" or "+" after that section, it is there
to ensure that a following punctuation sign isn't considered part of
the URL. This way, sentences like
See http://www.dict.com, everything is there.
Did you found it on http://www.google.com?
I can't connect to http://slashdot.org.
will do "the right thing".
Now lets's see if it does ;-)
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-16 18:24 URLs containing `?' are not fully comprehended Karl Kleinpaste
2000-05-16 19:29 ` Arne Jørgensen
2000-05-17 11:46 ` Per Abrahamsen
@ 2000-05-17 14:23 ` Toby Speight
2000-05-17 15:40 ` Steinar Bang
2000-05-19 0:31 ` Russ Allbery
2 siblings, 2 replies; 14+ messages in thread
From: Toby Speight @ 2000-05-17 14:23 UTC (permalink / raw)
Karl> Karl Kleinpaste <URL:mailto:karl@charcoal.com>
0> In article <vxku2fytkso.fsf@mesquite.charcoal.com>, Karl wrote:
Karl> The following is a legitimate URL.
Karl>
Karl> mailto:him@over.there?subject=testing
While we're on URL highlighting, some folks use <...> instead of
<URL:...> to delimit multi-line URLs. Is it possible for us to
accommodate this use? (IME, it's becoming more common)
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-17 14:23 ` Toby Speight
@ 2000-05-17 15:40 ` Steinar Bang
2000-05-18 10:22 ` Toby Speight
2000-05-19 0:31 ` Russ Allbery
1 sibling, 1 reply; 14+ messages in thread
From: Steinar Bang @ 2000-05-17 15:40 UTC (permalink / raw)
>>>>> Toby Speight <streapadair@gmx.net>:
> While we're on URL highlighting, some folks use <...> instead of
> <URL:...> to delimit multi-line URLs. Is it possible for us to
> accommodate this use? (IME, it's becoming more common)
Well... Message-Ids also commonly reside inside <...>.
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-17 15:40 ` Steinar Bang
@ 2000-05-18 10:22 ` Toby Speight
0 siblings, 0 replies; 14+ messages in thread
From: Toby Speight @ 2000-05-18 10:22 UTC (permalink / raw)
Steinar> Steinar Bang <URL:mailto:sb@metis.no>
>>>>> Toby Speight <streapadair@gmx.net>:
>>
>> While we're on URL highlighting, some folks use <...> instead of
>> <URL:...> to delimit multi-line URLs. Is it possible for us to
>> accommodate this use? (IME, it's becoming more common)
0> In article <whbt25rxra.fsf@viffer.metis.no>, Steinar wrote:
Steinar> Well... Message-Ids also commonly reside inside <...>.
But they don't normally[1] begin with "http:", "ftp:", "mailto:", or
any of the other scheme names Gnus recognises. I'm not after matching
bare hostnames in this case (as I think they're evil anyway), just that
the existing[2] matches be multi-line if they're immediately preceded
by "<". I think we're reaching the limits of regexps with this stuff
now - but is there a sufficiently user-configurable alternative?
[1] I can handle _occasional_ false positives
[2] before the over-liberalisation that's caused so much talk here
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: URLs containing `?' are not fully comprehended
2000-05-17 14:23 ` Toby Speight
2000-05-17 15:40 ` Steinar Bang
@ 2000-05-19 0:31 ` Russ Allbery
1 sibling, 0 replies; 14+ messages in thread
From: Russ Allbery @ 2000-05-19 0:31 UTC (permalink / raw)
Toby Speight <streapadair@gmx.net> writes:
> While we're on URL highlighting, some folks use <...> instead of
> <URL:...> to delimit multi-line URLs. Is it possible for us to
> accommodate this use? (IME, it's becoming more common)
The <URL:...> form is basically dead, unfortunately, because it was a
recommendation of an RFC that's now been superseded by a newer RFC that
suggests just using <...> because that's what everyone is actually doing
and because almost no one used <URL:...>.
--
Russ Allbery (rra@stanford.edu) <http://www.eyrie.org/~eagle/>
^ permalink raw reply [flat|nested] 14+ messages in thread
end of thread, other threads:[~2000-05-19 0:31 UTC | newest]
Thread overview: 14+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2000-05-16 18:24 URLs containing `?' are not fully comprehended Karl Kleinpaste
2000-05-16 19:29 ` Arne Jørgensen
2000-05-16 19:37 ` Bjørn Mork
2000-05-16 20:10 ` Simon Josefsson
2000-05-16 21:00 ` Kai Großjohann
2000-05-17 14:21 ` Toby Speight
2000-05-17 16:21 ` Andreas Fuchs
2000-05-17 11:46 ` Per Abrahamsen
2000-05-17 12:17 ` Karl Kleinpaste
2000-05-17 17:55 ` Per Abrahamsen
2000-05-17 14:23 ` Toby Speight
2000-05-17 15:40 ` Steinar Bang
2000-05-18 10:22 ` Toby Speight
2000-05-19 0:31 ` Russ Allbery
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).