Gnus development mailing list
 help / color / mirror / Atom feed
* URLs containing `?' are not fully comprehended
@ 2000-05-16 18:24 Karl Kleinpaste
  2000-05-16 19:29 ` Arne Jørgensen
                   ` (2 more replies)
  0 siblings, 3 replies; 14+ messages in thread
From: Karl Kleinpaste @ 2000-05-16 18:24 UTC (permalink / raw)


The following is a legitimate URL.

mailto:him@over.there?subject=testing

It should induce a message buffer with Subject already filled in.

Technically, "?body=stuff" should work, too, but outside actual HTML
<a href> enclosures, I suspect no heuristic will work well because of
the likelihood of needing to catch spaces in body content, whereas
one-word subjects are pretty common.

Anyhow, "?anything" should still be considered part of the URL when
buttonizing and highlighting.



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-16 18:24 URLs containing `?' are not fully comprehended Karl Kleinpaste
@ 2000-05-16 19:29 ` Arne Jørgensen
  2000-05-16 19:37   ` Bjørn Mork
  2000-05-16 20:10   ` Simon Josefsson
  2000-05-17 11:46 ` Per Abrahamsen
  2000-05-17 14:23 ` Toby Speight
  2 siblings, 2 replies; 14+ messages in thread
From: Arne Jørgensen @ 2000-05-16 19:29 UTC (permalink / raw)


Karl Kleinpaste <karl@charcoal.com> writes:

> The following is a legitimate URL.
> 
> mailto:him@over.there?subject=testing

Are you sure? I thought it was just a Netscape and InternetExplorer
'feature' ?

In a http-request ? (and &) can be part of the url though.

        /arne
-- 
stud. scient. Arne Jørgensen
Børglumvej 2, v. 351, DK-8240 Risskov, Denmark
phone: +45 89 37 93 51, cellular: +45 21 65 01 13
email: arne@daimi.au.dk, http://www.daimi.au.dk/~arne/



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-16 19:29 ` Arne Jørgensen
@ 2000-05-16 19:37   ` Bjørn Mork
  2000-05-16 20:10   ` Simon Josefsson
  1 sibling, 0 replies; 14+ messages in thread
From: Bjørn Mork @ 2000-05-16 19:37 UTC (permalink / raw)


arne+usenet@daimi.au.dk (Arne Jørgensen) writes:
> Karl Kleinpaste <karl@charcoal.com> writes:
> 
> > The following is a legitimate URL.
> > 
> > mailto:him@over.there?subject=testing
> 
> Are you sure? I thought it was just a Netscape and InternetExplorer
> 'feature' ?

>From RFC 2368:

  Following the syntax conventions of RFC 1738 [RFC1738], a "mailto"
  URL has the form:
     mailtoURL  =  "mailto:" [ to ] [ headers ]
     to         =  #mailbox
     headers    =  "?" header *( "&" header )
     header     =  hname "=" hvalue
     hname      =  *urlc
     hvalue     =  *urlc


Bjørn



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-16 19:29 ` Arne Jørgensen
  2000-05-16 19:37   ` Bjørn Mork
@ 2000-05-16 20:10   ` Simon Josefsson
  2000-05-16 21:00     ` Kai Großjohann
  1 sibling, 1 reply; 14+ messages in thread
From: Simon Josefsson @ 2000-05-16 20:10 UTC (permalink / raw)
  Cc: ding

arne+usenet@daimi.au.dk (Arne Jørgensen) writes:

> > The following is a legitimate URL.
> > 
> > mailto:him@over.there?subject=testing
> 
> Are you sure? I thought it was just a Netscape and InternetExplorer
> 'feature' ?

No, the syntax is described in rfc2368:

 mailtoURL  =  "mailto:" [ to ] [ headers ]
 to         =  #mailbox
 headers    =  "?" header *( "&" header )
 header     =  hname "=" hvalue
 hname      =  *urlc
 hvalue     =  *urlc

And #mailbox is of course one or more rfc822-"mailbox", so the problem
of recognizing mailto URLs is not simple.



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-16 20:10   ` Simon Josefsson
@ 2000-05-16 21:00     ` Kai Großjohann
  2000-05-17 14:21       ` Toby Speight
  0 siblings, 1 reply; 14+ messages in thread
From: Kai Großjohann @ 2000-05-16 21:00 UTC (permalink / raw)
  Cc: Arne Jørgensen, ding

Simon Josefsson <jas@pdc.kth.se> writes:

> And #mailbox is of course one or more rfc822-"mailbox", so the problem
> of recognizing mailto URLs is not simple.

With percent and bang paths and stuff?  Whee!

kai
-- 
Beware of flying birch trees.




^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-16 18:24 URLs containing `?' are not fully comprehended Karl Kleinpaste
  2000-05-16 19:29 ` Arne Jørgensen
@ 2000-05-17 11:46 ` Per Abrahamsen
  2000-05-17 12:17   ` Karl Kleinpaste
  2000-05-17 14:23 ` Toby Speight
  2 siblings, 1 reply; 14+ messages in thread
From: Per Abrahamsen @ 2000-05-17 11:46 UTC (permalink / raw)


Karl Kleinpaste <karl@charcoal.com> writes:

> Anyhow, "?anything" should still be considered part of the URL when
> buttonizing and highlighting.

It isn't, but I don't understand why.  Here is the regexp:

(defcustom gnus-button-url-regexp "\\b\\(\\(www\\.\\|\\(s?https?\\|ftp\\|file\\|gopher\\|news\\|telnet\\|wais\\|mailto\\):\\)\\(//[-a-zA-Z0-9_.]+:[0-9]*\\)?\\([-a-zA-Z0-9_=!?#$@~`%&*+|\\/:;.,]\\|\\w\\)+\\([-a-zA-Z0-9_=#$@~`%&*+|\\/]\\|\\w\\)\\)"

As far as I can see, the relevant part is

        \\([-a-zA-Z0-9_=!?#$@~`%&*+|\\/:;.,]\\|\\w\\)+
                        -^-

which *does* include question mark.



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-17 11:46 ` Per Abrahamsen
@ 2000-05-17 12:17   ` Karl Kleinpaste
  2000-05-17 17:55     ` Per Abrahamsen
  0 siblings, 1 reply; 14+ messages in thread
From: Karl Kleinpaste @ 2000-05-17 12:17 UTC (permalink / raw)


Per Abrahamsen <abraham@dina.kvl.dk> writes:
> As far as I can see, the relevant part is
>         \\([-a-zA-Z0-9_=!?#$@~`%&*+|\\/:;.,]\\|\\w\\)+

That regexp always makes my eyes hurt, but I think the segment in
error is
        [-a-zA-Z0-9_=#$@~`%&*+|\\/]
which lacks `?' (and `!', and a bunch of others...which raises the
question of why this segment isn't just like the preceding segment).



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-16 21:00     ` Kai Großjohann
@ 2000-05-17 14:21       ` Toby Speight
  2000-05-17 16:21         ` Andreas Fuchs
  0 siblings, 1 reply; 14+ messages in thread
From: Toby Speight @ 2000-05-17 14:21 UTC (permalink / raw)


Kai> Kai Großjohann <URL:mailto:Kai.Grossjohann@CS.Uni-Dortmund.DE>

0> In article <200005162100.XAA29747@marcy.cs.uni-dortmund.de>, Kai
0> wrote:

Kai> Simon Josefsson <jas@pdc.kth.se> writes:

>> And #mailbox is of course one or more rfc822-"mailbox", so the
>> problem of recognizing mailto URLs is not simple.

Kai> With percent and bang paths and stuff?

You have to write your percent symbol as "%25", of course.




^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-16 18:24 URLs containing `?' are not fully comprehended Karl Kleinpaste
  2000-05-16 19:29 ` Arne Jørgensen
  2000-05-17 11:46 ` Per Abrahamsen
@ 2000-05-17 14:23 ` Toby Speight
  2000-05-17 15:40   ` Steinar Bang
  2000-05-19  0:31   ` Russ Allbery
  2 siblings, 2 replies; 14+ messages in thread
From: Toby Speight @ 2000-05-17 14:23 UTC (permalink / raw)


Karl> Karl Kleinpaste <URL:mailto:karl@charcoal.com>

0> In article <vxku2fytkso.fsf@mesquite.charcoal.com>, Karl wrote:

Karl> The following is a legitimate URL.
Karl>
Karl> mailto:him@over.there?subject=testing

While we're on URL highlighting, some folks use <...> instead of
<URL:...> to delimit multi-line URLs.  Is it possible for us to
accommodate this use?  (IME, it's becoming more common)




^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-17 14:23 ` Toby Speight
@ 2000-05-17 15:40   ` Steinar Bang
  2000-05-18 10:22     ` Toby Speight
  2000-05-19  0:31   ` Russ Allbery
  1 sibling, 1 reply; 14+ messages in thread
From: Steinar Bang @ 2000-05-17 15:40 UTC (permalink / raw)


>>>>> Toby Speight <streapadair@gmx.net>:

> While we're on URL highlighting, some folks use <...> instead of
> <URL:...> to delimit multi-line URLs.  Is it possible for us to
> accommodate this use?  (IME, it's becoming more common)

Well... Message-Ids also commonly reside inside <...>.



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-17 14:21       ` Toby Speight
@ 2000-05-17 16:21         ` Andreas Fuchs
  0 siblings, 0 replies; 14+ messages in thread
From: Andreas Fuchs @ 2000-05-17 16:21 UTC (permalink / raw)



Today, Toby Speight <streapadair@gmx.net> wrote:
>>> And #mailbox is of course one or more rfc822-"mailbox", so the
>>> problem of recognizing mailto URLs is not simple.
Kai> With percent and bang paths and stuff?
> You have to write your percent symbol as "%25", of course.

And what was the C trigraph for '?' again?

kind regards,
-- 
Andreas Stefan Fuchs                             in Real Life aka
asf@acm.org, asfuchs@gmx.at, asf@ycom.at         in NNTP and SMTP,
antifuchs                                        in IRCNet and
Relf Herbstfresser, Male 1/2 Elf Priest          in AD&D



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-17 12:17   ` Karl Kleinpaste
@ 2000-05-17 17:55     ` Per Abrahamsen
  0 siblings, 0 replies; 14+ messages in thread
From: Per Abrahamsen @ 2000-05-17 17:55 UTC (permalink / raw)


Karl Kleinpaste <karl@charcoal.com> writes:

> That regexp always makes my eyes hurt, but I think the segment in
> error is
>         [-a-zA-Z0-9_=#$@~`%&*+|\\/]
> which lacks `?' (and `!', and a bunch of others...which raises the
> question of why this segment isn't just like the preceding segment).

If you notice, there is no "*" or "+" after that section, it is there
to ensure that a following punctuation sign isn't considered part of
the URL.  This way, sentences like

        See http://www.dict.com, everything is there.

        Did you found it on http://www.google.com?

        I can't connect to http://slashdot.org.

will do "the right thing".

Now lets's see if it does ;-)



^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-17 15:40   ` Steinar Bang
@ 2000-05-18 10:22     ` Toby Speight
  0 siblings, 0 replies; 14+ messages in thread
From: Toby Speight @ 2000-05-18 10:22 UTC (permalink / raw)


Steinar> Steinar Bang <URL:mailto:sb@metis.no>
>>>>> Toby Speight <streapadair@gmx.net>:
>>
>> While we're on URL highlighting, some folks use <...> instead of
>> <URL:...> to delimit multi-line URLs.  Is it possible for us to
>> accommodate this use?  (IME, it's becoming more common)


0> In article <whbt25rxra.fsf@viffer.metis.no>, Steinar wrote:

Steinar> Well... Message-Ids also commonly reside inside <...>.

But they don't normally[1] begin with "http:", "ftp:", "mailto:", or
any of the other scheme names Gnus recognises.  I'm not after matching
bare hostnames in this case (as I think they're evil anyway), just that
the existing[2] matches be multi-line if they're immediately preceded
by "<".  I think we're reaching the limits of regexps with this stuff
now - but is there a sufficiently user-configurable alternative?

[1] I can handle _occasional_ false positives
[2] before the over-liberalisation that's caused so much talk here




^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: URLs containing `?' are not fully comprehended
  2000-05-17 14:23 ` Toby Speight
  2000-05-17 15:40   ` Steinar Bang
@ 2000-05-19  0:31   ` Russ Allbery
  1 sibling, 0 replies; 14+ messages in thread
From: Russ Allbery @ 2000-05-19  0:31 UTC (permalink / raw)


Toby Speight <streapadair@gmx.net> writes:

> While we're on URL highlighting, some folks use <...> instead of
> <URL:...> to delimit multi-line URLs.  Is it possible for us to
> accommodate this use?  (IME, it's becoming more common)

The <URL:...> form is basically dead, unfortunately, because it was a
recommendation of an RFC that's now been superseded by a newer RFC that
suggests just using <...> because that's what everyone is actually doing
and because almost no one used <URL:...>.

-- 
Russ Allbery (rra@stanford.edu)             <http://www.eyrie.org/~eagle/>



^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2000-05-19  0:31 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2000-05-16 18:24 URLs containing `?' are not fully comprehended Karl Kleinpaste
2000-05-16 19:29 ` Arne Jørgensen
2000-05-16 19:37   ` Bjørn Mork
2000-05-16 20:10   ` Simon Josefsson
2000-05-16 21:00     ` Kai Großjohann
2000-05-17 14:21       ` Toby Speight
2000-05-17 16:21         ` Andreas Fuchs
2000-05-17 11:46 ` Per Abrahamsen
2000-05-17 12:17   ` Karl Kleinpaste
2000-05-17 17:55     ` Per Abrahamsen
2000-05-17 14:23 ` Toby Speight
2000-05-17 15:40   ` Steinar Bang
2000-05-18 10:22     ` Toby Speight
2000-05-19  0:31   ` Russ Allbery

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).