From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/28920 Path: main.gmane.org!not-for-mail From: Steinar Bang Newsgroups: gmane.emacs.gnus.general Subject: on matching more naked URLs in articles Date: 22 Jan 2000 15:32:13 +0100 Organization: NCR METIS Sender: owner-ding@hpc.uh.edu Message-ID: NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1035165681 32303 80.91.224.250 (21 Oct 2002 02:01:21 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 21 Oct 2002 02:01:21 +0000 (UTC) Return-Path: Original-Received: from karazm.math.uh.edu (karazm.math.uh.edu [129.7.128.1]) by mailhost.sclp.com (Postfix) with ESMTP id BD0F6D051E for ; Sat, 22 Jan 2000 09:35:15 -0500 (EST) Original-Received: from sina.hpc.uh.edu (lists@Sina.HPC.UH.EDU [129.7.3.5]) by karazm.math.uh.edu (8.9.3/8.9.3) with ESMTP id IAC27178; Sat, 22 Jan 2000 08:35:09 -0600 (CST) Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Sat, 22 Jan 2000 08:34:25 -0600 (CST) Original-Received: from mailhost.sclp.com (postfix@sclp3.sclp.com [204.252.123.139]) by sina.hpc.uh.edu (8.9.3/8.9.3) with ESMTP id IAA18049 for ; Sat, 22 Jan 2000 08:34:13 -0600 (CST) Original-Received: from viffer.metis.no (gran.metis.no [193.90.64.36]) by mailhost.sclp.com (Postfix) with ESMTP id 45DC4D051E for ; Sat, 22 Jan 2000 09:33:29 -0500 (EST) Original-Received: (from sb@localhost) by viffer.metis.no (8.9.3/8.9.3) id PAA30650; Sat, 22 Jan 2000 15:33:23 +0100 X-Authentication-Warning: viffer.metis.no: sb set sender to sb@metis.no using -f Original-To: ding@gnus.org User-Agent: Gnus/5.0804 (Gnus v5.8.4) XEmacs/20.4 (Emerald) Original-Lines: 23 Precedence: list X-Majordomo: 1.94.jlt7 Xref: main.gmane.org gmane.emacs.gnus.general:28920 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:28920 Before I waste time on it: has anyone looked into changing gnus-button-url-regexp into matching naked URLs without a protocol field? Ie. URLs like dodrt.dod.no, vvv.no, imdb.com etc. And is there a way to getting the changed gnus-button-url-regexp into gnus-button-alist without restarting Gnus? (I suspect that ever longer and more cryptic regexps require quick cycles of changing and trying) Todays gnus-button-url-regexp is "\\b\\(s?https?\\|ftp\\|file\\|gopher\\|news\\|telnet\\|wais\\|mailto\\):\\(//[-a-zA-Z0-9_.]+:[0-9]*\\)?\\([-a-zA-Z0-9_=!?#$@~`%&*+|\\/:;.,]\\|\\w\\)+\\([-a-zA-Z0-9_=#$@~`%&*+|\\/]\\|\\w\\)" I'm not sure where the best place to change it is. My first attempt would be something like this: "\\b\\(\\(s?https?\\|ftp\\|file\\|gopher\\|news\\|telnet\\|wais\\|mailto\\):\\(//[-a-zA-Z0-9_.]+:[0-9]*\\)?\\([-a-zA-Z0-9_=!?#$@~`%&*+|\\/:;.,]\\|\\w\\)+\\([-a-zA-Z0-9_=#$@~`%&*+|\\/]\\|\\w\\)\\|\\([A-Za-z]+\\.\\)+\\(com\\|org\\|no\\|se\\)\\(/[A-Za-z0-9/]+\\)?\\)" I'm not sure if the top level domain should be from a fixed list, or if we just should match [A-Za-z]? Are there efficiency reasons for doing it either way? Most URLs exchanged this way seems to be .com URLs or URLs in the native national domain of a particular newsgroup.