From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/28325 Path: main.gmane.org!not-for-mail From: Lloyd Zusman Newsgroups: gmane.emacs.gnus.general Subject: New error ... missing `char-charset' symbol. Date: 18 Dec 1999 16:08:17 -0500 Organization: Linux Hippopotamus Preserve Sender: owner-ding@hpc.uh.edu Message-ID: NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1035165195 29316 80.91.224.250 (21 Oct 2002 01:53:15 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 21 Oct 2002 01:53:15 +0000 (UTC) Return-Path: Original-Received: from farabi.math.uh.edu (farabi.math.uh.edu [129.7.128.57]) by mailhost.sclp.com (Postfix) with ESMTP id 71BB7D051E for ; Sat, 18 Dec 1999 16:09:52 -0500 (EST) Original-Received: from sina.hpc.uh.edu (lists@Sina.HPC.UH.EDU [129.7.3.5]) by farabi.math.uh.edu (8.9.3/8.9.1) with ESMTP id PAB24501; Sat, 18 Dec 1999 15:09:03 -0600 (CST) Original-Received: by sina.hpc.uh.edu (TLB v0.09a (1.20 tibbs 1996/10/09 22:03:07)); Sat, 18 Dec 1999 15:08:54 -0600 (CST) Original-Received: from mailhost.sclp.com (postfix@sclp3.sclp.com [204.252.123.139]) by sina.hpc.uh.edu (8.9.3/8.9.3) with ESMTP id PAA07428 for ; Sat, 18 Dec 1999 15:08:41 -0600 (CST) Original-Received: from ljz.net (ljz.net [205.230.65.138]) by mailhost.sclp.com (Postfix) with ESMTP id 528AFD051F for ; Sat, 18 Dec 1999 16:08:19 -0500 (EST) Original-Received: (from gnus@localhost) by ljz.net (8.8.7/8.8.7) id QAA08853; Sat, 18 Dec 1999 16:08:17 -0500 Original-To: ding@gnus.org X-Face: "!ga1s|?LNLE3MeeeEYs(%LIl9q[xV9!j4#xf4!**BFW_ihlOb;:Slb>)vy>CJM writes:\n\n> Lloyd Zusman writes:\n> >\n> > [ ... ]\n> >\n> >There is a Perl HTML parser that accurately recognizes all the tags\n> >and the non-HTML. I can use it to write a 10-20 line script which\n> >easily could strip out all the HTML. [ ... ]\n> \n> Cool. Then from the command line I can write higher-level\n> shell utilities that use the Internet as an OS:\n> \n> * spell-check via cgi to dictionary.com\n> * check a domain for availability\n> * give me a list of URL's that altavista says contain \"EJB\"\n> * check a stock quote\n> * give me the weather in NYC\n> * etc...\n\nExactly!\n\nAttached is the Perl script. It's called `stripML' and it works like\na filter, just as you specified. The input is stripped of HTML tags,\nand the result goes to stdout. Also, it can take input from files on\nthe com m! ! and line, like most traditional Unix filter programs.\n\n usage: stripML [ file ... ]\n\nActually, `stripML' has 56 lines, but most of those are comments so\nthat you can follow the logic, or else white space. There really are\nless than 20 significant executable lines of code.\n\nThe HTML::Parser Perl module needs to be installed for this to work.\nIt's a standard module (one of 700 or more) that resides in the CPAN\narchive (Comprehensive Perl Archive Network) on the net. If there's a\nstandard installation of Perl, this module could be installed within\naround 5 minutes.\n\n\n> I wanted to revisit my idea of centralizing access to all\n> the net services. Doing it with CGI-GET's (I guess CGI-POST's\n> can't be done this way) is just a proof of concept.\n> \n> btw, how hard would it be to duplicate the Perl HTML parser in\n> Java? Considering that there are many XML parsers out there,\n> it sounds like something might already exist.\n\nA Java HTML parser probab l! ! y exists. I'll take a look to see if I can\nfind one. Or if not, perhaps we could find a Java SGML parser to use\nin its stead, since HTML is a subset of that.\n\n\n> Take a look at: http://www.bookmarklets.com/\n\nWill do.\n\n\n> [ ... ]\n\n- Lloyd\n\n")) (part (type . "application/octet-stream") (filename . "/usr/local/bin/stripML") (disposition . "attachment") (description . "stripML Perl script") (contents . "")) (part (type . "text/plain") (contents . "\n-- \n Lloyd Zusman\n ljz@crossroadstech.com\n (212) 482-5280 ext 223\n")))) mml-generate-mime() message-encode-message-body() message-send-mail(nil) message-send-via-mail(nil) message-send(nil) message-send-and-exit(nil) ljz-message-send-and-exit(nil) call-interactively(ljz-message-send-and-exit) Thanks in advance. -- Lloyd Zusman ljz@asfast.com