From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.user/759 Path: news.gmane.org!not-for-mail From: Jesper Harder Newsgroups: gmane.emacs.gnus.user Subject: Re: Removing HTML tags in console mode (was Hotmail HTML Mail) Date: Fri, 12 Jul 2002 04:33:38 +0200 Organization: http://purl.org/harder/ Message-ID: References: <87d6ttri13.fsf@happy.sherilyn.org.uk> NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1138667683 8293 80.91.229.2 (31 Jan 2006 00:34:43 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 31 Jan 2006 00:34:43 +0000 (UTC) Original-X-From: nobody Tue Jan 17 17:28:04 2006 Original-Path: quimby.gnus.org!news.ccs.neu.edu!news.dfci.harvard.edu!news.harvard.edu!iad-peer.news.verio.net!news.verio.net!news.tele.dk!small.news.tele.dk!193.162.153.118!news.tele.dk!not-for-mail Original-Newsgroups: gnu.emacs.gnus X-Face: ^RrvqCr7c,P$zTR:QED"@h9+BTm-"fjZJJ-3=OU7.)i/K]<.J88}s>'Z_$r; Sherilyn writes: > It should be quite easy to write a simple-minded wash option that just > replaces anything in angle brackets with a space and wraps any > resulting long lines. [..] Or you could pipe it through an external > program like Michael Haardt's dehtml utility using (shell-command). I think Oort Gnus includes something similar: ;;; html2text.el --- a simple html to plain text converter ;; Copyright (C) 2002 Free Software Foundation, Inc. ;; Author: Joakim Hove ;;; Commentary: ;; These functions provide a simple way to wash/clean html infected ;; mails. Definitely do not work in all cases, but some improvement ;; in readability is generally obtained. Formatting is only done in ;; the buffer, so the next time you enter the article it will be ;; "re-htmlized". ;; ;; The main function is "html2text"