From mboxrd@z Thu Jan 1 00:00:00 1970 Message-ID: To: 9fans@cse.psu.edu Subject: Re: [9fans] conversion of charsets in upas/fs Date: Tue, 1 Feb 2005 18:37:54 +0000 From: rog@vitanuova.com In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit Topicbox-Message-UUID: 3bbd9644-eace-11e9-9e20-41e7f4b1d025 > Probably the best thing to do is write a program that applies > the transformations you want but works a byte at a time and > is character-set-ignorant. alternatively, a very quick solution might be to get pipeto.lib to take an extra verbatim copy of the message before running upas/fs on it. e.g. # save and parse the mail file cat > $TMP.msgraw sed '/^$/,$ s/^From / From /' < $TMP.rawmsg >$TMP.msg upas/fs -p -f $TMP.msg || exit $status and in fn spool: $BIN/deliver $RECIP $D/from $_mbox < $TMP.rawmsg || exit $status this does mean that more space is used, and that the spam classification isn't strictly accurate for non-utf charsets, but it does avoid having to write any code... N.B. i haven't tried this out!