9fans - fans of the OS Plan 9 from Bell Labs
 help / color / mirror / Atom feed
* Re: [9fans] mime
@ 2000-11-22 15:19 forsyth
  2000-11-22 15:23 ` Boyd Roberts
  0 siblings, 1 reply; 26+ messages in thread
From: forsyth @ 2000-11-22 15:19 UTC (permalink / raw)
  To: 9fans

>>nope.
>>that's the problem.  somewhere a gateway stripped out 'invalid' chars
>>(didn't even turn the e-acute into an 'i' which is what i would have
>>expected)

i think it's demon's smtp mailer that does it.  i lose utf-8 in messages sent
between here and home and both are on demon.

220 finch-post-10.mail.demon.net ESMTP Wed, 22 Nov 2000 15:12:40 +0000



^ permalink raw reply	[flat|nested] 26+ messages in thread
* Re: [9fans] mime
@ 2000-11-23 12:21 rog
  2000-11-23 19:49 ` Boyd Roberts
  0 siblings, 1 reply; 26+ messages in thread
From: rog @ 2000-11-23 12:21 UTC (permalink / raw)
  To: 9fans

> nah, i've done some good work in that pub [The Black Swan].

or was it the old white swan (goodramgate) or the swan (bishy
road, my fave), or the other white swan (blossom st) ?

lots of swans to choose from in york, those that haven't been
swept away by the floods....



^ permalink raw reply	[flat|nested] 26+ messages in thread
* Re: [9fans] mime
@ 2000-11-22 21:13 presotto
  0 siblings, 0 replies; 26+ messages in thread
From: presotto @ 2000-11-22 21:13 UTC (permalink / raw)
  To: 9fans

	However, I'm still perplexed as to what to put in the header
	of a multipart message.  Does the transfer encoding specified
	there pertain to the whole message or just to the

		nyah nyah, if you only had a MIME mailer you wouldn't see this

	message that starts every multipart message?

Just answered my own question.  Rfc2045 says:

	6.4.  Interpretation and Use

	If a Content-Transfer-Encoding header field appears as part of a
	message header, it applies to the entire body of that message.  If a
	Content-Transfer-Encoding header field appears as part of an entity's
	headers, it applies only to the body of that entity.  If an entity is
	of type "multipart" the Content-Transfer-Encoding is not permitted to
	have any value other than "7bit", "8bit" or "binary".

That makes my life much easier.  Someone must have been thinking.


^ permalink raw reply	[flat|nested] 26+ messages in thread
* Re: [9fans] mime
@ 2000-11-22 20:49 presotto
  2000-11-22 21:26 ` Boyd Roberts
  2000-11-23 10:18 ` Lyndon Nerenberg
  0 siblings, 2 replies; 26+ messages in thread
From: presotto @ 2000-11-22 20:49 UTC (permalink / raw)
  To: 9fans

[-- Attachment #1: Type: text/plain, Size: 1866 bytes --]

You don't have to know the transfer encoding when parsing the header.
According to 822,

	Each header field can be viewed as a single, logical  line  of
	ASCII  characters,  comprising  a field-name and a field-body.

Of course 821 states

	The maximum total length of a text line including the
	<CRLF> is 1000 characters (but not counting the leading
	dot duplicated for transparency).

To stay consistent with these, headers need to be in the
7bit domain.  However, rfc2047 provides the amazing
=?b?gobbledeygook?= syntax to allow non ascii tokens in the
header.

Therefore, the 'Content-Transfer-Encoding: {78}bit' header field
isn't telling you anything per se about the header since the
headers are predefined to be US-ASCII with the =???=
encoding providing an escape in case you want what's there
to be convertible to non-ASCII.

When upas/marshal says that a message (or message portion)
is either 7 or 8 bit, it's just stating a fact.  However, if
smtp sees an 8bit message and doesn't get an esmtp connection
to the remote host, it should encode the message (or message
part) in base64 (or quoted printable).

We either have to bite the bullet and encode all 8-bit data
presented to marshal OR change smtp to do the recoding OR
both.  However, if we're also willing to relay messages (which
I do) than the marshal change is insufficient since mail may
pass through us from an 8bit to a 7bit channel without passing
marshal.

Therefore, I'm going to make smtp (or a front end to it)
convert and you can all stop your whining.

However, I'm still perplexed as to what to put in the header
of a multipart message.  Does the transfer encoding specified
there pertain to the whole message or just to the

	nyah nyah, if you only had a MIME mailer you wouldn't see this

message that starts every multipart message?

[-- Attachment #2: Type: message/rfc822, Size: 2026 bytes --]

From: rog@vitanuova.com
To: 9fans@cse.psu.edu
Subject: Re: [9fans] mime
Date: Wed, 22 Nov 2000 13:12:38 0000
Message-ID: <20001122121028.38BD3199E1@mail.cse.psu.edu>

> What brain damaged piece of dog
> excrement program would throw away the 8th bit of every
> byte in today's world?

annoying thing is that you don't know what the 8th bit means until
you've parsed the Content-transfer-encoding header.  it would be nice
if we could convert the whole thing from utf initially, but some
people have a habit of sending latin1 which of course translates as
lots of broken utf runes (and presumably utf translates as broken
latin1 on many machines).

i don't think that the header is allowed 8 bit stuff no matter what.

is the idea that if we just send utf regardless, sooner or later
mailers will assume that 8bit chars in mail headers are utf?

i might hack up a version of marshal that conforms to the standards,
at least superficially.  it would be nice to be able to send mail that
people on other platforms could read. we're not so far away.

  rog.

^ permalink raw reply	[flat|nested] 26+ messages in thread
* Re: [9fans] mime
@ 2000-11-22 15:46 rog
  2000-11-22 15:02 ` Boyd Roberts
  2000-11-22 15:07 ` Boyd Roberts
  0 siblings, 2 replies; 26+ messages in thread
From: rog @ 2000-11-22 15:46 UTC (permalink / raw)
  To: 9fans

> that's how it goes. je dconne pas...
>
> did you see the e acute?

nope.
that's the problem.  somewhere a gateway stripped out 'invalid' chars
(didn't even turn the e-acute into an 'i' which is what i would have
expected)

> i'm not sure you want to do that.  it's a mimefield.

yeah, but we're already neck deep in the big muddy.
so the damn fool might might as well keep yellin' to push on.

  rog.



^ permalink raw reply	[flat|nested] 26+ messages in thread
* Re: [9fans] mime
@ 2000-11-22 13:12 rog
  2000-11-22 14:21 ` Boyd Roberts
  0 siblings, 1 reply; 26+ messages in thread
From: rog @ 2000-11-22 13:12 UTC (permalink / raw)
  To: 9fans

> What brain damaged piece of dog
> excrement program would throw away the 8th bit of every
> byte in today's world?

annoying thing is that you don't know what the 8th bit means until
you've parsed the Content-transfer-encoding header.  it would be nice
if we could convert the whole thing from utf initially, but some
people have a habit of sending latin1 which of course translates as
lots of broken utf runes (and presumably utf translates as broken
latin1 on many machines).

i don't think that the header is allowed 8 bit stuff no matter what.

is the idea that if we just send utf regardless, sooner or later
mailers will assume that 8bit chars in mail headers are utf?

i might hack up a version of marshal that conforms to the standards,
at least superficially.  it would be nice to be able to send mail that
people on other platforms could read. we're not so far away.

  rog.



^ permalink raw reply	[flat|nested] 26+ messages in thread
* [9fans] mime
@ 2000-11-21 19:37 presotto
  2000-11-21 20:04 ` Boyd Roberts
  2000-11-21 23:25 ` rob pike
  0 siblings, 2 replies; 26+ messages in thread
From: presotto @ 2000-11-21 19:37 UTC (permalink / raw)
  To: 9fans

If I were willing to do it correctly, I should have the
smtp program (my only MTA) determine the transfer-encoding
and not marshal.  It only needs to convert if it can't get
an esmtp conection.

Seriously though.  What brain damaged piece of dog
excrement program would throw away the 8th bit of every
byte in today's world?  Smtp's requirement for
7 bit clean was acknowledgement of systems of the time
that really did push messages across serial lines with
parity and/or used the 8th bit in mail files as a signalling
channel.  However, I really don't know of any such these
days.  Every smtp I call up seems perfectly happy to pass
on all 8 bits despite what the RFC says.


^ permalink raw reply	[flat|nested] 26+ messages in thread

end of thread, other threads:[~2000-11-23 19:49 UTC | newest]

Thread overview: 26+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2000-11-22 15:19 [9fans] mime forsyth
2000-11-22 15:23 ` Boyd Roberts
2000-11-22 20:30   ` Boyd Roberts
  -- strict thread matches above, loose matches on Subject: below --
2000-11-23 12:21 rog
2000-11-23 19:49 ` Boyd Roberts
2000-11-22 21:13 presotto
2000-11-22 20:49 presotto
2000-11-22 21:26 ` Boyd Roberts
2000-11-23 10:18 ` Lyndon Nerenberg
2000-11-22 15:46 rog
2000-11-22 15:02 ` Boyd Roberts
2000-11-22 15:16   ` Nigel Roles
2000-11-22 23:28   ` Steve Kilbane
2000-11-22 22:37     ` Boyd Roberts
2000-11-22 15:07 ` Boyd Roberts
2000-11-22 13:12 rog
2000-11-22 14:21 ` Boyd Roberts
2000-11-21 19:37 presotto
2000-11-21 20:04 ` Boyd Roberts
2000-11-21 23:07   ` Steve Kilbane
2000-11-22 13:50     ` Boyd Roberts
2000-11-21 23:25 ` rob pike
2000-11-21 23:31   ` Boyd Roberts
2000-11-22 10:31     ` matt
2000-11-22 14:00       ` Boyd Roberts
2000-11-22 14:53   ` Boyd Roberts

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).