From mboxrd@z Thu Jan 1 00:00:00 1970 To: Fans of the OS Plan 9 from Bell Labs <9fans@9fans.net> In-reply-to: Your message of "Mon, 05 Jan 2015 21:52:12 GMT." <5e85d81160785cdf717f01a8c0649731@quintile.net> References: <5e85d81160785cdf717f01a8c0649731@quintile.net> Date: Mon, 5 Jan 2015 14:31:00 -0800 From: Bakul Shah Message-Id: <20150105223100.106B2B827@mail.bitblocks.com> Subject: Re: [9fans] I don't understand utf8 (it seems) Topicbox-Message-UUID: 395ed6aa-ead9-11e9-9d60-3106f5b1d025 On Mon, 05 Jan 2015 21:52:12 GMT "Steve Simon" wrote: > I am trying to parse a stream from a tcp connection. > > I think the data is utf8, here is a sample > > 20 2d 20 c8 65 73 6b fd 20 72 6f 7a 68 6c 61 73 > > which when I print it I get: > > - e s k r o z h l a s =20 > ^ ^ > missing missing > > there are two missing characters. Ok, bad UTF8 perhaps? According to http://www.isthisthingon.org/unicode/index.php this is an invalid UTF8 hex code.