From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on inbox.vuxu.org X-Spam-Level: X-Spam-Status: No, score=-1.0 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, MAILING_LIST_MULTI,RCVD_IN_DNSWL_NONE autolearn=ham autolearn_force=no version=3.4.4 Received: (qmail 21229 invoked from network); 13 Oct 2023 20:29:35 -0000 Received: from tb-ob1.topicbox.com (64.147.108.173) by inbox.vuxu.org with ESMTPUTF8; 13 Oct 2023 20:29:35 -0000 Received: from tb-mx0.topicbox.com (tb-mx0.nyi.icgroup.com [10.90.30.73]) by tb-ob1.topicbox.com (Postfix) with ESMTP id 8C97C23788 for ; Fri, 13 Oct 2023 16:29:33 -0400 (EDT) (envelope-from bounce.mM127761f645d18b8419fc4f9b.r522be890-2105-11eb-b15e-8d699134e1fa@9fans.bounce.topicbox.com) Received: by tb-mx0.topicbox.com (Postfix, from userid 1132) id 8A7B29754AF; Fri, 13 Oct 2023 16:29:33 -0400 (EDT) ARC-Authentication-Results: i=2; topicbox.com; arc=pass; dkim=pass (2048-bit rsa key sha256) header.d=aaoth.xyz header.i=@aaoth.xyz header.b=jQTBBap+ header.a=rsa-sha256 header.s=mail x-bits=2048; dmarc=pass policy.published-domain-policy=none policy.applied-disposition=none policy.evaluated-disposition=none (p=none,d=none,d.eval=none) policy.policy-from=p header.from=aaoth.xyz; spf=pass smtp.mailfrom=aaoth@aaoth.xyz smtp.helo=aaoth.xyz; x-internal-arc=fail (as.1.topicbox.com=pass, ams.1.topicbox.com=fail (message has been altered)) (Message modified while forwarding at Topicbox) ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d= topicbox.com; h=date:from:to:message-id:mime-version :content-type:content-transfer-encoding:list-help:list-id :list-post:list-subscribe:reply-to:subject:list-unsubscribe; s= sysmsg-1; t=1697228973; bh=4pwQ+uLx81VU7Ze7rwaVFiH3bJPmI+bVqyRa/ EY3oIo=; b=R00NLAPwep2gKYp8wz//K6XpzRdfSpujB+NS8xAhQkZ4RsqSgDIs+ lp4KAAwhfzYW1dxhTTYLUB4Djey1m9I6RDkLE75ZrRY6RkL8zv37ARnamZGYt+jo UV6DcvTQon2J7izrMURO9DL+6yADHGLp8kNbF2y9nMMsNOBgNympLA= ARC-Seal: i=2; a=rsa-sha256; cv=pass; d=topicbox.com; s=sysmsg-1; t= 1697228973; b=e4iIwvsGpwYLjwNb5SD5JIqqkN3lg4KFjPpZ4XwHoowkU/J6pm vTkykEzeFoTD0cEMvt4UY5T6V5O6EIsSau0SvybdSvHKseoRutqRfw9Tri9b3me/ aqv+M4e3hkskQuLEClt6iq427SNt1GLZrcgFMQzAmGruHWW7ThfZkJfu0= Authentication-Results: topicbox.com; arc=pass; dkim=pass (2048-bit rsa key sha256) header.d=aaoth.xyz header.i=@aaoth.xyz header.b=jQTBBap+ header.a=rsa-sha256 header.s=mail x-bits=2048; dmarc=pass policy.published-domain-policy=none policy.applied-disposition=none policy.evaluated-disposition=none (p=none,d=none,d.eval=none) policy.policy-from=p header.from=aaoth.xyz; spf=pass smtp.mailfrom=aaoth@aaoth.xyz smtp.helo=aaoth.xyz; x-internal-arc=fail (as.1.topicbox.com=pass, ams.1.topicbox.com=fail (message has been altered)) (Message modified while forwarding at Topicbox) X-Received-Authentication-Results: tb-mx0.topicbox.com; arc=none (no signatures found); bimi=skipped (DMARC Policy is not at enforcement); dkim=pass (2048-bit rsa key sha256) header.d=aaoth.xyz header.i=@aaoth.xyz header.b=jQTBBap+ header.a=rsa-sha256 header.s=mail x-bits=2048; dmarc=pass policy.published-domain-policy=none policy.applied-disposition=none policy.evaluated-disposition=none (p=none,d=none,d.eval=none) policy.policy-from=p header.from=aaoth.xyz; iprev=pass smtp.remote-ip=5.255.103.81 (aaoth.xyz); spf=pass smtp.mailfrom=aaoth@aaoth.xyz smtp.helo=aaoth.xyz; x-aligned-from=pass (Address match); x-me-sender=none; x-ptr=pass smtp.helo=aaoth.xyz policy.ptr=aaoth.xyz; x-return-mx=pass header.domain=aaoth.xyz policy.is_org=yes (MX Records found: mail.aaoth.xyz); x-return-mx=pass smtp.domain=aaoth.xyz policy.is_org=yes (MX Records found: mail.aaoth.xyz); x-tls=pass smtp.version=TLSv1.2 smtp.cipher=ECDHE-RSA-AES256-GCM-SHA384 smtp.bits=256/256; x-vs=clean score=70 state=0 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed; d=9fans.net; h=date:from :to:message-id:mime-version:content-type :content-transfer-encoding:list-help:list-id:list-post :list-subscribe:reply-to:subject:list-unsubscribe; s=dkim-1; t= 1697228973; x=1697315373; bh=4pwQ+uLx81VU7Ze7rwaVFiH3bJPmI+bVqyR a/EY3oIo=; b=MgBYstGPpVbZVMeJ4mPsShibYVncJi3yLqvxNx3GElMVWhCMoF0 bCsG86E5JpONPfTlSMBnJ0wrKX0Hxe5PvmQM/0+C9+R8/zZ7OgBFZI270bkcdJPf bNmNCNCG4hmorb3PBIpYkSXpa2PscHG7u49XBt3VvLoQD/83Gd/n4pNI= Received: from tb-mx0.topicbox.com (localhost.local [127.0.0.1]) by tb-mx0.topicbox.com (Postfix) with ESMTP id 6158097501B for <9fans@9fans.net>; Fri, 13 Oct 2023 16:29:23 -0400 (EDT) (envelope-from aaoth@aaoth.xyz) Received: from tb-mx0.topicbox.com (localhost [127.0.0.1]) by tb-mx0.topicbox.com (Authentication Milter) with ESMTP id F59357F2A0C; Fri, 13 Oct 2023 16:29:23 -0400 ARC-Seal: i=1; a=rsa-sha256; cv=none; d=topicbox.com; s=arcseal; t= 1697228963; b=WuPDGEp/cPtFAexYfI20d7MionO+KjGnA/A0BytPeUqNRx15P2 lcqvE0UqpbfgW0bSTv21BHxi49b/xDYYl60iFz8e/uFrwjyuglWVjt6ln8f3WHfP EhAnlzJj9Ot96J0sPzTVjnObPXHxkGP5xkeL4EK1OAlApTrxvpMfmWqVb1o4D/ha 2o1OzUrx46NGyhgYSY744nPHozqIrDNssCXS5WuY18kryMibFpwvmMlwdq2oLKlQ CLXEwaF4HZjWrcOBbkaozWx2Wf9IACzbCGS3CUCYeR46PBfccIOS6Gs92MKvmQHR mM/pVfy1ZKASQfMxCC/9ozLLv99Sq1a9WHRw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d= topicbox.com; h=date:from:to:subject:message-id:mime-version :content-type:content-transfer-encoding; s=arcseal; t= 1697228963; bh=vS9CNRfmLYuGEbD/rMuedSpC3QLYfXg0dQykClXTmmc=; b=Y fEEDraNBL8/N+I7fh2pqsFTTcT/osnb2C05TWCCPXgceqGcPKuXIDG9456UZMWNs E73FHrJVBGCTY8bbXt7bwfMh39J2UwSjfOYgWl515w1ckyqUlTXcxhUHk8kyCXni 0V07keeg1HMAajCQ1Hm3UMaUhu9NiIe8FlGgxPfKc0vf9g9lPR2atrVhZzy215Zs THYWadxTKHrwIUYRWIGkktZlG59tUCL51kSB00OhYespnQ2jlDSvt5zGKZEdFCqV LGzxytdiexVUiyR/KmN4t8no+65PGjb8lA5I1uel77+DVv+Cd6bhJ8mJinDu6Iac wfKrupyX7bJ/RXaj8LXdA== ARC-Authentication-Results: i=1; tb-mx0.topicbox.com; arc=none (no signatures found); bimi=skipped (DMARC Policy is not at enforcement); dkim=pass (2048-bit rsa key sha256) header.d=aaoth.xyz header.i=@aaoth.xyz header.b=jQTBBap+ header.a=rsa-sha256 header.s=mail x-bits=2048; dmarc=pass policy.published-domain-policy=none policy.applied-disposition=none policy.evaluated-disposition=none (p=none,d=none,d.eval=none) policy.policy-from=p header.from=aaoth.xyz; iprev=pass smtp.remote-ip=5.255.103.81 (aaoth.xyz); spf=pass smtp.mailfrom=aaoth@aaoth.xyz smtp.helo=aaoth.xyz; x-aligned-from=pass (Address match); x-me-sender=none; x-ptr=pass smtp.helo=aaoth.xyz policy.ptr=aaoth.xyz; x-return-mx=pass header.domain=aaoth.xyz policy.is_org=yes (MX Records found: mail.aaoth.xyz); x-return-mx=pass smtp.domain=aaoth.xyz policy.is_org=yes (MX Records found: mail.aaoth.xyz); x-tls=pass smtp.version=TLSv1.2 smtp.cipher=ECDHE-RSA-AES256-GCM-SHA384 smtp.bits=256/256; x-vs=clean score=70 state=0 X-ME-VSCause: gggruggvucftvghtrhhoucdtuddrgedvkedrieefgdduvdeiucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdggtfgfnhhsuhgsshgtrhhisggvpdfu rfetoffkrfgpnffqhgenuceurghilhhouhhtmecufedttdenucgfrhhlucfvnfffucdlje dtmdenucfjughrpeffhffvuffkgggtgfesthhqmhdttddtjeenucfhrhhomheplhgrqdhn ihhnphhrvgcuoegrrghothhhsegrrghothhhrdighiiiqeenucggtffrrghtthgvrhhnpe fhueetjeevhfevuefhtddvveeiveeukefftdfgjeevleejieevgfffheevgeelhfenucff ohhmrghinhepphelfhdrohhrghenucfkphephedrvdehhedruddtfedrkedupdeluddrud elfedrudejjedrhedvnecuvehluhhsthgvrhfuihiivgeptdenucfrrghrrghmpehinhgv thephedrvdehhedruddtfedrkedupdhhvghloheprggrohhthhdrgiihiidpmhgrihhlfh hrohhmpeeorggrohhthhesrggrohhthhdrgiihiieq X-ME-VSScore: 70 X-ME-VSCategory: clean Received-SPF: pass (aaoth.xyz: 5.255.103.81 is authorized to use 'aaoth@aaoth.xyz' in 'mfrom' identity (mechanism 'mx' matched)) receiver=tb-mx0.topicbox.com; identity=mailfrom; envelope-from="aaoth@aaoth.xyz"; helo=aaoth.xyz; client-ip=5.255.103.81 Received: from aaoth.xyz (aaoth.xyz [5.255.103.81]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by tb-mx0.topicbox.com (Postfix) with ESMTPS for <9fans@9fans.net>; Fri, 13 Oct 2023 16:29:22 -0400 (EDT) (envelope-from aaoth@aaoth.xyz) Received: from [127.0.0.1] ( [91.193.177.52]) by mail.aaoth.xyz (OpenSMTPD) with ESMTPSA id 9c44465f (TLSv1.2:ECDHE-RSA-AES256-GCM-SHA384:256:NO) for <9fans@9fans.net>; Fri, 13 Oct 2023 22:29:19 +0200 (CEST) Date: Fri, 13 Oct 2023 20:29:17 +0000 From: la-ninpre To: 9fans@9fans.net Message-ID: <1597A7B3-09D5-443F-B372-8B28F5F2B059@aaoth.xyz> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Autocrypt: addr=aaoth@aaoth.xyz; prefer-encrypt=mutual; keydata= mDMEXyMhExYJKwYBBAHaRw8BAQdAoz5P6kkrxcX5l/bFn8buDGlpdocTXuTHJWfMOqR2hM60G2xh IG5pbnByZSA8YWFvdGhAYWFvdGgueHl6PoicBBMWCgBEAhsDBQsJCAcCAiICBhUKCQgLAgQWAgMB Ah4HAheAAhkBFiEE3R953CQFTB9CRA8uQFwsFj72oskFAmR5hv4FCQc3mWsACgkQQFwsFj72oslN IgD/d3mI/lhGU5+WbynsYS0nJGqOSwEXTC0tlApW9gvaqZwBAOtjRLUiMobQrSIf7npuDqTi+tBE BGxcB7CuDrHjhhMEuDgEXyMhExIKKwYBBAGXVQEFAQEHQG9b3Fw0voICbVQUuvQtmMdqPw7BTmiQ 7wbbqla89BxZAwEIB4h+BBgWCAAmAhsMFiEE3R953CQFTB9CRA8uQFwsFj72oskFAmR5h1oFCQc3 mccACgkQQFwsFj72oskgBAEAlG7h7B1V4TxkBom9RfjKr7fZazJmxu344ukX/7lUsPMA+wXAn467 C3Zs6HuLFjHw2tzmUvhEcGI70D5B4xLNAusD Topicbox-Policy-Reasoning: allow: sender is a member Topicbox-Message-UUID: 34fff452-6a07-11ee-8cd7-77fd068c7b06 Archived-At: =?UTF-8?B?PGh0dHBzOi8vOWZhbnMudG9waWNib3guY29tL2dyb3Vwcy85?= =?UTF-8?B?ZmFucy9UODM4NGI4MTc0ZWI4ODA5Ni1NMTI3NzYxZjY0NWQxOGI4NDE5ZmM0?= =?UTF-8?B?ZjliPg==?= List-Help: List-Id: "9fans" <9fans.9fans.net> List-Post: List-Software: Topicbox v0 List-Subscribe: Precedence: list Reply-To: 9fans <9fans@9fans.net> Subject: [9fans] utf-8 handling oddities List-Unsubscribe: , Topicbox-Delivery-ID: 2:9fans:437d30aa-c441-11e9-8a57-d036212d11b0:522be890-2105-11eb-b15e-8d699134e1fa:M127761f645d18b8419fc4f9b:1:ALUXxCzXcrzG7SdY_NEg_atFz3GkE3DRt-R75YfPWV0 greetings, 9fans. recently i have been studying utf-8 encoding and decided to look at how it = is handled in plan 9. i thought that since plan 9 was the first application= of this encoding, it makes sense to look at its implementation. the fact t= hat mentioned implementation was done by designers of the encoding themselv= es only adds to this decision. so i grabbed the last release tarball from p9f.org and studied it. but when= i was testing some other implementations to compare how each handles encod= ing/decoding errors, i noticed that the same code linked with plan9port's l= ib9 behaves differently (or may i say, incorrectly) when dealing with surro= gate halves than that original plan 9 implementation. i started digging thr= ough archive versions of the same code only to find out that the implementa= tion changed only after the release of fourth edition. specifically, i look= ed at /sys/src/libc/port/rune.c file. the version that i studied was taken = from so called 'latest release' on p9f page. the timestamp on that file say= s that it was last modified in 2013, while the rest of the code is timestam= ped at 2002. inferno os source code too has this change ported to it around= the same time. if i understand it correctly, unicode extended past the BMP in 1996 with th= e release of unicode 2.0. plan 9 had two editions released after that, but,= of course assuming that archives on p9f are indeed correct, the implementa= tion didn't reflect the change in the code until 2013 (and that's why that = old code propagated to both plan9port and 9front). so, maybe someone knows = why is that the case? i'd appreciate any input on this or some pointers to = information resources that you may know of. best regards, la ninpre. ------------------------------------------ 9fans: 9fans Permalink: https://9fans.topicbox.com/groups/9fans/T8384b8174eb88096-M12776= 1f645d18b8419fc4f9b Delivery options: https://9fans.topicbox.com/groups/9fans/subscription