From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/73196 Path: news.gmane.org!not-for-mail From: Russ Allbery Newsgroups: gmane.emacs.gnus.general Subject: Re: Invalid characters, or something else? Date: Sat, 16 Oct 2010 17:20:25 -0700 Organization: The Eyrie Message-ID: <87d3r9r7p2.fsf@windlord.stanford.edu> References: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: dough.gmane.org 1287274876 10061 80.91.229.12 (17 Oct 2010 00:21:16 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Sun, 17 Oct 2010 00:21:16 +0000 (UTC) To: ding@gnus.org Original-X-From: ding-owner+M21568@lists.math.uh.edu Sun Oct 17 02:21:15 2010 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1P7Gzy-0006PX-SI for ding-account@gmane.org; Sun, 17 Oct 2010 02:21:15 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1P7Gzp-0006x2-Kc; Sat, 16 Oct 2010 19:21:05 -0500 Original-Received: from mx2.math.uh.edu ([129.7.128.33]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1P7Gzn-0006wl-K7 for ding@lists.math.uh.edu; Sat, 16 Oct 2010 19:21:03 -0500 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx2.math.uh.edu with esmtp (Exim 4.72) (envelope-from ) id 1P7Gzj-0005zL-P0 for ding@lists.math.uh.edu; Sat, 16 Oct 2010 19:21:03 -0500 Original-Received: from smtp3.stanford.edu ([171.67.219.83] helo=smtp.stanford.edu) by quimby.gnus.org with esmtp (Exim 3.36 #1 (Debian)) id 1P7Gzi-0005sw-00 for ; Sun, 17 Oct 2010 02:20:58 +0200 Original-Received: from smtp.stanford.edu (localhost.localdomain [127.0.0.1]) by localhost (Postfix) with SMTP id D58001A0ED1 for ; Sat, 16 Oct 2010 17:20:26 -0700 (PDT) Original-Received: from windlord.stanford.edu (windlord.Stanford.EDU [171.67.225.134]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by smtp.stanford.edu (Postfix) with ESMTPS id 9D9141A0DCF for ; Sat, 16 Oct 2010 17:20:25 -0700 (PDT) Original-Received: by windlord.stanford.edu (Postfix, from userid 1000) id 743112F466; Sat, 16 Oct 2010 17:20:25 -0700 (PDT) In-Reply-To: (Lars Magne Ingebrigtsen's message of "Sun, 17 Oct 2010 02:05:09 +0200") User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.2 (gnu/linux) X-Spam-Score: -4.9 (----) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:73196 Archived-At: Lars Magne Ingebrigtsen writes: > Some Gwene groups have some characters that can't be parsed as utf-8 (or > something). It's usually dash characters or the like. > Like this: > --- > other easy activities=C2=97talking, chewing gum=C2=97and suggest=20 > --- > Is that valid? If not, what's the likely explanation where it's coming > from? They're from Windows-1252 (or 1250). Octal 227 is a dash in that charset. Microsoft users tend to send unlabelled Windows code pages. http://en.wikipedia.org/wiki/Windows-1252 --=20 Russ Allbery (rra@stanford.edu)