From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/13500 Path: news.gmane.org!not-for-mail From: John MACFARLANE Newsgroups: gmane.text.pandoc Subject: Re: pandoc: Cannot decode byte '\xa1': [....]: Invalid UTF-8 stream Date: Mon, 24 Aug 2015 10:06:37 -0700 Message-ID: <20150824170637.GA45262@D25Q40BGFY13.Berkeley.EDU> References: <97277b03-f86a-4120-a07d-eecbf08a17e3@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: quoted-printable X-Trace: ger.gmane.org 1440436023 1771 80.91.229.3 (24 Aug 2015 17:07:03 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 24 Aug 2015 17:07:03 +0000 (UTC) To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBCJZJHG45QDBBKU65WXAKGQE46KM65Y-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mon Aug 24 19:06:53 2015 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-vk0-f57.google.com ([209.85.213.57]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1ZTvCu-0002mT-00 for gtp-pandoc-discuss@m.gmane.org; Mon, 24 Aug 2015 19:06:52 +0200 Original-Received: by vkif69 with SMTP id f69sf31531944vki.0 for ; Mon, 24 Aug 2015 10:06:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=from:date:to:subject:message-id:references:mime-version :content-type:content-disposition:content-transfer-encoding :in-reply-to:user-agent:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe; bh=uweEHUhZlnjjHhja5a2qclheI811q5bQxQWzpfkqQ4g=; b=P5M3p6+ifeHtFmYVLgezs/a/oqAP5KYt17McwsNCaA8lCK0wHcyJfN6aj4r3knOQTz m74ymi5IzHcIE7rZIziQCth1SqtziOfVr7G8q27GgOHz+9BmiBxYvp/9WrxtHfVXsfkv Pb2Z77liIxbOZiy/hTEcsltM1hMTwFgx5kaVMUktsAW/1BOYXFsi1qSMA+MYMktjGl/O AMWcgdylzt1TYH+AHUlePpkwxFF4CGSc6tS83myh4azebTHHufdwAa0BoG5HBYgTovW9 jUKLfPI99at7vB4+Z1kI4e2Xyk14mUy0edFxFN7K0 X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:date:to:subject:message-id:references :mime-version:content-type:content-disposition :content-transfer-encoding:in-reply-to:user-agent:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe; bh=uweEHUhZlnjjHhja5a2qclheI811q5bQxQWzpfkqQ4g=; b=cFy93BVp52iunD23x5uEVdEy53fC3qDhSmv4EAHHBRxGJ7pisZ1uS+rIXXsRZhhtGi cv8TUw4c/NzZ11Xrtx130ep7gl4VowkCu4LAwQZEbL2pFCnDHB/3aLK7yj4vYfVS5ZJ1 Sim5TTPNujFD77HWHLIUZ+OYLVnAPcBE/zs66phr3eNi9xNATNTDaWP2uNbni34qw2B9 fdL2hWbYYK0f+CFOISsnO5LUkQ6AEtCWFr8NZgWKhZRzd+r+lCBztvY8Yxnnyj6fCJDT hIbRLVoMcHGHUz+T+8OG X-Received: by 10.50.88.41 with SMTP id bd9mr261393igb.8.1440436011335; Mon, 24 Aug 2015 10:06:51 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.107.15.29 with SMTP id x29ls1509662ioi.99.gmail; Mon, 24 Aug 2015 10:06:50 -0700 (PDT) X-Received: by 10.67.11.97 with SMTP id eh1mr24269543pad.16.1440436010802; Mon, 24 Aug 2015 10:06:50 -0700 (PDT) Original-Received: from mail-pa0-f49.google.com (mail-pa0-f49.google.com. [209.85.220.49]) by gmr-mx.google.com with ESMTPS id c1si1784334pdg.0.2015.08.24.10.06.50 for (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 24 Aug 2015 10:06:50 -0700 (PDT) Received-SPF: pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 209.85.220.49 as permitted sender) client-ip=209.85.220.49; Original-Received: by mail-pa0-f49.google.com with SMTP id ti10so28152991pac.0 for ; Mon, 24 Aug 2015 10:06:50 -0700 (PDT) X-Gm-Message-State: ALoCoQn6aVE2+diPU693O7paMejCT+zOR6xuLeZoUF1bczpw1e+ULR47JG2RbJnwNOeHoE0bvg8d X-Received: by 10.66.161.105 with SMTP id xr9mr48081426pab.50.1440436010630; Mon, 24 Aug 2015 10:06:50 -0700 (PDT) Original-Received: from johnmacfarlane.net (li55-134.members.linode.com. [74.82.3.134]) by smtp.gmail.com with ESMTPSA id tt8sm18017268pbc.49.2015.08.24.10.06.48 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 24 Aug 2015 10:06:48 -0700 (PDT) Original-Received: by johnmacfarlane.net (Postfix, from userid 1000) id A8E9CA65A; Mon, 24 Aug 2015 13:06:37 -0400 (EDT) Content-Disposition: inline In-Reply-To: <97277b03-f86a-4120-a07d-eecbf08a17e3-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> X-PGP-Key: http://johnmacfarlane.net/jgm.asc User-Agent: Mutt/1.5.23 (2014-03-12) X-Original-Sender: jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 209.85.220.49 as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Spam-Checked-In-Group: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:13500 Archived-At: The CSS file is 109,518 bytes, and contains only ASCII characters. This is the portion that causes the problem (confirmed by removing it): ``` @font-face{font-family:'Glyphicons Halflings';src:url(../fonts/glyphicons-h= alfl\ ings-regular.eot);src:url(../fonts/glyphicons-halflings-regular.eot?#iefix)= for\ mat('embedded-opentype'),url(../fonts/glyphicons-halflings-regular.woff) fo= rmat\ ('woff'),url(../fonts/glyphicons-halflings-regular.ttf) format('truetype'),= url(\ ../fonts/glyphicons-halflings-regular.svg#glyphicons_halflingsregular) form= at('\ svg')} ``` I expect, then, that the problem is the following: pandoc is expecting the source files (under `src`) to be UTF-8 encoded, and this is not the case for all of these font files. This needs further investigation. +++ kurt.pfeifle-gM/Ye1E23mwN+BqQ9rBEUg@public.gmane.org [Aug 24 15 07:01 ]: > When I try to create a self-contained HTML document which uses > bootstrap.min.css, I encounter the following error: > pandoc: Cannot decode byte '\xa1': Data.Text.Internal.Encoding.Fus= ion.st >reamUtf8: Invalid UTF-8 stream > > My command line includes --to html --self-contained --css > http://cups.org/css/bootstrap.min.css. > > I=E2=80=99m unable to locate the position of the bye '\xa1' (because = =E2=80=94 quite > untrue to its name! =E2=80=94 this CSS is almost 1 MByte in filesize! > > The problem occurs with any, even the most minimal Markdown input. > Changing the command to --standalone does get rid of the problem (as is > to be expected from the symptoms). > * Is this a problem with the specific CSS? > * Is it a problem with all CSS which are derived from bootstrap.css? > * Or is this a bug in Pandoc? > > =E2=80=8B > > -- > You received this message because you are subscribed to the Google > Groups "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send > an email to [1]pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To post to this group, send email to > [2]pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit > [3]https://groups.google.com/d/msgid/pandoc-discuss/97277b03-f86a-4120- > a07d-eecbf08a17e3%40googlegroups.com. > For more options, visit [4]https://groups.google.com/d/optout. > >References > > 1. mailto:pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org > 2. mailto:pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org > 3. https://groups.google.com/d/msgid/pandoc-discuss/97277b03-f86a-4120-= a07d-eecbf08a17e3-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org?utm_medium=3Demail&utm_source=3Dfooter > 4. https://groups.google.com/d/optout --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/20150824170637.GA45262%40D25Q40BGFY13.Berkeley.EDU. For more options, visit https://groups.google.com/d/optout.