From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/13925 Path: news.gmane.org!not-for-mail From: John MacFarlane Newsgroups: gmane.text.pandoc Subject: Re: Plain text input Date: Tue, 17 Nov 2015 08:16:47 -0800 Message-ID: <20151117161647.GC5230@MacBook-Air-2.local> References: <8e9e08e4-1eb5-4620-9a0d-5afa1d31166b@googlegroups.com> <564B2F5C.9050504@gmail.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8; format=flowed X-Trace: ger.gmane.org 1447777028 31929 80.91.229.3 (17 Nov 2015 16:17:08 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 17 Nov 2015 16:17:08 +0000 (UTC) To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBCJZJHG45QDBB7FFVWZAKGQEJJUHWBI-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Nov 17 17:17:02 2015 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-yk0-f184.google.com ([209.85.160.184]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1ZyiwI-0006s7-4Z for gtp-pandoc-discuss@m.gmane.org; Tue, 17 Nov 2015 17:17:02 +0100 Original-Received: by ykdr82 with SMTP id r82sf2854866ykd.0 for ; Tue, 17 Nov 2015 08:17:01 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=from:date:to:subject:message-id:references:mime-version :content-type:content-disposition:in-reply-to:user-agent :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:x-spam-checked-in-group:list-post :list-help:list-archive:sender:list-subscribe:list-unsubscribe; bh=cxdQu236s25GknBZpgbII99uubL58cSN/L8llRfCJlQ=; b=ExNVAgW+6B6Jwh20ICSV9SIwMKdlLkxuKQweauGmtb17IwJxa3d/Kf3C8vSj1hxAW4 JGKWRYsljH8SpLQUAJb7m58nZ0yc6sjCZp0uToedn+L7i+hoLJEQ/MyIClwZcoWlXYGG FH9X+fOYHGQ+PInMZ5tP3nIJh1J+bGnQKQRMcrPehSr4qObMHQmbu6in3EisEdHgNeYD BIT31m/Bq9Trja5mWdulvIuwR1o5K+vZ9CzfYEIbHAOIZ8DE4hBE8jlRF6o2s6MALmPi Ht9zFxzd5WWeRcIHLJzTgNmw4569tKypvK6wvbR7x3P1aOcrSlI4sTmaelCZrBClKbx9 X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20130820; h=x-gm-message-state:from:date:to:subject:message-id:references :mime-version:content-type:content-disposition:in-reply-to :user-agent:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:x-spam-checked-in-group :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe; bh=cxdQu236s25GknBZpgbII99uubL58cSN/L8llRfCJlQ=; b=AhQNrR6kb1mx+xDQOCzfx2HAZam//U/m+22lS1hgin0dAsUU2/ZJHawMQXpgvGzMC4 LXgGroHS/tTPOlOUNmqNy3r58Ykx7bz55StKYVwahNtZMzcdcCLs5PU/sSr1YKMvxeRO PtOvr7pdwMt6jEaSpOmVhzgoGUri1FC983+89AXbTTCepXhuKA9410Rp6Bp17OYK4tt1 JK0smKCB5dgGMb3NiItQIRI8FIqFjvr3cjMxdb9XW8r212Q3MNQAm5f5SHmcmVzknIhI PpNgqxQTSRtgw7vb0NEFpy4xZ07VHQSMAQa6Bb/4Xz6zSB X-Received: by 10.50.8.68 with SMTP id p4mr78619iga.8.1447777021508; Tue, 17 Nov 2015 08:17:01 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.50.82.69 with SMTP id g5ls1306874igy.40.canary; Tue, 17 Nov 2015 08:17:00 -0800 (PST) X-Received: by 10.50.126.4 with SMTP id mu4mr2238513igb.3.1447777020744; Tue, 17 Nov 2015 08:17:00 -0800 (PST) Original-Received: from mail-pa0-x230.google.com (mail-pa0-x230.google.com. [2607:f8b0:400e:c03::230]) by gmr-mx.google.com with ESMTPS id pe1si3979832pac.2.2015.11.17.08.17.00 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 17 Nov 2015 08:17:00 -0800 (PST) Received-SPF: pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:400e:c03::230 as permitted sender) client-ip=2607:f8b0:400e:c03::230; Original-Received: by mail-pa0-x230.google.com with SMTP id dm15so12984302pac.3 for ; Tue, 17 Nov 2015 08:17:00 -0800 (PST) X-Gm-Message-State: ALoCoQkht4TVxziw62/j3Ej8lxLoIumMf9HuU2B9VGA7qrm/dr5zXwAVP3YbHGbUBwIG85uSfK9l X-Received: by 10.66.219.39 with SMTP id pl7mr64704125pac.88.1447777020540; Tue, 17 Nov 2015 08:17:00 -0800 (PST) Original-Received: from johnmacfarlane.net (li55-134.members.linode.com. [74.82.3.134]) by smtp.gmail.com with ESMTPSA id fk8sm43888184pab.33.2015.11.17.08.16.58 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Tue, 17 Nov 2015 08:16:58 -0800 (PST) Original-Received: by johnmacfarlane.net (Postfix, from userid 1000) id 9F9F8A63B; Tue, 17 Nov 2015 11:16:47 -0500 (EST) Content-Disposition: inline In-Reply-To: X-PGP-Key: http://johnmacfarlane.net/jgm.asc User-Agent: Mutt/1.5.23 (2014-03-12) X-Original-Sender: jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:400e:c03::230 as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Spam-Checked-In-Group: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:13925 Archived-At: If you just want the text as one big string, you can try running this Haskell program: -- plain.hs import Text.Pandoc.Definition main = do inp <- getContents print $ Pandoc nullMeta [Para [Str inp]] Run as: runghc plain.hs < inputfile.txt | pandoc -f native -o my.pdf But you probably want to retain some structural formatting, e.g. paragraphs? +++ Neil Youngman [Nov 17 15 06:15 ]: > On Tuesday, 17 November 2015 13:45:09 UTC, lukshuntim wrote: > > On 11/17/2015 06:12 PM, Neil Youngman wrote: > > I'm trying to find something that will convert CJK plain text to a > > graphical format. So far the only likely candidate I have found is > > Pandoc with GNU unifont to convert to PDF and the ghostscript to > produce > > TIFF. > > > > Pandoc does not appear to have a plain text input format, so if I > go > > down this route, I may have to preprocess the text and escape any > > markdown characters. Is there an option I've missed to turn off > markdown > > completely, a pre existing script to preprocess the text or some > other > > more suitable program that I should be considering? > Hi Neil, > There's should be no problem for pandoc if your CJK text is in > utf-8. > Regards, > ST > -- > > The problem is making sure that it doesn't interpret any punctuation as > markdown. Verbatim might be useful, but if I put backticks around > Japanese characters they disappear from the output. > An option to turn off markdown completely would be great, but as a > non-Haskell programmer I can't just dive into the source. > I am currently thinking I will wrap backticks around all characters in > the ASCII range except backticks. > > -- > You received this message because you are subscribed to the Google > Groups "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send > an email to [1]pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To post to this group, send email to > [2]pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit > [3]https://groups.google.com/d/msgid/pandoc-discuss/edf3f806-94a1-4ae1- > ad9c-0338c5469526%40googlegroups.com. > For more options, visit [4]https://groups.google.com/d/optout. > >References > > 1. mailto:pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org > 2. mailto:pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org > 3. https://groups.google.com/d/msgid/pandoc-discuss/edf3f806-94a1-4ae1-ad9c-0338c5469526-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org?utm_medium=email&utm_source=footer > 4. https://groups.google.com/d/optout