From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/32947 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: "'William Lupton' via pandoc-discuss" Newsgroups: gmane.text.pandoc Subject: Re: Handling binary format of an old word processor Date: Thu, 13 Jul 2023 19:28:57 +0100 Message-ID: References: <6019A3AE-5A17-4857-A428-5B2C5F7AB879@gmail.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="0000000000002d5d7e0600628212" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="12616"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBCS4HJ6WSAHBB6MEYGSQMGQE2YLGEKA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Thu Jul 13 20:29:19 2023 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-lf1-f60.google.com ([209.85.167.60]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1qK142-00037y-UL for gtp-pandoc-discuss@m.gmane-mx.org; Thu, 13 Jul 2023 20:29:18 +0200 Original-Received: by mail-lf1-f60.google.com with SMTP id 2adb3069b0e04-4fb9087a677sf1000370e87.1 for ; Thu, 13 Jul 2023 11:29:18 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1689272958; cv=pass; d=google.com; s=arc-20160816; b=ECEjJKBlvYD4Tto/Os2N9IYs0zQ1s90Tw18Qtb2oeDraF9zA02I9jvyJRZ1xFvxdjv +Mc3Sc+KvOrxLo+JvqYVLNOC2gOEuxHqv3yepPUHPSz28QuIn+JwuWkdyUVurzFbpA31 gySQw3dFufFsmFQVwzNyb0FNQgKwiKvKeQ7EGxtkpqHw52d/tHwaVE9BUYr1DjPFq60B FEGwq4FYF81MiSCjHYRwtW+cESm692DxqQd1UsKpNCWuYj5QuxvW2Q/e16o1HhI7yopk ydQgfvr3GWZ5OlwQ5qbyaGFZc1SoTlkXkRgwZV84g/KyDDr2cUlOAn23eLwY+tK204DL jB9g== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:to:subject:message-id:date :from:in-reply-to:references:mime-version:dkim-signature; bh=2l52XU5bsSeP9WflUj2bTVX6p4cc7Oc/TzLf4Por2bI=; fh=emcR+utGLP2SmCxNDUAuodpNNvid3cufIiJqH2SvSmE=; b=VKysNG5u/aYysBzHlPOmB/jZ/sG/YDSGgzgWNj58BbJHO+pwzr5sTlg6g0CsyPyJdw DTxjebD1bBvsZsB3a+2fjqxhjg2VUdhPGRAOpgtY9WOKpzJxOkBG6WRenPLSA5RTg1jn /QnFqQWcRLS/tJRbvprV7nnllPq8D5rfjyllqSHSPBzzrk5XZmO81hR9xrGdKOXNNu71 DJgQOmRjx6aytEJP/hblctPGJRWCUtKzJ+z5OtvEME69gSP0aLBodPd57s6A1drW03dE R6FHNJTraoXBv1wehwkIj3hOQCIJqtqL+LeNXm1PYpBCoGHxMfqi5VUT+4M5i0v+3Re9 VgUg== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@broadband-forum.org header.s=google header.b="b/vm/oyT"; spf=pass (google.com: domain of wlupton-QSt+ys/nuMyEUIsrzH9SikB+6BGkLq7r@public.gmane.org designates 2a00:1450:4864:20::532 as permitted sender) smtp.mailfrom=wlupton-QSt+ys/nuMyEUIsrzH9SikB+6BGkLq7r@public.gmane.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=broadband-forum.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20221208; t=1689272958; x=1691864958; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:to:subject :message-id:date:from:in-reply-to:references:mime-version:from:to:cc :subject:date:message-id:reply-to; bh=2l52XU5bsSeP9WflUj2bTVX6p4cc7Oc/TzLf4Por2bI=; b=Vc6wlLaeGnXTJOphAseBh/Hf06Mt5/4CZEYPV4H/EDthznpBWY/W1BpbWI5bek3czg l3kAnrPItPr2xMiWEOQY1sW1+Z1w+F0OnI6jxCPX/yA5xICUlnVvD3ujEi5Zl4WPtrZs OARlHm6WyIL5D6XDkVm9xK6vw+K9+F2J0yAgGmj6TkDOW+wd2YrrSASCpF5j2jlIApjt dDu1UJidDC505YrpxQkekRodEZcpfT9TSfZH5vfEVORC9jcVAhszUslzzx/2fMIGDHfD px0+44yvGx8As6T543NzwMqDcZgSjmfeyoawGvJOcAvbugC2U7Bie7D0P4s+2YRmqz85 m5 X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1689272958; x=1691864958; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :x-spam-checked-in-group:list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:to:subject :message-id:date:from:in-reply-to:references:mime-version :x-beenthere:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=2l52XU5bsSeP9WflUj2bTVX6p4cc7Oc/TzLf4Por2bI=; b=iVS9YG8HSHpDdGo0MU9721W38FrF5nRClf6cMDghIdV2jhxGkHHsBEfa6P5jpaTDGM WsnvhJGTpLS79ODwLUh+KIX+BjlbNUG6HFqk9nY81vKNjrMiFcE19RpywS1vSWaPbdSC hq3C1KAaOEu5YNY06MKTSf5TVvrl6Rj82G/8NkwlN5RXG7iSBq6mnbVoTd/tSIHnM+E7 UM9OcnuqEHGIBUmYZXW/jYroPDTp74AZyAZEUuD8YNnrgAyc+vkQZ1h87TFLZsXuhrYf y6OaXoYhK6BzH X-Gm-Message-State: ABy/qLakhE8gho0OuNeLQoXyyooASyRQmrAFok5496byJmSghY3m0lM5 fzHa/5ZuSj1dXV5hUQkFJ+Y= X-Google-Smtp-Source: APBJJlFthkjuvrjt4c50tYpC3J2H6ZO8dtQJStgSk0ML/olAiUSckQU8iNb9kJIeXhdgHo5Wp4OWoA== X-Received: by 2002:a05:6512:234a:b0:4f9:58ed:7bba with SMTP id p10-20020a056512234a00b004f958ed7bbamr2359119lfu.16.1689272957571; Thu, 13 Jul 2023 11:29:17 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a19:675a:0:b0:4f9:5598:d2d0 with SMTP id e26-20020a19675a000000b004f95598d2d0ls310825lfj.2.-pod-prod-09-eu; Thu, 13 Jul 2023 11:29:12 -0700 (PDT) X-Received: by 2002:a2e:8082:0:b0:2b6:effc:8ee1 with SMTP id i2-20020a2e8082000000b002b6effc8ee1mr2161650ljg.5.1689272952123; Thu, 13 Jul 2023 11:29:12 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689272952; cv=none; d=google.com; s=arc-20160816; b=0MnP+vFDA3aWjERBAjYe77OUX5ltFkmkijACNTDUTsUbnyXG9cUkRjSIiDbxAc+PN9 294x2Fw4CoFwguZW3L9aocqo+iMMa234vkhsLJgQrAXTd3bG8OPuUkY6zfaT4Gw3Brwp xHMx3CUqUmT8mMdGV1Nz7or9wnsETmCeSIE0nXLpp0HZw8JuQoUUmk/AYGI+tlTJ4bKI Pk1TAFh6HQqDXbpp4Befnk/FhY2gKqFERCyrdDH/LIqe81VD2au5RF6EAL00raNiffpn oNMStLHCP53wIkPFfE72HOA4vEjC6GsAQUNeD4JNe+W9xjqCQRNZ7CfvHzdi50k3j4Kg ZDGg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=gZ6wFSO64EkTMXxr+8rnyjQm3I+Z/I+FT7R5SiqNj/w=; fh=ivnXo9bAAU2KTllGhxvfTVg/49Brd+hBVbQOiKwa8jI=; b=IluqUKkB1KiMkxDlzoHmFk+ZXfxSsHTzZmqwEl7qlQebuqCxw4P1rb55N+RV5g7JIF qSfN0Hs5QrSCnTVq/54SzyT7fFFNn+2wAjWFS2RuWbj3jknOawgXlEBifFiyalMcyr6M tWZJ/GcpAKHJ7OC6YFbYYL2XbwyEqzq8mKceu3FEYeuctVA010MeLPyuJKiE999Tum3F ts1CnMoj8MOacdIxDqzErQo8J8PrtN8DFy/0wKA4uEuKn7918OX3Idn0FsVGlpAZMSKz Ue2EUHqlyiI28CjcVOPpwYQZyTTQSOQb7Q6aBBcypFtaReVLxDahX5DD3pkIepwCUs31 AlyA== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@broadband-forum.org header.s=google header.b="b/vm/oyT"; spf=pass (google.com: domain of wlupton-QSt+ys/nuMyEUIsrzH9SikB+6BGkLq7r@public.gmane.org designates 2a00:1450:4864:20::532 as permitted sender) smtp.mailfrom=wlupton-QSt+ys/nuMyEUIsrzH9SikB+6BGkLq7r@public.gmane.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=broadband-forum.org Original-Received: from mail-ed1-x532.google.com (mail-ed1-x532.google.com. [2a00:1450:4864:20::532]) by gmr-mx.google.com with ESMTPS id p4-20020a2eb104000000b002b6cdd8e7b6si627458ljl.6.2023.07.13.11.29.12 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Thu, 13 Jul 2023 11:29:12 -0700 (PDT) Received-SPF: pass (google.com: domain of wlupton-QSt+ys/nuMyEUIsrzH9SikB+6BGkLq7r@public.gmane.org designates 2a00:1450:4864:20::532 as permitted sender) client-ip=2a00:1450:4864:20::532; Original-Received: by mail-ed1-x532.google.com with SMTP id 4fb4d7f45d1cf-51de9c2bc77so1297716a12.3 for ; Thu, 13 Jul 2023 11:29:12 -0700 (PDT) X-Received: by 2002:aa7:cd75:0:b0:51d:f74c:1d44 with SMTP id ca21-20020aa7cd75000000b0051df74c1d44mr2973767edb.31.1689272951054; Thu, 13 Jul 2023 11:29:11 -0700 (PDT) In-Reply-To: <6019A3AE-5A17-4857-A428-5B2C5F7AB879-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> X-Original-Sender: wlupton-QSt+ys/nuMyEUIsrzH9SikB+6BGkLq7r@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@broadband-forum.org header.s=google header.b="b/vm/oyT"; spf=pass (google.com: domain of wlupton-QSt+ys/nuMyEUIsrzH9SikB+6BGkLq7r@public.gmane.org designates 2a00:1450:4864:20::532 as permitted sender) smtp.mailfrom=wlupton-QSt+ys/nuMyEUIsrzH9SikB+6BGkLq7r@public.gmane.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=broadband-forum.org X-Original-From: William Lupton Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:32947 Archived-At: --0000000000002d5d7e0600628212 Content-Type: text/plain; charset="UTF-8" I think you need to use a custom reader. Here's a link to the appropriate bit of the appropriate page. https://pandoc.org/custom-readers.html#bytestring-readers On Thu, 13 Jul 2023, 14:11 Trevor Jenkins, wrote: > I have a group of files created long ago with a word processor for the BBC > Micro. The word processor(WordWide/WordWisePlus), which originated in the > late 1980s, saved documents in a sort of binary format in that some 7-bit > printable characters have the eighth bit set to indicate that the character > was preceded on screen by a carriage return. Other old ASCII control codes > such as SI and SO were used for italics (I think). There are also > encapsulated Epson dot-matrix setup sequences at the start of the files. > Had a few read throughs of the Pandoc/Lua filter documentation but simply > cannot get my head around how these documents could be processed using a > Lua filter for pandoc to convert what is essentially a stream of 8-bit > bytes into a form that can be used to convert to a modern format. > > Would appreciate some pointers to how to handle this task using pandoc and > Lua? Otherwise I will have write some one-off throw away C code that does > not help me in using Lua for future tasks. > > Regards, Trevor. > > <>< Re: deemed! > > -- > You received this message because you are subscribed to the Google Groups > "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit > https://groups.google.com/d/msgid/pandoc-discuss/6019A3AE-5A17-4857-A428-5B2C5F7AB879%40gmail.com > . > -- You received this message because you are subscribed to the Google Groups "pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/CAEe_xxgArqRo2zf5%2BQDvhhT7U3ZLz8gH4me9LXBiN_di2qF2DQ%40mail.gmail.com. --0000000000002d5d7e0600628212 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
I think you need to use a custom reader. Here's a lin= k to the appropriate bit of the appropriate page. https://pandoc.org/custom-read= ers.html#bytestring-readers

On Thu, 13 Jul 2023, 14:11 Trevor Jenkins, &= lt;bslwannabe-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wro= te:
I have a group of files created= long ago with a word processor for the BBC Micro. The word processor(WordW= ide/WordWisePlus), which originated in the late 1980s, saved documents in a= sort of binary format in that some 7-bit printable characters have the eig= hth bit set to indicate that the character was preceded on screen by a carr= iage return. Other old ASCII control codes such as SI and SO were used for = italics (I think). There are also encapsulated Epson dot-matrix setup seque= nces at the start of the files. Had a few read throughs of the Pandoc/Lua f= ilter documentation but simply cannot get my head around how these document= s could be processed using a Lua filter for pandoc to convert what is essen= tially a stream of 8-bit bytes into a form that can be used to convert to a= modern format.

Would appreciate some pointers to how to handle this task using pandoc and = Lua? Otherwise I will have write some one-off throw away C code that does n= ot help me in using Lua for future tasks.

Regards, Trevor.

<>< Re: deemed!

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe@googlegroups.= com.
To view this discussion on the web visit https://groups.google.com/= d/msgid/pandoc-discuss/6019A3AE-5A17-4857-A428-5B2C5F7AB879%40gmail.com= .

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://group= s.google.com/d/msgid/pandoc-discuss/CAEe_xxgArqRo2zf5%2BQDvhhT7U3ZLz8gH4me9= LXBiN_di2qF2DQ%40mail.gmail.com.
--0000000000002d5d7e0600628212--