From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/32245 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: John MacFarlane Newsgroups: gmane.text.pandoc Subject: Re: Error caused by document length Date: Mon, 27 Feb 2023 08:08:25 -0800 Message-ID: <0AFB3E23-B7C1-49E8-9F8A-12716F6A2C40@gmail.com> References: <7ed278f7-071b-4bcc-9f9a-e9dd5c09ee55n@googlegroups.com> <8f11cfaf-7c36-4cc6-9866-aa3741d965a4n@googlegroups.com> <4bd152b5-32f7-4f4c-9a9b-0d20afebea84n@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3696.120.41.1.2\)) Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="20019"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDW7ZIEHTIIBB7FK6OPQMGQEBVK76AA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mon Feb 27 17:08:33 2023 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-il1-f189.google.com ([209.85.166.189]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1pWg3E-00050u-1A for gtp-pandoc-discuss@m.gmane-mx.org; Mon, 27 Feb 2023 17:08:32 +0100 Original-Received: by mail-il1-f189.google.com with SMTP id k2-20020a056e0205a200b0031703f4bcabsf4105892ils.0 for ; Mon, 27 Feb 2023 08:08:31 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1677514111; cv=pass; d=google.com; s=arc-20160816; b=bs8ueA7B0HwaaOFK4w0UeyHSrMkAQDJYPqGxrVCoc/PDriwrYPlnFkTrIjCb4F//+6 dF3qAvd2w96SpvFDaaz7OlLb2oTm2K1PEnDxf0VpS7rwtBOIPMjQsFNUoCA2x3hh+Gp3 MKzYzEcUX3pg3dj3CaExa+CiTLV49n0oyBkSS51eXF5s05xEzkawyF7/vKfc47FMAPR6 V0W/AVPDkpzHURPyfYh01Za8GoyD7SMbAY4gR5CuAu5hPqVp+lyLbBZ4PkW/GTWLv/j2 KvgO7YtYqLOxFoTuNmZ1coDDsdL2FMIYKFBA6K2KIypt8olfCh3cub8S5uGzU9eRjvgr w/Zw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:message-id:in-reply-to:to :references:date:subject:mime-version:content-transfer-encoding:from :sender:dkim-signature:dkim-signature; bh=cJy0nIoqsFStLwWDjzC72uSRozkmaz6De4orsuShXzw=; b=vkDknzn/aLHLZkvtslLmCERBcGQUew8Cdrxnfx9WO85ezy0gcygav731pH8rYCGQzW +fejgMgYj+sFK/bB07L4Dutesyx5Ko3ZepIsEOV8ygyX0p3KvfIuJ3tD0p/ysv6GVmK1 0oD2ZC6PZJyvsnbHb16Ngq69AmFhsEpTAwaDuozUdXHj+8uRKmIIHF+7V4Fy9jrEsO79 XVA//VuVaBQv6jIMDrppypMSvqQmylTk/qUbDowe0jLy1qANpfZ8s1Y6goZShdq7/NNy rxjhE9SfezmZcVW+8Bwbhps8gd27dNoZX8yrLhlCOnHGAMvo6U/w1JBqJcPR6KKJz/w2 i9Dg== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b="PeZP3/sf"; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::102d as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:sender:from:to:cc:subject:date :message-id:reply-to; bh=cJy0nIoqsFStLwWDjzC72uSRozkmaz6De4orsuShXzw=; b=hb28EwydoA3tDoT6RBtxlUxcCu+vLrVYtQy04gSA2Kca7L9B4LCARBX+mAdMQ7AVQR 42DTWix9+HJDdR4pvHg33VNUiHtdRYA5yDykRm8RJ8X78SY5irmUvraRg5KU8Q7eAYOp BV7AOKVJ2+5smAGPthizGHH5DnmZRdKQo3rS9dMpHg3umWO+qzydc0gAUJxyUuKuXSDc 6hkNPcTaQ+y5WQ4kkZJXNSD73bOLL+CNmb2x7FHD60Hi9fZDF3lGwZRWM2Df8Xz6GZZm dNSXEPOqHIfPz4y0RWcyxSy1/PTb7LW7rx4zSA46gRZ1/t7Q0quFycYM5pMGfaeXI DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:from:to:cc:subject:date:message-id :reply-to; bh=cJy0nIoqsFStLwWDjzC72uSRozkmaz6De4orsuShXzw=; b=bUC0p5hnJ/j38fAFzjj65q3qdSSJt4XPi10uFckdWWwoiZVeSwCLzG89eE+GbSffok QqgbLBA5umky4we+XiuZvT+twdQf6oZjsJRHKZO7wRaQOnPduKDWUUfqZ7VA0XQ7mXP7 pfNYQjTUz3J/Pvt7O0BORvrQpZxjU65/SwiVIRilY4863dC164AiUjeFfWeOup3UmLAt VzsuoHvuRRURlaWvq/JtXWTEZKYqO2aZxmY1/bKhUBVALIc0iSDRNfjmfaQwGGJLj2+h dx4zlqYjGdamy/LKhPq2JzSmse0QMtYs+nqe2/qbqWHKDhnHXuZ39HeP0fm5F65CO5dR Z X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :x-spam-checked-in-group:list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:message-id :in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:x-gm-message-state:sender:from:to:cc :subject:date:message-id:reply-to; bh=cJy0nIoqsFStLwWDjzC72uSRozkmaz6De4orsuShXzw=; b=bpQu0fQw/9wf22uyLv3g7cYq/VPLhZlnDD+i7cmOV2zc3BK1y7Evps1FGVwwPwpB0w radfWrwsPRhBYkfg0YtkuBgfcIGDdQxy6CehXcDwvS097sUm9ZNelpQoT4Lc0YAtue3V Hk1SAZloYf8TAYwDrevpfneuuZhU15V9TJmxlEgmh98VxTzmI7DeJhTS14z9viiUfXWr 8yMXM7y67CBRHZuEOPQj0qhfzGsZe1Drr+iTdnQqYs98efh/5lUErADbZO0/WxK5jlAo EOH7eK7EWPQGjmF7ga49 Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AO0yUKUg4CWbCsHQ+keDJenyITnwqs7MbO6FkrCdmExyJ2Rjd6B+1rvu mR5HQPq+si7Dl7Zvq6HEGa8= X-Google-Smtp-Source: AK7set+VWGOZLceCdsF1mtsOQ66Pd7hPeysYergMmlXV6qO2nfwwdQ753kQbE4zlUhXE5/Bs9NYhlQ== X-Received: by 2002:a6b:8d56:0:b0:744:d7fc:7a4f with SMTP id p83-20020a6b8d56000000b00744d7fc7a4fmr6109390iod.1.1677514110902; Mon, 27 Feb 2023 08:08:30 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6e02:1290:b0:315:9b6d:97d8 with SMTP id y16-20020a056e02129000b003159b6d97d8ls4221817ilq.9.-pod-prod-gmail; Mon, 27 Feb 2023 08:08:28 -0800 (PST) X-Received: by 2002:a05:6e02:1c84:b0:317:641e:1088 with SMTP id w4-20020a056e021c8400b00317641e1088mr2403143ill.19.1677514108136; Mon, 27 Feb 2023 08:08:28 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1677514108; cv=none; d=google.com; s=arc-20160816; b=tC5o9szsuJs5zBsaypCvKbZlws7py34cOIWLHLyObKhsTa5r4MmzMEp+ftf8GKQ1x2 bX8fvd09x4TgWvgFku8JvxvXVz3Tl1w3dlu0SRRmR6VDNgWuaBe9T7Pzl8efnxzu/Dxs 70fRejH9XS4JALEobS6Bg7AnX432fT61gESIponrLImVPfwoTUJdV9Jl2n8srF37JFAD k6KmM5/FH/jeNhvfaUSxQHGSP6r+IDXIh26SI7VPnmluyFoXZ1syPWD7xCmRHPRtq1yY HcZ+TclCNkrdFyb6QZoNFPEaakwkgkhsa0JTx9ieb9g16pv/pvto9tCMlZiDQ4WdSeA6 KUUQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=message-id:in-reply-to:to:references:date:subject:mime-version :content-transfer-encoding:from:dkim-signature; bh=6oKXe+LMaTrBzWto2uXK50kNqNncsv9dXDyghRPsinI=; b=VezGwH6GxuxLnduVHMyuEh8M8Hs8bdn95pvHWw0oOjCYNPxFFMmNHvolBgdJ2P5HKq 6q3ksxAp08/kNAruBd19raYmribgoe6CAPre5lyA9lR0L3gJN75UNdOQUNOqo5OHmGuK ww6l0pKhhFk3UsJW50sA3CfL3zd7z9f7FM8i5taaG0dSFyuABTFuJDSVOvagZ8pCmPlU xPLiOIXEmGqoOlbeXeC1q1fL/tcTmYX1lby6pKoMpDqwduk1r+Y+6MMUXmA7tjFwtLDV MHz8l9lJfh9SQp+B+rDbtDvAyagix8lyuUFxSkzOPvj8wc6QQkea7elzooCzsyKDZPvn mrHg== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b="PeZP3/sf"; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::102d as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Original-Received: from mail-pj1-x102d.google.com (mail-pj1-x102d.google.com. [2607:f8b0:4864:20::102d]) by gmr-mx.google.com with ESMTPS id m41-20020a026d29000000b003c5196a884dsi528338jac.5.2023.02.27.08.08.28 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 27 Feb 2023 08:08:28 -0800 (PST) Received-SPF: pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::102d as permitted sender) client-ip=2607:f8b0:4864:20::102d; Original-Received: by mail-pj1-x102d.google.com with SMTP id oj5so2674369pjb.5 for ; Mon, 27 Feb 2023 08:08:28 -0800 (PST) X-Received: by 2002:a05:6a20:3d81:b0:cc:32aa:8570 with SMTP id s1-20020a056a203d8100b000cc32aa8570mr17652004pzi.14.1677514107358; Mon, 27 Feb 2023 08:08:27 -0800 (PST) Original-Received: from smtpclient.apple ([2601:644:4780:3350:f019:7ecb:dd5:ac38]) by smtp.gmail.com with ESMTPSA id v14-20020aa7808e000000b005dca6f0046dsm4592138pff.12.2023.02.27.08.08.26 for (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Mon, 27 Feb 2023 08:08:26 -0800 (PST) In-Reply-To: X-Mailer: Apple Mail (2.3696.120.41.1.2) X-Original-Sender: fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b="PeZP3/sf"; spf=pass (google.com: domain of fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2607:f8b0:4864:20::102d as permitted sender) smtp.mailfrom=fiddlosopher-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:32245 Archived-At: You could try running epubcheck on the epub produced by pandoc, to see if i= t points to anything. > On Feb 27, 2023, at 6:33 AM, 'Peter Vedal Utnes' via pandoc-discuss wrote: >=20 > I just did some further testing, and replaced the sections that I would o= therwise have removed with as many words and paragraphs, but no signs, only= "test test test" etc. The document then works. So I was wrong about the le= ngth: It must be some character or symbol producing the error (only with pa= ndoc, not other EPUB converters). Any idea how to further isolate it, or ho= w to circumvent with a pandoc command or template? >=20 > Thanks for the help so far, Bernardo. >=20 >=20 >=20 > mandag 27. februar 2023 kl. 15:23:57 UTC+1 skrev Peter Vedal Utnes: > I am not sure what you mean by normalize in this context. I'll elaborate = in case this is what you mean: In the interest of removing variables that m= ight interfere with troubleshooting, I have copied the text from research p= apers (not just one, but a few), pasted it in notepad, copied and pasted it= back into a new word-file (this is more thorough than "clear formatting"),= ran this "pure" file through pandoc and I get the error. If I then randoml= y shorten the file, the error disappears. This is not the case for my "test= " file, but only for research papers, which is baffling. I can only assume = that pandoc responds to something like a character or in-text references in= particular contexts, or as was my original hypothesis, the number of lines= or columns in the EPUB.=20 >=20 > mandag 27. februar 2023 kl. 15:17:10 UTC+1 skrev bernardov...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org: > Have you tried editing the original research paper in some minor way (add= ing or removing a couple of characters) and then running it? This is a comp= letely wild guess, but maybe the text in the file is getting normalized upo= n editing them, whereas the original research paper still contains the uned= ited, unnormalized text. >=20 > On Mon, Feb 27, 2023 at 10:48=E2=80=AFAM 'Peter Vedal Utnes' via pandoc-d= iscuss wrote: > I thank you for the suggestion. It is proving somewhat hard to (dis)confi= rm. I have made a testfile with just the word "test" pasted over and over a= gain, with and without various formatting and with the same length or longe= r as the proper papers. This file consistently works. But when I attempt to= do it with a regular research paper, it only works if I shorten it. Curiou= sly, I can remove either half of the main text, or indeed sections here and= there, randomly, and it works, but not with all of them present. I have co= mbed it for special characters or tags, but cannot find any.=20 >=20 > mandag 27. februar 2023 kl. 13:49:58 UTC+1 skrev Bernardo C. D. A. Vascon= celos: > I do not know the answer to this problem in particular, but perhaps it is= worth checking the main document and the bibliography for invisible contro= l characters (e.g. `\X{A0}`). They tend to cause all sorts of strange probl= ems that result in random error msgs. >=20 > On Monday, February 27, 2023 at 8:16:20=E2=80=AFAM UTC-3 Peter Vedal Utne= s wrote: > We have a workflow in Open Journal Systems where we use Pandoc to convert= word documents to EPUB, and then display them with an embedded EPUB app (B= ibi).=20 >=20 > Our resulting EPUBs work fine with both debuggers and viewers like calibr= e. They work in Bibi, but only when they are reduced to a certain length. W= henever the files exceed approx 100 lines or 600 words, Bibi claims: >=20 > TypeError: Cannot read properties of undefined (reading =E2=80=98getAttri= bute=E2=80=99) >=20 > Meanwhile, the same documents works when converted to EPUB using other co= nverters, or when I reduce the length (length, not size in bytes-- I've tri= ed with graphics, still works). It suddenly works when I reduce the length = by removing pure paragraph text, even though all the formatted elements (ab= stract, references, etc) are the same.=20 >=20 > I recognize that this problem is very specific to the interrelation pando= c <-> Bibi, but I'd be grateful for general troubleshooting suggestions.=20 >=20 > Thanks in advance,=20 >=20 > Peter >=20 >=20 > --=20 > You received this message because you are subscribed to a topic in the Go= ogle Groups "pandoc-discuss" group. > To unsubscribe from this topic, visit https://groups.google.com/d/topic/p= andoc-discuss/hPUa1uWGS_k/unsubscribe. > To unsubscribe from this group and all its topics, send an email to pando= c-discus...-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit https://groups.google.com/d/msgi= d/pandoc-discuss/4bd152b5-32f7-4f4c-9a9b-0d20afebea84n%40googlegroups.com. >=20 > --=20 > You received this message because you are subscribed to the Google Groups= "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an= email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit https://groups.google.com/d/msgi= d/pandoc-discuss/bc147d77-69c9-4e5d-82a6-e149f662a823n%40googlegroups.com. --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/0AFB3E23-B7C1-49E8-9F8A-12716F6A2C40%40gmail.com.