From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/28783 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: John MacFarlane Newsgroups: gmane.text.pandoc Subject: Re: File splitting bug Date: Fri, 09 Jul 2021 11:34:44 -0700 Message-ID: References: <297bc662-7841-4423-bcbb-534e99bbba09n@googlegroups.com> <38ac5d4c-8cba-4c23-a313-bf81e79779e7n@googlegroups.com> <5d588eaf-0fd8-4023-8296-b9748189593cn@googlegroups.com> <53332f68-2c48-4416-91b8-8e34395d0859n@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="2365"; mail-complaints-to="usenet@ciao.gmane.io" To: Gary Glass , pandoc-discuss Original-X-From: pandoc-discuss+bncBCJZJHG45QDBBUNNUKDQMGQEMBNR4PQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Fri Jul 09 20:35:00 2021 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-oi1-f186.google.com ([209.85.167.186]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1m1vL1-0000KM-Pz for gtp-pandoc-discuss@m.gmane-mx.org; Fri, 09 Jul 2021 20:34:59 +0200 Original-Received: by mail-oi1-f186.google.com with SMTP id m62-20020acad5410000b02902411a37ddbdsf3242551oig.16 for ; Fri, 09 Jul 2021 11:34:59 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1625855698; cv=pass; d=google.com; s=arc-20160816; b=VhFjDN1OlqBzgT6Ipi3QKNKZTXa+xWHFKXP9jvcKRjdGfY9oguTcrf4SqwjQbkDc9n aFfTMQbNlBJJww8kQWlc2hsPMeKHSPPzk0oDvNvQti0AGLVmd6ALz/4ecgrzzqpLyhn8 T1OYkGsbceMW4DjumCx51MZN4d0m0on5RbtikjrJOioJX3LpQ8GupHQgQmDOKlZdZ209 Cs+ZsXYz8DP1YPk2cKZc3CvK04Ck3UtsxOsQXddjqhOZVdWVRNm/Ijn3DqnSWGCQHwzz fuAM4FxKpiOxQwtnKT1wb7eS1BEtIGh8uSV7yyVl6veXJtLTPwlmhyoIsyHQN/ioYejv x72g== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:mime-version:message-id :date:references:in-reply-to:subject:to:from:sender:dkim-signature; bh=nLXwm5vFCOh6cGGQc9dh8AhO3w32jA/fcsPorIkohQw=; b=ncYuVkmCog5wxthwg/K0CY0ZXfxNiAddlrqpLXpCP2/OLu7+uqpQrKG0P50fbAVSIa c9f+1hk+X222N9/b9niLSaTHOhp2GJB21Wy0QpLQ1Cb7YZw9vFFBt0wAEObIywFfldbm 7E9w1+PqILuFVc58eCA3XRyNm70Kc0cadnNtwUKkyhEjz+W3pgH6oRloaxG9LUOG4cIh 1vo/y4A/XR+pUZW8ckvewtEVHfBi+hCFMH/GX9bNvc6eX5iH+zIagDdIB2m3BpY9VtSP T6vbQS3TGdkaGad0vWlwlpij76CG6vpom+rsY9v8blYR4YcLxRb7trvvUl1bE8N+Cfad RBiw== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=PGkvtyKs; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::533 as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:from:to:subject:in-reply-to:references:date:message-id :mime-version:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=nLXwm5vFCOh6cGGQc9dh8AhO3w32jA/fcsPorIkohQw=; b=CyeegdjR0uzdlsoWodXZB8SLzxbO4FSttOOLlMPGmjXnj9RoMLEs23lU3n9Wc7VMOL Jg0uKA9g4Q8KisYosCYF+6Z1Qf4h5RUJ6GOHtYaHjC7tx4NF2GOItezHoZyJqxvSqKKY 8NTM7AGiWngpx9KNEINfupdmMmhVxoBA1AYiI5llMn6V/37N+yXFddKzJDz63IZ6G7Mi gLmOCGH3MTjOg1X7nKqvryZW7Fky0GDE53PIfC7+XzATk9ec1aNfxHe3ZKxbO5oK3LgS gDNtNP1q4hjNfoJqueCPAnQuT+Rm7u7EWHvvVks4cqFnfaZYdaOrhApsivg0Fsxzs62Z L7lA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:from:to:subject:in-reply-to:references :date:message-id:mime-version:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=nLXwm5vFCOh6cGGQc9dh8AhO3w32jA/fcsPorIkohQw=; b=uCWRA4e2sx+ju5xSW1nr3PwbfLsNG4w7zL7qYCfyqVTh9Ag81mkE6jPaaN15xS8C2O TySopHoFUSvZc7ZucvLbcSMwpCke3GlcgwZx/dTO94I1Sxj2jyThqQHAw2Sbfm7p66Xc vvD/b0/eVBr/oebqiVDNesQwvATDZszdpWqnwzIHoPn1T/MQ335SA42rqcNqEBcHo2AI afQOo7P3inenvR1glCRT8AXV2Lmc1D+V+7Dz+LBqroXmZaol0tdk2p8r4jAUpjx3SFf1 +2m1QWtTRAnctfNcTSaDVJxDT2Q7yEsrVM7Z6Z3sIXWQxkH1Ypa6E7g9YAnh4BYwWzhw bXDQ== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM532HYQxgGHEFl9KbwjvUERhiUR2e0C6v1JKGuMaaHIKTSBHKZp8E mIXvtfk+EKNTRabxKbTI92w= X-Google-Smtp-Source: ABdhPJzxadcWxHqjliqj0jIEJ1mUzpFB0YHEwGbPnGRtq9yCJ10DGqBQrR5lwfRyQnt/ChJiK1Pzlw== X-Received: by 2002:a05:6808:312:: with SMTP id i18mr257016oie.136.1625855698839; Fri, 09 Jul 2021 11:34:58 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a4a:b443:: with SMTP id h3ls562527ooo.4.gmail; Fri, 09 Jul 2021 11:34:57 -0700 (PDT) X-Received: by 2002:a4a:9d47:: with SMTP id f7mr28185767ook.67.1625855697488; Fri, 09 Jul 2021 11:34:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1625855697; cv=none; d=google.com; s=arc-20160816; b=0Uy/0rNW/4N54j92CS9/n8q3TWxDVbDVwdYu5gTW7InZ9KIBAXji8sjGNMnAjvZeFy 7BTVBnlsXMmTCj045i/cmfeztCldNJf0JjTCsZdngzQYcNocXeBqT8UF2KfBu4LGGGJj xKxhBy/5kfMb9DFJVje1spvTl2CKj89RIhI/s2YVljyiagvnlZaE2cit/DYjxrpiuNt0 rpQbExc3byofaSiLqB1DeHp7fQ3IcOZqQxxDWetXCjmbHOYoW8mmYR5lDNg72Arkwrqs fkPVNsgILwLJjJtz4YD0T0SatScPVIHJc4yJ0AluGj4so4lEUSJI37VofzTuVBR6wffs 0qHQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=mime-version:message-id:date:references:in-reply-to:subject:to:from :dkim-signature; bh=cgfmD/zA9ozALAemxs+sO732wUQ8UwXVPJXRZz1psXM=; b=ANkOmCnrMUFfZJqbMAC/eUqW9+9DYlSqD74cTDUsCS2pzdFXBkRkPqJ3LtCUtVwyO+ aZ6AsVgzmg3q4F4jzgO7XeqfWZh2Kr6iX5indA6M0gnruj3wwJ4JvWMDaZYuksHZBUMC 2zANonGyugQs1ImR5d4rFcNvYdfPqKCvUxn0C0E/2U8/YtFDJ6hrXI4a9FyEMvSQZ97n 16f5TfN5p/mmgNimFPWzQ1JJgYi0TXWDvuCyogcAg4oTDPUnUetP2M2gHHHFa7War9M/ 3wAD9SmLhPBTma4rWFc7ytzgovtVjGbs62lOzBLEgmxY7GEy65+jLhx3vQhGJF4+vVxP 4lRA== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=PGkvtyKs; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::533 as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Original-Received: from mail-pg1-x533.google.com (mail-pg1-x533.google.com. [2607:f8b0:4864:20::533]) by gmr-mx.google.com with ESMTPS id j26si510673ooj.0.2021.07.09.11.34.57 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 09 Jul 2021 11:34:57 -0700 (PDT) Received-SPF: pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::533 as permitted sender) client-ip=2607:f8b0:4864:20::533; Original-Received: by mail-pg1-x533.google.com with SMTP id k20so3837479pgg.7 for ; Fri, 09 Jul 2021 11:34:57 -0700 (PDT) X-Received: by 2002:a63:d711:: with SMTP id d17mr39727261pgg.268.1625855696687; Fri, 09 Jul 2021 11:34:56 -0700 (PDT) Original-Received: from johnmacfarlane.net (li55-134.members.linode.com. [74.82.3.134]) by smtp.gmail.com with ESMTPSA id z15sm8052154pgu.71.2021.07.09.11.34.55 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 09 Jul 2021 11:34:55 -0700 (PDT) Original-Received: by johnmacfarlane.net (Postfix, from userid 1000) id 7F4BEA249; Fri, 9 Jul 2021 14:34:44 -0400 (EDT) In-Reply-To: <53332f68-2c48-4416-91b8-8e34395d0859n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> X-Original-Sender: jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=PGkvtyKs; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::533 as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:28783 Archived-At: What exact command line are you using? Does pandoc --version work? Gary Glass writes: > Well for some reason that doesn't work. The pandoc.exe just hangs when I > run it. > > On Wednesday, July 7, 2021 at 6:48:04 PM UTC+2 John MacFarlane wrote: > >> >> You can try a nightly. >> https://github.com/jgm/pandoc/actions/runs/1007239404 >> >> Gary Glass writes: >> >> > Is there an installer for that rev? I'll be happy to test it. >> > >> > On Tuesday, July 6, 2021 at 7:26:26 PM UTC+2 John MacFarlane wrote: >> > >> >> >> >> OK, I think I've fixed this in >> >> commit f88ebf3ebf49e00ffa12778caf6817cc34459e6a >> >> >> >> John MacFarlane writes: >> >> >> >> > Another big clue: if you remove the elements >> >> > from the , it works again. It also works if you use >> >> > >> >> > >> >> > >> >> > instead of >> >> > >> >> > >> >> > >> >> > >> >> > >> >> > John MacFarlane writes: >> >> > >> >> >> Thank you for the minimal test case! >> >> >> Actually one can see the issue just with >> >> >> >> >> >> pandoc --section-divs bug.md >> >> >> >> >> >> At the end there is >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> where you'd want >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> >> The difference is that, with the colgroup, the
tags are >> >> >> being parsed as raw HTML blocks, while without it, we get a >> >> >> native Div in the AST (which is what we want in this case). >> >> >> >> >> >> Somehow the colgroup is interfering with parsing of the native >> >> >> Div. >> >> >> >> >> >> If you don't mind reporting this at >> >> >> https://github.com/jgm/pandoc/issues (including this information) >> >> >> it will help us keep track. Looking at the code, I currently >> >> >> have no idea why this is happening. >> >> >> >> >> >> Gary Glass writes: >> >> >> >> >> >>> Here's the simplest file I could make to repro the issue. The >> pandoc >> >> >>> command is very simple: >> >> >>> >> >> >>> pandoc --output=bug.epub --to=epub3 bug.md >> >> >>> >> >> >>> It produces an HTML file with a mismatched section tag. >> >> >>> >> >> >>> If you comment out the colgroup, the output is fine. >> >> >>> >> >> >>> On Friday, July 2, 2021 at 6:55:27 PM UTC+2 John MacFarlane wrote: >> >> >>> >> >> >>>> >> >> >>>> Pandoc won't emit invalid HTML itself, but if you include >> >> >>>> invalid HTML, it just dutifully passes it through verbatim. >> >> >>>> >> >> >>>> Checking HTML syntax is not pandoc's job. Use epubcheck >> >> >>>> to verify the EPUB if you like. >> >> >>>> >> >> >>>> Gary Glass writes: >> >> >>>> >> >> >>>> > I figured out the source of the issue. I had an html table in >> the >> >> >>>> markdown >> >> >>>> > and I added a colgroup to the table. The colgroup caused the >> >> problem. >> >> >>>> > Removing it made it go away. >> >> >>>> > >> >> >>>> > Colgroup is not a commonly used tag (in my experience), but I >> think >> >> the >> >> >>>> bug >> >> >>>> > is that pandoc shouldn't just emit invalid epub html when the >> >> source >> >> >>>> code >> >> >>>> > is valid, even if it doesn't know what to do with it. Report an >> >> error or >> >> >>>> > something! The html looked something like this: >> >> >>>> > >> >> >>>> > >> >> >>>> > >> >> >>>> > >> >> >>>> > >> >> >>>> > >> >> >>>> > >> >> >>>> > >> >> >>>> > >> >> >>>> > >> >> >>>> > >> >> >>>> > >> >> >>>> > ... >> >> >>>> >
.........
>> >> >>>> > >> >> >>>> > On Thursday, July 1, 2021 at 5:57:57 PM UTC+2 John MacFarlane >> wrote: >> >> >>>> > >> >> >>>> >> >> >> >>>> >> No ideas. We'd have to see the actual files to know more. >> >> >>>> >> >> >> >>>> >> >> >> >>>> > >> >> >>>> > -- >> >> >>>> > You received this message because you are subscribed to the >> Google >> >> >>>> Groups "pandoc-discuss" group. >> >> >>>> > To unsubscribe from this group and stop receiving emails from >> it, >> >> send >> >> >>>> an email to pandoc-discus...-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org >> >> >>>> > To view this discussion on the web visit >> >> >>>> >> >> >> https://groups.google.com/d/msgid/pandoc-discuss/38ac5d4c-8cba-4c23-a313-bf81e79779e7n%40googlegroups.com >> >> >>>> . >> >> >>>> >> >> >>> >> >> >>> -- >> >> >>> You received this message because you are subscribed to the Google >> >> Groups "pandoc-discuss" group. >> >> >>> To unsubscribe from this group and stop receiving emails from it, >> send >> >> an email to pandoc-discus...-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org >> >> >>> To view this discussion on the web visit >> >> >> https://groups.google.com/d/msgid/pandoc-discuss/fd258aa4-a793-4d12-bb15-3f55fc2d0e4an%40googlegroups.com >> >> . >> >> >>> # header 1 >> >> >>> >> >> >>>
>> >> >>> >> >> >>> ## header 2 >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>> >> >> >>>
abc
xxxxxxxxx
>> >> >>> >> >> >>>
>> >> >> > >> > -- >> > You received this message because you are subscribed to the Google >> Groups "pandoc-discuss" group. >> > To unsubscribe from this group and stop receiving emails from it, send >> an email to pandoc-discus...-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org >> > To view this discussion on the web visit >> https://groups.google.com/d/msgid/pandoc-discuss/5d588eaf-0fd8-4023-8296-b9748189593cn%40googlegroups.com >> . >> > > -- > You received this message because you are subscribed to the Google Groups "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/53332f68-2c48-4416-91b8-8e34395d0859n%40googlegroups.com.