From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/24952 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: John MacFarlane Newsgroups: gmane.text.pandoc Subject: Re: HTML =?utf-8?Q?=E2=86=92?= EPUB: Either "Out of memory" or "openBinaryFile: invalid argument (Invalid argument)" Date: Wed, 22 Apr 2020 08:58:21 -0700 Message-ID: References: <879425ff-d491-4d0b-8ffe-db24ad9cce23@googlegroups.com> <14c0eaf0-b920-477c-a735-dded7f1df0c5@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="121167"; mail-complaints-to="usenet@ciao.gmane.io" To: Heck Lennon , pandoc-discuss Original-X-From: pandoc-discuss+bncBCJZJHG45QDBBK6TQH2QKGQETRDFKFA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Wed Apr 22 17:58:39 2020 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-pj1-f63.google.com ([209.85.216.63]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1jRHlm-000VN0-VM for gtp-pandoc-discuss@m.gmane-mx.org; Wed, 22 Apr 2020 17:58:38 +0200 Original-Received: by mail-pj1-f63.google.com with SMTP id l40sf2191542pjb.8 for ; Wed, 22 Apr 2020 08:58:38 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1587571117; cv=pass; d=google.com; s=arc-20160816; b=HZyeoLKaSnWjlXWFJuxA6lPUUvJx52oex16rQmo10ccGuV1H2UybVKZLSKYVgdEVY5 v65Nxe3QgOwDeZzBAK+/qe0ViudbOT4kjVas//NF7uo03DksT/pzw2Q+tCLHYQen90tB LIo+hX4lP3vTUfJyioITpEvJR3qLMWX89ouRWZ6/93w/JT9B9s8fD5pFdCEOZ1YPebut howfBnom4xJZf90hvHYwjHCdoRmjgqKTWOEcSAux8sJuZNOaMP4cFPcVftBCQIvfsJEa e1IIzHsy/4GlcDDE28oA4r2Grt3N93iUlTlLXSaOMj+KH/ZWwpI36NDlVRg1y8bg9sI/ bm9A== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:content-transfer-encoding :mime-version:message-id:date:references:in-reply-to:subject:to:from :sender:dkim-signature; bh=+vJU3Gp4p+LZrkmZdU6n3l+h6u0oT0xoztpCOqK731c=; b=Q27af6ZrmOaLGyHeLMZj+29trI65tpaMkgCiBNwY5toWiurSaLtBJF0HXhVLMSSLEg +MuJiJ6YS/M0Ud3drsy8/BONBDKBAOJmNvIr4VM6C5mggnP0M06gHjgk1tMQFnHsDlfq vC6M8MZ71O1UpT23KRdQSfTPYK3MWYd2nzwe/RHT+a6rd0D1xJatg+rLAy5zE1LzZ3XV 118/ViRXnFznPu3rZAWi27a+WJWmxBkHBP557A5DJLpvWAMuGc90UgShRbRaON1EcgDM ME6W0tj55Pnp4tXMGrBpJSFbKuUEnnSZl/ZpCaH066q5d9c86gM8Hcwo3j/cHA4O3YaH AljA== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=g3dqpiB3; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::102c as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:from:to:subject:in-reply-to:references:date:message-id :mime-version:content-transfer-encoding:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=+vJU3Gp4p+LZrkmZdU6n3l+h6u0oT0xoztpCOqK731c=; b=R7cauQFy91Dl4Sq/aIrXK9OyUAEJXhD/h6w3YjVn5vNY8wdzg+XhJ4NPv0zofneCcC 8NO0uIckgGF2E7AFodJ72PtsDwpCvUwgjA6yUkaiPry1mbMgCMLsmTkqEK2sVQyHpq3S UOgZ+PfAX0SIN8mX1JvVa/rkbTLJ/b2YCVo3AYWm3H1i/UqA7QZmpIaUULj7srTgFcJj YznpA6aKt3Ocevupdbivgr4uKWSLa33d6sEaZPNMJ7wRsA+0gAURbL+jvvaW1RiH0zrJ hBeoPiNEFcf3rGF5FTkHGemwCm7gBhZCKpWMGSxIrPX+6fZzz6rtCg9r9sAqusJdLIcC am+g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:from:to:subject:in-reply-to:references :date:message-id:mime-version:content-transfer-encoding :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:x-spam-checked-in-group:list-post :list-help:list-archive:list-subscribe:list-unsubscribe; bh=+vJU3Gp4p+LZrkmZdU6n3l+h6u0oT0xoztpCOqK731c=; b=AA4xYQ0NCSw/QNnsd2Ib6CfWo9r4qWn9LjZDwKFJuEiQ0NS25LIls4sJYS2soMVyxe cFn95zujuh1VIQijwE/qwbroMvLx6lYNbytXaOYdMWYdnsIUQUXlMtw697X7CgCdLcoA lVv62veqAQV+hIVJVYQ4n0VPjE9aWOQPNTbH81dGwYsElQEJ95AeP57rajePj1bHPAX0 uiIFjIqo4QKucWg8DYpbA0Xl4iF443w1OiIofk3xU46prv4rNDAAsWlyovRTaTOMOlST 2BhU16otAl5UN010WvF0RTELXiUaxLA9XDoi3n4qZpfH/ulcpD7YOe7QbQi761Lg1NE+ Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AGi0PubWYyOHwKc1nCoSBeLP3JkuG65IuLt+cgPKXKqoRD/iK2dlAlQA /cSYdKhZkFqOaBHXT9T1lDw= X-Google-Smtp-Source: APiQypL8oBXqtuDCZpuOmJZA923al4FUbiTeRnhWYEkpst4nG3tHWdYvC8LhuGAsCU5O77tiK1spWA== X-Received: by 2002:a17:902:ed03:: with SMTP id b3mr25795972pld.247.1587571117706; Wed, 22 Apr 2020 08:58:37 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a63:f1a:: with SMTP id e26ls1414953pgl.10.gmail; Wed, 22 Apr 2020 08:58:34 -0700 (PDT) X-Received: by 2002:a63:460a:: with SMTP id t10mr27691220pga.105.1587571114633; Wed, 22 Apr 2020 08:58:34 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1587571114; cv=none; d=google.com; s=arc-20160816; b=aC2UmBngphHL3gLqLpan5aIjtdgDmXm+DS7hZsXgDqoMTx6cR8pofmRkW8FzpFtk6j cpHB16scyD8tkEA30B3chRasJ3Sv+NNgmV7aJOUKw3xxXqRtrdEDctQbHRp28oRDebPs pUtMptfudSTO6epC1iuRfognCMmrnKYdWlD9Ulpheu4WJg9zqBDBwiDW1nmLJx1cO/aY ClhSr1v9KdX1I+TcI1Xi3SyNMyxvQ+gBzA719L24N0kajD55EwjqO1ilrOvFEYnJvtgW rF434XYaD4tcVXMsikk7nJ8WO1O6HFYSBtzStPpJE8iRDWvAf2LW4IagpXy9XBc39cVJ UDhg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:mime-version:message-id:date:references :in-reply-to:subject:to:from:dkim-signature; bh=ZajiJFOObrJdVMkg7SiFJ3gRMMAnxbTIPTNn2a7y+AQ=; b=aDW4hby1MXBehD2xEAQ2qQsKPwb9mvDoI7FxLZmT2gLPt/XNUC9Ihj/6JNEijX4DdK q7fFau03Z6M63ofvEhI8/jFOrDjEbh71xISgR+N+HCFDT5dA3CRvDSGYgxL5J6SIcLnt YXZ3oCsuQeHYn48b78maq7/Qym1oysYqa4Y8TSptvgAq0yMQLUDbw+jMBcksXu00s/Zq cWrKPtbziXk6C6p5YQX0L1h5//fZdpU+AQ1EIVwNJbOp7642skqeu6rDO54eHSrOBbRr PbwWWEvSJ+tcC1EwqX4s7hNneEeOKIY7mEfsL8o2glbpRfMsHtSqGONcn0vACBqHqpzX rFZQ== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=g3dqpiB3; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::102c as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Original-Received: from mail-pj1-x102c.google.com (mail-pj1-x102c.google.com. [2607:f8b0:4864:20::102c]) by gmr-mx.google.com with ESMTPS id 138si339819pfa.6.2020.04.22.08.58.34 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 22 Apr 2020 08:58:34 -0700 (PDT) Received-SPF: pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::102c as permitted sender) client-ip=2607:f8b0:4864:20::102c; Original-Received: by mail-pj1-x102c.google.com with SMTP id y6so1115499pjc.4 for ; Wed, 22 Apr 2020 08:58:34 -0700 (PDT) X-Received: by 2002:a17:90a:9295:: with SMTP id n21mr468666pjo.195.1587571114183; Wed, 22 Apr 2020 08:58:34 -0700 (PDT) Original-Received: from johnmacfarlane.net (li55-134.members.linode.com. [74.82.3.134]) by smtp.gmail.com with ESMTPSA id h27sm5608810pgb.90.2020.04.22.08.58.32 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 22 Apr 2020 08:58:33 -0700 (PDT) Original-Received: by johnmacfarlane.net (Postfix, from userid 1000) id 348BCA256; Wed, 22 Apr 2020 11:58:22 -0400 (EDT) In-Reply-To: X-Original-Sender: jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@berkeley-edu.20150623.gappssmtp.com header.s=20150623 header.b=g3dqpiB3; spf=pass (google.com: domain of jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org designates 2607:f8b0:4864:20::102c as permitted sender) smtp.mailfrom=jgm-TVLZxgkOlNX2fBVCVOL8/A@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:24952 Archived-At: What pandoc version are you running on the linux box? This works fine for me. Heck Lennon writes: > Since I had a Linux host available, I went around that issue with Windows= =20 > and shell expansion. > > pandoc -f html -t epub3 -o output.epub input.html > > > pandoc ran successfully (no error message), but the EPUB can't be opened = in=20 > a Windows GUI application that supports EPUB files ("Error loading=20 > file.epub"). Likewise, I can't open the file after changing its extension= =20 > from EPUB to ZIP. > > Here's the input files (HTML + PNGs): > > https://we.tl/t-5EeGXML1rb > > Do I need extra options in the command line? > > Le mercredi 22 avril 2020 11:55:49 UTC+2, Heck Lennon a =C3=A9crit : >> >> Thanks everyone for the infos! >> >> Le mercredi 22 avril 2020 01:25:21 UTC+2, Kolen Cheung a =C3=A9crit : >>> >>> A side note, since your goal is to convert from PDF to ePub, you probab= ly=20 >>> will have better results using other tools. Eg I know it can be convert= ed=20 >>> to docx, and then from docx to ePub. There may he tool that can help yo= u=20 >>> convert that directly too. Essentially for the tools you choose, you=E2= =80=99d want=20 >>> to choose one preserving most information. And since pandoc focuses man= y on=20 >>> the structure of the document, much other information would be lost. Th= e=20 >>> choice of tool also depends on which ones you=E2=80=99re comfortable wi= th, Eg the=20 >>> PDF to docx I mentioned probably can be done by Adobe Acrobat and MS Wo= rd.=20 >>> But they are proprietary and difficult to run from the command line.=20 >>> >>> In your case, since you have a tool preconverted them to html already,= =20 >>> html to ePub can be done better by some other engines (since the 2 are= =20 >>> closely related.) may be you can try Calibre which also have a cli. >> >> > > --=20 > You received this message because you are subscribed to the Google Groups= "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an= email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit https://groups.google.com/d/msgi= d/pandoc-discuss/b3218bbb-9846-4e52-b201-7e4a1b8b09d6%40googlegroups.com. --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/m2tv1bfr6q.fsf%40johnmacfarlane.net.