From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/24936 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: Heck Lennon Newsgroups: gmane.text.pandoc Subject: =?UTF-8?Q?Re:_HTML_=E2=86=92_EPUB:_Either_"Out_of_memory"_or_"open?= =?UTF-8?Q?BinaryFile:_invalid_argument_(Invalid_argument)"?= Date: Tue, 21 Apr 2020 03:52:43 -0700 (PDT) Message-ID: References: <65ccb50b-6595-450d-86ca-c8103867e3bf@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_2293_2083479178.1587466363226" Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="110096"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDPJHXO6WIOBB7NA7P2AKGQE5JTY6FQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Apr 21 12:52:50 2020 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-oo1-f57.google.com ([209.85.161.57]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1jQqWG-000SW0-TJ for gtp-pandoc-discuss@m.gmane-mx.org; Tue, 21 Apr 2020 12:52:48 +0200 Original-Received: by mail-oo1-f57.google.com with SMTP id y41sf3553245ooi.16 for ; Tue, 21 Apr 2020 03:52:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:date:from:to:message-id:in-reply-to:references:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=eN/1JZo+JB/TIqfFGp6/OILTQpxEBVqfate9UOoanaM=; b=kIelLhaiC4cF/kywvU0NEHT4jgwDlT3b7/R+KWYBg85rFqexJtxcpTM3Tr1M2Koiel KaBhM3wLFHrkr/WmaxftmDqeKlAXdTuYG/AWCz6Uy7Yz+6xt7ammt9t8i2wvyQ7Nkbu9 Z3LF+NF10HFwvI9XbIT7LWLCygmrvRzCDhXJQsMYXIn8Bwm6w2HP+S2iUBGREXEvCORG cYSmG97qNuVD96NN9RfTsIDZeKyO/l921/0oUHphlGKmPyxevb8pUj4Q0zTFhe/CRYgO vSIgH4NrT7Tz9tJrstJA4kj5Z2FeYuhp0d8TcSJoG4lUx5T6GammmKZG8qxh7UeNGskD RfNQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=eN/1JZo+JB/TIqfFGp6/OILTQpxEBVqfate9UOoanaM=; b=sVd7TaOxEyElGkhi8uGnnTP7j56n5Zv8IgP/tHKcbvN0b9Dt3B4T7mbjj/pvSRjah1 Uq5gpBGI1tmi3ddhdERiH0qJKmIZ/rooUbs2Yw5CV7GVYJp0TzhcO3byD7gjaM/GgH+P yFPw1uz9DXAqZqypI14ckcacWEVKUd3Mdy17fG15dxb1udI9+fX+HSd83d1wzNKAOZLy G2PH+I/ioycAdaRQuBCFQ/9urfxcowzmEnAIcX/1wfhghg98vpXWtl/XVFI/S3KB7ldR uEjDpQF+zV0JHyr6AkN9hhHkihXskjMzkUzT7EjbWrZT9KOkQ08rjEBuYo5wA3Z2/CxJ S/+w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:date:from:to:message-id:in-reply-to :references:subject:mime-version:x-original-sender:reply-to :precedence:mailing-list:list-id:x-spam-checked-in-group:list-post :list-help:list-archive:list-subscribe:list-unsubscribe; bh=eN/1JZo+JB/TIqfFGp6/OILTQpxEBVqfate9UOoanaM=; b=fdgSaONOwIqW2bpM8o/au05psJ3BTsr/sVZUMY/Frv75hoL22+FY9VhHXTWABFSVrH SXo1DlgA0Jz2DyX43ys2tkee4ETHATzKwtB71xuKU9q3kiAvthRS10B7fRkQT6IBkAx2 16srpmht+84macpgPKgcDetHUc3OyQ8l3bG4Wkpv952z+OuMsqdY+2QfTFAIwHjnlVzy AtnnD1WxRqf050jbL86Ru8KvAiCKOgQq5WD+/G3gbZ90BAWwasHFdZiRbtXaAs9AUeLO xTugO8fBczb17KJsNjScL09xacP4Ek6v2kQKQ3tWKKtv+RS3drVgO1gHmOG0J0zlZy25 EMNg== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AGi0PubyBrqtWzZfEX8YvNDYGJYoABuDeuZObxFFny1lEBQ75UhoDFcU zqyGAS9bv+neLdZWDbeRj1I= X-Google-Smtp-Source: APiQypLxDlCWueuxvcDixK9rV4yl0rU2yXMrC24WBKvFe/uCcxGruDro1Aou5heXRC9DpeqYQUa0TQ== X-Received: by 2002:aca:5014:: with SMTP id e20mr2762973oib.34.1587466367859; Tue, 21 Apr 2020 03:52:47 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:aca:3104:: with SMTP id x4ls3379539oix.0.gmail; Tue, 21 Apr 2020 03:52:45 -0700 (PDT) X-Received: by 2002:aca:aa8c:: with SMTP id t134mr2785915oie.103.1587466363832; Tue, 21 Apr 2020 03:52:43 -0700 (PDT) In-Reply-To: <65ccb50b-6595-450d-86ca-c8103867e3bf-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> X-Original-Sender: frdtheman-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:24936 Archived-At: ------=_Part_2293_2083479178.1587466363226 Content-Type: multipart/alternative; boundary="----=_Part_2294_823355805.1587466363227" ------=_Part_2294_823355805.1587466363227 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Per this thread=E2=80=A6 https://groups.google.com/d/msg/pandoc-discuss/eMfCGU3Gn8E/bEhyLpUYBAAJ =E2=80=A6 I named the batch file pandoc.cmd, and re-ran the command thusly: echo output.epub | pandoc *.html - It runs for a few minutes, and ends with displaying some HTML=E2=80=A6 but = no .epub=20 can be found. I assume I'm not using the command correctly. Can pandoc use the standard= =20 input? Le mardi 21 avril 2020 12:10:30 UTC+2, Heck Lennon a =C3=A9crit : > > It's Windows (7, 32 bits) and pandoc 2.9.2.1. > > Le mardi 21 avril 2020 07:40:45 UTC+2, John MacFarlane a =C3=A9crit : >> >> >> That's extremely strange. Your shell should be expanding the *=20 >> in *.html before it even gets to pandoc. So if pandoc can see=20 >> the *, your shell hasn't done what it's supposed to.=20 >> >> What OS are you using, and what version of pandoc?=20 >> >> Heck Lennon writes:=20 >> >> > Hello=20 >> >=20 >> >=20 >> > On Windows (7, 32 bits), I'm trying to convert a ~450 page PDF into=20 >> EPUB.=20 >> >=20 >> >=20 >> > 1. I used "mutool draw" to convert the PDF into a single, ~10MB HTML:= =20 >> >=20 >> >=20 >> > pandoc -f html -t epub3 -o output.epub input.html=20 >> >=20 >> > (~10mn wait on my sluggish computer)=20 >> >=20 >> > "Out of memory":=20 >> >=20 >> >=20 >> > 2. Next, I reran "mutool draw" to convert the PDF as one page =3D one= =20 >> HTML=20 >> > page:=20 >> >=20 >> >=20 >> > pandoc -o output.epub *.html=20 >> >=20 >> > pandoc: *.html: openBinaryFile: invalid argument (Invalid argument)=20 >> >=20 >> >=20 >> > 3.Finally, I used pandoc to concatenate all the HTML files, but still= =20 >> got a=20 >> > "openBinaryFile: invalid argument (Invalid argument)".=20 >> >=20 >> >=20 >> > pandoc *.html > full.html=20 >> >=20 >> > pandoc: *.html: openBinaryFile: invalid argument (Invalid argument)=20 >> >=20 >> >=20 >> > What do you suggest I try?=20 >> >=20 >> >=20 >> > Thank you.=20 >> >=20 >> > --=20 >> > You received this message because you are subscribed to the Google=20 >> Groups "pandoc-discuss" group.=20 >> > To unsubscribe from this group and stop receiving emails from it, send= =20 >> an email to pandoc-...-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org=20 >> > To view this discussion on the web visit=20 >> https://groups.google.com/d/msgid/pandoc-discuss/cfd086c1-9fe5-41bd-b735= -3cd8db7579d9%40googlegroups.com.=20 >> >> > --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/f11a136c-0f32-4a59-b7cf-4aab865e1d68%40googlegroups.com. ------=_Part_2294_823355805.1587466363227 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Per this thread=E2=80=A6
https://groups.google.com/d/m= sg/pandoc-discuss/eMfCGU3Gn8E/bEhyLpUYBAAJ
=E2=80=A6 I named = the batch file pandoc.cmd, and re-ran the command thusly:

echo output.epub | pandoc *.html -

It runs= for a few minutes, and ends with displaying some HTML=E2=80=A6 but no .epu= b can be found.

I assume I'm not using the com= mand correctly. Can pandoc use the standard input?


Le mar= di 21 avril 2020 12:10:30 UTC+2, Heck Lennon a =C3=A9crit=C2=A0:
It's Windows (7, 32 = bits) and pandoc=C2=A02.9.2.1.

Le mardi 21 avril 2020 07:40:45 UTC+2= , John MacFarlane a =C3=A9crit=C2=A0:
frdt...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> writes:

> Hello
>
>
> On Windows (7, 32 bits), I'm trying to convert a ~450 page PDF= into EPUB.
>
>
> 1. I used "mutool draw" to convert the PDF into a single= , ~10MB HTML:
>
>
> pandoc -f html -t epub3 -o output.epub input.html
>
> (~10mn wait on my sluggish computer)
>
> "Out of memory":
>
>
> 2. Next, I reran "mutool draw" to convert the PDF as one= page =3D one HTML=20
> page:
>
>
> pandoc -o output.epub =C2=A0*.html
>
> pandoc: *.html: openBinaryFile: invalid argument (Invalid argument= )=20
>
>
> 3.Finally, I used pandoc to concatenate all the HTML files, but st= ill got a=20
> "openBinaryFile: invalid argument (Invalid argument)".
>
>
> pandoc *.html > full.html
>
> pandoc: *.html: openBinaryFile: invalid argument (Invalid argument= )
>
>
> What do you suggest I try?
>
>
> Thank you.
>
> --=20
> You received this message because you are subscribed to the Google= Groups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, = send an email to pandoc-...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
> To view this discussion on the web visit https://groups.= google.com/d/msgid/pandoc-discuss/cfd086c1-9fe5-41bd-b735-3cd8db7= 579d9%40googlegroups.com.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/= msgid/pandoc-discuss/f11a136c-0f32-4a59-b7cf-4aab865e1d68%40googlegroups.co= m.
------=_Part_2294_823355805.1587466363227-- ------=_Part_2293_2083479178.1587466363226--