From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/24934 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: Heck Lennon Newsgroups: gmane.text.pandoc Subject: =?UTF-8?Q?Re:_HTML_=E2=86=92_EPUB:_Either_"Out_of_memory"_or_"open?= =?UTF-8?Q?BinaryFile:_invalid_argument_(Invalid_argument)"?= Date: Tue, 21 Apr 2020 03:10:30 -0700 (PDT) Message-ID: <65ccb50b-6595-450d-86ca-c8103867e3bf@googlegroups.com> References: Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_2406_359362246.1587463830806" Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="57384"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDPJHXO6WIOBBF4N7P2AKGQEMOGSSIA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Apr 21 12:10:36 2020 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-oo1-f62.google.com ([209.85.161.62]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1jQprO-000End-Ov for gtp-pandoc-discuss@m.gmane-mx.org; Tue, 21 Apr 2020 12:10:34 +0200 Original-Received: by mail-oo1-f62.google.com with SMTP id w8sf3365231oov.0 for ; Tue, 21 Apr 2020 03:10:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:date:from:to:message-id:in-reply-to:references:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=SAuSu9HgII+6V8OoCfvhib8DfBUNS0+BuZcerYnqzMk=; b=GRxVEA4gOpDeHUEhkuZ/JOBABLgJ1sPDAk0kYK39122P8np+bQ+uiPS5DzKCwqLUXK OpLROxQvPG7eNoD89IXrSUeaFUz/YRafbUWDRTWNX2wGHZimG8YMOVkyp557zSiWiZ1C sAR8lHW4zE6twyRizzEqjHjdM/u326U5lL6ze888LbW/zY8v6JdsS1G1lUA0JlPfhCs3 p6UfBtq6pEXpyTR05zcYHUS0hcuLBrHB4A452rGbuGs3OOf3Rv5BjWAtKllAOwzZ/fYI Qr//yzLsEcIgamh6ugoU2B1tFacmrrE/Y4nBTUJWTDmKJz3MmptimZEhFBwivXj/P0Z4 KtBg== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=SAuSu9HgII+6V8OoCfvhib8DfBUNS0+BuZcerYnqzMk=; b=udKdYQ87fhJtTHqn8rIFyKAwsa0j0VwOKTHmI5By0/hMq1vNE4Oj0BfXM5Q+bRT8UO cvt4sMEzXAo84Mz7+Pce6fItYe8+0RmT60XkDgXJFn8zTpNJx3uZLKOJRbnkWGs4HdcL Ym+qm0F/admEXOG1n9O9BFOc0XWymhDYXwp0Gi+8G7/4a3JyBbKpEFGq4BRpJqHx3Kqg 1xs93ahn4Reo/09EoCK5w0IVnLTjbrS6jaMUdinJ5/A2JJ2oLIn+rjJhuNY2CLcyv85e lJpIw2pU5xg4licqZnkGbbtHzTzKUN2yunsI45XbtOlF2j8LSvr6+EFmzOy3LQM8BoYC +C9w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:date:from:to:message-id:in-reply-to :references:subject:mime-version:x-original-sender:reply-to :precedence:mailing-list:list-id:x-spam-checked-in-group:list-post :list-help:list-archive:list-subscribe:list-unsubscribe; bh=SAuSu9HgII+6V8OoCfvhib8DfBUNS0+BuZcerYnqzMk=; b=VTvkRXRuEqm07+fiKgcA6XVwVmIEMOGHjrRQbf23lqlEbX0HSCxb8SE88MXCKfdfZc A0dPh9zIYgAhTiYhU9E9x6Z7f9n0yrXcS5HJr2ACpe7tajWXnKJxJa7v+GJgVX/EUvLp PMVVrSHC5fYMrPRL0lRBYH3xXyecrGKHuZUyzk7/sMnQHYvg6pgUUzBaWqbwv/CizyxX Kzm21vbqhJxINTKztnImbTb6hTPcsgjFCY1Un14Z/O6XaK0bg8CHAmWzsACdtqulbjS5 5I6yMhZaewOdiTI+3Z5NiGRBNDPyuM+kxI9FK77hmFUJ620QT7F4pcN/C/sLVyfrqPdr T3qQ== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AGi0Pua2gLxe11BxSZZ4XZtI5B89Jd8ohYhZkLLcBv7pLf6xn35/gWuc /iprEhdfuI5IxGVD2aMLuEo= X-Google-Smtp-Source: APiQypLbvZlWOfE/Pcmgkw0Ep59kX1O56HKIsiWPVjDUizfEjUqyjEz1ekoeckIYa3gxFgOZ7mnHOw== X-Received: by 2002:a9d:5505:: with SMTP id l5mr13362222oth.29.1587463833833; Tue, 21 Apr 2020 03:10:33 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a4a:b4c7:: with SMTP id g7ls878822ooo.3.gmail; Tue, 21 Apr 2020 03:10:31 -0700 (PDT) X-Received: by 2002:a4a:accf:: with SMTP id c15mr15794814oon.29.1587463831451; Tue, 21 Apr 2020 03:10:31 -0700 (PDT) In-Reply-To: X-Original-Sender: frdtheman-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:24934 Archived-At: ------=_Part_2406_359362246.1587463830806 Content-Type: multipart/alternative; boundary="----=_Part_2407_608433999.1587463830807" ------=_Part_2407_608433999.1587463830807 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable It's Windows (7, 32 bits) and pandoc 2.9.2.1. Le mardi 21 avril 2020 07:40:45 UTC+2, John MacFarlane a =C3=A9crit : > > > That's extremely strange. Your shell should be expanding the *=20 > in *.html before it even gets to pandoc. So if pandoc can see=20 > the *, your shell hasn't done what it's supposed to.=20 > > What OS are you using, and what version of pandoc?=20 > > Heck Lennon > writes:=20 > > > Hello=20 > >=20 > >=20 > > On Windows (7, 32 bits), I'm trying to convert a ~450 page PDF into=20 > EPUB.=20 > >=20 > >=20 > > 1. I used "mutool draw" to convert the PDF into a single, ~10MB HTML:= =20 > >=20 > >=20 > > pandoc -f html -t epub3 -o output.epub input.html=20 > >=20 > > (~10mn wait on my sluggish computer)=20 > >=20 > > "Out of memory":=20 > >=20 > >=20 > > 2. Next, I reran "mutool draw" to convert the PDF as one page =3D one H= TML=20 > > page:=20 > >=20 > >=20 > > pandoc -o output.epub *.html=20 > >=20 > > pandoc: *.html: openBinaryFile: invalid argument (Invalid argument)=20 > >=20 > >=20 > > 3.Finally, I used pandoc to concatenate all the HTML files, but still= =20 > got a=20 > > "openBinaryFile: invalid argument (Invalid argument)".=20 > >=20 > >=20 > > pandoc *.html > full.html=20 > >=20 > > pandoc: *.html: openBinaryFile: invalid argument (Invalid argument)=20 > >=20 > >=20 > > What do you suggest I try?=20 > >=20 > >=20 > > Thank you.=20 > >=20 > > --=20 > > You received this message because you are subscribed to the Google=20 > Groups "pandoc-discuss" group.=20 > > To unsubscribe from this group and stop receiving emails from it, send= =20 > an email to pandoc-...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org .=20 > > To view this discussion on the web visit=20 > https://groups.google.com/d/msgid/pandoc-discuss/cfd086c1-9fe5-41bd-b735-= 3cd8db7579d9%40googlegroups.com.=20 > > --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/65ccb50b-6595-450d-86ca-c8103867e3bf%40googlegroups.com. ------=_Part_2407_608433999.1587463830807 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
It's Windows (7, 32 bits) and pandoc=C2=A02.9.2.1.
=
Le mardi 21 avril 2020 07:40:45 UTC+2, John MacFarlane a =C3=A9crit=C2= =A0:

That's extremely strange. =C2=A0Your shell should be expanding the = *
in *.html before it even gets to pandoc. =C2=A0So if pandoc can see
the *, your shell hasn't done what it's supposed to.

What OS are you using, and what version of pandoc?

Heck Lennon <frdt...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> writes:

> Hello
>
>
> On Windows (7, 32 bits), I'm trying to convert a ~450 page PDF= into EPUB.
>
>
> 1. I used "mutool draw" to convert the PDF into a single= , ~10MB HTML:
>
>
> pandoc -f html -t epub3 -o output.epub input.html
>
> (~10mn wait on my sluggish computer)
>
> "Out of memory":
>
>
> 2. Next, I reran "mutool draw" to convert the PDF as one= page =3D one HTML=20
> page:
>
>
> pandoc -o output.epub =C2=A0*.html
>
> pandoc: *.html: openBinaryFile: invalid argument (Invalid argument= )=20
>
>
> 3.Finally, I used pandoc to concatenate all the HTML files, but st= ill got a=20
> "openBinaryFile: invalid argument (Invalid argument)".
>
>
> pandoc *.html > full.html
>
> pandoc: *.html: openBinaryFile: invalid argument (Invalid argument= )
>
>
> What do you suggest I try?
>
>
> Thank you.
>
> --=20
> You received this message because you are subscribed to the Google= Groups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, = send an email to pandoc-...@googlegroups.com.
> To view this discussion on the web visit https://groups.= google.com/d/msgid/pandoc-discuss/cfd086c1-9fe5-41bd-b735-3cd8db7= 579d9%40googlegroups.com.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/= msgid/pandoc-discuss/65ccb50b-6595-450d-86ca-c8103867e3bf%40googlegroups.co= m.
------=_Part_2407_608433999.1587463830807-- ------=_Part_2406_359362246.1587463830806--