From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/24945 Path: news.gmane.io!.POSTED.ciao.gmane.io!not-for-mail From: Heck Lennon Newsgroups: gmane.text.pandoc Subject: =?UTF-8?Q?Re:_HTML_=E2=86=92_EPUB:_Either_"Out_of_memory"_or_"open?= =?UTF-8?Q?BinaryFile:_invalid_argument_(Invalid_argument)"?= Date: Wed, 22 Apr 2020 02:55:49 -0700 (PDT) Message-ID: <14c0eaf0-b920-477c-a735-dded7f1df0c5@googlegroups.com> References: <879425ff-d491-4d0b-8ffe-db24ad9cce23@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_3045_1900120370.1587549349469" Injection-Info: ciao.gmane.io; posting-host="ciao.gmane.io:159.69.161.202"; logging-data="32133"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDPJHXO6WIOBBJVJQD2QKGQEHITEGUA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Wed Apr 22 11:55:55 2020 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-oi1-f187.google.com ([209.85.167.187]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1jRC6k-0008Do-6q for gtp-pandoc-discuss@m.gmane-mx.org; Wed, 22 Apr 2020 11:55:54 +0200 Original-Received: by mail-oi1-f187.google.com with SMTP id x2sf876844oif.18 for ; Wed, 22 Apr 2020 02:55:54 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:date:from:to:message-id:in-reply-to:references:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=6ihkPIuuNFNc26xngsFWAVGQWAcLpcRFuZk+yvrFR7o=; b=D+oTFCYbrWv3tPdM0F01CtOhqPBWpZP5AwZ1eoSpHALpdCzG7OrpDlLQ70Th8WhZcA MUvRq+2QQYuEbeIvLyZbBblwuyKZLCh0d5OWOf4ZYFDyofmTfUXcPvDRxqXtjVe8c8af cT1qCATLP8WN4eMO7nc1Z8kVJR0G6PMAlrSGQL+nDmdIK3hQ3wiL/QqOxHB5g85SBaw0 dXdX9Qrxd4088WE5WXrAxzoxELxjyxwF1+mpJcheMFIi7fKv9UCjc5rTMzp+JHWPXnFV VHhmbjwNoBKr8cxl1rpwNOWQvwtPOfZXmrzDlcT1cv8pJo1K9T2uhkI3P4MhXdQzM8Ep /KrQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=6ihkPIuuNFNc26xngsFWAVGQWAcLpcRFuZk+yvrFR7o=; b=FHqYnx4KEPhRfBH6E9NpDY7lqWuF+4oD6BwbmSUdbUN04UfPh36JgaJdBAVmvU17IW /cIkvDwqc6vbFAgEUWSe8TgGqS7W361Zs32SPzO6qoGESbUQojs1DsEeHkBLw5xqVM6i G3n+/9T20VLIebtpBR3PeB8IjnhQdq6jiyJhDt/raCDICydUjQBEmzrBit3KUxlvMZL1 z+pmYBd2YSGbXM0qKuFljDSjhbKg9kwFGpyl6NH1V+P+TlqioO4+Fn1KD2QqTEZg1UoK ZiNznaJysRlVrwp4gyiwYFigdcESea8pBN1OkrbZhgJ4VHDRre5M1alEfg6yp8rYHDxd TONA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:date:from:to:message-id:in-reply-to :references:subject:mime-version:x-original-sender:reply-to :precedence:mailing-list:list-id:x-spam-checked-in-group:list-post :list-help:list-archive:list-subscribe:list-unsubscribe; bh=6ihkPIuuNFNc26xngsFWAVGQWAcLpcRFuZk+yvrFR7o=; b=beP6XcRqsrIzZmZm53Tz5ek0TPsTqjeQknCBCx/Kz/y842D+gid0mDcMj16GvMvYr/ mnJAMCIxSkILjTDEqYYRkBvtQz+OH6FkGHMxYwZMIf4gLP/D0WJDFKYlSFzFqmg7Zavu fT95FPgfnIAv/VfZx2BP3KfCTVLgAF/P0rRHCRUZU/HyeVQ0ZMjbBXPVOiVsGVFagnzA fe/NmXhbZx7nVWF0XXVRWnNXEoRAyRAj5SdkaLsbrjvpyBcazdWvpoplnBmQBZn03zgt mSI+RoTg2jHe2Vc7BbgsKNmj1kG+5+dsfDdhkI8P2Ag+P7OIo8Gxer7H5PN7GvbQ7ezL VUEw== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AGi0PuaKNptYBr/74wi1wpiTtUVzGTQHzACwzEAsehZlYkt8wAVxMSjd OWI5TUdZjBGR65vdj9hsBek= X-Google-Smtp-Source: APiQypLoTub5QEUUadm0NWJ/KnXq0fLjlky14OO0yWHr1BZqSZOXjokg5p/exkvq3c2v9tuBvVec+Q== X-Received: by 2002:aca:5513:: with SMTP id j19mr6040086oib.31.1587549353297; Wed, 22 Apr 2020 02:55:53 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a9d:3985:: with SMTP id y5ls354168otb.7.gmail; Wed, 22 Apr 2020 02:55:50 -0700 (PDT) X-Received: by 2002:a9d:b8c:: with SMTP id 12mr16898254oth.205.1587549349997; Wed, 22 Apr 2020 02:55:49 -0700 (PDT) In-Reply-To: <879425ff-d491-4d0b-8ffe-db24ad9cce23-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org> X-Original-Sender: frdtheman-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:24945 Archived-At: ------=_Part_3045_1900120370.1587549349469 Content-Type: multipart/alternative; boundary="----=_Part_3046_671243537.1587549349469" ------=_Part_3046_671243537.1587549349469 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Thanks everyone for the infos! Le mercredi 22 avril 2020 01:25:21 UTC+2, Kolen Cheung a =C3=A9crit : > > A side note, since your goal is to convert from PDF to ePub, you probably= =20 > will have better results using other tools. Eg I know it can be converted= =20 > to docx, and then from docx to ePub. There may he tool that can help you= =20 > convert that directly too. Essentially for the tools you choose, you=E2= =80=99d want=20 > to choose one preserving most information. And since pandoc focuses many = on=20 > the structure of the document, much other information would be lost. The= =20 > choice of tool also depends on which ones you=E2=80=99re comfortable with= , Eg the=20 > PDF to docx I mentioned probably can be done by Adobe Acrobat and MS Word= .=20 > But they are proprietary and difficult to run from the command line.=20 > > In your case, since you have a tool preconverted them to html already,=20 > html to ePub can be done better by some other engines (since the 2 are=20 > closely related.) may be you can try Calibre which also have a cli. --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/14c0eaf0-b920-477c-a735-dded7f1df0c5%40googlegroups.com. ------=_Part_3046_671243537.1587549349469 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Thanks everyone for the infos!

Le mercredi 22 avril= 2020 01:25:21 UTC+2, Kolen Cheung a =C3=A9crit=C2=A0:
A side note, since your goal is to convert from PDF= to ePub, you probably will have better results using other tools. Eg I kno= w it can be converted to docx, and then from docx to ePub. There may he too= l that can help you convert that directly too. Essentially for the tools yo= u choose, you=E2=80=99d want to choose one preserving most information. And= since pandoc focuses many on the structure of the document, much other inf= ormation would be lost. The choice of tool also depends on which ones you= =E2=80=99re comfortable with, Eg the PDF to docx I mentioned probably can b= e done by Adobe Acrobat and MS Word. But they are proprietary and difficult= to run from the command line.

In your case, since you have a tool preconverted them to html already, = html to ePub can be done better by some other engines (since the 2 are clos= ely related.) may be you can try Calibre which also have a cli.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/= msgid/pandoc-discuss/14c0eaf0-b920-477c-a735-dded7f1df0c5%40googlegroups.co= m.
------=_Part_3046_671243537.1587549349469-- ------=_Part_3045_1900120370.1587549349469--