EPUB to Markdown, how to extract images?

public inbox archive for pandoc-discuss@googlegroups.com
 help / color / mirror / Atom feed

* EPUB to Markdown, how to extract images?
@ 2018-06-25 18:08 CR
       [not found] ` <c4a62d93-17f0-4d16-a4aa-f50967b28ec1-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
  0 siblings, 1 reply; 3+ messages in thread
From: CR @ 2018-06-25 18:08 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 1106 bytes --]

I'm using this command line to convert an EPUB file to Markdown. 

pandoc --data-dir=data --read=EPUB --write=markdown_mmd --atx-headers --top-
level-division=chapter --output=myfile.md myepub.epub


It converts the text to a single Markdown file, complete with linked table 
of contents, which is fine. The Markdown file points to lots of images in a 
subdir called "images", but no images were actually extracted from the 
EPUB. 

How do I actually extract the images from the EPUB when I convert the EPUB 
to markdown?

Thank you!

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/c4a62d93-17f0-4d16-a4aa-f50967b28ec1%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[-- Attachment #1.2: Type: text/html, Size: 4029 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: EPUB to Markdown, how to extract images?
       [not found] ` <c4a62d93-17f0-4d16-a4aa-f50967b28ec1-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
@ 2018-06-25 19:53   ` John MacFarlane
       [not found]     ` <871scuprr7.fsf-pgq/RBwaQ+zq8tPRBa0AtqxOck334EZe@public.gmane.org>
  0 siblings, 1 reply; 3+ messages in thread
From: John MacFarlane @ 2018-06-25 19:53 UTC (permalink / raw)
  To: CR, pandoc-discuss


Use --extract-media=dir

where dir is a directory of your choosing.

CR <chuckr69-Wuw85uim5zDR7s880joybQ@public.gmane.org> writes:

> I'm using this command line to convert an EPUB file to Markdown. 
>
> pandoc --data-dir=data --read=EPUB --write=markdown_mmd --atx-headers --top-
> level-division=chapter --output=myfile.md myepub.epub
>
>
> It converts the text to a single Markdown file, complete with linked table 
> of contents, which is fine. The Markdown file points to lots of images in a 
> subdir called "images", but no images were actually extracted from the 
> EPUB. 
>
> How do I actually extract the images from the EPUB when I convert the EPUB 
> to markdown?
>
> Thank you!
>
> -- 
> You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/c4a62d93-17f0-4d16-a4aa-f50967b28ec1%40googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: EPUB to Markdown, how to extract images?
       [not found]     ` <871scuprr7.fsf-pgq/RBwaQ+zq8tPRBa0AtqxOck334EZe@public.gmane.org>
@ 2018-06-26 12:51       ` CR
  0 siblings, 0 replies; 3+ messages in thread
From: CR @ 2018-06-26 12:51 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 637 bytes --]

Thank you! For this book, images are ending up in media\images\.

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/e06a6b72-cb03-4173-9b36-9ce64c69b3bf%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[-- Attachment #1.2: Type: text/html, Size: 1057 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2018-06-26 12:51 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-06-25 18:08 EPUB to Markdown, how to extract images? CR
     [not found] ` <c4a62d93-17f0-4d16-a4aa-f50967b28ec1-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2018-06-25 19:53   ` John MacFarlane
     [not found]     ` <871scuprr7.fsf-pgq/RBwaQ+zq8tPRBa0AtqxOck334EZe@public.gmane.org>
2018-06-26 12:51       ` CR

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).