public inbox archive for pandoc-discuss@googlegroups.com
 help / color / mirror / Atom feed
* Need pandoc to create a latex file from some html files
@ 2023-11-27  0:12 almaghfuur lahu
       [not found] ` <aaec9ee1-c839-4dd6-bd56-c41657991b6en-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
  0 siblings, 1 reply; 4+ messages in thread
From: almaghfuur lahu @ 2023-11-27  0:12 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 1308 bytes --]



How do we create a new PDF file by having pandoc to create a latex file 
from some html files under one directory merely, i.e. not recursively, as 
it's just acquired from a server directory to display a rather simple, 
small reference documentation?
$ ls
cover.png  errata.html  index.html  manual.html


index.html is actually containing table of contents, each line of which is 
a link to a location in the file manual.html, and also some words index , 
each line of which is a link to a location in manual.html

It has line:

<IMG SRC="cover.png" ALT="" TITLE="click to buy the book" BORDER=1 
ALIGN="left" HSPACE=12>

manual.html is the content or body of the reference manual

errata.html is the catch up of corrections needed to care in this reference 
manual
How to accomplish to obtain its one analogous latex file from which its PDF 
file can be created the best, most efficient way ?

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/aaec9ee1-c839-4dd6-bd56-c41657991b6en%40googlegroups.com.

[-- Attachment #1.2: Type: text/html, Size: 8496 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Need pandoc to create a latex file, from some html files
       [not found] ` <aaec9ee1-c839-4dd6-bd56-c41657991b6en-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
@ 2023-11-27  8:35   ` Bastien DUMONT
  2023-11-28  2:37     ` almaghfuur lahu
  2023-11-28 18:24   ` Need pandoc to create a latex file " BP Jonsson
  1 sibling, 1 reply; 4+ messages in thread
From: Bastien DUMONT @ 2023-11-27  8:35 UTC (permalink / raw)
  To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw

Did you try the basic command?

  pandoc index.html manual.html errata.html -o doc.pdf

More advanced options are documented in the manual.

Le Sunday 26 November 2023 à 04:12:58PM, almaghfuur lahu a écrit :
> How do we create a new PDF file by having pandoc to create a latex file from
> some html files under one directory merely, i.e. not recursively, as it's just
> acquired from a server directory to display a rather simple, small reference
> documentation?
> 
> $ ls
> cover.png  errata.html  index.html  manual.html
> 
> 
> index.html is actually containing table of contents, each line of which is a
> link to a location in the file manual.html, and also some words index , each
> line of which is a link to a location in manual.html
> 
> It has line:
> 
> <IMG SRC="cover.png" ALT="" TITLE="click to buy the book" BORDER=1 ALIGN="left"
> HSPACE=12>
> 
> manual.html is the content or body of the reference manual
> 
> errata.html is the catch up of corrections needed to care in this reference
> manual
> 
> How to accomplish to obtain its one analogous latex file from which its PDF
> file can be created the best, most efficient way ?
> 
> --
> You received this message because you are subscribed to the Google Groups
> "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, send an email
> to [1]pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> To view this discussion on the web visit [2]https://groups.google.com/d/msgid/
> pandoc-discuss/aaec9ee1-c839-4dd6-bd56-c41657991b6en%40googlegroups.com.
> 
> References:
> 
> [1] mailto:pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org
> [2] https://groups.google.com/d/msgid/pandoc-discuss/aaec9ee1-c839-4dd6-bd56-c41657991b6en%40googlegroups.com?utm_medium=email&utm_source=footer

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/ZWRU1LFk3ixeWoQg%40localhost.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Need pandoc to create a latex file, from some html files
  2023-11-27  8:35   ` Need pandoc to create a latex file, " Bastien DUMONT
@ 2023-11-28  2:37     ` almaghfuur lahu
  0 siblings, 0 replies; 4+ messages in thread
From: almaghfuur lahu @ 2023-11-28  2:37 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 2793 bytes --]

Result is they're all 3 merged by having static i.e. absolute constant 
links as it was
It must be changed to drop the filename so all become html internal links  

Any knowledge on that pandoc has ability/option to automatically adjust 
these such links string so to be compiled by latex immediately correct ?
On Monday, 27 November 2023 at 15:35:50 UTC+7 Bastien DUMONT wrote:

> Did you try the basic command?
>
> pandoc index.html manual.html errata.html -o doc.pdf
>
> More advanced options are documented in the manual.
>
> Le Sunday 26 November 2023 à 04:12:58PM, almaghfuur lahu a écrit :
> > How do we create a new PDF file by having pandoc to create a latex file 
> from
> > some html files under one directory merely, i.e. not recursively, as 
> it's just
> > acquired from a server directory to display a rather simple, small 
> reference
> > documentation?
> > 
> > $ ls
> > cover.png  errata.html  index.html  manual.html
> > 
> > 
> > index.html is actually containing table of contents, each line of which 
> is a
> > link to a location in the file manual.html, and also some words index , 
> each
> > line of which is a link to a location in manual.html
> > 
> > It has line:
> > 
> > <IMG SRC="cover.png" ALT="" TITLE="click to buy the book" BORDER=1 
> ALIGN="left"
> > HSPACE=12>
> > 
> > manual.html is the content or body of the reference manual
> > 
> > errata.html is the catch up of corrections needed to care in this 
> reference
> > manual
> > 
> > How to accomplish to obtain its one analogous latex file from which its 
> PDF
> > file can be created the best, most efficient way ?
> > 
> > --
> > You received this message because you are subscribed to the Google Groups
> > "pandoc-discuss" group.
> > To unsubscribe from this group and stop receiving emails from it, send 
> an email
> > to [1]pandoc-discus...-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> > To view this discussion on the web visit [2]
> https://groups.google.com/d/msgid/
> > pandoc-discuss/aaec9ee1-c839-4dd6-bd56-c41657991b6en%40googlegroups.com.
> > 
> > References:
> > 
> > [1] mailto:pandoc-discus...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org
> > [2] 
> https://groups.google.com/d/msgid/pandoc-discuss/aaec9ee1-c839-4dd6-bd56-c41657991b6en%40googlegroups.com?utm_medium=email&utm_source=footer
>
>

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/186e2481-33ca-471a-b4e4-54e6ac6ec926n%40googlegroups.com.

[-- Attachment #1.2: Type: text/html, Size: 4584 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Need pandoc to create a latex file from some html files
       [not found] ` <aaec9ee1-c839-4dd6-bd56-c41657991b6en-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
  2023-11-27  8:35   ` Need pandoc to create a latex file, " Bastien DUMONT
@ 2023-11-28 18:24   ` BP Jonsson
  1 sibling, 0 replies; 4+ messages in thread
From: BP Jonsson @ 2023-11-28 18:24 UTC (permalink / raw)
  To: pandoc-discuss

[-- Attachment #1: Type: text/plain, Size: 2792 bytes --]

You should be good just doing

pandoc index.html manual.html errata.html -o manual.pdf

or -o manual.tex (or manual.ltx if you prefer) if you want the LaTeX file.
Pandoc should be able to guess the formats from the extensions.

One caveat though: if the three HTML files contain external links to each
other you probably want to convert them to internal links. It could be done
with a filter but the easiest way is probably to convert to an intermediate
markdown file and polish that in a text editor, then convert that markdown
file to LaTeX/HTML. It also has the advantage that you can inspect what
Pandoc makes of the HTML.

Also it is probably better to let LaTeX (re)build the table of contents.



Den mån 27 nov. 2023 01:13almaghfuur lahu <budikusasi-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> skrev:

> How do we create a new PDF file by having pandoc to create a latex file
> from some html files under one directory merely, i.e. not recursively, as
> it's just acquired from a server directory to display a rather simple,
> small reference documentation?
> $ ls
> cover.png  errata.html  index.html  manual.html
>
>
> index.html is actually containing table of contents, each line of which
> is a link to a location in the file manual.html, and also some words index
> , each line of which is a link to a location in manual.html
>
> It has line:
>
> <IMG SRC="cover.png" ALT="" TITLE="click to buy the book" BORDER=1
> ALIGN="left" HSPACE=12>
>
> manual.html is the content or body of the reference manual
>
> errata.html is the catch up of corrections needed to care in this
> reference manual
> How to accomplish to obtain its one analogous latex file from which its
> PDF file can be created the best, most efficient way ?
>
> --
> You received this message because you are subscribed to the Google Groups
> "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/pandoc-discuss/aaec9ee1-c839-4dd6-bd56-c41657991b6en%40googlegroups.com
> <https://groups.google.com/d/msgid/pandoc-discuss/aaec9ee1-c839-4dd6-bd56-c41657991b6en%40googlegroups.com?utm_medium=email&utm_source=footer>
> .
>

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/CAFC_yuRTCLE6Ja85ELO4rUXz1q5aUtO0MBTPXwbXyiVOPTy0Ew%40mail.gmail.com.

[-- Attachment #2: Type: text/html, Size: 10046 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2023-11-28 18:24 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2023-11-27  0:12 Need pandoc to create a latex file from some html files almaghfuur lahu
     [not found] ` <aaec9ee1-c839-4dd6-bd56-c41657991b6en-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2023-11-27  8:35   ` Need pandoc to create a latex file, " Bastien DUMONT
2023-11-28  2:37     ` almaghfuur lahu
2023-11-28 18:24   ` Need pandoc to create a latex file " BP Jonsson

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).