From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/33414 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: BP Jonsson Newsgroups: gmane.text.pandoc Subject: Re: Need pandoc to create a latex file from some html files Date: Tue, 28 Nov 2023 19:24:19 +0100 Message-ID: References: Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="0000000000007a3f5f060b3a87e6" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="32522"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDIY76M674FRBX7ATCVQMGQEKLV5H4Y-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Nov 28 19:24:37 2023 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-wr1-f63.google.com ([209.85.221.63]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1r82lA-0008E5-Pc for gtp-pandoc-discuss@m.gmane-mx.org; Tue, 28 Nov 2023 19:24:37 +0100 Original-Received: by mail-wr1-f63.google.com with SMTP id ffacd0b85a97d-33308815448sf1194379f8f.3 for ; Tue, 28 Nov 2023 10:24:36 -0800 (PST) ARC-Seal: i=2; a=rsa-sha256; t=1701195876; cv=pass; d=google.com; s=arc-20160816; b=UcpoSun9QchuSJfhnDCW8JFTmgSwAX2RZ15tWmVK7hvFZJXfLmyR3Ks347P6TjvMmy tj/xol0NHxxMki92Nh6xZeHq/SdYlqhrguK/QINalT9Z6V5yldRpZ5lAr3awVbfL2h6m jObm7fP5So0MtPnjurIw1gi2A3sHRqmchgip1fAH5i1rAl99bcz5Y1q7tKHyeDg5a/Qq YsAjdXpZUMyp8q24CqrrkP+Dp6igwomJwDhvinDW3WyGji+lO6+Uf0HKERG/CWnq6Jt5 owZppfNkgEs7T1t6oChDmLBK9uOBFyJyXDD8ZAHOVTDYJdgJi76z5wQAWqZ6Hymt0SRq zWLw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:to:subject:message-id:date :from:in-reply-to:references:mime-version:sender:dkim-signature :dkim-signature; bh=n1PzoJDoSyPV+gxaaYUTfTFlnonbLgk8KOqkNnr6OZU=; fh=4cPfTtzleA5nPUC1EQtk197aIUeaT1ew1v/oILbLT2I=; b=QwCyyPKnsFUqqB7SHid1KAaV6yVznulBmW4rEZrII0ajkncsz2kgRBe1oja+El6W6D d2EHWe9aVX5tNiuveuQCKim+Y0Y5Wr9XVuSwW4CMQHWb2ZjBB/MzmycKjepbW+NGHoJX qVQXQjihgcK/wXq1mVbm+JKdnBgeBkg2YOroeka+WeJf3xlsLvQlZPNE5h88E6g6tQkj 7k0TWTYaC8QFkgvpC6GKE9sRMHLflmzLmOea54E8qF/wQbPuHoKo/p3erndrp8iLZ64x nklzFfC9q6lfi+qofq+T5uWVVmGgYc78vwNeR1Fhzx94lAGihVgkZNn71rsnJPYbFyGw BBPA== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20230601 header.b=PzwC9iBR; spf=pass (google.com: domain of bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2a00:1450:4864:20::12a as permitted sender) smtp.mailfrom=bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20230601; t=1701195876; x=1701800676; darn=m.gmane-mx.org; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:to:subject :message-id:date:from:in-reply-to:references:mime-version:sender :from:to:cc:subject:date:message-id:reply-to; bh=n1PzoJDoSyPV+gxaaYUTfTFlnonbLgk8KOqkNnr6OZU=; b=sk4gMTLtf8S6/4Bpj/c7MVXc+kN89ae178WeCYf5A/gEYrGFOAIC2u80eX30llBzhB 0QoJk72GZU79O3jTYczKCfcLT2nE3iBSWlVmuHNEJq3TD/S7Gd8nn5hB4JuocKQuNiyf Qde+w0uoDo5cMrNRfslIpVoRQQnvblETyvJpWcjmw7pRJMdsCeXu3AoZAAHKSe8dviMt JMiLUFRr8BSwBg0mUXYfJ7mYnqtVJsjQQKp6ZDrZnJ7RsOUSqpJrWnNwa0qVXNuzXfUp 6XI5NGRbrCApSDDLp2Z2/sbo3gNskcQlAZ/HE2p6W7RjyGQO5sGR DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1701195876; x=1701800676; darn=m.gmane-mx.org; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:to:subject :message-id:date:from:in-reply-to:references:mime-version:from:to:cc :subject:date:message-id:reply-to; bh=n1PzoJDoSyPV+gxaaYUTfTFlnonbLgk8KOqkNnr6OZU=; b=mXVnSckAdSIN0PXPxf42010FuZ/61lO+NwedL+JWY2XvngpyIXULdbYd2jhdgV7suq tfPQwivWlEpW31Kq4uAQwIb7Z84HZJsE0qr0fuKjQh18OcsDeO45mobHtMbfRlMGrjaH H3rFH5w7eiOTg+akivgF/8BzomlSXgRkv3pp6X40C/ZonG/7kQe1QBXYWo/rIPKCDwNw 52yL/RieW/WwZipd4YaI9y0Jh4Xdt0vDHLO8dnta16lz7prxgn3d4KTfVvmsm6oNoVcR 8GpBOtwe3mbmQeCDr9ci8hAqQV2n3FyypophOMW2DwCx9yemIDvnE/0bLGS/ga0uOP X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1701195876; x=1701800676; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :x-spam-checked-in-group:list-id:mailing-list:precedence:reply-to :x-original-authentication-results:x-original-sender:to:subject :message-id:date:from:in-reply-to:references:mime-version :x-beenthere:x-gm-message-state:sender:from:to:cc:subject:date :message-id:reply-to; bh=n1PzoJDoSyPV+gxaaYUTfTFlnonbLgk8KOqkNnr6OZU=; b=H/ySwPHvCoHzKjMZZH4HKEK1M6t/nut5JzXUAr5bw3KtcFhZ3E6gP0+mDwCwQVjYyw 43W7ppRaDtlR/Lb14IBgHDa9DvtAoRFlQMpPbhUXcILYZuWCrh3kSFe0IvCOVHFKrDBK GV0501kJtdIr9p9GovyshC3su0GFrfajZVJ6yhUzxRTJZZTf0FEfTzrpui7sU6qOH7d7 nAvd2/QrFznyvVrMd0yEykVLiSEt/QZTA12UqVabS4+hfLxFdF11PEcbmie0ZuQsZ/Jz W1+XsY Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOJu0YwOdmeiQvC37UOYrszaDuaik1ZB1MvcCv8pkClaWFGcfQ8i1Cf5 lrcsCAJhAmvK30ViKDxlZ4Q= X-Google-Smtp-Source: AGHT+IH6CIp2zJN2xdktL72LD1lW/NAF8CPU+/XNUo0en8Efc6pQHpZ5JCuTGQAXORJqpJ/h22FwUw== X-Received: by 2002:adf:ec52:0:b0:332:d319:5955 with SMTP id w18-20020adfec52000000b00332d3195955mr10902384wrn.35.1701195875519; Tue, 28 Nov 2023 10:24:35 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:adf:fdcd:0:b0:332:ffc0:4860 with SMTP id i13-20020adffdcd000000b00332ffc04860ls948270wrs.1.-pod-prod-05-eu; Tue, 28 Nov 2023 10:24:30 -0800 (PST) X-Received: by 2002:a5d:6443:0:b0:332:e777:696e with SMTP id d3-20020a5d6443000000b00332e777696emr10806999wrw.64.1701195870015; Tue, 28 Nov 2023 10:24:30 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1701195869; cv=none; d=google.com; s=arc-20160816; b=qTHYM/w0BID3wnUZehTWJpQZ/f4zymhP2I2GwYkBcyiXloo9Q+J8X8OaK7cSqrg7UZ LIwQLE6w0KJ3LudkPIrpdcP1qafNFWmdDeiavSe3rJDOyTz9wgvuffet4/HDIK4Vd9AQ GSc/hDWBtlPiq8o2iXPEHTayAOnWMK7WEyzBk9g5SOb0fwo9Ty5ZCdvC3zRi4FCMs1wh IrMzcbhFzzWutrXkAxX2IlJAd4/R89xFUhvjR4aDfgn+seGsw+6+Q3nMy0gge6LV40n8 DpSlGtOhF3LedboQKLRNtUkP8vY//U9YEcWh/BPwk/i/tzt2Fj5PDXp0E2zIKzq2yb3g RXGA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=H/dFeWi59XG84gtr2pRcfEWhQz4RaFywQkRz9QVxc9w=; fh=4cPfTtzleA5nPUC1EQtk197aIUeaT1ew1v/oILbLT2I=; b=ijlCcMhkybmLRGSbRNzB8lOIb86GswG7/VeL/mNknjhs6Aiw0RdKiTryTDUd2bwpA0 G/q0QvGectqD6hVIVgYbU0TwLppplcVWlCyN8IxSjvdQKIDJziV5rFyOYz1+3xE1sD3D 0uFALIdzKTSJZsjopj1vLHT36J5eGPjKwyoeTwG2mExskvksEGCC8AJyNeo6RzxVsTA7 ucQlK+23kSFCFjo2vUQZrA8zNK5sn59gjHcocUi3TQ88U4qs9+0NykZIFDPim7ugP8T2 ZvI7QbdX5ePKYeLLhZqoJ7JRvyIdDMkuuGYVQ/6U0GF+LmmeQn2Pi6vTcS0pf3jRcu41 4V5A== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20230601 header.b=PzwC9iBR; spf=pass (google.com: domain of bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2a00:1450:4864:20::12a as permitted sender) smtp.mailfrom=bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Original-Received: from mail-lf1-x12a.google.com (mail-lf1-x12a.google.com. [2a00:1450:4864:20::12a]) by gmr-mx.google.com with ESMTPS id d24-20020adf9b98000000b00332c094fc56si1022803wrc.5.2023.11.28.10.24.29 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Tue, 28 Nov 2023 10:24:29 -0800 (PST) Received-SPF: pass (google.com: domain of bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2a00:1450:4864:20::12a as permitted sender) client-ip=2a00:1450:4864:20::12a; Original-Received: by mail-lf1-x12a.google.com with SMTP id 2adb3069b0e04-5094727fa67so8420310e87.3 for ; Tue, 28 Nov 2023 10:24:29 -0800 (PST) X-Received: by 2002:a05:6512:3990:b0:507:a8cd:6c90 with SMTP id j16-20020a056512399000b00507a8cd6c90mr14162747lfu.51.1701195869207; Tue, 28 Nov 2023 10:24:29 -0800 (PST) In-Reply-To: X-Original-Sender: bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@gmail.com header.s=20230601 header.b=PzwC9iBR; spf=pass (google.com: domain of bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 2a00:1450:4864:20::12a as permitted sender) smtp.mailfrom=bpjonsson-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:33414 Archived-At: --0000000000007a3f5f060b3a87e6 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable You should be good just doing pandoc index.html manual.html errata.html -o manual.pdf or -o manual.tex (or manual.ltx if you prefer) if you want the LaTeX file. Pandoc should be able to guess the formats from the extensions. One caveat though: if the three HTML files contain external links to each other you probably want to convert them to internal links. It could be done with a filter but the easiest way is probably to convert to an intermediate markdown file and polish that in a text editor, then convert that markdown file to LaTeX/HTML. It also has the advantage that you can inspect what Pandoc makes of the HTML. Also it is probably better to let LaTeX (re)build the table of contents. Den m=C3=A5n 27 nov. 2023 01:13almaghfuur lahu skrev= : > How do we create a new PDF file by having pandoc to create a latex file > from some html files under one directory merely, i.e. not recursively, as > it's just acquired from a server directory to display a rather simple, > small reference documentation? > $ ls > cover.png errata.html index.html manual.html > > > index.html is actually containing table of contents, each line of which > is a link to a location in the file manual.html, and also some words inde= x > , each line of which is a link to a location in manual.html > > It has line: > > 3D"" ALIGN=3D"left" HSPACE=3D12> > > manual.html is the content or body of the reference manual > > errata.html is the catch up of corrections needed to care in this > reference manual > How to accomplish to obtain its one analogous latex file from which its > PDF file can be created the best, most efficient way ? > > -- > You received this message because you are subscribed to the Google Groups > "pandoc-discuss" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org > To view this discussion on the web visit > https://groups.google.com/d/msgid/pandoc-discuss/aaec9ee1-c839-4dd6-bd56-= c41657991b6en%40googlegroups.com > > . > --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/CAFC_yuRTCLE6Ja85ELO4rUXz1q5aUtO0MBTPXwbXyiVOPTy0Ew%40mail.g= mail.com. --0000000000007a3f5f060b3a87e6 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
You should be good just doing

<= /div>
pandoc index.html manual.html errata.html -o manual.= pdf

or -o manual.tex (or= manual.ltx if you prefer) if you want the LaTeX file. Pandoc should be abl= e to guess the formats from the extensions.

One caveat though: if the three HTML files contain exte= rnal links to each other you probably want to convert them to internal link= s. It could be done with a filter but the easiest way is probably to conver= t to an intermediate markdown file and polish that in a text editor, then c= onvert that markdown file to LaTeX/HTML. It also has the advantage that you= can inspect what Pandoc makes of the HTML.

Also it is probably better to let LaTeX (re)build the t= able of contents.



Den m=C3=A5n 27 nov. 2023 = 01:13almaghfuur lahu <budikusasi= @gmail.com> skrev:

How do we create a new PDF file b= y having pandoc to create a latex file from some html files under one direc= tory merely, i.e. not recursively, as it's just acquired from a server = directory to display a rather simple, small reference documentation?

$=C2=A0ls
cover.png=C2=A0 errata= .html=C2=A0 index.html=C2=A0 manual.html


index.html=C2=A0is actually contain= ing table of contents, each line of which is a link to a location in the fi= le manual.html, and also some words index , each line of which is a link to= a location in manual.html

It has line:

<IMG SRC=3D"cover.png&quo= t; ALT=3D"" TITLE=3D"click to buy the book" BORDER=3D1 = ALIGN=3D"left" HSPACE=3D12>

manual.html=C2=A0is= the content or body of the reference manual

errata.html=C2=A0is the catch up of co= rrections needed to care in this reference manual

How to accomplish to obtain its one analogous latex file from wh= ich its PDF file can be created the best, most efficient way ?

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh4Ykp1iOSErHA@public.gmane.org= m.
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/aaec9ee1-c= 839-4dd6-bd56-c41657991b6en%40googlegroups.com.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.= google.com/d/msgid/pandoc-discuss/CAFC_yuRTCLE6Ja85ELO4rUXz1q5aUtO0MBTPXwbX= yiVOPTy0Ew%40mail.gmail.com.
--0000000000007a3f5f060b3a87e6--