From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/28197 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: =?UTF-8?Q?wolfgang_h=C3=A4felinger?= Newsgroups: gmane.text.pandoc Subject: Re: HTML => ODT - almost identical conversion possible? Date: Mon, 19 Apr 2021 08:42:40 -0700 (PDT) Message-ID: References: <60ca164c-f868-412d-bdcf-9f75e36f5317n@googlegroups.com> <9708c88a-0378-4a49-a7a9-20003d16e301n@googlegroups.com> <6b87196f-8262-3147-fa5d-9f9f4a55ad83@gmail.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_9888_1062059911.1618846960135" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="7287"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBCPOJAPS2MNRB4OJ62BQMGQEVZRVYCY-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mon Apr 19 17:42:43 2021 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-oi1-f187.google.com ([209.85.167.187]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1lYW2t-0001le-5F for gtp-pandoc-discuss@m.gmane-mx.org; Mon, 19 Apr 2021 17:42:43 +0200 Original-Received: by mail-oi1-f187.google.com with SMTP id a2-20020a544e020000b02901864ae76be1sf1881335oiy.7 for ; Mon, 19 Apr 2021 08:42:43 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:date:from:to:message-id:in-reply-to:references:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=Vn2BAYAr4Nk34+d4fdfQcnvZv8E5UU3uRmK4xjUC0Rc=; b=CK6o8lT1zbznTwScaaxgdLGUqWAYd1DgHhqwcOyDHvmCSUeDNpDAKqR3Zp7G/GKurz Iu47X8S6ixN5a9/Fw7npitTqknuMyrD8IkZ5lkV9qOTH6DvSF9S8/PhU4Ag6RMLfOboL DFZFWBwX3UTeYXxVNypUa7clhcv4AOjDHuESPeLpcC/hEO+AT00xPO7DgvE5KWBGcynI JwD8G9ybE/aX6CKh5yNsfTPM66aOLP8JGTgTBAaGpygFj4EHPa7cRy1qMFReyc+d7bkT /7yqA6BMdi3tOgggXjiMV7XH12M0VzXttBJ546EfcBhi8jBGi1DYeQLoZRiveZPg8h4T C+GA== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=Vn2BAYAr4Nk34+d4fdfQcnvZv8E5UU3uRmK4xjUC0Rc=; b=kueqFVfncmitqUBpU3uY1LfFRISeWZloI8AQvVH08yAfdGKC769TS4Tv39ndVuaEG+ rgkDUj8BnJWo4ZHzySY+qii+yKN03DxgOWcvWQRac65rVYQ3I7ZatI2enJ5MzW9hlVci D1koRJOqbNQAL3HxLgCuLgO36671OauQeZp6QqpTGaSRFZ3xHjS9kUjXt/dPz3HsY5C/ 7ejMeeYGST50YMG7JcwYipg5us7xUnPIUhowSRpGTlt6Rlp7Wv0P4N6fGCfJ1YzMnsV7 unvsMWtG654kJ1Gw+KP7yo9OU+3wKIuQKPCVwGdAk4Cc1+UKQPQ5wxgoCWwBp9IG9sHu ObsA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:date:from:to:message-id:in-reply-to :references:subject:mime-version:x-original-sender:reply-to :precedence:mailing-list:list-id:x-spam-checked-in-group:list-post :list-help:list-archive:list-subscribe:list-unsubscribe; bh=Vn2BAYAr4Nk34+d4fdfQcnvZv8E5UU3uRmK4xjUC0Rc=; b=B0tKl/2bij5V8nd5+9XtO1VaJXpAVGnn3JSVQpXjUo/+1M5Bbux49c/zdr1b1ttE43 EBIn7BbgRyrqFl+FplyOzwwW+d1EcwvJynWjj+QSG7WMGlrbM3ny/iB2qMRpiWp+wlFv +SVsPq190AJLa2CidDSigj1yZUGbzMC7DRGp2lby/PnmFqahMW3EISugBurG/MXvXuxv 62gf1gfh17BCnHLRToTvXNhJ7LA0gOJlgkx6bhpUM2NPRyzOJ6Uirtw21f4GE94TzmiN KMU9KNBB5hFLkUhiHHr0plw+i1V6h7RCUMbNP0jXU/6b0SJMOZwWrQVDx9sh7C6M+ZY7 fsVQ== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM530BkH0gTr6eqs+5HGFKsdChOGUBkgZijsufslhfIyIiH1BsNXVm mZ4Y+va6kGz+xhNJ3gSaHvE= X-Google-Smtp-Source: ABdhPJx0xLw/h4BNtbhSuhvdoni/Jc+qPQaT+Q+s0vV7N89GAMWlO38O9BpugtPPXI3IHlyQocxhrg== X-Received: by 2002:a4a:8c4f:: with SMTP id v15mr13923663ooj.25.1618846962248; Mon, 19 Apr 2021 08:42:42 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:aca:758d:: with SMTP id q135ls908366oic.0.gmail; Mon, 19 Apr 2021 08:42:41 -0700 (PDT) X-Received: by 2002:a05:6808:1482:: with SMTP id e2mr16719548oiw.138.1618846960789; Mon, 19 Apr 2021 08:42:40 -0700 (PDT) In-Reply-To: <6b87196f-8262-3147-fa5d-9f9f4a55ad83-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> X-Original-Sender: whaefelinger-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:28197 Archived-At: ------=_Part_9888_1062059911.1618846960135 Content-Type: multipart/alternative; boundary="----=_Part_9889_981355508.1618846960135" ------=_Part_9889_981355508.1618846960135 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable > Why would a recruiter make such requests? There's no reason for them to= =20 change your design, right? Damned right you are. Of course they don't change my design. They just put= =20 *their* company logo on each page ;-) > How about thinking outside the box. I did that years ago where my base resume document has been odt/docx. Then= =20 I did some XPATH madness things to translate into ASCIIDOC and from there to DOCBOOK 4 and then via dblatex= =20 to xelatex to (beautifull) PDF.=20 ASCIIDOCTOR and PANDOC did not exist at that time. > That way the docx/odt would look as you intended, but if someone=20 copy-pastes its content, there's no formatting to screw something up.=20 Win-win. Hmm, good idea. Kind of "write protected" word document. However, the point= =20 is that they expect a "unprotected" Word document (see above). Perhaps I should do it this way: Just expose a public service that renders my resume into a PDF. By default= =20 it renders my style. However, a interested recruiter an upload images and (CSS) styles for customization. Voila. Who wants to receive a Word document= =20 anyway? On Thursday, 15 April 2021 at 19:50:23 UTC+2 monk...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org wrote: > > > The idea is to generate ODT or DOCX out of such a resume cause DOCX is= =20 > > often required by some recruiters). > > How about thinking outside the box. > > Why would a recruiter make such requests? There's no reason for them to= =20 > change your design, right? So the only reason could be to get at your=20 > raw data to input them into some system. They just don't have a simple=20 > way to communicate that. (For anything else I'm assuming unreasonable=20 > until proven innocent for now.) > > But if what they really need is raw data, asciidoc should be even=20 > better. So=E2=80=A6 what if the docx/odt contained your plain asciidoc. A= nd what=20 > if this asciidoc was hidden behind pictures of the pdf/html. That way=20 > the docx/odt would look as you intended, but if someone copy-pastes its= =20 > content, there's no formatting to screw something up. Win-win. > > So, I propose: > > 1. generate html/pdf from asciidoc via pandoc > > 2. turn each page of html/pdf into an image via unknown means > > 3. generate docx/odt from asciidoc plus images via pandoc plus a filter > > Why use a crowbar on pandoc if pandoc can be your crowbar ;) > > --=20 You received this message because you are subscribed to the Google Groups "= pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/= pandoc-discuss/a075f737-75ea-42de-aa1f-ee965e05e92bn%40googlegroups.com. ------=_Part_9889_981355508.1618846960135 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
> Why would a recruiter make such requests? There's no reason for t= hem to change your design, right?

Damned right you= are. Of course they don't change my design. They just put *their* company = logo on each page ;-)

> How about thinking = outside the box.

I did that years ago where my= base resume document has been odt/docx. Then I did some XPATH madness thin= gs
to translate into ASCIIDOC and from there to DOCBOOK 4 and the= n via dblatex to xelatex to (beautifull) PDF. 
ASCIIDOCTOR a= nd PANDOC did not exist at that time.

> Th= at way the docx/odt would look as you intended, but if someone copy-pastes = its content, there's no formatting to screw something up. Win-win.

Hmm, good idea. Kind of "write protected" word docum= ent. However, the point is that they expect a "unprotected"  Word docu= ment (see above).

Perhaps I should do it this way:=
Just expose a public service that renders my resume into a PDF. = By default it renders my style. However, a interested recruiter an upload i= mages and
(CSS) styles for customization. Voila. Who wants to rec= eive a Word document anyway?


On Thursday, 15 April 2021 at 1= 9:50:23 UTC+2 monk...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org wrote:

> The idea is to generate ODT or DOCX out of such a resume cause DOC= X is=20
> often required by some recruiters).

How about thinking outside the box.

Why would a recruiter make such requests? There's no reason for the= m to=20
change your design, right? So the only reason could be to get at your= =20
raw data to input them into some system. They just don't have a sim= ple=20
way to communicate that. (For anything else I'm assuming unreasonab= le=20
until proven innocent for now.)

But if what they really need is raw data, asciidoc should be even=20
better. So=E2=80=A6 what if the docx/odt contained your plain asciidoc.= And what=20
if this asciidoc was hidden behind pictures of the pdf/html. That way= =20
the docx/odt would look as you intended, but if someone copy-pastes its= =20
content, there's no formatting to screw something up. Win-win.

So, I propose:

1. generate html/pdf from asciidoc via pandoc

2. turn each page of html/pdf into an image via unknown means

3. generate docx/odt from asciidoc plus images via pandoc plus a filter

Why use a crowbar on pandoc if pandoc can be your crowbar ;)

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d= /msgid/pandoc-discuss/a075f737-75ea-42de-aa1f-ee965e05e92bn%40googlegroups.= com.
------=_Part_9889_981355508.1618846960135-- ------=_Part_9888_1062059911.1618846960135--