public inbox archive for pandoc-discuss@googlegroups.com
 help / color / mirror / Atom feed
* HTML => ODT - almost identical conversion possible?
@ 2021-04-14 14:55 wolfgang häfelinger
       [not found] ` <60ca164c-f868-412d-bdcf-9f75e36f5317n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
  0 siblings, 1 reply; 5+ messages in thread
From: wolfgang häfelinger @ 2021-04-14 14:55 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 907 bytes --]

Dear all,

I generated an example resume (see resume.html) from an ASCIIDOC input. Now 
I wonder whether I can use PANDOC to convert that resume into ODT or DOCX.

I tried 

pandoc -f html resume.html -t odt -o resume.odt

which produced attached ODT file. Below is a screenshot showing the result. 
Obviously the layout is not the same, icons are far to big and so on.

Is there a (easy) way to instruct PANDOC to get a 1:1 convertion?

Thanks.



[image: Screenshot 2021-04-14 at 16.50.44.png]

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/60ca164c-f868-412d-bdcf-9f75e36f5317n%40googlegroups.com.

[-- Attachment #1.2: Type: text/html, Size: 1539 bytes --]

[-- Attachment #2: Screenshot 2021-04-14 at 16.50.44.png --]
[-- Type: image/png, Size: 344369 bytes --]

[-- Attachment #3: resume.html --]
[-- Type: text/html, Size: 7107 bytes --]

[-- Attachment #4: resume.odt --]
[-- Type: application/x-zip, Size: 67098 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: HTML => ODT - almost identical conversion possible?
       [not found] ` <60ca164c-f868-412d-bdcf-9f75e36f5317n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
@ 2021-04-14 16:11   ` John MacFarlane
       [not found]     ` <m2tuo8hmm0.fsf-jF64zX8BO08an7k8zZ43ob9bIa4KchGshsV+eolpW18@public.gmane.org>
  0 siblings, 1 reply; 5+ messages in thread
From: John MacFarlane @ 2021-04-14 16:11 UTC (permalink / raw)
  To: wolfgang häfelinger, pandoc-discuss


You shouldn't expect exact duplication in formatting details.
See the beginning of the manual.

As for the images:  are you explicitly specifying sizes
for the images in the input html file?  If so, how?

If not, you can resize the images or just change their dpi
attribute.

You could also try different settings for --dpi with pandoc.

wolfgang häfelinger <whaefelinger-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> writes:

> Dear all,
>
> I generated an example resume (see resume.html) from an ASCIIDOC input. Now 
> I wonder whether I can use PANDOC to convert that resume into ODT or DOCX.
>
> I tried 
>
> pandoc -f html resume.html -t odt -o resume.odt
>
> which produced attached ODT file. Below is a screenshot showing the result. 
> Obviously the layout is not the same, icons are far to big and so on.
>
> Is there a (easy) way to instruct PANDOC to get a 1:1 convertion?
>
> Thanks.
>
>
>
> [image: Screenshot 2021-04-14 at 16.50.44.png]
>
> -- 
> You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/60ca164c-f868-412d-bdcf-9f75e36f5317n%40googlegroups.com.
>
> Chargée de projet culturel
>
> Chargée de projet culturel
>
> photo
>
> Léa Rumiz
>
> * 
>  phone
>  06 98 60 15 36
>
> * 
>  mail
>  lea.rumiz-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org
>
> * Permis B
>
> * Gestion de projet
>
> * Communication
>
> * Médiation culturelle
>
> * Anglais courant
>
> Compétences complémentaires
>
> * Photoshop, InDesign
>
> * QuarkXPress
>
> * Sphinx
>
> * Bases HTML 5 / CSS 3
>
> * WordPress
>
> Qualités personnelles
>
> * Rigueur
>
> * Autonomie
>
> * Esprit d’équipe
>
> * Dynamisme
>
> Centres d’intérêt
>
> * Couture
>
> * Littérature islandaise
>
> * Pilates / Yoga
>
> Exprience professionnelle
>
> 2015 - 2018 : Chargée de mission Métropole
>
> Festival Lumière - Lyon (69)
>
> * Coordonner les événements programmés dans plus de 20 communes de la Métropole.
>
> * Faire le lien entre le festival et ses partenaires institutionnels et culturels.
>
> * Assurer la visibilité des événements programmés
>
> * Promouvoir le festival dans la presse locale en lien avec l’attachée de presse.
>
> * Évaluer les retombées médiatiques et réaliser une revue de presse.
>
> * Concevoir des outils de communication (programme, flyers, communiqués) avec
>  l’équipe graphique.
>
> * Rédiger du contenu de communication web et mettre en place un calendrier de
>  diffusion.
>
> * Accompagner des invités du festival (acteurs, réalisateurs, professionnels du cinéma).
>
> * Traduire des interventions (anglais à français).
>
> * Présenter des séances au public.
>
> * Encadrer une personne en stage.
>
> 2013 - 2014 : Responsable de la programmation culturelle
>
> Le Rize - Ville de Villeurbanne (69)
>
> * Définir la programmation culturelle.
>
> * Organiser des événements (pièces de théâtre, conférences, concerts).
>
> * Définir les moyens humains, matériels et financiers d’un projet.
>
> * Rédiger des contrats.
>
> * Accueillir les artistes et intervenants extérieurs.
>
> * Coordonner des actions culturelles avec les autres services.
>
> * Animer une réunion.
>
> 2013 : Animatrice nature
>
> Direction Paysage et nature - Ville de Villeurbanne (69)
>
> * Concevoir des animations pédagogiques de vulgarisation scientifique
>  (nature/environnement).
>
> * Organiser la séance d’animation et préparer le matériel et l’espace d’animation.
>
> * Animer des groupes jeune public et adultes.
>
> * Guider les participants lors de la réalisation de l’activité et l’adapter selon leurs
>  besoins.
>
> * Réaliser le bilan du projet d’animation et proposer des axes d’évolution.
>
> Stages et projet professionnel
>
> 2012 : Assistante programmation scientifique et culturelle
>
> Musée des Confluences - Lyon (69)
>
> * Réaliser une veille de la programmation de musées, en France et à l’étranger.
>
> * Proposer des pistes de programmation par thématiques et publics cibles.
>
> 2009 - 2010 : Chef de projet - valorisation d’un lieu patrimonial
>
> Grange du Clou - Saint-Cyr sur Menthon (01)
>
> * Concevoir et mettre en place une exposition d’art contemporain.
>
> * Créer des outils de communication, de relations presse et de médiation culturelle.
>
> 2009 : Médiatrice culturelle
>
> Fort du Bruissin, Centre d’art contemporain - Francheville (69)
>
> * Accompagner des groupes lors de visites guidées.
>
> * Accueillir et informer le public.
>
> 2009 : Chargée de communication
>
> Le Pavé Dans La Mare, Centre d’art contemporain - Besançon (25)
>
> * Créer des outils de communication et de relations presse.
>
> Formation
>
> 2012 : Master 2 Communication, Culture et Institutions
>
> Sciences Po Lyon (69)
>
> 2010 : Licence Médiation Culturelle
>
> EAC, Formation supérieure aux métiers de la culture - Lyon (69)

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/m2tuo8hmm0.fsf%40MacBook-Pro.hsd1.ca.comcast.net.


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: HTML => ODT - almost identical conversion possible?
       [not found]     ` <m2tuo8hmm0.fsf-jF64zX8BO08an7k8zZ43ob9bIa4KchGshsV+eolpW18@public.gmane.org>
@ 2021-04-15  6:35       ` wolfgang häfelinger
       [not found]         ` <9708c88a-0378-4a49-a7a9-20003d16e301n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
  0 siblings, 1 reply; 5+ messages in thread
From: wolfgang häfelinger @ 2021-04-15  6:35 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 6695 bytes --]

Well I have been reading the intro, especially "Swiss Army Knife". Opening 
a beer bottle with such a knife still renders the beer digestable and does 
not destill it into pure water :-)

Anyway, thanks for replying. 

Images: "auto-generated" in HTML using asciidoctor-web-pdf: asciidoc => 
{html, pdf}  - The idea is to generate ODT or DOCX out of such a resume 
cause DOCX is often required by some recruiters).
 

On Wednesday, 14 April 2021 at 18:12:07 UTC+2 John MacFarlane wrote:

>
> You shouldn't expect exact duplication in formatting details.
> See the beginning of the manual.
>
> As for the images: are you explicitly specifying sizes
> for the images in the input html file? If so, how?
>
> If not, you can resize the images or just change their dpi
> attribute.
>
> You could also try different settings for --dpi with pandoc.
>
> wolfgang häfelinger <whaefe...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> writes:
>
> > Dear all,
> >
> > I generated an example resume (see resume.html) from an ASCIIDOC input. 
> Now 
> > I wonder whether I can use PANDOC to convert that resume into ODT or 
> DOCX.
> >
> > I tried 
> >
> > pandoc -f html resume.html -t odt -o resume.odt
> >
> > which produced attached ODT file. Below is a screenshot showing the 
> result. 
> > Obviously the layout is not the same, icons are far to big and so on.
> >
> > Is there a (easy) way to instruct PANDOC to get a 1:1 convertion?
> >
> > Thanks.
> >
> >
> >
> > [image: Screenshot 2021-04-14 at 16.50.44.png]
> >
> > -- 
> > You received this message because you are subscribed to the Google 
> Groups "pandoc-discuss" group.
> > To unsubscribe from this group and stop receiving emails from it, send 
> an email to pandoc-discus...-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> > To view this discussion on the web visit 
> https://groups.google.com/d/msgid/pandoc-discuss/60ca164c-f868-412d-bdcf-9f75e36f5317n%40googlegroups.com
> .
> >
> > Chargée de projet culturel
> >
> > Chargée de projet culturel
> >
> > photo
> >
> > Léa Rumiz
> >
> > * 
> > phone
> > 06 98 60 15 36
> >
> > * 
> > mail
> > lea....-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org
> >
> > * Permis B
> >
> > * Gestion de projet
> >
> > * Communication
> >
> > * Médiation culturelle
> >
> > * Anglais courant
> >
> > Compétences complémentaires
> >
> > * Photoshop, InDesign
> >
> > * QuarkXPress
> >
> > * Sphinx
> >
> > * Bases HTML 5 / CSS 3
> >
> > * WordPress
> >
> > Qualités personnelles
> >
> > * Rigueur
> >
> > * Autonomie
> >
> > * Esprit d’équipe
> >
> > * Dynamisme
> >
> > Centres d’intérêt
> >
> > * Couture
> >
> > * Littérature islandaise
> >
> > * Pilates / Yoga
> >
> > Exprience professionnelle
> >
> > 2015 - 2018 : Chargée de mission Métropole
> >
> > Festival Lumière - Lyon (69)
> >
> > * Coordonner les événements programmés dans plus de 20 communes de la 
> Métropole.
> >
> > * Faire le lien entre le festival et ses partenaires institutionnels et 
> culturels.
> >
> > * Assurer la visibilité des événements programmés
> >
> > * Promouvoir le festival dans la presse locale en lien avec l’attachée 
> de presse.
> >
> > * Évaluer les retombées médiatiques et réaliser une revue de presse.
> >
> > * Concevoir des outils de communication (programme, flyers, communiqués) 
> avec
> > l’équipe graphique.
> >
> > * Rédiger du contenu de communication web et mettre en place un 
> calendrier de
> > diffusion.
> >
> > * Accompagner des invités du festival (acteurs, réalisateurs, 
> professionnels du cinéma).
> >
> > * Traduire des interventions (anglais à français).
> >
> > * Présenter des séances au public.
> >
> > * Encadrer une personne en stage.
> >
> > 2013 - 2014 : Responsable de la programmation culturelle
> >
> > Le Rize - Ville de Villeurbanne (69)
> >
> > * Définir la programmation culturelle.
> >
> > * Organiser des événements (pièces de théâtre, conférences, concerts).
> >
> > * Définir les moyens humains, matériels et financiers d’un projet.
> >
> > * Rédiger des contrats.
> >
> > * Accueillir les artistes et intervenants extérieurs.
> >
> > * Coordonner des actions culturelles avec les autres services.
> >
> > * Animer une réunion.
> >
> > 2013 : Animatrice nature
> >
> > Direction Paysage et nature - Ville de Villeurbanne (69)
> >
> > * Concevoir des animations pédagogiques de vulgarisation scientifique
> > (nature/environnement).
> >
> > * Organiser la séance d’animation et préparer le matériel et l’espace 
> d’animation.
> >
> > * Animer des groupes jeune public et adultes.
> >
> > * Guider les participants lors de la réalisation de l’activité et 
> l’adapter selon leurs
> > besoins.
> >
> > * Réaliser le bilan du projet d’animation et proposer des axes 
> d’évolution.
> >
> > Stages et projet professionnel
> >
> > 2012 : Assistante programmation scientifique et culturelle
> >
> > Musée des Confluences - Lyon (69)
> >
> > * Réaliser une veille de la programmation de musées, en France et à 
> l’étranger.
> >
> > * Proposer des pistes de programmation par thématiques et publics cibles.
> >
> > 2009 - 2010 : Chef de projet - valorisation d’un lieu patrimonial
> >
> > Grange du Clou - Saint-Cyr sur Menthon (01)
> >
> > * Concevoir et mettre en place une exposition d’art contemporain.
> >
> > * Créer des outils de communication, de relations presse et de médiation 
> culturelle.
> >
> > 2009 : Médiatrice culturelle
> >
> > Fort du Bruissin, Centre d’art contemporain - Francheville (69)
> >
> > * Accompagner des groupes lors de visites guidées.
> >
> > * Accueillir et informer le public.
> >
> > 2009 : Chargée de communication
> >
> > Le Pavé Dans La Mare, Centre d’art contemporain - Besançon (25)
> >
> > * Créer des outils de communication et de relations presse.
> >
> > Formation
> >
> > 2012 : Master 2 Communication, Culture et Institutions
> >
> > Sciences Po Lyon (69)
> >
> > 2010 : Licence Médiation Culturelle
> >
> > EAC, Formation supérieure aux métiers de la culture - Lyon (69)
>

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/9708c88a-0378-4a49-a7a9-20003d16e301n%40googlegroups.com.

[-- Attachment #1.2: Type: text/html, Size: 8683 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: HTML => ODT - almost identical conversion possible?
       [not found]         ` <9708c88a-0378-4a49-a7a9-20003d16e301n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
@ 2021-04-15 17:50           ` MarLinn
       [not found]             ` <6b87196f-8262-3147-fa5d-9f9f4a55ad83-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
  0 siblings, 1 reply; 5+ messages in thread
From: MarLinn @ 2021-04-15 17:50 UTC (permalink / raw)
  To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw


> The idea is to generate ODT or DOCX out of such a resume cause DOCX is 
> often required by some recruiters).

How about thinking outside the box.

Why would a recruiter make such requests? There's no reason for them to 
change your design, right? So the only reason could be to get at your 
raw data to input them into some system. They just don't have a simple 
way to communicate that. (For anything else I'm assuming unreasonable 
until proven innocent for now.)

But if what they really need is raw data, asciidoc should be even 
better. So… what if the docx/odt contained your plain asciidoc. And what 
if this asciidoc was hidden behind pictures of the pdf/html. That way 
the docx/odt would look as you intended, but if someone copy-pastes its 
content, there's no formatting to screw something up. Win-win.

So, I propose:

1. generate html/pdf from asciidoc via pandoc

2. turn each page of html/pdf into an image via unknown means

3. generate docx/odt from asciidoc plus images via pandoc plus a filter

Why use a crowbar on pandoc if pandoc can be your crowbar ;)

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/6b87196f-8262-3147-fa5d-9f9f4a55ad83%40gmail.com.


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: HTML => ODT - almost identical conversion possible?
       [not found]             ` <6b87196f-8262-3147-fa5d-9f9f4a55ad83-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
@ 2021-04-19 15:42               ` wolfgang häfelinger
  0 siblings, 0 replies; 5+ messages in thread
From: wolfgang häfelinger @ 2021-04-19 15:42 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 2795 bytes --]

> Why would a recruiter make such requests? There's no reason for them to 
change your design, right?

Damned right you are. Of course they don't change my design. They just put 
*their* company logo on each page ;-)

> How about thinking outside the box.

I did that years ago where my base resume document has been odt/docx. Then 
I did some XPATH madness things
to translate into ASCIIDOC and from there to DOCBOOK 4 and then via dblatex 
to xelatex to (beautifull) PDF. 
ASCIIDOCTOR and PANDOC did not exist at that time.

> That way the docx/odt would look as you intended, but if someone 
copy-pastes its content, there's no formatting to screw something up. 
Win-win.

Hmm, good idea. Kind of "write protected" word document. However, the point 
is that they expect a "unprotected"  Word document (see above).

Perhaps I should do it this way:
Just expose a public service that renders my resume into a PDF. By default 
it renders my style. However, a interested recruiter an upload images and
(CSS) styles for customization. Voila. Who wants to receive a Word document 
anyway?


On Thursday, 15 April 2021 at 19:50:23 UTC+2 monk...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org wrote:

>
> > The idea is to generate ODT or DOCX out of such a resume cause DOCX is 
> > often required by some recruiters).
>
> How about thinking outside the box.
>
> Why would a recruiter make such requests? There's no reason for them to 
> change your design, right? So the only reason could be to get at your 
> raw data to input them into some system. They just don't have a simple 
> way to communicate that. (For anything else I'm assuming unreasonable 
> until proven innocent for now.)
>
> But if what they really need is raw data, asciidoc should be even 
> better. So… what if the docx/odt contained your plain asciidoc. And what 
> if this asciidoc was hidden behind pictures of the pdf/html. That way 
> the docx/odt would look as you intended, but if someone copy-pastes its 
> content, there's no formatting to screw something up. Win-win.
>
> So, I propose:
>
> 1. generate html/pdf from asciidoc via pandoc
>
> 2. turn each page of html/pdf into an image via unknown means
>
> 3. generate docx/odt from asciidoc plus images via pandoc plus a filter
>
> Why use a crowbar on pandoc if pandoc can be your crowbar ;)
>
>

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/a075f737-75ea-42de-aa1f-ee965e05e92bn%40googlegroups.com.

[-- Attachment #1.2: Type: text/html, Size: 3582 bytes --]

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2021-04-19 15:42 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-04-14 14:55 HTML => ODT - almost identical conversion possible? wolfgang häfelinger
     [not found] ` <60ca164c-f868-412d-bdcf-9f75e36f5317n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2021-04-14 16:11   ` John MacFarlane
     [not found]     ` <m2tuo8hmm0.fsf-jF64zX8BO08an7k8zZ43ob9bIa4KchGshsV+eolpW18@public.gmane.org>
2021-04-15  6:35       ` wolfgang häfelinger
     [not found]         ` <9708c88a-0378-4a49-a7a9-20003d16e301n-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2021-04-15 17:50           ` MarLinn
     [not found]             ` <6b87196f-8262-3147-fa5d-9f9f4a55ad83-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
2021-04-19 15:42               ` wolfgang häfelinger

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).