public inbox archive for pandoc-discuss@googlegroups.com
 help / color / mirror / Atom feed
* Pandoc html to doc(Not Docx) conversion issue
@ 2018-06-07 13:26 Bhaskar Sanjeevamoorthy
       [not found] ` <4968239f-7a2b-41f0-bf08-b5bcdcfe7cc1-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
  0 siblings, 1 reply; 4+ messages in thread
From: Bhaskar Sanjeevamoorthy @ 2018-06-07 13:26 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 1719 bytes --]


Hello, 

Anyone came across below issue, Please let me know if there is a solution 
for this:-

*I have html(s) like below it has base64 encoded image. When i convert it 
into doc file it doesnt render the image. Please suggest do i need to 
change anything in the html format.*

<html>

<body>
<img id="123" 
style="background-image:url(data:image;base64,R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7);width:300px;height:300px;"/>

</body>
</html>

or

<html>
<body>
<img id="123" style="width:300px;height:300px;" src 
="data:image;base64,R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7"/>
</body>
</html>

Thanks,
Bhaskar

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/4968239f-7a2b-41f0-bf08-b5bcdcfe7cc1%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[-- Attachment #1.2: Type: text/html, Size: 2496 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Pandoc html to doc(Not Docx) conversion issue
       [not found] ` <4968239f-7a2b-41f0-bf08-b5bcdcfe7cc1-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
@ 2018-06-07 17:49   ` John MacFarlane
       [not found]     ` <yh480kvaautrkr.fsf-pgq/RBwaQ+zq8tPRBa0AtqxOck334EZe@public.gmane.org>
  0 siblings, 1 reply; 4+ messages in thread
From: John MacFarlane @ 2018-06-07 17:49 UTC (permalink / raw)
  To: Bhaskar Sanjeevamoorthy, pandoc-discuss

Bhaskar Sanjeevamoorthy <mailtobhaskars-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
writes:

You say doc, not docx.  Pandoc doesn't directly
support doc.  If you specify an output file with a
.doc extension, it will default to HTML. You can
create a docx and then use Word to convert it to doc.

For docx, the second one will work, but not the first
(because in general styles don't come across).

> Hello, 
>
> Anyone came across below issue, Please let me know if there is a solution 
> for this:-
>
> *I have html(s) like below it has base64 encoded image. When i convert it 
> into doc file it doesnt render the image. Please suggest do i need to 
> change anything in the html format.*
>
> <html>
>
> <body>
> <img id="123" 
> style="background-image:url(data:image;base64,R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7);width:300px;height:300px;"/>
>
> </body>
> </html>
>
> or
>
> <html>
> <body>
> <img id="123" style="width:300px;height:300px;" src 
> ="data:image;base64,R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7"/>
> </body>
> </html>
>
> Thanks,
> Bhaskar
>
> -- 
> You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/4968239f-7a2b-41f0-bf08-b5bcdcfe7cc1%40googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/yh480kvaautrkr.fsf%40johnmacfarlane.net.
For more options, visit https://groups.google.com/d/optout.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Pandoc html to doc(Not Docx) conversion issue
       [not found]     ` <yh480kvaautrkr.fsf-pgq/RBwaQ+zq8tPRBa0AtqxOck334EZe@public.gmane.org>
@ 2018-06-08 12:23       ` Bhaskar Sanjeevamoorthy
       [not found]         ` <91ef9043-57fa-497f-a2e2-1859efe246d7-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
  0 siblings, 1 reply; 4+ messages in thread
From: Bhaskar Sanjeevamoorthy @ 2018-06-08 12:23 UTC (permalink / raw)
  To: pandoc-discuss


[-- Attachment #1.1: Type: text/plain, Size: 3400 bytes --]

Thanks for the reply.

you suggested to use docx file instead doc.
But we have lot of inline style tags,css classes and complex tables in our 
actual html file, that is why we switched to doc format.

Can you plz suggest how to solve this inline style tags issue for docx 
output, so that we can use this approach & drop off .doc approach

Thanks & regards,
Bhaskar

On Thursday, June 7, 2018 at 11:19:40 PM UTC+5:30, John MacFarlane wrote:
>
> Bhaskar Sanjeevamoorthy <mailtob...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org <javascript:>> 
> writes: 
>
> You say doc, not docx.  Pandoc doesn't directly 
> support doc.  If you specify an output file with a 
> .doc extension, it will default to HTML. You can 
> create a docx and then use Word to convert it to doc. 
>
> For docx, the second one will work, but not the first 
> (because in general styles don't come across). 
>
> > Hello, 
> > 
> > Anyone came across below issue, Please let me know if there is a 
> solution 
> > for this:- 
> > 
> > *I have html(s) like below it has base64 encoded image. When i convert 
> it 
> > into doc file it doesnt render the image. Please suggest do i need to 
> > change anything in the html format.* 
> > 
> > <html> 
> > 
> > <body> 
> > <img id="123" 
> > 
> style="background-image:url(data:image;base64,R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7);width:300px;height:300px;"/> 
>
> > 
> > </body> 
> > </html> 
> > 
> > or 
> > 
> > <html> 
> > <body> 
> > <img id="123" style="width:300px;height:300px;" src 
> > 
> ="data:image;base64,R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7"/> 
>
> > </body> 
> > </html> 
> > 
> > Thanks, 
> > Bhaskar 
> > 
> > -- 
> > You received this message because you are subscribed to the Google 
> Groups "pandoc-discuss" group. 
> > To unsubscribe from this group and stop receiving emails from it, send 
> an email to pandoc-discus...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org <javascript:>. 
> > To post to this group, send email to pandoc-...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org 
> <javascript:>. 
> > To view this discussion on the web visit 
> https://groups.google.com/d/msgid/pandoc-discuss/4968239f-7a2b-41f0-bf08-b5bcdcfe7cc1%40googlegroups.com. 
>
> > For more options, visit https://groups.google.com/d/optout. 
>

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/91ef9043-57fa-497f-a2e2-1859efe246d7%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

[-- Attachment #1.2: Type: text/html, Size: 5653 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: Pandoc html to doc(Not Docx) conversion issue
       [not found]         ` <91ef9043-57fa-497f-a2e2-1859efe246d7-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
@ 2018-06-08 15:26           ` John MacFarlane
  0 siblings, 0 replies; 4+ messages in thread
From: John MacFarlane @ 2018-06-08 15:26 UTC (permalink / raw)
  To: Bhaskar Sanjeevamoorthy, pandoc-discuss


Pandoc just converts the structure and content of the document.
It pays no attention to inline style tags or CSS.
And it generally can only handle simple tables (e.g. without
row/colspans).

Bhaskar Sanjeevamoorthy <mailtobhaskars-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> writes:

> Thanks for the reply.
>
> you suggested to use docx file instead doc.
> But we have lot of inline style tags,css classes and complex tables in our 
> actual html file, that is why we switched to doc format.
>
> Can you plz suggest how to solve this inline style tags issue for docx 
> output, so that we can use this approach & drop off .doc approach
>
> Thanks & regards,
> Bhaskar
>
> On Thursday, June 7, 2018 at 11:19:40 PM UTC+5:30, John MacFarlane wrote:
>>
>> Bhaskar Sanjeevamoorthy <mailtob...-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org <javascript:>> 
>> writes: 
>>
>> You say doc, not docx.  Pandoc doesn't directly 
>> support doc.  If you specify an output file with a 
>> .doc extension, it will default to HTML. You can 
>> create a docx and then use Word to convert it to doc. 
>>
>> For docx, the second one will work, but not the first 
>> (because in general styles don't come across). 
>>
>> > Hello, 
>> > 
>> > Anyone came across below issue, Please let me know if there is a 
>> solution 
>> > for this:- 
>> > 
>> > *I have html(s) like below it has base64 encoded image. When i convert 
>> it 
>> > into doc file it doesnt render the image. Please suggest do i need to 
>> > change anything in the html format.* 
>> > 
>> > <html> 
>> > 
>> > <body> 
>> > <img id="123" 
>> > 
>> style="background-image:url(data:image;base64,R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7);width:300px;height:300px;"/> 
>>
>> > 
>> > </body> 
>> > </html> 
>> > 
>> > or 
>> > 
>> > <html> 
>> > <body> 
>> > <img id="123" style="width:300px;height:300px;" src 
>> > 
>> ="data:image;base64,R0lGODlhEAAQAMQAAORHHOVSKudfOulrSOp3WOyDZu6QdvCchPGolfO0o/XBs/fNwfjZ0frl3/zy7////wAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACH5BAkAABAALAAAAAAQABAAAAVVICSOZGlCQAosJ6mu7fiyZeKqNKToQGDsM8hBADgUXoGAiqhSvp5QAnQKGIgUhwFUYLCVDFCrKUE1lBavAViFIDlTImbKC5Gm2hB0SlBCBMQiB0UjIQA7"/> 
>>
>> > </body> 
>> > </html> 
>> > 
>> > Thanks, 
>> > Bhaskar 
>> > 
>> > -- 
>> > You received this message because you are subscribed to the Google 
>> Groups "pandoc-discuss" group. 
>> > To unsubscribe from this group and stop receiving emails from it, send 
>> an email to pandoc-discus...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org <javascript:>. 
>> > To post to this group, send email to pandoc-...-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org 
>> <javascript:>. 
>> > To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/pandoc-discuss/4968239f-7a2b-41f0-bf08-b5bcdcfe7cc1%40googlegroups.com. 
>>
>> > For more options, visit https://groups.google.com/d/optout. 
>>
>
> -- 
> You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
> To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
> To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/91ef9043-57fa-497f-a2e2-1859efe246d7%40googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.

-- 
You received this message because you are subscribed to the Google Groups "pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org
To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/m2d0x1padq.fsf%40johnmacfarlane.net.
For more options, visit https://groups.google.com/d/optout.


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2018-06-08 15:26 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-06-07 13:26 Pandoc html to doc(Not Docx) conversion issue Bhaskar Sanjeevamoorthy
     [not found] ` <4968239f-7a2b-41f0-bf08-b5bcdcfe7cc1-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2018-06-07 17:49   ` John MacFarlane
     [not found]     ` <yh480kvaautrkr.fsf-pgq/RBwaQ+zq8tPRBa0AtqxOck334EZe@public.gmane.org>
2018-06-08 12:23       ` Bhaskar Sanjeevamoorthy
     [not found]         ` <91ef9043-57fa-497f-a2e2-1859efe246d7-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org>
2018-06-08 15:26           ` John MacFarlane

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).