From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/11649 Path: news.gmane.org!not-for-mail From: Jesse Rosenthal Newsgroups: gmane.text.pandoc Subject: Re: Docx reader ; style picking algorithm Date: Thu, 1 Jan 2015 08:05:33 -0800 (PST) Message-ID: <224dcd62-a503-40a9-bd4b-a7009abd559a@googlegroups.com> References: <4b2f3216-fc6c-4552-8bf0-3cd263ebc143@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_9157_305106692.1420128333877" X-Trace: ger.gmane.org 1420128342 6075 80.91.229.3 (1 Jan 2015 16:05:42 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 1 Jan 2015 16:05:42 +0000 (UTC) To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDF7DMU574PBBTXASWSQKGQEMCV7WTA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Thu Jan 01 17:05:36 2015 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-qg0-f58.google.com ([209.85.192.58]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Y6iFj-0005ES-Ap for gtp-pandoc-discuss@m.gmane.org; Thu, 01 Jan 2015 17:05:35 +0100 Original-Received: by mail-qg0-f58.google.com with SMTP id z60sf2902614qgd.13 for ; Thu, 01 Jan 2015 08:05:34 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :content-type:x-original-sender:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe; bh=jUZ6IWDI5EHEGlD6R7r9uGrS51JMryvPHfbyodJ8rRU=; b=C6xiVp4qI+8wLuP5D7jVTW9V5q0cAyj4W5ArCyv0Oze3VpnOFLYx+FyiFdHsqU1CAI BctGGCY2t5SxMdV8sRT8FgUBkE5Eld8fp4Dhm3q+2Zw9XWLYLEUR6U5DJRWgCdozz+s8 H7Dq+zgK1XusTyqXSUD3cN57AleSL+DRVbkzqCdA6RXyKlSDLO90D2ayNNrNR81Ylu6i rzqoVizmKc9wWLNnmHpAkXnyDeRUYwbnFwetNP9rmv5iATUqCEFsD6O57aX4DxkIt1Ah Is4KhqFJdvvQ/BgQH79gZmiFUGjIU4xFaYlNMUkK+echjoBQGrjrXk/vBz5RXZrWbGXi 4CiQ== X-Received: by 10.140.34.206 with SMTP id l72mr431997qgl.0.1420128334678; Thu, 01 Jan 2015 08:05:34 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.140.29.7 with SMTP id a7ls6502192qga.87.gmail; Thu, 01 Jan 2015 08:05:34 -0800 (PST) X-Received: by 10.140.34.206 with SMTP id l72mr431996qgl.0.1420128334334; Thu, 01 Jan 2015 08:05:34 -0800 (PST) In-Reply-To: X-Original-Sender: jrosenthal-4GNroTWusrE@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:11649 Archived-At: ------=_Part_9157_305106692.1420128333877 Content-Type: multipart/alternative; boundary="----=_Part_9158_1884707055.1420128333877" ------=_Part_9158_1884707055.1420128333877 Content-Type: text/plain; charset=UTF-8 Yep -- a few major styles (headers, block quotes) have been addressed in the reader to make sure they work in different languages. As far as the question of user-defined paragraph styles go -- the main to-do now is parsing the style file for paragraph styles similarly to how we already do it for character styles. There are additional details and complications which make it a bit harder, but it's doable, and just waiting on developer time. Eventually, user-defined styles that inherit from some set of base semantic styles will work. (This is all in the reader.) What will probably never work is being able to guess that your user defined style for headers which is just big and bold is really for headers. In other words, paragraph styles will have to inherit from something semantic, and not just be be a collection of visual character styles. On Thursday, January 1, 2015 9:50:49 AM UTC-5, Ghlen Livid wrote: > > I think what you are referring to is related to > https://github.com/jgm/pandoc/issues/1607 and > https://github.com/jgm/pandoc/issues/1692, please see revelant > discussions. > Basically, docx is a mess. -- You received this message because you are subscribed to the Google Groups "pandoc-discuss" group. To unsubscribe from this group and stop receiving emails from it, send an email to pandoc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFF+G/Ez6ZCGd0@public.gmane.org To view this discussion on the web visit https://groups.google.com/d/msgid/pandoc-discuss/224dcd62-a503-40a9-bd4b-a7009abd559a%40googlegroups.com. For more options, visit https://groups.google.com/d/optout. ------=_Part_9158_1884707055.1420128333877 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Yep -- a few major styles (headers, block quotes) have bee= n addressed in the reader to make sure they work in different languages.
As far as the question of user-defined paragraph styles go= -- the main to-do now is parsing the style file for paragraph styles simil= arly to how we already do it for character styles. There are additional det= ails and complications which make it a bit harder, but it's doable, and jus= t waiting on developer time. Eventually, user-defined styles that inherit f= rom some set of base semantic styles will work. (This is all in the reader.= )

What will probably never work is being able to g= uess that your user defined style for headers which is just big and bold is= really for headers. In other words, paragraph styles will have to inherit = from something semantic, and not just be be a collection of visual characte= r styles.

On Thursday, January 1, 2015 9:50:49 AM UTC-5, Ghlen Livid= wrote:
I think what you are re= ferring to is related to https://github.com/jgm/pandoc/issues/= 1607 and https://github.com/jgm/pandoc/issues/1692, pl= ease see revelant discussions.=20
Basically, docx is a mess.

--
You received this message because you are subscribed to the Google Groups &= quot;pandoc-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to pand= oc-discuss+unsubscribe-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To post to this group, send email to pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org.
To view this discussion on the web visit https://groups.google.com/d/= msgid/pandoc-discuss/224dcd62-a503-40a9-bd4b-a7009abd559a%40googlegroups.co= m.
For more options, visit http= s://groups.google.com/d/optout.
------=_Part_9158_1884707055.1420128333877-- ------=_Part_9157_305106692.1420128333877--