From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/11614 Path: news.gmane.org!not-for-mail From: Kjetil Flovild-Midtlie Newsgroups: gmane.text.pandoc Subject: Docx reader ; style picking algorithm Date: Tue, 30 Dec 2014 13:48:17 -0800 (PST) Message-ID: <4b2f3216-fc6c-4552-8bf0-3cd263ebc143@googlegroups.com> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_3517_376699313.1419976097093" X-Trace: ger.gmane.org 1419976104 8556 80.91.229.3 (30 Dec 2014 21:48:24 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Tue, 30 Dec 2014 21:48:24 +0000 (UTC) To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBDUMJ5E6WYLBBIN3RSSQKGQEF4YNUMQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Dec 30 22:48:20 2014 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-ob0-f188.google.com ([209.85.214.188]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1Y64eI-0001a7-NK for gtp-pandoc-discuss@m.gmane.org; Tue, 30 Dec 2014 22:48:18 +0100 Original-Received: by mail-ob0-f188.google.com with SMTP id uy5sf10558296obc.5 for ; Tue, 30 Dec 2014 13:48:18 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:message-id:subject:mime-version:content-type :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe; bh=SYrM4q4jPIZvI1o2n4RyGJhOmmKhBkimPMnayHkeG3Y=; b=p+3DwrYRrt4+JGX5JAeIp/tsawR8RuWr5/Z52Eqxnrj663Vbh5XpfJpml3BywBdtX5 d/nMGJnkBjuSjlvDZRgdpV4hURwE7WQHDDm90uuy4OjwAZKLbsZKJQaVgltjPr4dpXeW cThHW1MSWiOm8yaHb/RMk9nkGRDI6xfQrWo0ELHUvghjYX0+Di0/1MuxhHGooEb/sjXs YoapBh1OEn39DKiJi+aQVveU2m4p8z8ezjJ1KhKg6IUSgm/id/1j/ZdjgiOs1c8haDBR A7ShAHQhmKbYJBJWVFY5ccZ+Da6U1BEN9ygpVAzsnU1Y6F3jqhNLeFUAA6VQoulvUwrI 4/Dw== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:from:to:message-id:subject:mime-version:content-type :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe; bh=SYrM4q4jPIZvI1o2n4RyGJhOmmKhBkimPMnayHkeG3Y=; b=q6l/qh/FQlySORikDRZWaliTiWVp9puSCLepyRLj8BAvqgcvubj/f3GvPdhZqTNHWU ccRDZRmna+ACSfN0TO28sVGjB/O+eUxWPfFj/BqcEmprTk14xQmyK3na1c4sJ6LJ6YdB isp4L7g4MrLpQ4aFAqg9ejUOS5jSVgzHEnO7c+W8kBMMI5ZqrCk0MnLcU+Ut22GYXoYR 5YspUNoMdeSnkisH32Lf14MruxRWaUfLataGK14ZNzFWsYih7y8+ipXYw6yfi+8CwuL/ e2Q2nsAu8YvEKkKB84DFPx9DL6Uq4n/mgomeZcnq8kXY4TarR0SUhP1R9YuT0EFcDiB4 IrWg== X-Received: by 10.140.91.40 with SMTP id y37mr38244qgd.11.1419976097889; Tue, 30 Dec 2014 13:48:17 -0800 (PST) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.140.89.198 with SMTP id v64ls8156750qgd.25.gmail; Tue, 30 Dec 2014 13:48:17 -0800 (PST) X-Received: by 10.140.25.242 with SMTP id 105mr41204qgt.19.1419976097520; Tue, 30 Dec 2014 13:48:17 -0800 (PST) X-Original-Sender: kjetil.midtlie-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.org gmane.text.pandoc:11614 Archived-At: ------=_Part_3517_376699313.1419976097093 Content-Type: text/plain; charset=UTF-8 Sometimes authors change style names into locale variants or just non standard version. Pandoc skiped these when sectioning outout when I tested.. Could the docx reader check the underlying style/class name if an elem has a non standard style name? Does Word even keep this 'parentclass' info in the docx elements ? (I had a quick look in the docx Reader src file. Have also been reading up on Haskell again like mad this xmas, and sadly I realize I need more time "there" and also reading some raw docx files... ) Kjetil