From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/28149 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: "S. Manning" Newsgroups: gmane.text.pandoc Subject: Side Effects from HTML to HTML conversion Date: Mon, 12 Apr 2021 23:57:47 -0700 Message-ID: <40bf250d3cff42be22088054dc3fa618@ageofdatini.info> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8"; format=flowed Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="28217"; mail-complaints-to="usenet@ciao.gmane.io" User-Agent: Roundcube Webmail/1.4.11 To: Pandoc discuss Original-X-From: pandoc-discuss+bncBDJ5LMOP34CRB3MB2WBQMGQERZZMD2I-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Tue Apr 13 08:57:52 2021 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-lf1-f63.google.com ([209.85.167.63]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1lWCzf-0007Dy-VW for gtp-pandoc-discuss@m.gmane-mx.org; Tue, 13 Apr 2021 08:57:51 +0200 Original-Received: by mail-lf1-f63.google.com with SMTP id j26sf5308105lfm.23 for ; Mon, 12 Apr 2021 23:57:51 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1618297071; cv=pass; d=google.com; s=arc-20160816; b=KQEcc9hq8Kjv3OYOJwO1Eq3vhwrl7pSEUIZ7/fTxGQdIVbVspjEhziEwvRbontNMkS 9c6hP8rY3sCtQc6WLSEY+t6BHA3W90ulKhyvJqtWedry8LU9VbrnpMCoY3/MYuMto08m GZhzOyQOFIlR4rNHD9VNbpYDsDjQT7gWjiV+AesN0KgabhEEcFhoEosjbdEvTn4mILqL vpKBNPVIBJLNF2gfmwPuTtnO3OZqPHj1W1hsoPMvmSFPWnp9CBIxReoNSpLzpgu1dB8s DroSj5tCWD5CwlEF5FEAV4ITTKbqmhMSswmG9yBDoNdEVeehaGBHMCYQqEuGZOrVV3BO FAtw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:message-id:user-agent :subject:to:from:date:mime-version:sender:dkim-signature; bh=wFQrxlO9VA8HVrcY4oZPwVVO4QdocUP19BqGsdgVvxw=; b=vQm0Am4c5kU1RLucgIQ5xI2lEDDL6+yRpvzt2JAu0iSJEltjyXxpfcgooymFaHiviu 2cHiIhV4oZQ70VUNO34oMa335G5pVXl6UbcmZRwUGM+vQ99KpAi/WdiFpjcYPhlB7isP FKArPYAB/d4LhT8lMc5fOFpaBzqbt+inAkgKeEX4NfsSCU80TRjK9Gb7Tp1D9xi+3dLW Jo1Ua1NFkZyPAdi22QDJNnnJPzaCryGn6ZTTcDbvydTwJb8TqHHeqn7GLzpaqfd5nj1y WWMID4w24agRRJnlBG64Q2fkuWLtNQ0WwXKxdOGcw7bASQG9B9yfKhnmi3Bwv4hHxAIN yJeQ== ARC-Authentication-Results: i=2; gmr-mx.google.com; spf=neutral (google.com: 213.145.224.20 is neither permitted nor denied by best guess record for domain of scriptor-aFO/2INALiozYggVrLCuDg@public.gmane.org) smtp.mailfrom=scriptor-aFO/2INALiozYggVrLCuDg@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:mime-version:date:from:to:subject:user-agent:message-id :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=wFQrxlO9VA8HVrcY4oZPwVVO4QdocUP19BqGsdgVvxw=; b=fEszAot1L+tzxYh/yQD6LKPxJoBNLqVS0wR8/5UyBYvhOZGKLisxAqig+6Y53BO7z6 13hQP0P5tNz8NARXiyWth4U5MFyGDxOE6mdExCVF2ODR2chnYlVnmoZk3UUSKkZg6JTK PSuOIJNA/k6RSuLM6fCZFScNZf9BEyZw07f8yHuM07pLRy6gD3eKOHZZ7JVZjMTlBFnc Mh6AklzJw1EhIedFngDV7BnT4c4C/PMJihBt2Mf/mOMB44LDgqr/+O7OYxD/CVtE+fA7 3mbSHLuNzQ+L4S77zAyWZTVfdXXaA5oRM+rHqE1TSMHERMEefkwg1DcG0ZMTCnkzS3tP 0Cyw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:mime-version:date:from:to:subject :user-agent:message-id:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=wFQrxlO9VA8HVrcY4oZPwVVO4QdocUP19BqGsdgVvxw=; b=UZLQwt3E+sq6i2fSvAN+QvFq/PZzRjuD2x5AOrfT0R0rqLJ0tTZFDz9RKJgcqS2HXR 5p6aw7EuoLPQkxvxONc7JPHoAKx6sYjZkKfi4AvBClsZkr7YAEKOVe5RvFC5ay1VK1l1 JAZf2MS5aDhNlIgBHwURh+AwQxBJjig0wzLIj3ShVtsKIU+CxU4bQPKXgaY3ndjnfau+ Dq3eI7lceKMV7GP7z0B1mKVx39CcKz/DP8dkp1ATerSiMKKKYGsiMtvmPJ/GazduxD25 RguYpURDqEiGMg12EM6UjsYgA3Up17aNLMYj5jPJnlweIoeAlRiYlZVaVyOC8FUrcgq1 79AQ== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM532TzkalpqsDtLH4khWo8nFfHyoxAKhvjVcDP5TbpXveVzT4RWnt DVY1LVxSZGtKcQ3Imy3RZxw= X-Google-Smtp-Source: ABdhPJwQFg1oivigS9zhnfWOAZf1d/BmuV2hzjRiqpo9Wdn8W6jsYlirJDCIT5nkieHxRTOKP0lxGg== X-Received: by 2002:ac2:418f:: with SMTP id z15mr21421118lfh.2.1618297071562; Mon, 12 Apr 2021 23:57:51 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a05:6512:6c2:: with SMTP id u2ls2065121lff.3.gmail; Mon, 12 Apr 2021 23:57:49 -0700 (PDT) X-Received: by 2002:ac2:538d:: with SMTP id g13mr13928683lfh.661.1618297069036; Mon, 12 Apr 2021 23:57:49 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1618297069; cv=none; d=google.com; s=arc-20160816; b=sTYYANPiy0ZrV/dgKdS5YE6s7VhxBNymKMkAALUYbbQWg8OwyPseaQc+MtFI0/a3T9 5QOKBG3qHCKdyeXvns+Yyw+2kqOFhtlsgKE4zoVnC5gNQrdEuLKqF7Rbj62bNaB9X1cN UeQbn+NZfax+qws30piH/Yw7QWJFjHl0idUrkvrapkssPRFTPNOx61BteCt4FfJxokFs ksIGvpc3riWLILw5pyDcO/R3VC+unblBXZxeXuQdLVlqVeu3gPb5Z/26vS4mdd9ocIHK ODdSVzeBMN+KCgiQbuCvT0WbxXmIvk2D+2Y6/M2/Rv/KRlSaz/5zJXFUT+3tKuuICg8x s5ww== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:message-id:user-agent:subject:to:from :date:mime-version; bh=3fOdIPcI3NNFrya8s9Zy1i4tRZ/GZ32w9tAMuBhBLG0=; b=no5PXh7nNMSf5kldYtFHIfpgWvm1zGZkgSozLhDn5QJN6ylYw0IcqLdMNluiVmzHrE NvU2lCs+14QJDoRqBFT0FuvqLnyzMwbXiuWiDA+RvSmZ1LNKiNTjdUC+FjkCwGF5pF4S M6FUwjeHQkMAeaoQ/0NlTcpjzmAaTMiWVFt3bR5BhTzxm3IWV0MDq7LzQxDWmfL56wNg A7GfhEhnbueGVt8/qrtqpJ70m1AlujUMNqWw6oAWf9XyFbjFEcqY8CmganBMa0m7igQW JarMGF0/u2svJXJWrtzyzq2w0KnyWIEh+SFMpXnSgClLu3qBqm8DkZJKqp4I8Bp1iW4j cQ1g== ARC-Authentication-Results: i=1; gmr-mx.google.com; spf=neutral (google.com: 213.145.224.20 is neither permitted nor denied by best guess record for domain of scriptor-aFO/2INALiozYggVrLCuDg@public.gmane.org) smtp.mailfrom=scriptor-aFO/2INALiozYggVrLCuDg@public.gmane.org Original-Received: from ssl01.alldomains.hosting (ssl01.alldomains.hosting. [213.145.224.20]) by gmr-mx.google.com with ESMTPS id n13si902892lfi.5.2021.04.12.23.57.48 for (version=TLS1_2 cipher=ECDHE-ECDSA-AES128-GCM-SHA256 bits=128/128); Mon, 12 Apr 2021 23:57:48 -0700 (PDT) Received-SPF: neutral (google.com: 213.145.224.20 is neither permitted nor denied by best guess record for domain of scriptor-aFO/2INALiozYggVrLCuDg@public.gmane.org) client-ip=213.145.224.20; Original-Received: (qmail 977697 invoked by uid 7799); 13 Apr 2021 08:57:48 +0200 Original-Received: by simscan 1.4.0 ppid: 977675, pid: 977693, t: 0.0524s scanners: clamav: 0.101.5/m:59/d:26138 Original-Received: from ssl01.alldomains.hosting (scriptor-aFO/2INALiozYggVrLCuDg@public.gmane.org@213.145.224.20) by ssl01.alldomains.hosting with SMTP [49750]; 13 Apr 2021 08:57:48 +0200 X-Sender: scriptor-aFO/2INALiozYggVrLCuDg@public.gmane.org X-Original-Sender: scriptor-aFO/2INALiozYggVrLCuDg@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=neutral (google.com: 213.145.224.20 is neither permitted nor denied by best guess record for domain of scriptor-aFO/2INALiozYggVrLCuDg@public.gmane.org) smtp.mailfrom=scriptor-aFO/2INALiozYggVrLCuDg@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:28149 Archived-At: I seem to still be getting side effects when I take HTML as input and output to HTML (so all I use pandoc for is to take some variables and wrap the contents in header and footer code with the variables inserted in the appropriate places). Passages like the following in the input:
a mysterious machine 
sticking out of a cardboard shipping box
become like so in the output:
One of this proud 
company's most famous products, the type 37 widget ...
I lose the tag and I lose the contents of the alt attribute (good alt text is not the same as a good caption! The caption tells you how to interpret the picture, the alt text tells you what the picture would be if you could see it). Are there any ways of avoiding these side effects? If any of you can suggest a more appropriate tool than pandoc for my use case (take a HTML fragment and some metadata, wrap the fragment in header and footer text with some values inserted from the metadata to create a valid HTML file) I will consider it.