From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/26142 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Gwern Branwen Newsgroups: gmane.text.pandoc Subject: Re: pandoc as a linkchecker? Date: Mon, 14 Sep 2020 09:23:51 -0400 Message-ID: References: Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="31530"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDFJXQMSYMIRBDG67X5AKGQEN2PFK5I-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mon Sep 14 15:24:34 2020 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-pj1-f64.google.com ([209.85.216.64]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1kHoTA-00082o-Sf for gtp-pandoc-discuss@m.gmane-mx.org; Mon, 14 Sep 2020 15:24:32 +0200 Original-Received: by mail-pj1-f64.google.com with SMTP id n19sf6263914pjt.1 for ; Mon, 14 Sep 2020 06:24:32 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1600089871; cv=pass; d=google.com; s=arc-20160816; b=SJBn43AYSi0FJg2b/Jc7U2Bw4KfvA3SDggWRN+RE0oKeqscdMO8UmxRBAvdznTrrTT rYhESEcm7UiqFVgZ0ZyrAbl/jpkvR3tlN40l0BDmfy1QAOSVrzslRObHmv+Z2HBXp2+i A06dSnY+HFHOBMtNLjcVb0Y06Op0qApJaFKYtIcp/5p/dGE0fca4xBwaHF5O7D1MS/tW mGv89HH4jOd+IdxVT4GYfdlnsn+AkKlDoAIZVC8WHIQXe4A8h6kK9GWC8PCbKFjZV0gE e55lH54eRViTMwco3wIYcOrGDJ9HbkEcTqWmLSVSPILz7u5eXgzb6VK7Nvwk9T5SF+I8 JZKg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:to:subject:message-id:date :from:in-reply-to:references:mime-version:sender:dkim-signature; bh=+EeTPwOhAcjXyPv4iuX4x0uS1ulUBd3LVZkFnQTz/PQ=; b=M7Ka9To1Ona9xLewOxpgEAoFl1w/EX9DpwBLWkTk+hyY8sZ1M7ODxINCqoLVSM3LhB Q6rKEFLdWov+wiLeEMILhGJlV0+xm23i6njnKciwoes/Zf9usRme+kq3Ue/5xkwL6Pwn zdSzAQlgbYf9xg2tbwmj3JdZvJziZ+FCtg7Q0wmi363NGkS/FHBDiaPx+UJD0BHVszEO H6lunZZ+F0Gs77IA/m4XsU/741HHb3mlHrSxsNYz5HR4sFNl4CqRrR31pXA6B1GFDwZ7 lt35w8TNXpALWUZOWYxxZ5PnWi5ku9WVeV/ym2ac8BnOyGjaOhKGx4TtDWx3tkEWGjUD 5FMg== ARC-Authentication-Results: i=2; gmr-mx.google.com; spf=pass (google.com: domain of gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 209.85.166.174 as permitted sender) smtp.mailfrom=gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:mime-version:references:in-reply-to:from:date:message-id :subject:to:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=+EeTPwOhAcjXyPv4iuX4x0uS1ulUBd3LVZkFnQTz/PQ=; b=BNw+ZtbY6c76ZnneBoq4SEStLCzmo4uj6hTll/Wx58do3iP+HRL6B4eRj+x4soQkTg i/S8eDli7ETR2FWeUbsw8NAstJoXUvaaoQTi+2BqkZytf6+TRJsndkBlNt4fmRjnmuIs 4UnY0ful4UQrtQm0/7ep4SFN4/3YMVBCf3/WiSo5cXbYzvss9xW67wtRco+NcM3zHd+e wuGtreipjmjt4G6wCzpIH6/AZmN5uQc46e3iYCtNoMicZbSYBl/5Whlv60n619EA0HBq f7GFxEiwx/jWacxGgMhs8ahj7Gc//NXeprDbA8BlrCreK2SX2RVbL8uNm6A3b4AeglxD 7SAQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:mime-version:references:in-reply-to:from :date:message-id:subject:to:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=+EeTPwOhAcjXyPv4iuX4x0uS1ulUBd3LVZkFnQTz/PQ=; b=bSzcNAw4gfTBXOoxc/UXn6OYDWoo/EElnQJxU+dw5si8fCizWxyr9t0oaP/5hmbWN5 iJlsYs5klq6Lt7etzYaPJkvMFBeSMbdjEfAhVO/9lDq1TS+nuaatfUsRuMTc8XoXKPp9 9j/FOVoCsoCdxWK2lIqTjmtsTs5pmje+qLWUzD7OVOe8Uo0APi9E9BWbGYcCP+JpdSI3 rAb2wcfpjUWHvWm+KA2YMg6gOL65DldVsrG0u5/GQ99wvknaL7CBFfLaW+JGiNqWR/Wv FRX9Y4hDpJ5ZVu673+/ouQeXAK8PlZvWteTi8WoB4rGNZBIavdKFAlxJr6AJPC0wUItP xTyA== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM5311A98Yi3CfFP9ulIJJQ8zbGKyC1Jig/spsDfASrwXFFdY+xuaK GghgtJRmj5J0hIbVHFQGDsY= X-Google-Smtp-Source: ABdhPJx7PkIa/DjYSBLm9LR2Z+2BX2/IgKCLI9ld5kHRddkE29zfGTzaX781G9nIlKTZ7AqL70Z3cQ== X-Received: by 2002:a63:36cc:: with SMTP id d195mr11207482pga.426.1600089871654; Mon, 14 Sep 2020 06:24:31 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a17:902:8d89:: with SMTP id v9ls4936588plo.4.gmail; Mon, 14 Sep 2020 06:24:27 -0700 (PDT) X-Received: by 2002:a17:90a:4ec4:: with SMTP id v4mr13756986pjl.62.1600089867685; Mon, 14 Sep 2020 06:24:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1600089867; cv=none; d=google.com; s=arc-20160816; b=o4ojq9csBQ4+yJlHGLpcTtL1mKi+SEg6RSZQrpRG7wN0RomJ+4kX/IZYqg3v0jOXak A9pdVhn8zRQkPumurd6OHk1bBg0nsqUetGdV0nDrWqxqsP/HjLeRARLce/rtudHnp9HR 2JdGhO/r33lhQpdFEVIIAsqiUW10Do2ri27lMNZQIrsoyYd7uAfnlzcTxJrd701fP5jA HbW8DHmy30rXEK6RCZkF1eUisCpngHn6cOG3RX/ueS8NhCdEyfmnqNKS2oQLGlfYt4ZR 26wnDKxGCOIC8n4oMaObv9qSGYQnyuMz0hdmGDcolm2rONgqlbPx+B5PW2axbO65coQu SqnA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=to:subject:message-id:date:from:in-reply-to:references:mime-version; bh=/4PQNkyaXMKLjqtlwg7CiaCBjtJk6xrr+QW4G8hMYqw=; b=Oj0FckAvfL9umW//+437RnPZsLbWQ65B3kbVrMfAOl4V/t94AwXHzdfsrAFHRJLaBd PosKcKiriGlidzMDVrRBlbpj22NRz/dGVQ4BNIeP9sIb3u/oj8bXVKKhchIBmts+T/KE +sc4RUTVNxUr2WKuoZq+0ZHTOUik2pki2Dw696XJadHIyKKCrHWV6SK/np/+QPQ2NiqX Sro4RjHNlfo2lEJpfD2DbvSRNwI73AKwfhWml2pqHAEw1fGKy/R2nnfur3sCpTDeDt6U izf+/bpm3iMg1U0vxhuOu5VYHhHeYOM4MTledQp0cAOrFroZfTm68ahnwj8REiw83FP6 Axrw== ARC-Authentication-Results: i=1; gmr-mx.google.com; spf=pass (google.com: domain of gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 209.85.166.174 as permitted sender) smtp.mailfrom=gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Original-Received: from mail-il1-f174.google.com (mail-il1-f174.google.com. [209.85.166.174]) by gmr-mx.google.com with ESMTPS id lj12si681447pjb.0.2020.09.14.06.24.27 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Mon, 14 Sep 2020 06:24:27 -0700 (PDT) Received-SPF: pass (google.com: domain of gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 209.85.166.174 as permitted sender) client-ip=209.85.166.174; Original-Received: by mail-il1-f174.google.com with SMTP id s88so6386426ilb.6 for ; Mon, 14 Sep 2020 06:24:27 -0700 (PDT) X-Received: by 2002:a05:6e02:138c:: with SMTP id d12mr9380421ilo.97.1600089867004; Mon, 14 Sep 2020 06:24:27 -0700 (PDT) In-Reply-To: X-Original-Sender: gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 209.85.166.174 as permitted sender) smtp.mailfrom=gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:26142 Archived-At: You could just work around it by scripting. Use a linkchecker to get broken URLs, and then search each broken URL in the original file. Even grep has `--line-number`. -- gwern https://www.gwern.net