From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/26119 Path: news.gmane.io!.POSTED.blaine.gmane.org!not-for-mail From: Gwern Branwen Newsgroups: gmane.text.pandoc Subject: Re: pandoc as a linkchecker? Date: Sat, 12 Sep 2020 15:35:05 -0400 Message-ID: References: Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Injection-Info: ciao.gmane.io; posting-host="blaine.gmane.org:116.202.254.214"; logging-data="24182"; mail-complaints-to="usenet@ciao.gmane.io" To: pandoc-discuss Original-X-From: pandoc-discuss+bncBDFJXQMSYMIRBDWG6T5AKGQEELUJMPA-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Sat Sep 12 21:35:46 2020 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane-mx.org Original-Received: from mail-yb1-f185.google.com ([209.85.219.185]) by ciao.gmane.io with esmtps (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1kHBJK-0006C0-Ou for gtp-pandoc-discuss@m.gmane-mx.org; Sat, 12 Sep 2020 21:35:46 +0200 Original-Received: by mail-yb1-f185.google.com with SMTP id a6sf12670446ybr.4 for ; Sat, 12 Sep 2020 12:35:46 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1599939346; cv=pass; d=google.com; s=arc-20160816; b=dnVFpl043bGirESqZcdnLmK2ZAuj7V+Hsg7YpPt/dGygR+Usel2XAJQNOCwrk6pLR7 dIobrGlwLsTbGeuxPx45v69iPheRnpxb9L+98iq8QzFtMSJhKY0GsqaY7zkYHIgbxWF5 MBpDvsSVe08dD4BVULGmsXrL+gQHeg0KUFkoXvy2G3IBHyMA3Mvo2TmjawAFj2ifCTNN jQ8i7RJEjr4qg8+MPMAsyvyJKDOas4pc3Bc0L2Ov60zm66gYOQfXmNZfPVFACJH3FKvo 7iWHHb6tGGFLVRZP+OPm8YkvugNGwJZHUMGvlANGzkgokaE6EWPZ8b/iWf4dBoB96tet F8Cw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:to:subject:message-id:date :from:in-reply-to:references:mime-version:sender:dkim-signature; bh=bqSXsyQcSjpsGZItehy5fOHaYSs6JLyqmM5s1tcLIsg=; b=vA5veZGY7sRXykcFa1WL8DVvkfOYsQq5R0rKoyve3Nul8oVORNvbzgPY2VvxSQ5WJ4 QqTlXcK8tOKDSgZ61Lnt2Rl5Fyx4PDhNasWRbAA7TA7O8dJFk1dKHtARvuNHiG12gkA3 bdH1XhO6qf7G6HUo7YK5vaWdYrsZmoJ7gpDta2Cm+i17viO7MXP632/Kdequz9i0CCdb xL0P4deVkCAArq11//bbSbvwbwuz/xlb1rXfftEMQUXEiWW314fly+GyTgXUiMh+zrL0 AbDEHF+X+lJECIi3RnvzABKXUjWk9BfOMUS38K+iEedHT4EgR00tUYvmeENz9DZCHy8Z zzlw== ARC-Authentication-Results: i=2; gmr-mx.google.com; spf=pass (google.com: domain of gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 209.85.166.179 as permitted sender) smtp.mailfrom=gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:mime-version:references:in-reply-to:from:date:message-id :subject:to:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=bqSXsyQcSjpsGZItehy5fOHaYSs6JLyqmM5s1tcLIsg=; b=GCpqWMg61AFpMRiY4tW1nVr3YbCIP/Pu2kbJ1K0vFxB6obbzJtoS2oJSFMApd5o9lh /JbRnstZOblF0QHApkK9m2IKobQfKB5DYfgh2CYj7UVxIeZLNBg0h6HMtQhr2+OyDKqJ 4M1hAA8YDPxOD8wBor84A2elrfVbetUo9t1yozBBIc4hYCQh1CH4B+Gc7GByHhHzJ44c LEHgznj0tX13Ki9IArSmeEmMUyAKyh3T5h3+FQQdCw5qVYgbjw1s3HAi5D0VmvZ+WL14 XhRwYxyuS0aAo4PUvRyE97wOODIu3XzwGFkNsQp2QvEDWU08i5tVuUq91ZN8zg/JZYys yB3w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:mime-version:references:in-reply-to:from :date:message-id:subject:to:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=bqSXsyQcSjpsGZItehy5fOHaYSs6JLyqmM5s1tcLIsg=; b=lyXZY+WGwevfloRr7QC5P6v0/W6O4R+BydCgUTscQ4VpWVn9wfMcRuLQmREyx0V0ox nFAeMu20R5IRei7sK5QCM7FT98eNDFHSWMlhi3tJNzvfAfzRbkxpyFyDBrGzZCazgP84 TIdDt3CljKZ42deizE4WiCHvtjgMco+fZC0dVqt6Kr220Xsq+ktvyZwb4EMupn6fFWEq 7EjOolo/nC2XJxKsT4QB+p1w77AvM9dGEXvBB2Sxr1G0y6Ki+UUwm4XYeY1mzgR1zIcs P6SgSK2JZdrn/PNmoiBVB0QHZuljluDMdflrLFapVwxxIKpVY36S2xn6o4SdiUEdVg+W 1luQ== Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org X-Gm-Message-State: AOAM531xHkYawzLKlV8oGufrayI3poYsafeggB6kScpgXYO7PCxWEO0A 2zH79WMfH/v2MLIALW8U12E= X-Google-Smtp-Source: ABdhPJzkvcxlo/XqHcxkGGTIJW0qRCnG2qWw5NyKgyeys2+wY0efIdSl55fEi/6tO6ZzoYsdH8wgtQ== X-Received: by 2002:a25:6b52:: with SMTP id o18mr10496865ybm.367.1599939345924; Sat, 12 Sep 2020 12:35:45 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 2002:a25:d4d3:: with SMTP id m202ls2403647ybf.6.gmail; Sat, 12 Sep 2020 12:35:42 -0700 (PDT) X-Received: by 2002:a25:6606:: with SMTP id a6mr12456007ybc.83.1599939342584; Sat, 12 Sep 2020 12:35:42 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1599939342; cv=none; d=google.com; s=arc-20160816; b=thLrp0pldxV/oHo7ugGqzbpdnXmrGLfTEeGzbjPk06I+xPDbM2Bx3yvfhuRJ3VduNg +xrQKHubT49TlfeGkJrJn5qlUh+WBKRy26TD0t631bHbkF3T+jjpln+I46Pum0kKv5W0 VLFtgcdnOhhx5VAoivGTYSqL/nxAhbUs1KwMiVv2L9fxNjhld4b2ScFcWN16l5TWxvT+ svss26daG/aRS6uZbYOm8uZLK3J6aJCAvSvEweisf5n4JqhUKMEZFG4ubP1SgMgy3Fbn 6yqgaEOYfVfNpu579wavZlJA3QkJRvHAmLvqd70/1ok+1GfDqbXDIaSaV2sN1MSwROrN pY5g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=to:subject:message-id:date:from:in-reply-to:references:mime-version; bh=llCCoM8V89tT1PGO1rOcNREMW0+KuzBrrgnIwPMKymk=; b=mAxVDX/F40Gfzeg8g51CBIHKlozzSauvIlTeP0tFzUISdX4bN2xcdiZzTWnK9vWEGe wB7xpLNqeG8AF7K9UYjReYjm3BE3ZxjQgn9HApUdSqzGEfalCxlF2UArgtvXJdBrWn7M WQQsQiRaIIvqmHG/dfu7OuhOzXjmO0qaMfMWIiXYL7u7Kx1pJkItIig9IiHePoz+HXi6 0nf0f40w5Zeqq44XQ6MOZ+SszMxG8xPGW2G4XzNx5AIkQ0vjE8SyZJg4+Vp9mVvXs8KV Gnb6ormsY6SEliWE8SDmEiHS5ruzszziIaGmu6V19389bSk0MueKW9G4J2Qhk7CJiupY zqMQ== ARC-Authentication-Results: i=1; gmr-mx.google.com; spf=pass (google.com: domain of gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 209.85.166.179 as permitted sender) smtp.mailfrom=gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Original-Received: from mail-il1-f179.google.com (mail-il1-f179.google.com. [209.85.166.179]) by gmr-mx.google.com with ESMTPS id v129si290766ybe.2.2020.09.12.12.35.42 for (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Sat, 12 Sep 2020 12:35:42 -0700 (PDT) Received-SPF: pass (google.com: domain of gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 209.85.166.179 as permitted sender) client-ip=209.85.166.179; Original-Received: by mail-il1-f179.google.com with SMTP id t16so12033616ilf.13 for ; Sat, 12 Sep 2020 12:35:42 -0700 (PDT) X-Received: by 2002:a92:d28c:: with SMTP id p12mr6174853ilp.120.1599939341830; Sat, 12 Sep 2020 12:35:41 -0700 (PDT) In-Reply-To: X-Original-Sender: gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org designates 209.85.166.179 as permitted sender) smtp.mailfrom=gwern0-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Xref: news.gmane.io gmane.text.pandoc:26119 Archived-At: Which kinds of links? Pandoc may chase some links in order to inline them ("linked scripts, stylesheets, images, and videos"), but that's not most links, and trying to check hyperlinks in full generality is quite complex and difficult (look at the complexity of what I use for dead-link finding, https://github.com/linkchecker/linkchecker ), and not possible in some cases.* * Consider relative or absolute links on a website: if I link to '/About' on a gwern.net page, that is a valid link when deployed, but it will break for every linkchecking tool which doesn't assume that that is relative to 'https://www.gwern.net'. How does Pandoc know that? -- gwern