From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <musl-return-20550-ml=inbox.vuxu.org@lists.openwall.com>
X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on inbox.vuxu.org
X-Spam-Level: 
X-Spam-Status: No, score=-2.9 required=5.0 tests=DKIM_ADSP_CUSTOM_MED,
	DKIM_INVALID,DKIM_SIGNED,FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM,
	HEADER_FROM_DIFFERENT_DOMAINS,HTML_MESSAGE,MAILING_LIST_MULTI,
	NORMAL_HTTP_TO_IP,NUMERIC_HTTP_ADDR,RCVD_IN_DNSWL_MED,
	RCVD_IN_MSPIKE_H4,RCVD_IN_MSPIKE_WL,T_SCC_BODY_TEXT_LINE,WEIRD_PORT
	autolearn=ham autolearn_force=no version=3.4.4
Received: from second.openwall.net (second.openwall.net [193.110.157.125])
	by inbox.vuxu.org (Postfix) with SMTP id 4CEF626390
	for <ml@inbox.vuxu.org>; Fri,  8 Mar 2024 03:07:07 +0100 (CET)
Received: (qmail 7562 invoked by uid 550); 8 Mar 2024 02:03:06 -0000
Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm
Precedence: bulk
List-Post: <mailto:musl@lists.openwall.com>
List-Help: <mailto:musl-help@lists.openwall.com>
List-Unsubscribe: <mailto:musl-unsubscribe@lists.openwall.com>
List-Subscribe: <mailto:musl-subscribe@lists.openwall.com>
List-ID: <musl.lists.openwall.com>
Reply-To: musl@lists.openwall.com
Received: (qmail 7529 invoked from network); 8 Mar 2024 02:03:06 -0000
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=gmail.com; s=20230601; t=1709863615; x=1710468415; darn=lists.openwall.com;
        h=cc:to:subject:message-id:date:from:in-reply-to:references
         :mime-version:from:to:cc:subject:date:message-id:reply-to;
        bh=YPUQGyBk5Ad/+DwTJFm+p9c/ipRxvqVDm8wfQtWjxgs=;
        b=LUIPyM1fq970ggICHx7Mjo9teXVqjWQKnnMdcJIsS1CGd4PMhVMBQgNldDbs4D9kJv
         nOR6BKk1SG61Jt4EcRwBLuLFPIlG17YuYkW9wyh8WdwqYkJnyxXzRIMDbhvNUys/py6j
         JuG+8pj8F4uocp+dhZlmVCCZ32tBNLB5qJhxKeWE4HKmD4D7p76gxMXWbEtKgkgbyJyD
         1ghifFywWJBN4AJjHYjjhaDuYr5GW+Kkq5tX4tn7ZLSv1z60ETbVgtrvHgfFIiuZPC2C
         tOzAn4bbUgB8dOuSvH9eSdqs46U0x3iydeSLlhwyTqMd3bk30e8ba8xfKs3178cH+y4k
         lVXg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20230601; t=1709863615; x=1710468415;
        h=cc:to:subject:message-id:date:from:in-reply-to:references
         :mime-version:x-gm-message-state:from:to:cc:subject:date:message-id
         :reply-to;
        bh=YPUQGyBk5Ad/+DwTJFm+p9c/ipRxvqVDm8wfQtWjxgs=;
        b=Gqouww0uCct7jKQaxF5AhJcIGNSQVnldGvGgAPs+infS6wi82e+uDRMW1KMQtJD7EY
         MrUwj98QkND/71x4XeWYW+yotjIY4XvRFQoh/FouP5+xxmIGuIfHL2AVJWe38jJxesiF
         q9kBea8JJ5q5c2Rv+t8gltST8bD4l3WANlLtfdZQMhE0H5Pr2ZZhm+cn2s17nCQcsrgD
         ZSiW9YH4HmqZ57UgAg+NC0YiqVwb+EwJIKXPfYKFVmSqYjj0p/z4xZs7EYFXMNm88XIY
         Xe9bEcKBKDWdhUK+LhRiyg/YU9WvXGRPh0mFbURwhycBw9YXXhgI57KpowMyaAJ1ypII
         HrKQ==
X-Gm-Message-State: AOJu0YyB+oB15ONznUTarwBozCK34MWvOhvU59ELczhWYcGo4Li577b7
	aksOt/QmpLzq7Q4h75m48Fh5I5nu1EoNRZNQqcPd7jrTBWs4knHiC0TFmF4GH7q0wwSWGbMYWC8
	z50AZyGYvkIMxcK5rmBRfgLY7pTP5xyo9BPI=
X-Google-Smtp-Source: AGHT+IGqEMg/0EAegeUfWziApYAQAHa4GrTuEkzgnZAEzTQBSZwRLWuDH+Vl7Zet7pe9lD6jw80oiVJiNprrLmlicUA=
X-Received: by 2002:a17:906:4558:b0:a40:189d:c5b5 with SMTP id
 s24-20020a170906455800b00a40189dc5b5mr13437653ejq.38.1709863614777; Thu, 07
 Mar 2024 18:06:54 -0800 (PST)
MIME-Version: 1.0
References: <CAPDSy+52ffN_Rb8JsL8=F5oeTqGVWFcDVk0F-W_H8DvsWY8RCw@mail.gmail.com>
 <20240306161544.GH4163@brightrain.aerifal.cx> <CAPDSy+7_=Dm4A=ub37QZNLD2hxY-5yxJX9mJTqvn_Hhha4=T=A@mail.gmail.com>
 <20240307024316.GI4163@brightrain.aerifal.cx> <CAPDSy+4m+m5xjc63t-3Jnt-bdA8BQKHt60fTMnx1LgCyXnO2yA@mail.gmail.com>
 <20240308000818.GJ4163@brightrain.aerifal.cx> <CAPDSy+5wvsmCSxrz_6W0kEMT=xR_SoasQKFuKNV+TZn+JMy00g@mail.gmail.com>
In-Reply-To: <CAPDSy+5wvsmCSxrz_6W0kEMT=xR_SoasQKFuKNV+TZn+JMy00g@mail.gmail.com>
From: David Schinazi <dschinazi.ietf@gmail.com>
Date: Thu, 7 Mar 2024 18:06:43 -0800
Message-ID: <CAPDSy+7KUJ6EOhONeHww9AhkziBovdjYMOa-x4knsoC2RoC1yw@mail.gmail.com>
To: Rich Felker <dalias@libc.org>
Cc: musl@lists.openwall.com
Content-Type: multipart/alternative; boundary="0000000000005f942606131ca5c6"
Subject: Re: [musl] mDNS in musl

--0000000000005f942606131ca5c6
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Oh, one more thing: we might be able to use sendmsg and IP_PKTINFO to
select the outgoing interface for each send call instead of binding and
requiring multiple sockets.
David

On Thu, Mar 7, 2024 at 5:30=E2=80=AFPM David Schinazi <dschinazi.ietf@gmail=
.com>
wrote:

>
>
> On Thu, Mar 7, 2024 at 4:08=E2=80=AFPM Rich Felker <dalias@libc.org> wrot=
e:
>
>> On Thu, Mar 07, 2024 at 02:50:53PM -0800, David Schinazi wrote:
>> > On Wed, Mar 6, 2024 at 6:42=E2=80=AFPM Rich Felker <dalias@libc.org> w=
rote:
>> >
>> > > On Wed, Mar 06, 2024 at 04:17:44PM -0800, David Schinazi wrote:
>> > > > As Jeffrey points out, when the IETF decided to standardize mDNS,
>> they
>> > > > published it (RFC 6762) at the same time as the Special-Use Domain
>> > > Registry
>> > > > (RFC 6761) which created a process for reserving domain names for
>> custom
>> > > > purposes, and ".local" was one of the initial entries into that
>> registry.
>> > > > The UTF-8 vs punycode issue when it comes to mDNS and DNS is
>> somewhat of
>> > > a
>> > > > mess. It was discussed in Section 16 of RFC 6762 but at the end of
>> the
>> > > day
>> > > > punycode won. Even Apple's implementation of getaddrinfo will
>> perform
>> > > > punycode conversion for .local instead of sending the UTF-8. So in
>> > > practice
>> > > > you wouldn't need to special-case anything here.
>> > >
>> > > OK, these are both really good news!
>> > >
>> > > > There's also very much a policy matter of what "locally over
>> > > > > multicast" means (what the user wants it to mean). Which
>> interfaces
>> > > > > should be queried? Wired and wireless ethernet? VPN links or oth=
er
>> > > > > sorts of tunnels? Just one local interface (which one to
>> prioritize)
>> > > > > or all of them? Only if the network is "trusted"? Etc.
>> > > > >
>> > > >
>> > > > You're absolutely right. Most mDNS systems try all non-loopback
>> non-p2p
>> > > > multicast-supporting interfaces, but sending to the default route
>> > > interface
>> > > > would be a good start, more on that below.
>> > >
>> > > This is really one thing that suggests a need for configurability
>> > > outside of what libc might be able to offer. With normal DNS lookups=
,
>> > > they're something you can block off and prevent from going to the
>> > > network at all by policy (and in fact they don't go past the loopbac=
k
>> > > by default, in the absence of a resolv.conf file). Adding mDNS that'=
s
>> > > on-by-default and not configurable would make a vector for network
>> > > traffic being generated that's probably not expected and that could =
be
>> > > a privacy leak.
>> > >
>> >
>> > Totally agree. I was thinking through this both in terms of RFCs and i=
n
>> > terms of minimal code changes, and had a potential idea. Conceptually,
>> > sending DNS to localhost is musl's IPC mechanism to a more feature-ric=
h
>> > resolver running in user-space. So when that's happening, we don't wan=
t
>> to
>> > mess with it because that could cause a privacy leak. Conversely, when
>> > there's a non-loopback IP configured in resolv.conf, then musl acts as=
 a
>> > DNS stub resolver and the server in resolv.conf acts as a DNS recursiv=
e
>> > resolver. In that scenario, sending the .local query over DNS to that
>> other
>> > host violates the RFCs. This allows us to treat the configured resolve=
r
>> > address as an implicit configuration mechanism that allows us to
>> > selectively enable this without impacting anyone doing their own DNS
>> > locally.
>>
>> This sounds like an odd overloading of one thing to have a very
>> different meaning, and would break builtin mDNS for anyone doing
>> DNSSEC right (which requires validating nameserver on localhost).
>> Inventing a knob that's an overload of an existing knob is still
>> inventing a knob, just worse.
>>
>
> Sorry, I was suggesting the other way around: to only enable the mDNS mod=
e
> if resolver !=3D 127.0.0.1. But on the topic of DNSSEC, that doesn't real=
ly
> make sense in the context of mDNS because the names aren't globally uniqu=
e
> and signed. In theory you could exchange DNSSEC keys out of band and use
> DNSSEC with mDNS, but I've never heard of anyone doing that. At that poin=
t
> people exchange TLS certificates out of band and use mTLS. But overall I
> can't argue that overloading configs to mean multiple things is janky :-)
>
> > > > When you do that, how do you control which interface(s) it goes ove=
r?
>> > > > > I think that's an important missing ingredient.
>> > > >
>> > > > You're absolutely right. In IPv4, sending to a link-local multicas=
t
>> > > address
>> > > > like this will send it over the IPv4 default route interface. In
>> IPv6,
>> > > the
>> > > > interface needs to be specified in the scope_id. So we'd need to
>> pull
>> > > that
>> > > > out of the kernel with rtnetlink.
>> > >
>> > > There's already code to enumerate interfaces, but it's a decent bit =
of
>> > > additional machinery to pull in as a dep for the stub resolver,
>> >
>> >
>> > Yeah we'd need lookup_name.c to include netlink.h - it's not huge
>> though,
>> > netlink.c is 50 lines long and statically linked anyway right?
>>
>> I was thinking in terms of using if_nameindex or something, but indeed
>> that's not desirable because it's allocating. So it looks like it
>> wouldn't share code but use netlink.c directly if it were done this
>> way.
>>
>> BTW if there's a legacy ioctl that tells you the number of interfaces
>> (scope_ids), it sems like you could just iterate over the whole
>> numeric range without actually doing netlink enumeration.
>>
>
> That would also work. The main limitation I was working around was that
> you can only pass around MAXNS (3) name servers around without making mor=
e
> changes.
>
> > > and
>> > > it's not clear how to do it properly for IPv4 (do scope ids work wit=
h
>> > > v4-mapped addresses by any chance?)
>> > >
>> >
>> > Scope IDs unfortunately don't work for IPv4. There's the SO_BINDTODEVI=
CE
>> > socket option, but that requires elevated privileges. For IPv4 I'd jus=
t
>> use
>> > the default route interface.
>>
>> But the default route interface is almost surely *not* the LAN where
>> you expect .local things to live except in the case where there is
>> only one interface. If you have a network that's segmented into
>> separate LAN and outgoing interfaces, the LAN, not the route to the
>> public internet, is where you would want mDNS going.
>>
>
> In the case of a router, definitely. In the case of most end hosts or VMs
> though, they often have only one or two routable interfaces, and the
> default route is also the LAN.
>
> With that said, SO_BINDTODEVICE is not the standard way to do this,
>> and the correct/standard way doesn't need root. What it does need is
>> binding to the local address on each device, which is still rather
>> undesirable because it means you need N sockets for N interfaces,
>> rather than one socket that can send/receive all addresses.
>>
>
> Oh you're absolutely right, I knew there was a non-privileged way to do
> this but couldn't remember it earlier.
>
> This is giving me an idea though: we could use the "connect UDP socket to
> get a route lookup" trick. Let's say we're configured with a nameserver
> that's not 127.0.0.1 (which is the case where I'd like to enable this)
> let's say the nameserver is set to 192.0.2.33, then today foobar.local
> would be sent to 192.0.2.33 over whichever interface has a route to it (i=
n
> most cases the default interface, but not always). We could open an
> AF_INET/SOCK_DGRAM socket, connect it to 192.0.2.33:53, and then
> use getsockname to get the local address - we then close that socket. We
> can then create a new socket, bind it to that local address. That would
> ensure that we send the mDNS traffic on the same interface where we would
> have sent the unicast query. Downside is that since all queries share the
> same socket, we'd bind everything to the interface of the first resolver,
> or need multiple sockets.
>
> You answered for v4, but are you sure scope ids don't work for
>> *v4-mapped*? That is, IPv6 addresses of the form
>> ::ffff:aaa.bbb.ccc.ddd. I guess I could check this. I'm not very
>> hopeful, but it would be excellent if this worked to send v4 multicast
>> to a particular interface.
>>
>
> Huh I hadn't thought of that, worth a try? RFC 4007 doesn't really allow
> using scope IDs for globally routable addresses but I'm not sure if Linux
> does.
>
> > Another issue you haven't mentioned: how does TCP fallback work with
>> > > mDNS? Or are answers too large for standard UDP replies just illegal=
?
>> > >
>> >
>> > Good point, I hadn't thought of that. That handling for mDNS is define=
d
>> in
>> > [1]. In the ephemeral query mode that we'd use here, it works the same
>> as
>> > for regular DNS: when you receive a response with the TC bit, retry th=
e
>> > query with TCP. The slight difference is that you send the TCP to the
>> > address you got the response from (not to the multicast address that y=
ou
>> > sent the original query to). From looking at the musl code, we'd need =
a
>> > small tweak to __res_msend_rc() to use that address. Luckily that code
>> > already looks at the sender address so we don't need any additional
>> calls
>> > to get it.
>>
>> Yes, that's what I figured you might do. I guess that works reasonably
>> well.
>>
>> > > > Reason for that is that that is the most generic way to support an=
y
>> > > > > other name service besides DNS. It avoids the dependency on
>> dynamic
>> > > > > loading that something like glibc's nsswitch would create, and
>> would
>> > > > > avoid having multiple backends in libc. I really don't think
>> anyone
>> > > > > wants to open that particular door. Once mDNS is in there,
>> someone will
>> > > > > add NetBIOS, just you wait.
>> > > >
>> > > >
>> > > > I'm definitely supportive of the slippery slope argument, but I
>> think
>> > > > there's still a real line between mDNS and NetBIOS. mDNS uses a
>> different
>> > > > transport but lives inside the DNS namespace, whereas NetBIOS is
>> really
>> > > its
>> > > > own thing - NetBIOS names aren't valid DNS hostnames.
>> > > >
>> > > > Let me know what you think of the above. If you think of mDNS as
>> its own
>> > > > beast then I can see how including it wouldn't really make sense.
>> But if
>> > > > you see it as an actual part of the DNS, then it might be worth a
>> small
>> > > > code change :-)
>> > >
>> > > I'm not worried about slippery slopes to NetBIOS. :-P I am concerned
>> > > about unwanted network traffic that can't be suppressed, privacy
>> > > leaks, inventing new configuration knobs, potentially pulling in mor=
e
>> > > code & more fragility, getting stuck supporting something that turns
>> > > out to have hidden problems we haven't thought about, etc.
>> > >
>> >
>> > Those are great reasons, and I totally agree with those goals. If we
>> scope
>> > the problem down with the details higher up in this email, we have a
>> way to
>> > turn this off (set the resolver to localhost), we avoid privacy leaks =
in
>> > cases where the traffic wasn't going out in the first place, we don't
>> have
>> > to add more configuration knobs because we're reusing an existing one,
>> and
>>
>> As mentioned above, I don't think "reusing an existing one" is an
>> improvement.
>>
>
> Fair, my goal was minimizing change size, but that's not the only goal.
>
> > the amount of added code would be quite small. Limiting things to the
>> > default interface isn't a full multi-network solution, but for those I
>> > think it makes more sense to recommend running your own resolver on
>> > loopback (you'd need elevated privileges to make this work fully
>> anyway).
>> > Coding wise, I think this would be pretty robust. The only breakage I
>> > foresee is cases where someone built a custom resolver that runs on a
>> > different machine and somehow handles .local differently than what the
>> RFCs
>> > say. That config sounds like a bad idea, and a violation of the RFCs,
>> but
>> > that doesn't mean there isn't someone somewhere who's doing it. So
>> there's
>> > a non-zero risk there. But to me that's manageable risk.
>> >
>> > What do you think?
>>
>> I think a more reasonable approach might be requiring an explicit knob
>> to enable mDNS, in the form of an options field like ndots, timeout,
>> retries, etc. in resolv.conf. This ensures that it doesn't become
>> attack surface/change-of-behavior in network environments where peers
>> are not supposed to be able to define network names.
>>
>
> That would work. I'm not sure who maintains the list of options though.
> From a quick search it looks like they came out of 4.3BSD like many
> networking features, but it's unclear if POSIX owns it or just no one doe=
s
> (which would be the same, POSIX is not around as a standard body any more=
).
>
> One further advantage of such an approach is that it could also solve
>> the "which interface(s)" problem by letting the answer just be
>> "whichever one(s) the user configured" (with the default list being
>> empty). That way we wouldn't even need netlink, just if_nametoindex to
>> convert interface name strings to scope ids, or alternatively (does
>> this work for v6 in the absence of an explicit scope_id?) one or more
>> local addresses to bind and send from.
>>
>
> I definitely would avoid putting local addresses in the config, because i=
t
> would break for any non-static addresses like DHCP or v6 RAs. The interfa=
ce
> name would require walking the getifaddrs list to map it to a correspondi=
ng
> source address but it would work if the interface name is stable.
>
> I guess we're looking at two ways to go about this:
>
> (1) the simpler but less clean option - where we key off of "resolver !=
=3D
> 127.0.0.1" - very limited code size change, but only handles a small subs=
et
> of scenarios
>
> (2) the cleaner option that involves more work - new config option, need
> multiple sockets - would be cleaner design-wise, but would change quite a
> bit more code
>
> Another aspect to consider is the fact that in a lot of cases resolv.conf
> is overwritten by various components like NetworkManager, so we'd need to
> modify them to also understand the option.
>
> I'm always in favor of doing the right thing, unless the right thing ends
> up being so much effort that it doesn't happen. Then I'm a fan of doing t=
he
> easy thing ;-)
>
> David
>

--0000000000005f942606131ca5c6
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">Oh, one more thing: we might be able to use sendmsg and IP=
_PKTINFO to select the outgoing interface for each send call instead of bin=
ding and requiring multiple sockets.<div>David</div></div><br><div class=3D=
"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Thu, Mar 7, 2024 at =
5:30=E2=80=AFPM David Schinazi &lt;<a href=3D"mailto:dschinazi.ietf@gmail.c=
om">dschinazi.ietf@gmail.com</a>&gt; wrote:<br></div><blockquote class=3D"g=
mail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204=
,204,204);padding-left:1ex"><div dir=3D"ltr"><div dir=3D"ltr"><br></div><br=
><div class=3D"gmail_quote"><div dir=3D"ltr" class=3D"gmail_attr">On Thu, M=
ar 7, 2024 at 4:08=E2=80=AFPM Rich Felker &lt;<a href=3D"mailto:dalias@libc=
.org" target=3D"_blank">dalias@libc.org</a>&gt; wrote:<br></div><blockquote=
 class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px so=
lid rgb(204,204,204);padding-left:1ex">On Thu, Mar 07, 2024 at 02:50:53PM -=
0800, David Schinazi wrote:<br>
&gt; On Wed, Mar 6, 2024 at 6:42=E2=80=AFPM Rich Felker &lt;<a href=3D"mail=
to:dalias@libc.org" target=3D"_blank">dalias@libc.org</a>&gt; wrote:<br>
&gt; <br>
&gt; &gt; On Wed, Mar 06, 2024 at 04:17:44PM -0800, David Schinazi wrote:<b=
r>
&gt; &gt; &gt; As Jeffrey points out, when the IETF decided to standardize =
mDNS, they<br>
&gt; &gt; &gt; published it (RFC 6762) at the same time as the Special-Use =
Domain<br>
&gt; &gt; Registry<br>
&gt; &gt; &gt; (RFC 6761) which created a process for reserving domain name=
s for custom<br>
&gt; &gt; &gt; purposes, and &quot;.local&quot; was one of the initial entr=
ies into that registry.<br>
&gt; &gt; &gt; The UTF-8 vs punycode issue when it comes to mDNS and DNS is=
 somewhat of<br>
&gt; &gt; a<br>
&gt; &gt; &gt; mess. It was discussed in Section 16 of RFC 6762 but at the =
end of the<br>
&gt; &gt; day<br>
&gt; &gt; &gt; punycode won. Even Apple&#39;s implementation of getaddrinfo=
 will perform<br>
&gt; &gt; &gt; punycode conversion for .local instead of sending the UTF-8.=
 So in<br>
&gt; &gt; practice<br>
&gt; &gt; &gt; you wouldn&#39;t need to special-case anything here.<br>
&gt; &gt;<br>
&gt; &gt; OK, these are both really good news!<br>
&gt; &gt;<br>
&gt; &gt; &gt; There&#39;s also very much a policy matter of what &quot;loc=
ally over<br>
&gt; &gt; &gt; &gt; multicast&quot; means (what the user wants it to mean).=
 Which interfaces<br>
&gt; &gt; &gt; &gt; should be queried? Wired and wireless ethernet? VPN lin=
ks or other<br>
&gt; &gt; &gt; &gt; sorts of tunnels? Just one local interface (which one t=
o prioritize)<br>
&gt; &gt; &gt; &gt; or all of them? Only if the network is &quot;trusted&qu=
ot;? Etc.<br>
&gt; &gt; &gt; &gt;<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; You&#39;re absolutely right. Most mDNS systems try all non-l=
oopback non-p2p<br>
&gt; &gt; &gt; multicast-supporting interfaces, but sending to the default =
route<br>
&gt; &gt; interface<br>
&gt; &gt; &gt; would be a good start, more on that below.<br>
&gt; &gt;<br>
&gt; &gt; This is really one thing that suggests a need for configurability=
<br>
&gt; &gt; outside of what libc might be able to offer. With normal DNS look=
ups,<br>
&gt; &gt; they&#39;re something you can block off and prevent from going to=
 the<br>
&gt; &gt; network at all by policy (and in fact they don&#39;t go past the =
loopback<br>
&gt; &gt; by default, in the absence of a resolv.conf file). Adding mDNS th=
at&#39;s<br>
&gt; &gt; on-by-default and not configurable would make a vector for networ=
k<br>
&gt; &gt; traffic being generated that&#39;s probably not expected and that=
 could be<br>
&gt; &gt; a privacy leak.<br>
&gt; &gt;<br>
&gt; <br>
&gt; Totally agree. I was thinking through this both in terms of RFCs and i=
n<br>
&gt; terms of minimal code changes, and had a potential idea. Conceptually,=
<br>
&gt; sending DNS to localhost is musl&#39;s IPC mechanism to a more feature=
-rich<br>
&gt; resolver running in user-space. So when that&#39;s happening, we don&#=
39;t want to<br>
&gt; mess with it because that could cause a privacy leak. Conversely, when=
<br>
&gt; there&#39;s a non-loopback IP configured in resolv.conf, then musl act=
s as a<br>
&gt; DNS stub resolver and the server in resolv.conf acts as a DNS recursiv=
e<br>
&gt; resolver. In that scenario, sending the .local query over DNS to that =
other<br>
&gt; host violates the RFCs. This allows us to treat the configured resolve=
r<br>
&gt; address as an implicit configuration mechanism that allows us to<br>
&gt; selectively enable this without impacting anyone doing their own DNS<b=
r>
&gt; locally.<br>
<br>
This sounds like an odd overloading of one thing to have a very<br>
different meaning, and would break builtin mDNS for anyone doing<br>
DNSSEC right (which requires validating nameserver on localhost).<br>
Inventing a knob that&#39;s an overload of an existing knob is still<br>
inventing a knob, just worse.<br></blockquote><div><br></div><div><div>Sorr=
y, I was suggesting the other way around: to only enable the mDNS mode if r=
esolver !=3D 127.0.0.1. But on the topic of DNSSEC, that doesn&#39;t really=
 make sense in the context of mDNS because the names aren&#39;t globally un=
ique and signed. In theory you could exchange DNSSEC keys out of band and u=
se DNSSEC with mDNS, but I&#39;ve never heard of anyone doing that. At that=
 point people exchange TLS certificates out of band and use mTLS. But overa=
ll I can&#39;t argue that overloading configs to mean multiple things is ja=
nky :-)</div></div><div><br></div><blockquote class=3D"gmail_quote" style=
=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding=
-left:1ex">
&gt; &gt; &gt; When you do that, how do you control which interface(s) it g=
oes over?<br>
&gt; &gt; &gt; &gt; I think that&#39;s an important missing ingredient.<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; You&#39;re absolutely right. In IPv4, sending to a link-loca=
l multicast<br>
&gt; &gt; address<br>
&gt; &gt; &gt; like this will send it over the IPv4 default route interface=
. In IPv6,<br>
&gt; &gt; the<br>
&gt; &gt; &gt; interface needs to be specified in the scope_id. So we&#39;d=
 need to pull<br>
&gt; &gt; that<br>
&gt; &gt; &gt; out of the kernel with rtnetlink.<br>
&gt; &gt;<br>
&gt; &gt; There&#39;s already code to enumerate interfaces, but it&#39;s a =
decent bit of<br>
&gt; &gt; additional machinery to pull in as a dep for the stub resolver,<b=
r>
&gt; <br>
&gt; <br>
&gt; Yeah we&#39;d need lookup_name.c to include netlink.h - it&#39;s not h=
uge though,<br>
&gt; netlink.c is 50 lines long and statically linked anyway right?<br>
<br>
I was thinking in terms of using if_nameindex or something, but indeed<br>
that&#39;s not desirable because it&#39;s allocating. So it looks like it<b=
r>
wouldn&#39;t share code but use netlink.c directly if it were done this<br>
way.<br>
<br>
BTW if there&#39;s a legacy ioctl that tells you the number of interfaces<b=
r>
(scope_ids), it sems like you could just iterate over the whole<br>
numeric range without actually doing netlink enumeration.<br></blockquote><=
div><br></div><div>That would also work. The main limitation I was working =
around was that you can only pass around=C2=A0MAXNS (3) name servers around=
 without making more changes.</div><div><br></div><blockquote class=3D"gmai=
l_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,20=
4,204);padding-left:1ex">
&gt; &gt; and<br>
&gt; &gt; it&#39;s not clear how to do it properly for IPv4 (do scope ids w=
ork with<br>
&gt; &gt; v4-mapped addresses by any chance?)<br>
&gt; &gt;<br>
&gt; <br>
&gt; Scope IDs unfortunately don&#39;t work for IPv4. There&#39;s the SO_BI=
NDTODEVICE<br>
&gt; socket option, but that requires elevated privileges. For IPv4 I&#39;d=
 just use<br>
&gt; the default route interface.<br>
<br>
But the default route interface is almost surely *not* the LAN where<br>
you expect .local things to live except in the case where there is<br>
only one interface. If you have a network that&#39;s segmented into<br>
separate LAN and outgoing interfaces, the LAN, not the route to the<br>
public internet, is where you would want mDNS going.<br></blockquote><div><=
br></div><div>In the case of a router, definitely. In the case of most end =
hosts or VMs though, they often have only one or two routable=C2=A0interfac=
es, and the default route is also the LAN.</div><div><br></div><blockquote =
class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px sol=
id rgb(204,204,204);padding-left:1ex">
With that said, SO_BINDTODEVICE is not the standard way to do this,<br>
and the correct/standard way doesn&#39;t need root. What it does need is<br=
>
binding to the local address on each device, which is still rather<br>
undesirable because it means you need N sockets for N interfaces,<br>
rather than one socket that can send/receive all addresses.<br></blockquote=
><div><br></div><div>Oh you&#39;re absolutely right, I knew there was a non=
-privileged way to do this but couldn&#39;t remember it earlier.</div><div>=
<br></div><div>This is giving me an idea though: we could use the &quot;con=
nect UDP socket to get a route lookup&quot; trick. Let&#39;s say we&#39;re =
configured with a nameserver that&#39;s not 127.0.0.1 (which is the case wh=
ere I&#39;d like to enable this) let&#39;s say the nameserver is set to 192=
.0.2.33, then today foobar.local would be sent to 192.0.2.33 over whichever=
 interface has a route to it (in most cases the default interface, but not =
always). We could open an AF_INET/SOCK_DGRAM socket, connect it to <a href=
=3D"http://192.0.2.33:53" target=3D"_blank">192.0.2.33:53</a>, and then use=
=C2=A0getsockname to get the local address - we then close that socket. We =
can then create a new socket, bind it to that local address. That would ens=
ure that we send the mDNS traffic on the same interface where we would have=
 sent the unicast query. Downside is that since all queries share the same =
socket, we&#39;d bind everything to the interface of the first resolver, or=
 need multiple sockets.</div><div><br></div><blockquote class=3D"gmail_quot=
e" style=3D"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204)=
;padding-left:1ex">
You answered for v4, but are you sure scope ids don&#39;t work for<br>
*v4-mapped*? That is, IPv6 addresses of the form<br>
::ffff:aaa.bbb.ccc.ddd. I guess I could check this. I&#39;m not very<br>
hopeful, but it would be excellent if this worked to send v4 multicast<br>
to a particular interface.<br></blockquote><div><br></div><div>Huh I hadn&#=
39;t thought of that, worth a try? RFC 4007 doesn&#39;t really allow using =
scope IDs for globally routable addresses but I&#39;m not sure if Linux doe=
s.</div><div><br></div><blockquote class=3D"gmail_quote" style=3D"margin:0p=
x 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
&gt; Another issue you haven&#39;t mentioned: how does TCP fallback work wi=
th<br>
&gt; &gt; mDNS? Or are answers too large for standard UDP replies just ille=
gal?<br>
&gt; &gt;<br>
&gt; <br>
&gt; Good point, I hadn&#39;t thought of that. That handling for mDNS is de=
fined in<br>
&gt; [1]. In the ephemeral query mode that we&#39;d use here, it works the =
same as<br>
&gt; for regular DNS: when you receive a response with the TC bit, retry th=
e<br>
&gt; query with TCP. The slight difference is that you send the TCP to the<=
br>
&gt; address you got the response from (not to the multicast address that y=
ou<br>
&gt; sent the original query to). From looking at the musl code, we&#39;d n=
eed a<br>
&gt; small tweak to __res_msend_rc() to use that address. Luckily that code=
<br>
&gt; already looks at the sender address so we don&#39;t need any additiona=
l calls<br>
&gt; to get it.<br>
<br>
Yes, that&#39;s what I figured you might do. I guess that works reasonably<=
br>
well.<br>
<br>
&gt; &gt; &gt; Reason for that is that that is the most generic way to supp=
ort any<br>
&gt; &gt; &gt; &gt; other name service besides DNS. It avoids the dependenc=
y on dynamic<br>
&gt; &gt; &gt; &gt; loading that something like glibc&#39;s nsswitch would =
create, and would<br>
&gt; &gt; &gt; &gt; avoid having multiple backends in libc. I really don=
9;t think anyone<br>
&gt; &gt; &gt; &gt; wants to open that particular door. Once mDNS is in the=
re, someone will<br>
&gt; &gt; &gt; &gt; add NetBIOS, just you wait.<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; I&#39;m definitely supportive of the slippery slope argument=
, but I think<br>
&gt; &gt; &gt; there&#39;s still a real line between mDNS and NetBIOS. mDNS=
 uses a different<br>
&gt; &gt; &gt; transport but lives inside the DNS namespace, whereas NetBIO=
S is really<br>
&gt; &gt; its<br>
&gt; &gt; &gt; own thing - NetBIOS names aren&#39;t valid DNS hostnames.<br=
>
&gt; &gt; &gt;<br>
&gt; &gt; &gt; Let me know what you think of the above. If you think of mDN=
S as its own<br>
&gt; &gt; &gt; beast then I can see how including it wouldn&#39;t really ma=
ke sense. But if<br>
&gt; &gt; &gt; you see it as an actual part of the DNS, then it might be wo=
rth a small<br>
&gt; &gt; &gt; code change :-)<br>
&gt; &gt;<br>
&gt; &gt; I&#39;m not worried about slippery slopes to NetBIOS. :-P I am co=
ncerned<br>
&gt; &gt; about unwanted network traffic that can&#39;t be suppressed, priv=
acy<br>
&gt; &gt; leaks, inventing new configuration knobs, potentially pulling in =
more<br>
&gt; &gt; code &amp; more fragility, getting stuck supporting something tha=
t turns<br>
&gt; &gt; out to have hidden problems we haven&#39;t thought about, etc.<br=
>
&gt; &gt;<br>
&gt; <br>
&gt; Those are great reasons, and I totally agree with those goals. If we s=
cope<br>
&gt; the problem down with the details higher up in this email, we have a w=
ay to<br>
&gt; turn this off (set the resolver to localhost), we avoid privacy leaks =
in<br>
&gt; cases where the traffic wasn&#39;t going out in the first place, we do=
n&#39;t have<br>
&gt; to add more configuration knobs because we&#39;re reusing an existing =
one, and<br>
<br>
As mentioned above, I don&#39;t think &quot;reusing an existing one&quot; i=
s an<br>
improvement.<br></blockquote><div><br></div><div>Fair,=C2=A0my goal was min=
imizing change size,=C2=A0but that&#39;s not the only goal.</div><div><br><=
/div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;bo=
rder-left:1px solid rgb(204,204,204);padding-left:1ex">
&gt; the amount of added code would be quite small. Limiting things to the<=
br>
&gt; default interface isn&#39;t a full multi-network solution, but for tho=
se I<br>
&gt; think it makes more sense to recommend running your own resolver on<br=
>
&gt; loopback (you&#39;d need elevated privileges to make this work fully a=
nyway).<br>
&gt; Coding wise, I think this would be pretty robust. The only breakage I<=
br>
&gt; foresee is cases where someone built a custom resolver that runs on a<=
br>
&gt; different machine and somehow handles .local differently than what the=
 RFCs<br>
&gt; say. That config sounds like a bad idea, and a violation of the RFCs, =
but<br>
&gt; that doesn&#39;t mean there isn&#39;t someone somewhere who&#39;s doin=
g it. So there&#39;s<br>
&gt; a non-zero risk there. But to me that&#39;s manageable risk.<br>
&gt; <br>
&gt; What do you think?<br>
<br>
I think a more reasonable approach might be requiring an explicit knob<br>
to enable mDNS, in the form of an options field like ndots, timeout,<br>
retries, etc. in resolv.conf. This ensures that it doesn&#39;t become<br>
attack surface/change-of-behavior in network environments where peers<br>
are not supposed to be able to define network names.<br></blockquote><div><=
br></div><div>That would work. I&#39;m not sure who maintains the list of o=
ptions though. From a quick search it looks like they came out of 4.3BSD li=
ke many networking features, but it&#39;s unclear if POSIX owns it or just =
no one does (which would be the same, POSIX is not around as a standard bod=
y any more).</div><div><br></div><blockquote class=3D"gmail_quote" style=3D=
"margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-le=
ft:1ex">
One further advantage of such an approach is that it could also solve<br>
the &quot;which interface(s)&quot; problem by letting the answer just be<br=
>
&quot;whichever one(s) the user configured&quot; (with the default list bei=
ng<br>
empty). That way we wouldn&#39;t even need netlink, just if_nametoindex to<=
br>
convert interface name strings to scope ids, or alternatively (does<br>
this work for v6 in the absence of an explicit scope_id?) one or more<br>
local addresses to bind and send from.<br></blockquote><div><br></div><div>=
I definitely would avoid putting local addresses in the config, because it =
would break for any non-static addresses like DHCP or v6 RAs. The interface=
 name would require walking the getifaddrs list to map it to a correspondin=
g source address but it would work if the interface name is stable.</div><d=
iv><br></div><div>I guess we&#39;re looking at two ways to go about this:</=
div><div><br></div><div>(1) the simpler but less clean option - where we ke=
y off of &quot;resolver !=3D 127.0.0.1&quot; - very limited code size chang=
e, but only handles a small subset of scenarios</div><div><br></div><div>(2=
) the cleaner option that involves more work - new config option, need mult=
iple sockets - would be cleaner design-wise, but would change quite a bit m=
ore code</div><div><br></div><div>Another aspect to consider is the fact th=
at in a lot of cases resolv.conf is overwritten by various components like =
NetworkManager, so we&#39;d need to modify them to also understand the opti=
on.</div><div><br></div><div>I&#39;m always in favor of doing the right thi=
ng, unless the right thing ends up being so much effort that it doesn&#39;t=
 happen. Then I&#39;m a fan of doing the easy thing ;-)</div><div><br></div=
><div>David</div></div></div>
</blockquote></div>

--0000000000005f942606131ca5c6--