From mboxrd@z Thu Jan  1 00:00:00 1970
X-Msuck: nntp://news.gmane.org/gmane.linux.lib.musl.general/7327
Path: news.gmane.org!not-for-mail
From: Alexander Monakov <amonakov@ispras.ru>
Newsgroups: gmane.linux.lib.musl.general
Subject: Re: Resuming work on new semaphore
Date: Fri, 3 Apr 2015 00:39:10 +0300 (MSK)
Message-ID: <alpine.LNX.2.11.1504030021400.8195@monopod.intra.ispras.ru>
References: <20150402013006.GA1108@brightrain.aerifal.cx> <alpine.LNX.2.11.1504021036070.31632@monopod.intra.ispras.ru> <20150402152642.GW6817@brightrain.aerifal.cx>
Reply-To: musl@lists.openwall.com
NNTP-Posting-Host: plane.gmane.org
Mime-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
X-Trace: ger.gmane.org 1428010764 19364 80.91.229.3 (2 Apr 2015 21:39:24 GMT)
X-Complaints-To: usenet@ger.gmane.org
NNTP-Posting-Date: Thu, 2 Apr 2015 21:39:24 +0000 (UTC)
To: musl@lists.openwall.com
Original-X-From: musl-return-7340-gllmg-musl=m.gmane.org@lists.openwall.com Thu Apr 02 23:39:23 2015
Return-path: <musl-return-7340-gllmg-musl=m.gmane.org@lists.openwall.com>
Envelope-to: gllmg-musl@m.gmane.org
Original-Received: from mother.openwall.net ([195.42.179.200])
	by plane.gmane.org with smtp (Exim 4.69)
	(envelope-from <musl-return-7340-gllmg-musl=m.gmane.org@lists.openwall.com>)
	id 1Ydmpf-0000oq-5Z
	for gllmg-musl@m.gmane.org; Thu, 02 Apr 2015 23:39:23 +0200
Original-Received: (qmail 17926 invoked by uid 550); 2 Apr 2015 21:39:22 -0000
Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm
Precedence: bulk
List-Post: <mailto:musl@lists.openwall.com>
List-Help: <mailto:musl-help@lists.openwall.com>
List-Unsubscribe: <mailto:musl-unsubscribe@lists.openwall.com>
List-Subscribe: <mailto:musl-subscribe@lists.openwall.com>
Original-Received: (qmail 17908 invoked from network); 2 Apr 2015 21:39:21 -0000
In-Reply-To: <20150402152642.GW6817@brightrain.aerifal.cx>
User-Agent: Alpine 2.11 (LNX 23 2013-08-11)
Xref: news.gmane.org gmane.linux.lib.musl.general:7327
Archived-At: <http://permalink.gmane.org/gmane.linux.lib.musl.general/7327>

On Thu, 2 Apr 2015, Rich Felker wrote:
> > Interesting.  To examine the issue under a different light, consider that from
> > the perspective of semaphore implementation, waiters that were killed,
> > stopped, or pre-empted forever in the middle of sem_wait are
> > indistinguishable.
> 
> Yes, I noticed this too. In that sense, theoretically there should be
> no harm (aside from eventual overflow of pending wake counter) from
> having asynchronously-killed waiters, assuming the implementation is
> bug-free in the absence of async killing of waiters.

Did you mean "presence"?  I'm having trouble understanding your phrase,
especially after "assuming ..."; can you elaborate or rephrase?

That waiters can die breaks an assumption that operations on val[0] and val[1]
do not under/overflow due to their range exceeding the number of
simultaneously live tasks.

> > Thus, subsequent sem_wait succeeds by effectively stealing
> > a post, and to make things consistent you can teach sem_trywait to steal posts
> > too (i.e. try atomic-decrement-if-positive val[1] just before returning
> > EAGAIN, return 0 if that succeeds).
> 
> Hmm, perhaps that is valid. I'll have to think about it again. I was
> thinking of having sem_trywait unconditionally down the value (val[0])
> then immitate the exit path of sem_timedwait, but that's not valid
> because another waiter could race and prevent sem_trywait from ever
> being able to exit. But if it only does the down as a dec-if-positive
> then it seems like it can safely dec-if-positive the wake count before
> reporting failure.

I think my proposition above needs at least the following correction: when
trywait succeeds in stealing a post by dec-if-positive(val[1]), it should also
decrement val[0] before returning.

Are you sure your proposition is invalid?  I don't think so.  How is trywait
different from a timedwait with a timeout that immediately expires?  That is
basically what your scheme should do.

Alexander