From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from smtp-2.sys.kth.se (smtp-2.sys.kth.se [130.237.32.160]) by krisdoz.my.domain (8.14.3/8.14.3) with ESMTP id pAOAPCVj021183 for ; Thu, 24 Nov 2011 05:25:12 -0500 (EST) Received: from mailscan-1.sys.kth.se (mailscan-1.sys.kth.se [130.237.32.91]) by smtp-2.sys.kth.se (Postfix) with ESMTP id 768E414EA4C for ; Thu, 24 Nov 2011 11:25:07 +0100 (CET) X-Virus-Scanned: by amavisd-new at kth.se Received: from smtp-2.sys.kth.se ([130.237.32.160]) by mailscan-1.sys.kth.se (mailscan-1.sys.kth.se [130.237.32.91]) (amavisd-new, port 10024) with LMTP id O45U9JwwHOnO for ; Thu, 24 Nov 2011 11:25:06 +0100 (CET) X-KTH-Auth: kristaps [193.10.49.5] X-KTH-mail-from: kristaps@bsd.lv X-KTH-rcpt-to: tech@mdocml.bsd.lv Received: from ctime.hhs.se (ctime.hhs.se [193.10.49.5]) by smtp-2.sys.kth.se (Postfix) with ESMTP id 13C0314EA40 for ; Thu, 24 Nov 2011 11:25:06 +0100 (CET) Message-ID: <4ECE1B81.7080902@bsd.lv> Date: Thu, 24 Nov 2011 11:25:05 +0100 From: Kristaps Dzonsons User-Agent: Mozilla/5.0 (X11; U; OpenBSD amd64; en-US; rv:1.9.2.13) Gecko/20110223 Thunderbird/3.1.7 X-Mailinglist: mdocml-tech Reply-To: tech@mdocml.bsd.lv MIME-Version: 1.0 To: tech@mdocml.bsd.lv Subject: Re: mandocdb: handle formatted manuals References: <20111119005649.GA10365@iris.usta.de> In-Reply-To: <20111119005649.GA10365@iris.usta.de> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit On 11/19/11 01:56, Ingo Schwarze wrote: > Hi, > > right, extracting information from formatted manuals is a rather > dirty business and never going to be that reliable, but there is > no choice: Sometimes, nothing else is available, and we have to > deal with it. Of course, on OpenBSD, we could leave that dirty > work to espie@'s OpenBSD::Makewhatis perl modules, but i'd rather > have a portable solution, and i'd rather not have makewhatis(8) > split into two pieces. I still hope that mandocdb(8) can replace > makewhatis(8) completely (except for the pkg_add(1)/pkg_delete(1)/ > pkg_create(1)-integration of course, which is not going to be > portable given how different pkg_add and pkgsrc are). > > So here is what i did on my train ride from the p2k11 ports hackathon > in Budapest back to Karlsruhe (including the one hour lockup in > Hegyeshalom when the locomotive stopped working, grrr): > > * Even without -a, walk the cat* dirs in addition to man*. > * Only use those cats where men^Wmans are not available > because mans are just greater than cats. > > There is still a lot of room for improvement, several features of > OpenBSD::Makewhatis are not yet implemented. However, this is > already working in most respects, and i'd like to put it in for > in-tree polishing. Hi Ingo, There's a slight problem with this: when a file is entered into mandocdb's databases, there's an implicit assumption that it was parsed. In other words, mandocdb entries are "safe" for mandoc. We need to clearly demark which files are "safe" and which are not. In this way, progs interfacing with mandocdb databases can act accordingly. The easiest way, of course, is a bit in the index file. Can you modify this patch, and the mandoc.index format, to do something like that? While we're updating the mandoc.index format, is there anything else that should be going in there? Thoughts? Kristaps -- To unsubscribe send an email to tech+unsubscribe@mdocml.bsd.lv