From mboxrd@z Thu Jan 1 00:00:00 1970 Message-ID: To: 9fans@cse.psu.edu Subject: Re: [9fans] IDE FS failure From: Geoff Collyer MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="upas-afehsbqdwneviacrbvaichvggg" Date: Wed, 25 Sep 2002 15:40:52 -0700 Topicbox-Message-UUID: f4e30bbc-eaca-11e9-9e20-41e7f4b1d025 This is a multi-part message in MIME format. --upas-afehsbqdwneviacrbvaichvggg Content-Disposition: inline Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit It looks like you're right; my apologies for the bad advice. I remembered that all the startsb block numbers had been corrected (to "2") in 4e but had forgotten about firstsb (and I'd initialised it in my file servers to "2"). firstsb (if non-zero) is primarily an optimisation and it depends on the specific jukebox and on time, so in either /sys/src/fs/emelie/9pcfs.c or /sys/src/fs/sony/9sonyfs.c, conf.firstsb = 13219302; is presumably wrong (I'd bet on 9sonyfs.c being wrong) and could be deleted or changed to conf.firstsb = 0; Also, /sys/src/fs/words should probably be amended to advise zeroing firstsb in the spun-off subtree's 9*fs.c. I noticed just now that conf.wcpsize is not used in 4e; it was used in 3e in /sys/src/fs/port/worm.c in wcpinit(), but worm.c is now gone. So all those conf.wcpsize = 10; lines in /sys/src/fs/*/9*fs.c and port/main.c: conf.wcpsize = 1024*1024; can be deleted. --upas-afehsbqdwneviacrbvaichvggg Content-Type: message/rfc822 Content-Disposition: inline Received: from collyer.net ([10.9.0.4]) by collyer.net; Wed Sep 25 06:15:22 PDT 2002 Received: from mail.cse.psu.edu ([130.203.4.6]) by collyer.net; Wed Sep 25 06:15:22 PDT 2002 Received: from psuvax1.cse.psu.edu (psuvax1.cse.psu.edu [130.203.4.6]) by mail.cse.psu.edu (CSE Mail Server) with ESMTP id D0A9419AB6; Wed, 25 Sep 2002 09:15:10 -0400 (EDT) Delivered-To: 9fans@cse.psu.edu Received: from rwcrmhc51.attbi.com (rwcrmhc51.attbi.com [204.127.198.38]) by mail.cse.psu.edu (CSE Mail Server) with ESMTP id 8FD6619A9F for <9fans@cse.psu.edu>; Wed, 25 Sep 2002 09:14:58 -0400 (EDT) Received: from patton ([12.226.228.159]) by rwcrmhc51.attbi.com (InterMail vM.4.01.03.27 201-229-121-127-20010626) with SMTP id <20020925131455.LBPZ16629.rwcrmhc51.attbi.com@patton> for <9fans@cse.psu.edu>; Wed, 25 Sep 2002 13:14:55 +0000 Message-ID: To: 9fans@cse.psu.edu Subject: Re: [9fans] IDE FS failure From: David Swasey MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit Sender: 9fans-admin@cse.psu.edu Errors-To: 9fans-admin@cse.psu.edu X-BeenThere: 9fans@cse.psu.edu X-Mailman-Version: 2.0.11 Precedence: bulk Reply-To: 9fans@cse.psu.edu List-Id: Fans of the OS Plan 9 from Bell Labs <9fans.cse.psu.edu> List-Archive: Date: Wed, 25 Sep 2002 09:15:11 -0400 If you can compile a new fs kernel using a stand-alone machine, then you can probably get past this particular panic. I believe this happens because of the line conf.firstsb = 13219302; in your file server's localconfinit. (This function is part of the file-server-specific .c file; it may be called 9pcfs.c.) If firstsb is non-zero, then it is the address of the first super block to consult when performing a recovery; otherwise, the value in the startsb array (near the top of 9pcfs.c) is used. I suggest setting firstsb to 0 and recompiling the file server kernel. With the new kernel, "recover main" should actually start the recovery process. -dave > On Tue, Sep 24, 2002 at 04:54:03PM -0700, Geoff Collyer wrote: >> >> recover main >> end >> > I get a different error. I wonder if I didn't configure "archive" > badly and somehow overlapped it onto main or the cache. The error is > now > > panic: fworm: rbounds 13219302 --upas-afehsbqdwneviacrbvaichvggg--