From mboxrd@z Thu Jan 1 00:00:00 1970 From: jrvalverde@cnb.csic.es (Jose R. Valverde) Date: Fri, 14 Oct 2011 17:59:53 +0200 Subject: [TUHS] Mirror with link In-Reply-To: <20111013184337.GA23950@bitmover.com> References: <20111013203727.7f33a67d@cnb.csic.es> <20111013184337.GA23950@bitmover.com> Message-ID: <20111014175953.2b02bde2@cnb.csic.es> That is already included: -m == --mirror But --mirror (or -m) does not include -k (convert links to local after the transfer) nor -np (do not follow links upwards the parent directory), nor an instruction to ignore 'robots.txt'. The magic incantation I submitted will only download down the hierarchy, in spite of robots.txt and fixing links, all three problems reported in the thread. Of course it is not polite to ignore robots.txt, but sometimes it may be justified. j On Thu, 13 Oct 2011 11:43:37 -0700 Larry McVoy wrote: > On Thu, Oct 13, 2011 at 08:37:27PM +0200, Jose R. Valverde wrote: > > Just for the record. > > > > The correct way to mirror a site with links corrected is > > > > wget -c -m -k -np -e robots=off URL > > > > Seems most people have problem remembering this incantation. > > Wouldn't it be nice if it were > > wget --mirror URL > > ? > -- > --- > Larry McVoy lm at bitmover.com http://www.bitkeeper.com -- EMBnet/CNB Scientific Computing Service Solving all your computer needs for Scientific Research. http://bioportal.cnb.csic.es http://www.es.embnet.org