BaseSeed
Matt Zimmerman
mdz at canonical.com
Wed Sep 8 18:34:42 CDT 2004
On Thu, Sep 09, 2004 at 09:28:19AM +1000, James Gregory wrote:
> On Wed, 2004-09-08 at 14:04 -0700, Matt Zimmerman wrote:
> > On Thu, Sep 09, 2004 at 04:54:34AM +0800, John wrote:
> >
> > > How else do you do this?
> > > summer at Dolphin:~$ time lynx -dump http://www.x.com/ | tail
> > > 30. http://www.ebay.com/
> > > 31. http://www.paypal.com/cgi-bin/webscr
> > > 32. http://www.paypal.com/cgi-bin/webscr?cmd=p/gen/fdic-outside
> > > 33. http://www.paypal.com/cgi-bin/webscr?cmd=p/gen/privacy-outside
> > > 34. http://www.bbbonline.org/cks.asp?id=20111061155818568
> > >
> > >
> > > I regularly want a list of URLs for some reason, often to get a list of
> > > files to download with wget or (sometimes) with curl.
> >
> > You don't need a browser at all if you only want to extract URLs.
> >
> > wget -O- http://www.x.com/ | urlview
>
> You can also go to mozilla and click 'page info'. There's a links tab
> there with all the links for the page. But if you want to download
> everything on a page, wget -r will work.
Oh, there's also lftp in base, which is even better if what you want is to
download:
mizar:[~] lftp http://www.dmoz.org/
cd ok, cwd=/
lftp www.dmoz.org:/> ls
-rw-r--r-- -- img/odphead.gif
-rw-r--r-- -- about.html
-rw-r--r-- -- add.html
-rw-r--r-- -- help/helpmain.html
-rw-r--r-- -- link.html
drwxr-xr-x -- Arts
drwxr-xr-x -- Arts/Movies
drwxr-xr-x -- Arts/Television
drwxr-xr-x -- Arts/Music
drwxr-xr-x -- Business
drwxr-xr-x -- Business/Employment
drwxr-xr-x -- Business/Real_Estate
drwxr-xr-x -- Business/Investing
drwxr-xr-x -- Computers
drwxr-xr-x -- Computers/Internet
drwxr-xr-x -- Computers/Software
drwxr-xr-x -- Computers/Hardware
drwxr-xr-x -- Games
drwxr-xr-x -- Games/Video_Games
drwxr-xr-x -- Games/Roleplaying
drwxr-xr-x -- Games/Gambling
drwxr-xr-x -- Health
drwxr-xr-x -- Health/Fitness
drwxr-xr-x -- Health/Medicine
drwxr-xr-x -- Health/Alternative
drwxr-xr-x -- Home
drwxr-xr-x -- Home/Family
drwxr-xr-x -- Home/Consumer_Information
drwxr-xr-x -- Home/Cooking
[...]
--
- mdz
More information about the sounder
mailing list