html pages download from website

**coffeecake** · December 4th, 2011

does anyone know of a good website copier that runs on ubuntu and that downloads and saves html files so that i can open them without the need of the original program.

**idoitprone** · December 5th, 2011

If you just want to download the html text code then I guess on command line

Code:

wget -r [url]

example

Code:

wget -r google.com

Website are built differently, depending on the website, they might thwart you efforts.

All html code can be opened by any webbrowser

**coffeecake** · December 5th, 2011

Originally Posted by idoitprone

If you just want to download the html text code then I guess on command line

Code:

wget -r [url]

example

Code:

wget -r google.com

Website are built differently, depending on the website, they might thwart you efforts.

All html code can be opened by any webbrowser

I mean i want to download at least a small part of a website. I only know the homapage url. So with wget i may not be able to solve my problem.

**aeiah** · December 5th, 2011

if you don't know the location of the data, how will your software?

wget can open links recursively, and you can specify recursion depth, but if there are no links on the front page and no way to figure out where the content is, you wont get very far

**coffeecake** · December 5th, 2011

Originally Posted by aeiah

if you don't know the location of the data, how will your software?

wget can open links recursively, and you can specify recursion depth, but if there are no links on the front page and no way to figure out where the content is, you wont get very far

Could you please tell me how to make wget recursively download html pages ?

**Lars Noodén** · December 5th, 2011

Originally Posted by coffeecake

Could you please tell me how to make wget recursively download html pages ?

It's described above in post #2. It is done with the help of the -r or --recursive option. Other options like --no-parent, --convert-links and --page-requisites might be useful to look at, too.