PDA

View Full Version : How do I rip a website



Martiini
September 4th, 2008, 10:24 AM
I need to download all .pdf files and website structure from southampton.gov.uk ...
is there any way to accomplish this .. (I know there are website ripping programs for windows)

luckyuser
September 4th, 2008, 10:32 AM
FlashGot firefox addon might help...have you tried it?

https://addons.mozilla.org/en-US/firefox/addon/220/#install-52638

edit:
I don't know your specific needs...wget might be the best, but here's another firefox addon that might also help for pdf's https://addons.mozilla.org/en-US/firefox/addon/636

kpkeerthi
September 4th, 2008, 10:37 AM
http://linuxreviews.org/quicktips/wget/

Polygon
September 4th, 2008, 10:59 AM
http://www.httrack.com/ i know has a linux web version

WinterWeaver
September 4th, 2008, 11:03 AM
try Applications >> Add/Remove >> Search for WebHTTrack Website Copier

I played with it very long ago, and seemed to do something like what you are looking for. Takes aages though, depending on the site.

Martiini
September 5th, 2008, 09:59 PM
I acquired what i needed with flashgot extension and
http://www.google.com/search?hl=en&q=%2B+building+filetype%3Apdf+site%3Asouthampton.g ov.uk&btnG=Google+Search&aq=f&oq=

Thanks

Nano Geek
September 5th, 2008, 10:06 PM
For future reference: wget -r <website-name> will also do the trick.

luckyuser
September 5th, 2008, 10:33 PM
I think wget -r would have taken a while in comparison to using flashgot and googleing "+ building filetype:pdf site:southampton.gov.uk." The later might be a little more manual, but you don't always know what you're getting into when using wget...

either way, glad your problem was solved! Cheers!

Masoris
September 5th, 2008, 11:35 PM
http://www.httrack.com/ i know has a linux web version

+1 for httarck.
But Windows version has better GUI.