使用wget砍站(下載整個網站)

使用wget砍站供離線瀏覽

範例:只抓http://www.example.org/manual/install/之下的資料,不抓example.org以外的外部連結,也不抓/manual/install/目錄之外的檔案。

$ wget \
    --recursive \
    --no-clobber \
    --page-requisites \
    --html-extension \
    --convert-links \
    --restrict-file-names=windows \
    --domains example.org \
    --no-parent www.example.org/manual/install/
  • --recursive: download the entire Web site.
  • --domains example.org: don’t follow links outside website.org.
  • --no-parent: don’t follow links outside the directory manual/install/.
  • --page-requisites: get all the elements that compose the page (images, CSS and so on).
  • --html-extension: save files with the .html extension.
  • --convert-links: convert links so that they work locally, off-line.
  • --restrict-file-names=windows: modify filenames so that they will work in Windows as well.
  • --no-clobber: don’t overwrite any existing files (used in case the download is interrupted and resumed).
  • -e robots=off: ignore robots.txt
Unless otherwise stated, the content of this page is licensed under Creative Commons Attribution-ShareAlike 3.0 License