Skip to content

Latest commit

 

History

History
55 lines (45 loc) · 1.96 KB

cix-wget.org

File metadata and controls

55 lines (45 loc) · 1.96 KB

wget

|≣|

AuthorHrvoje Nikšić (all…)
Maintainer(s)
Released1996
Sourcewget.git
Man
Info

GNU Wget is a file retrieval utility which can use either the HTTP or FTP protocols. Wget features include the ability to work in the background while you are logged out, recursive retrieval of directories, file name wildcard matching, remote file timestamp storage and comparison, use of Rest with FTP servers and Range with HTTP servers to retrieve files over slow or unstable connections, support for Proxy servers, and configurability.


Receipts

wget

wgre::options

p
e

wget::examples

wget-161127222725

download an entire website:

   ~ $  wget --random-wait -r -p -e robots=off -U mozilla WEBSITE_URL

wget-161121211338

download a sertain file in a list of files from a server with known structure:

#!/usr/bin/env bash

URL="https://dumps.wikimedia.org/other/pageviews/2016/2016-01/"
FILE_RE="(?<=\<a href\=\").+(?=\"\>)"

declare -a FILE_NAMES
FILE_NAMES=($(curl -s "$URL" | grep -oP "$FILE_RE"))

wget "$URL${FILE_NAMES[2]}"

declare curl (1) grep (1)