cat urls.html | grep -Eo "(http|https)://[a-zA-Z0-9./?=_%:-]*" | sort -u
grep -E
: is the same as egrepgrep -o
: only outputs what has been grepped(http|https)
: is an either / ora-z
: is all lower caseA-Z
: is all upper case.
: is dot/
: is the slash?
: is ?=
: is equal sign_
: is underscore%
: is percentage sign:
: is colon-
: is dash*
: is repeat the […] groupsort -u
: will sort & remove any duplicates
grep -Eo “(http|https)://[a-zA-Z0-9./?=_%:-]*\.(png|gif|jpg|jpeg)”
греп всех картинок