Question

我想获取没有标题的文件。我尝试过很多东西，比如

wget --header="" http://xxxxx.xxxxxx.xx

如何获取没有标题的文件？

Answer 1

' - header = header-line'发送标题行以及其余部分每个http请求中的标头。该提供的标头按原样发送，即意味着它必须包含名称和价值用冒号隔开，绝对不能包含换行符。您可以定义更多比一个额外的标题不止一次指定'--header'。
      wget --header='Accept-Charset: iso-8859-2' \
           --header='Accept-Language: hr'        \
             http://fly.srk.fer.hr/ Specification
的空字符串作为标题值   将清除以前所有用户定义的   头。

从Wget 1.10开始，此选项可以   用于覆盖标题   自动生成。这个例子   指示Wget连接   localhost，但要指定'foo.bar'   主机头：
      wget --header="Host: foo.bar" http://localhost/ In versions
之前的Wget这样使用1.10 '--header'导致发送重复头。

http://www.gnu.org/software/wget/manual/html_node/HTTP-Options.html

Answer 2

您可以将wget的输出分配给一个字符串，然后使用其他方式对其进行处理以删除标头（或从文本中解析出标头），例如

w1=$(wget --quiet --output-document - www.example.com)
echo $w1 | grep --only-matching "<body>.*</body>"

给予（添加换行符以便在此处很好地显示）：

<body> <div> <h1>Example Domain</h1> <p>This domain is established to be used for 
illustrative examples in documents. You may use this domain in examples without 
prior coordination or asking for permission.</p> <p><a href="http://www.iana.org
/domains/example">More information...</a></p> </div> </body>

wget没有任何标题

2 个答案: