Question

我想用shell / bash脚本从字符串中提取url，如果字符串中有多个url，则只返回第一个url。

我在下面提供了输入和输出字符串的一些示例。我猜我需要做一些正则表达式，但我不太熟悉如何在bash / shell中执行此操作？

Input: Take a look at this site: http://www.google.com/ and you'll find your answer
Output: http://www.google.com/


Input: http://www.google.com
Output: http://www.google.com


Input: Check out http://www.bing.com and http://www.google.com
Output: http://www.bing.com


Input: Grettings, visit <http://www.mywebsite.com> today!
Output: http://www.mywebsite.com

Answer 1

试试这个：

grep -Eo 'http://[^ >]+' yourFile|head -1

例如：

kent$  echo "Check out http://www.bing.com and http://www.google.com"|grep -Eo 'http://[^ >]+'|head -1 
http://www.bing.com
kent$  echo "Grettings, visit <http://www.mywebsite.com> today"|grep -Eo 'http://[^ >]+'|head -1 
http://www.mywebsite.com

Answer 2

使用grep命令，例如：

cat yourinput.txt | grep "your_regex_here"

如何从shell中的字符串中检索url？

2 个答案: