如何循环遍历unix shell脚本中与正则表达式匹配的文件

时间:2016-04-20 12:34:08

标签: regex bash shell unix

我希望能够遍历与特定模式匹配的文件列表。我可以让unix使用带有正则表达式的ls和egrep列出这些文件,但我找不到将其转换为迭代过程的方法。我怀疑使用ls不是答案。我们将非常感激地提供任何帮助。

我当前的ls命令如下所示:

ls | egrep -i 'MYFILE[0-9][0-9]([0][1-9]|1[0-2])([0][1-9]|[12][0-9]|[3][01]).dat'

我希望以上内容匹配:

  • MYFILE160418.dat
  • myFILE170312.DAT
  • MyFiLe160416.DaT

但不是:

  • MYOTHERFILE150202.DAT
  • MYFILE.DAT
  • myfile.csv

谢谢,

保罗。

2 个答案:

答案 0 :(得分:6)

您可以将(GNU)find与正则表达式搜索选项一起使用,而不是解析ls

find . -regextype "egrep" \
       -iregex '.*/MYFILE[0-9][0-9]([0][1-9]|1[0-2])([0][1-9]|[12][0-9]|[3][01]).dat' \
       -exec [[whatever you want to do]] {} \;

其中[[whatever you want to do]]是您要对文件名执行的命令。

来自手册页

-regextype type
          Changes  the regular expression syntax understood by -regex and -iregex tests 
          which occur later on the command line.  Currently-implemented types are 
          emacs (this is the default),posix-awk, posix-basic, posix-egrep and 
          posix-extended.

  -regex pattern
          File name matches regular expression pattern.  This is a match on the whole 
          path, not a search.  For example, to match a file named `./fubar3', you can 
          use the regular expression
          `.*bar.' or `.*b.*3', but not `f.*r3'.  The regular expressions understood by 
          find are by default Emacs Regular Expressions, but this can be changed with 
          the -regextype option.

  -iregex pattern
          Like -regex, but the match is case insensitive.

答案 1 :(得分:4)

基于Andy K提供的链接,我已根据我的匹配条件使用以下内容进行循环:

for i in $(ls | egrep -i 'MYFILE[0-9][0-9]([0][1-9]|1[0-2])([0][1-9]|[12][0-9]|[3][01]).dat' ); do             
 echo item: $i;         
done