Question

假设我有一个目录列表，每个目录可能包含也可能不包含我想要考虑的子目录。

我们还说我有一个时间戳列表，列表中的每个目录都有一个（但不是子目录）。这些被称为具有隐式时区的日期和时间，因此可以相当容易地转换为Unix时间戳，如果这样可以更容易比较。

对于列出的每个目录，

如何，确定目录中是否存在比时间戳更新的文件（mtime或ctime，但不是atime）我有相关目录吗？

我并不真正对哪个特定文件比时间戳更新感兴趣，只知道是否存在任何此类文件。

基本上，我想编写一个脚本，当运行时执行特定操作，如果在给定时间点之后任何一个目录中的任何一个文件已被更改，并且需要来用一种方法来检测是否有任何改变。

Answer 1

您的问题可以转化为多个简单的子问题

问：如何递归查看目录中的每个文件？

答：use File::Find。这看起来有点像

use File::Find;

find sub {
  return unless -f;
  if (file_is_newer_than($timestamp)) {
    do something;
  },
}, $top_dir;

问：我如何为多个目录执行此操作？

A：将它包裹在foreach循环中，例如

for my $dir_time (["./foo", 1234567890], ["./bar", 1230987654]) {
  my ($top_dir, $timestamp) = @$dir_time;
  # above code
}

问：如何判断文件是否较新？

答：stat mtime或ctime，然后将结果与时间戳进行比较。 E.g。
```
use File::stat;

say "$_ is new!" if stat($_)->mtime > $timestamp;
```

问：我只对是否存在任何此类文件感兴趣。如何缩短find？

的曲线

答：棘手的一个。我们不能只从return find，因为那只会从我们传递的coderef中退出。相反，我们可以使用例外控制流反模式：

eval {
  find {
    wanted => sub {
      return unless -f;
      die "New file found\n" if stat($_)->mtime > $timestamp;
    },
    no_chdir => 1,
  } $top_dir;
};
if ($@) {
  # I should really use exception objects here…
  if ($@ eq "New file found\n") {
    say "New file in $top_dir found";
  } else {
    die $@;  # rethrow error
  }
}

我设置了no_chdir选项，这样我就不必在异常处理程序中恢复正确的工作目录。

或者我们可以在标记的块上使用循环控制：

DIR: for my $dir_time (...) {
  my ($top_dir, $timestamp) = @$dir_time;
  RECURSION: {   
    find {
      wanted => sub {
        return unless -f;
        last RECURSION if stat($_)->mtime > $timestamp; # exit the RECURSION block
      },
      no_chdir => 1,
    } $top_dir;
    # if we are here, no newer file was found.
    next DIR; # make sure to skip over below code; go to next iteration
  }
  # this code only reached when a newer file was found
  say "New file found";
}

虽然这不会滥用控制流的异常，但这会触发警告：

Exiting subroutine via last

我们可以使用no warnings 'exiting'来解决这个问题。

注意：这里的所有代码都是未经测试的。

找出目录中是否存在比给定日期/时间更新的文件？

1 个答案: