php函数ShellFolder和一个AND链接

时间:2016-12-05 13:33:59

标签: php regex function shell grep

我找到了function ShellFolder($source, $search1, $search2){$shellBefehl...egrep的代码。到目前为止工作得很好。但是现在我遇到了问题,因为搜索到的文本文件通常包含相同的术语。现在我想将两个搜索术语相互链接,这两个术语必须在文本文件中找到。这是我的代码:

<?php
function ShellFolder($source, $search1, $search2){
    $shellBefehl = "egrep -o -w -l -a --directories=recurse '$search1' '$search2' $source";
    exec($shellBefehl, $var);
    return $var;
}

//$source = 'grep.txt';
$source = './ordner/daten';
$search1 = $this->item->title;
$search2 = $extraField->value;
$var = ShellFolder($source, $search1, $search2);

$myResult = print_r($var, true);

$text = $myResult;
$text_ohne = substr($myResult, 19);
$insgesamt = $text_ohne;
$insgesamt_neu=substr($insgesamt,0,-3);
echo ($search1);
echo ($search2);
$lines = file("$insgesamt_neu");

foreach($lines as $line) {
    echo($line);
}

有人可以帮忙吗?

1 个答案:

答案 0 :(得分:1)

grep的

例如,可以通过-E选项将多个模式传递给Grep。但Grep将搜索任何模式。

如果要使用逻辑AND加入搜索模式,则Grep不方便,因为它不支持逻辑AND。如果pattern1.*pattern2应该在pattern1之前,则可以使用pattern2这样的模式模拟AND:

$patterns = ['pattern1', 'pattern2'];
$dir = escapeshellarg($dir);
$pattern = escapeshellarg(implode('.*', $patterns));
$command = "egrep -o -w -l -a -r $pattern $dir";
exec($command, $output, $exit_status);

pattern1.*pattern2|pattern2.*pattern1任何订单。但对于一般情况,模式不是最理想的。换句话说,Grep不适用于通用案例,您应该使用其他工具。

AWK

portable way to search for multiple patterns using AWKawk '/pattern1/ && /pattern2/ && ... ' file。但是,AWK接受单个文件,您必须手动迭代目录并将命令应用于每个文件:

<?php
/**
 * Searches for lines matching all regexp patterns.
 *
 * @param string $dir Path to directory with text files
 * @param array $patterns AWK patterns without regexp markers ('/')
 * @return array Files matching all patterns
 * @throws InvalidArgumentException
 */
function grepDir($dir, array $patterns, callable $callback) {
  if (!$patterns) {
    throw new InvalidArgumentException("Invalid patterns");
  }

  // Build command as awk '/pattern1/ && /pattern2/ && ... path-to-file'
  $awk_script = '/' . implode('/ && /', $patterns) . '/';
  $awk_script = escapeshellarg($awk_script);
  $command_format = "awk $awk_script %s";

  try {
    $it = new RecursiveIteratorIterator(new RecursiveDirectoryIterator($dir));

    $it->rewind();
    while ($it->valid()) {
      if (!$it->isDot()) {
        $file_path = $it->key();
        $command = sprintf($command_format, $file_path);
        $output = null;
        exec($command, $output, $exit_status);

        if ($exit_status) {
          trigger_error("Command failed: $command");
          continue;
        }

        if ($output) {
          $callback($file_path, $output);
        }
      }

      $it->next();
    }
  } catch (Exception $e) {
    trigger_error($e->getMessage());
    return false;
  }
  return true;
}

$dir = '.';
$patterns = [ '456', '123' ];

grepDir($dir, $patterns, function ($file_path, array $output) {
  printf("File: %s\nLines:\n%s\n--------\n",
    $file_path, implode(PHP_EOL, $output));
});

示例输出

File: ./file1
Lines:
123 sdfsf 456
456 & 123
--------
File: ./test/file1
Lines:
456123

PHP

上面的示例可以很容易地转换为纯PHP解决方案(无需调用shell命令)。您可以逐行读取文件,并根据应用程序的逻辑使用preg_match()测试该行是否与模式匹配:

$patterns = ['456', '123'];

$file = 'file1'; // Replace with $it->key() in the example above
if (! $fp = fopen('file1', 'r')) {
  throw new RuntimeException("Failed to open file $file");
}

while ($line = fgets($fp)) {
  $matches = true;
  foreach ($patterns as $pattern) {
    // You might want to quote the pattern, if it isn't supposed to be
    // interpreted as a regular expression:
    // $pattern = preg_quote($pattern, '/');
    if (!preg_match("/{$pattern}/", $line)) {
      $matches = false;
      break;
    }
  }

  if ($matches) {
    echo "Line $line matches all patterns\n";
  }
}

fclose($fp);