php文件路径中的多个搜索值

时间:2013-11-27 22:28:37

标签: php arrays search

我正在开发一个需要使用搜索字段的任意组合搜索一组文件的php网站。

可能的搜索字段是

id,year,building,lastname,firstname,birthdate

文件夹结构和文件名是这样的

/year/building/file.pdf

文件名包含要搜索的数据

id_lastname_firstname_MM_dd_yy.pdf

除了这部分,我在网站上都有工作。最初我只有ID,年份和建筑物,我能够做到,如果要检查组合的每种可能性。现在有更多的组合,所以它更加复杂。

我在考虑嵌套if和in_array等,但必须有更好的方法。我只是学习PHP的方式。

我希望能够使用任何字段组合进行搜索。如果有帮助,我可以更改文件名。

我从这样的事情开始

function search($transcripts, $studentid=null, $year=null, $building=null, $last=null, $first=null, $birthdate=null){
$ext = '.pdf';
date_default_timezone_set('America/Los_Angeles');

$dir_iterator = new RecursiveDirectoryIterator("../transcripts");
$iterator = new RecursiveIteratorIterator($dir_iterator, RecursiveIteratorIterator::SELF_FIRST);

foreach ($iterator as $file) {
    if ($file->isFile()){
        $path = explode('\\',$file->getPath());
        $fname = explode('_', $file->getBasename($ext));
        if($path[1] == $year){
            if($path[2] == $building){
                if(in_array($last, $fname, true)){
                    if((in_array($first, $fname, true)){
                        if((in_array($birthdate

最初,根据提交的字段,我有单独的功能。

function bldStuSearch($building, $studentid, $transcripts){
$ext = '.pdf';
date_default_timezone_set('America/Los_Angeles');

$dir_iterator = new RecursiveDirectoryIterator("../transcripts");
$iterator = new RecursiveIteratorIterator($dir_iterator, RecursiveIteratorIterator::SELF_FIRST);

foreach ($iterator as $file) {
        $results = explode('\\',$file->getPath());
        //var_dump($results);
    if (($file->isFile()) && ($file->getBasename($ext)==$studentid) && ($results[2] == $building)){
        //echo substr($file->getPathname(), 27) . ": " . $file->getSize() . " B; modified " . date("Y-m-d", $file->getMTime()) . "\n";
        $results = explode('\\',$file->getPath());
        //var_dump($results);
        //$building = $results[2];
        $year = $results[1];
        //$studentid = $file->getBasename($ext);
        array_push($transcripts, array($year, $building, $studentid));
        //var_dump($transcripts);
        //$size += $file->getSize();
        //echo '<br>';
    }
}
//echo "\nTotal file size: ", $size, " bytes\n";
if (empty($transcripts))
{
    header('Location: index.php?error=2'); exit();
}

return $transcripts;

}

现在我想尝试一个搜索功能来检查任何组合?任何想法至少会朝着正确的方向发展?

感谢。

1 个答案:

答案 0 :(得分:0)

所以我对做一个评分系统有了一个想法,但后来却驳回了它。我回过头来找到一种方法,使用加权评分系统使其工作。

这使得搜索变得非常灵活并且保持可移植性,不需要数据库用于元数据并且使用文件名作为搜索数据而无需搜索每个PDF。我正在使用A-Pdf拆分器将PDF拆分为单独的文件,并将元数据添加到文件名中。

我希望有一天有人发现这对其他搜索很有用。我对此结果非常满意。

我将在http://github.com/friedcircuits

完成后发布整个代码

我应该改变的一件事是为数组使用命名键。

以下是生成的代码。现在,生日必须以m-d-yyyy的形式输入才能匹配。

function search($transcripts, $studentid=null, $year=null, $building=null, $last=null, $first=null, $birthdate=null){
$ext = '.pdf';
$bldSearch = false;
date_default_timezone_set('America/Los_Angeles');

if (($building == null) AND ($year == null)){ $searchLocation = "../transcripts";}
elseif (($year != null) AND ($building != null)){$searchLocation = "../transcripts/".$year."/".$building;}
elseif ($year != null) {$searchLocation = "../transcripts/".$year;}
elseif ($building != null) {
    $searchLocation = "../transcripts/";
    $bldSearch = true;
}
else{$searchLocation = "../transcripts";}

$dir_iterator = new RecursiveDirectoryIterator($searchLocation);
$iterator = new RecursiveIteratorIterator($dir_iterator, RecursiveIteratorIterator::SELF_FIRST);

$score = 0;

foreach ($iterator as $file) {
    if ($file->isFile()){
        //Fix for slashes changing direction depending on search path
        $path = str_replace('/','\\', $file->getPath());

        $path = explode('\\',$path);
        $fname = explode('_', $file->getBasename($ext));
        //var_dump($path);
        //echo "<br>";
        //var_dump($fname);
        //echo "<br>";

        //fix for different search paths
        if($path[1] == "transcripts"){
            $pYear = $path[2];
            $pbuilding = $path[3];
        }
        else{
            $pYear = $path[1];
            $pbuilding = $path[2];
        }

        if ($bldSearch == true){
            if ($building != $pbuilding) {continue;}
        }


        //$fname[1] = @strtolower($fname[1]);
        //$fname[2] = @strtolower($fname[2]);


        if($fname[0] == $studentid){
            $yearS = $pYear;
            $buildingS = $pbuilding;
            $studentidS = $fname[0];
            $lastS = $fname[1];
            $firstS = $fname[2];
            $birthdateS = $fname[3];
            array_push($transcripts, array($yearS, $buildingS, $studentidS, $lastS, $firstS, $birthdateS));
            continue;
        }
        if($pYear == $year){
            $score += 1;
        }
        if($path[2] == $building){
            $score += 1;
        }
        if(@strpos(@strtolower($fname[1]),$last) !== false){
            $score += 3;
        }
        if(@strpos(strtolower($fname[2]), $first) !== false){
            $score += 3;
        }
        if($fname[3] == $birthdate){
            $score += 3;
        }
        //echo $score." ";
        if ($score > 2) {
            $yearS = $pYear;
            $buildingS = $pbuilding;
            $studentidS = $fname[0];
            $lastS = $fname[1];
            $firstS = $fname[2];
            $birthdateS = $fname[3];
            array_push($transcripts, array($yearS, $buildingS, $studentidS, $lastS, $firstS, $birthdateS)); 
            //var_dump($transcripts);
        }
    }
    $score = 0;
}

if (empty($transcripts))
{
    header('Location: index.php?error=2'); exit();
}


return $transcripts;}