如何从imap_body结果中仅提取HTML

时间:2014-08-25 17:18:15

标签: php html imap

我想从imap_body结果中仅提取HTML内容。 imap_body给出了邮件的逐字副本。

3 个答案:

答案 0 :(得分:8)

我找到了解决方案:

function getBody($uid, $imap)
{
    $body = $this->get_part($imap, $uid, "TEXT/HTML");
    // if HTML body is empty, try getting text body
    if ($body == "") {
        $body = $this->get_part($imap, $uid, "TEXT/PLAIN");
    }
    return $body;
}

function get_part($imap, $uid, $mimetype, $structure = false, $partNumber = false)
{
    if (!$structure) {
        $structure = imap_fetchstructure($imap, $uid, FT_UID);
    }
    if ($structure) {
        if ($mimetype == $this->get_mime_type($structure)) {
            if (!$partNumber) {
                $partNumber = 1;
            }
            $text = imap_fetchbody($imap, $uid, $partNumber, FT_UID);
            switch ($structure->encoding) {
                case 3:
                    return imap_base64($text);
                case 4:
                    return imap_qprint($text);
                default:
                    return $text;
            }
        }

        // multipart
        if ($structure->type == 1) {
            foreach ($structure->parts as $index => $subStruct) {
                $prefix = "";
                if ($partNumber) {
                    $prefix = $partNumber . ".";
                }
                $data = $this->get_part($imap, $uid, $mimetype, $subStruct, $prefix . ($index + 1));
                if ($data) {
                    return $data;
                }
            }
        }
    }
    return false;
}

function get_mime_type($structure)
{
    $primaryMimetype = ["TEXT", "MULTIPART", "MESSAGE", "APPLICATION", "AUDIO", "IMAGE", "VIDEO", "OTHER"];

    if ($structure->subtype) {
        return $primaryMimetype[(int)$structure->type] . "/" . $structure->subtype;
    }
    return "TEXT/PLAIN";
}

答案 1 :(得分:5)

http://php.net/manual/en/function.imap-fetchbody.php

参数3,"部分"如下:

  

部件号。它是一个由句点分隔的整数字符串,根据IMAP4规范索引到正文部分列表中

(empty) - Entire message
0 - Message header
1 - MULTIPART/ALTERNATIVE
1.1 - TEXT/PLAIN
1.2 - TEXT/HTML
2 - file.ext

因此,要获取邮件的HTML部分,您必须使用1.2选项作为第三个参数。像这样:

$message = imap_fetchbody($inbox, $number, 1.2);

答案 2 :(得分:2)

我没有足够的声誉来添加评论,但我只是想在@ GunniH的答案中澄清你对该函数的调用应如下所示:

$message = imap_fetchbody($inbox, $number, '1.2');

而不是

$message = imap_fetchbody($inbox, $number, 1.2);

最终的论点应该是string,而不是int