Question

我正在使用PHP函数imagettftext（）将文本转换为GIF图像。我正在转换的文本包含Unicode字符，包括日语。在我的本地机器（Ubuntu 7.10）上一切正常，但在我的webhost服务器上，日文字符被破坏了。可能导致差异的原因是什么？一切都应编码为UTF-8。

webhost服务器上的破碎图像： http://www.ibeni.net/flashcards/imagetest.php

从本地计算机复制正确的图像： http://www.ibeni.net/flashcards/imagetest.php.gif

我本地机器上的phpinfo（）副本： http://www.ibeni.net/flashcards/phpinfo.php.html

来自我的虚拟主机服务器的phpinfo（）副本： http://example5.nfshost.com/phpinfo

代码：

mb_language('uni');
mb_internal_encoding('UTF-8');

header('Content-type: image/gif');

$text = '日本語';
$font = './Cyberbit.ttf';

// Create the image
$im = imagecreatetruecolor(160, 160);
$white = imagecolorallocate($im, 255, 255, 255);
$black = imagecolorallocate($im, 0, 0, 0);

// Create some colors
imagefilledrectangle($im, 0, 0, 159, 159, $white);

// Add the text
imagettftext($im, 12, 0, 20, 20, $black, $font, $text);
imagegif($im);
imagedestroy($im);

Answer 1

这是最终对我有用的解决方案：

$text = "你好";
// Convert UTF-8 string to HTML entities
$text = mb_convert_encoding($text, 'HTML-ENTITIES',"UTF-8");
// Convert HTML entities into ISO-8859-1
$text = html_entity_decode($text,ENT_NOQUOTES, "ISO-8859-1");
// Convert characters > 127 into their hexidecimal equivalents
$out = "";
for($i = 0; $i < strlen($text); $i++) {
    $letter = $text[$i];
    $num = ord($letter);
    if($num>127) {
      $out .= "&#$num;";
    } else {
      $out .=  $letter;
    }
}

将字符串转换为HTML实体的工作原理除了函数imagettftext（）不接受命名实体。例如，

&#26085;&#26412;&#35486;

没问题，但是

&ccedil;

不是。转换回ISO-8859-1，将命名实体转换回字符，但还有第二个问题。 imagettftext（）不支持值大于＆gt; 127的字符。最终的for循环以十六进制编码这些字符。这个解决方案对我来说正在使用我正在使用的文本（包括日语，中文和葡萄牙语的重音拉丁字符），但我并不是100％肯定它会在所有情况下都有效。

所有这些体操都是必需的，因为imagettftext（）并不真正接受我服务器上的UTF-8字符串。

Answer 2

我遇到了一个问题，它会在图像中渲染文本并输出它。问题是，由于不同的浏览器（或代码耐用性/偏执狂，无论你想怎么想），我都无法知道在$_GET数组中放置了什么编码。

以下是我解决问题的方法。

$item_text = $_GET['text'];

# detect if the string was passed in as unicode
$text_encoding = mb_detect_encoding($item_text, 'UTF-8, ISO-8859-1');
# make sure it's in unicode
if ($text_encoding != 'UTF-8') {
    $item_text = mb_convert_encoding($item_text, 'UTF-8', $text_encoding);
}

# html numerically-escape everything (&#[dec];)
$item_text = mb_encode_numericentity($item_text,
    array (0x0, 0xffff, 0, 0xffff), 'UTF-8');

通过简单地将所有字符（包括多字节Unicode字符）更改为HTML数字字符实体 - “＆amp;＃65;”，解决了imagettftext无法处理＃127以上字符的任何问题。为“A”，“＆amp;＃66;”对于“B”等，manual page声称支持。

Answer 3

我遇到了同样的问题。将字体从otf转换为ttf有帮助。您可以使用FontForge（在标准存储库中提供）进行转换。

Answer 4

我的主要怀疑是你用于渲染的字体。

根据http://fr3.php.net/imagettftext，php使用的不同版本的GD库可以显示不同的行为。

本地计算机上的GD版本： 2.0或更高
您的虚拟主机服务器上的GD版本：捆绑（2.0.34兼容）

编辑： 另一个想法是：您可以验证$text = '日本語';是否真的在您的生产服务器上保存？也许您的脚本存在编码问题。

下一次编辑： BKB已经提出过。所以如果这是原因：他首先得到答案; - ）

Answer 5

生产机器上是否存在特定的字体文件？如果使用FTP上传文件，您使用的是二进制编码吗？

PHP函数imagettftext（）和unicode

5 个答案: