我正在使用带有php wrapper elastica的elasticsearch 1.2.2
索引某些文档时遇到以下异常
PHP Fatal error: Uncaught exception 'Elastica\Exception\Bulk\ResponseException' with message 'Error in one or more bulk request actions:
index: /en_search/en_msg/936 caused MapperParsingException[failed to parse, document is empty]
' in /root/search/vendor/ruflin/elastica/lib/Elastica/Bulk.php:395
Stack trace:
#0 /root/search/vendor/ruflin/elastica/lib/Elastica/Bulk.php(345): Elastica\Bulk->_processResponse(Object(Elastica\Response))
#1 /root/search/vendor/ruflin/elastica/lib/Elastica/Client.php(284): Elastica\Bulk->send()
#2 /root/search/vendor/ruflin/elastica/lib/Elastica/Index.php(147): Elastica\Client->addDocuments(Array)
#3 /root/search/vendor/ruflin/elastica/lib/Elastica/Type.php(187): Elastica\Index->addDocuments(Array)
#4 /root/search/setData.php(36): Elastica\Type->addDocuments(Array)
#5 {main}
thrown in /root/search/vendor/ruflin/elastica/lib/Elastica/Bulk.php on line 395
现在它无法解析的文件就是这个显然不是空的文件。
array(
[id] => 936
[uid] => 3222
[msid] => 211
[login] => user1222
[msg] => Wouldn’t you love a cup right now?
)
文档映射是这样的
$mapping->setProperties(array(
'id' => array('type' => 'integer', 'include_in_all' => true),
'uid' => array('type' => 'integer', 'include_in_all' => true),
'msid' => array('type' => 'integer', 'include_in_all' => true),
'login' => array('type' => 'string', 'include_in_all' => TRUE),
'msg' => array('type' => 'string', 'include_in_all' => true),
));
按照此处的建议在批量模式下进行索引 的 elatica.io: Bulk indexing
我很确定它是由’
字符引起的。不确定是否需要转义。以防它如何逃避这些角色?
答案 0 :(得分:1)
解决了。这是一个编码问题。我的数据来自mysql服务器,我必须在pdo对象初始化期间设置正确的charset选项。
答案 1 :(得分:0)
这是一个由于特殊字符而导致的编码问题,我建议您使用
来转义您要编入索引的文本$text = html_entity_decode(htmlentities($text, ENT_IGNORE, "UTF-8"));