Elasticsearch MapperParsingException [索引期间无法解析,文档为空]

时间:2014-07-12 20:24:50

标签: php elasticsearch elastica

我正在使用带有php wrapper elastica的elasticsearch 1.2.2

索引某些文档时遇到以下异常

PHP Fatal error:  Uncaught exception 'Elastica\Exception\Bulk\ResponseException' with message 'Error in one or more bulk request actions:

index: /en_search/en_msg/936 caused MapperParsingException[failed to parse, document is empty]
' in /root/search/vendor/ruflin/elastica/lib/Elastica/Bulk.php:395
Stack trace:
#0 /root/search/vendor/ruflin/elastica/lib/Elastica/Bulk.php(345): Elastica\Bulk->_processResponse(Object(Elastica\Response))
#1 /root/search/vendor/ruflin/elastica/lib/Elastica/Client.php(284): Elastica\Bulk->send()
#2 /root/search/vendor/ruflin/elastica/lib/Elastica/Index.php(147): Elastica\Client->addDocuments(Array)
#3 /root/search/vendor/ruflin/elastica/lib/Elastica/Type.php(187): Elastica\Index->addDocuments(Array)
#4 /root/search/setData.php(36): Elastica\Type->addDocuments(Array)
#5 {main}
  thrown in /root/search/vendor/ruflin/elastica/lib/Elastica/Bulk.php on line 395

现在它无法解析的文件就是这个显然不是空的文件。

array(
    [id] => 936
    [uid] => 3222
    [msid] => 211
    [login] => user1222
    [msg] => Wouldn’t you love a cup right now?
)

文档映射是这样的

$mapping->setProperties(array(
    'id'  => array('type' => 'integer', 'include_in_all' => true),
    'uid' => array('type' => 'integer', 'include_in_all' => true),
    'msid' => array('type' => 'integer', 'include_in_all' => true),
    'login' => array('type' => 'string', 'include_in_all' => TRUE),
    'msg'  => array('type' => 'string', 'include_in_all' => true), 

));

按照此处的建议在批量模式下进行索引 的 elatica.io: Bulk indexing

我很确定它是由字符引起的。不确定是否需要转义。以防它如何逃避这些角色?

2 个答案:

答案 0 :(得分:1)

解决了。这是一个编码问题。我的数据来自mysql服务器,我必须在pdo对象初始化期间设置正确的charset选项。

答案 1 :(得分:0)

这是一个由于特殊字符而导致的编码问题,我建议您使用

来转义您要编入索引的文本
$text = html_entity_decode(htmlentities($text, ENT_IGNORE, "UTF-8"));