Question

我想使用pyparsing的commaSeparatedList分隔一个字符串并忽略其中的工作人员＆＃39; {＆＃39; ＆＃39;}＆＃39;

示例：

a = 'xyz,abc{def,123,456}'

解析后，我想得到 [＆＃39; XYZ＆＃39;，＆＃39; ABC DEF {，} 123456＆＃39;]

我写了这个：

 nested_expr = '{' + SkipTo('}') + '}'
 commaSeparatedList.ignore(nested_expr).parseString(a)

结果：（[＆＃39; xyz＆＃39;，＆＃39; abc {def＆＃39;，＆＃39; 123＆＃39;，＆＃39; 456}＆＃39;]，{} ）

Actulally 看起来好像之前有一个分隔符＆＃39; {＆＃39;，这将起作用

a = 'xyz,abc,{def,123,456}'
commaSeparatedList.ignore(nested_expr).parseString(a)

结果：（[＆＃39; xyz＆＃39;，＆＃39; abc＆＃39;，＆＃39;＆＃39;]，{}）

你能看看为什么会这样吗？

Answer 1

打开pyparsing.py源文件，看看如何实现commaSeparatedList - 它不是那么多，并且很容易适应你的情况：

# original
_commasepitem = Combine(OneOrMore(Word(printables, excludeChars=',') +
                                  Optional( Word(" \t") +
                                            ~Literal(",") + ~LineEnd() ) ) ).streamline().setName("commaItem")
commaSeparatedList = delimitedList( Optional( quotedString.copy() | _commasepitem, default="") ).setName("commaSeparatedList")


# modified
_commasepitem = Combine(OneOrMore(QuotedString('{',endQuoteChar='}',unquoteResults=False) | Word(printables, excludeChars=',{}') +
                                  Optional( Word(" \t") +
                                            ~Literal(",") + ~LineEnd() ) ) ).streamline().setName("commaItem")

commaSeparatedList = delimitedList( Optional(_commasepitem, default="") ).setName("commaSeparatedList")

重要的是_commasepitem 中的Word 不允许包含“{}”字符。

pyparsing，ignore不能忽略字符串内联

1 个答案: