在pyparsing中,如何分配“不匹配”键值?

时间:2013-10-12 19:06:44

标签: python parsing pyparsing

我想让'pyparsing'解析结果作为字典出现,而不需要后处理。为此,我需要定义自己的键字符串。以下我能想到的最好的结果会产生预期的结果。

要解析的行:

%ADD22C,0.35X*%

代码:

import pyparsing as pyp

floatnum = pyp.Regex(r'([\d\.]+)')
comma = pyp.Literal(',').suppress()

cmd_app_def = pyp.Literal('AD').setParseAction(pyp.replaceWith('aperture-definition'))

cmd_app_def_opt_circ = pyp.Group(pyp.Literal('C') +
comma).setParseAction(pyp.replaceWith('circle'))

circular_apperture = pyp.Group(cmd_app_def_opt_circ +
pyp.Group(pyp.Empty().setParseAction(pyp.replaceWith('diameter')) + floatnum) +
pyp.Literal('X').suppress())

<the grammar for the entire line>

结果是:

['aperture-definition', '20', ['circle', ['diameter', '0.35']]]

我认为这是一个黑客攻击

pyp.Empty().setParseAction(pyp.replaceWith('diameter'))

总是匹配并且为空,但随后我将所需的密钥名称分配给它。

这是最好的方法吗?我是否滥用pyparsing来做一些不应该做的事情?

2 个答案:

答案 0 :(得分:4)

如果您想将floatnum命名为“直径”,可以使用named results

cmd_app_def_opt_circ = pyp.Group(pyp.Literal('C') +
comma)("circle")


circular_apperture = pyp.Group(cmd_app_def_opt_circ +
pyp.Group(floatnum)("diameter") +
pyp.Literal('X').suppress())

这样,每次解析在floatnum上下文中遇到circular_appertur时,此结果都会被命名为diameter。此外,如上所述,您可以以相同的方式命名circle。这对你有用吗?

答案 1 :(得分:3)

查看已发布代码中的评论。

import pyparsing as pyp

comma = pyp.Literal(',').suppress()
# use parse actions to do type conversion at parse time, so that results fields
# can immediately be used as ints or floats, without additional int() or float()
# calls
floatnum = pyp.Regex(r'([\d\.]+)').setParseAction(lambda t: float(t[0]))
integer = pyp.Word(pyp.nums).setParseAction(lambda t: int(t[0]))

# define the command keyword - I assume there will be other commands too, they
# should follow this general pattern (define the command keyword, then all the
# options, then define the overall command)
aperture_defn_command_keyword = pyp.Literal('AD')

# define a results name for the matched integer - I don't know what this
# option is, wasn't in your original post
d_option = 'D' + integer.setResultsName('D')

# shortcut for defining a results name is to use the expression as a 
# callable, and pass the results name as the argument (I find this much
# cleaner and keeps the grammar definition from getting messy with lots
# of calls to setResultsName)
circular_aperture_defn = 'C' + comma + floatnum('diameter') + 'X'

# define the overall command
aperture_defn_command = aperture_defn_command_keyword("command") + d_option + pyp.Optional(circular_aperture_defn)

# use searchString to skip over '%'s and '*'s, gives us a ParseResults object
test = "%ADD22C,0.35X*%"
appData = aperture_defn_command.searchString(test)[0]

# ParseResults can be accessed directly just like a dict
print appData['command']
print appData['D']
print appData['diameter']

# or if you prefer attribute-style access to results names
print appData.command
print appData.D
print appData.diameter

# convert ParseResults to an actual Python dict, removes all unnamed tokens
print appData.asDict()

# dump() prints out the parsed tokens as a list, then all named results
print appData.dump()

打印:

AD
22
0.35
AD
22
0.35
{'diameter': 0.34999999999999998, 'command': 'AD', 'D': 22}
['AD', 'D', 22, 'C', 0.34999999999999998, 'X']
- D: 22
- command: AD
- diameter: 0.35