Python正则表达式匹配特定字符串和结束字符之间的字符串

时间:2015-01-24 05:14:09

标签: python regex match

我正在构建一个文件剥离器来构建一个配置报告,我有一个非常长的字符串作为我的基础数据。以下是它的一小部分,但它至少说明了我正在使用的内容。

摘录示例:DEFAULT_GATEWAY=192.168.88.1&DELVRY_AGGREGATION_INTERVAL0=1&DELVRY_AGGREGATION_INTERVAL1=1&DELVRY_SCHEDULE0=1&DELVRY_SNI0=192.168.88.158&DELVRY_USE_SSL_TLS1=0&

我将如何匹配以下内容:

between "DEFAULT_GATEWAY=" and "&"
between "DELVRY_AGGREGATION_INTERVAL0=" and "&"
between "DELVRY_AGGREGATION_INTERVAL1=" and "&"
between "DELVRY_SCHEDULE=" and "&"
between "DELVRY_SNI0=" and "&"
between "DELVRY_USE_SSL_TLS1=" and "&"

并用它构建一个词典:

{"DEFAULT_GATEWAY":"192.168.88.1",
 "DELVRY_AGGREGATION_INTERVAL0":"1",
 "DELVRY_AGGREGATION_INTERVAL1":"1",
 "DELVRY_SCHEDULE0":"1",
 "DELVRY_SNI0":"0",
 "DELVRY_USE_SSL_TLS1":"0"}

3 个答案:

答案 0 :(得分:3)

这是一种方法。

In [1]: input = 'DEFAULT_GATEWAY=192.168.88.1&DELVRY_AGGREGATION_INTERVAL0=1&DELVRY_AGGREGATION_INTERVAL1=1&DELVRY_SCHEDULE0=1&DELVRY_SNI0=192.168.88.158&DELVRY_USE_SSL_TLS1=0&'

In [2]: input.split('&')
Out[2]: 
['DEFAULT_GATEWAY=192.168.88.1',
 'DELVRY_AGGREGATION_INTERVAL0=1',
 'DELVRY_AGGREGATION_INTERVAL1=1',
 'DELVRY_SCHEDULE0=1',
 'DELVRY_SNI0=192.168.88.158',
 'DELVRY_USE_SSL_TLS1=0',
 '']

In [3]: [keyval.split('=') for keyval in input.split('&') if keyval]
Out[3]: 
[['DEFAULT_GATEWAY', '192.168.88.1'],
 ['DELVRY_AGGREGATION_INTERVAL0', '1'],
 ['DELVRY_AGGREGATION_INTERVAL1', '1'],
 ['DELVRY_SCHEDULE0', '1'],
 ['DELVRY_SNI0', '192.168.88.158'],
 ['DELVRY_USE_SSL_TLS1', '0']]

In [4]: dict(keyval.split('=') for keyval in input.split('&') if keyval)
Out[4]: 
{'DEFAULT_GATEWAY': '192.168.88.1',
 'DELVRY_AGGREGATION_INTERVAL0': '1',
 'DELVRY_AGGREGATION_INTERVAL1': '1',
 'DELVRY_SCHEDULE0': '1',
 'DELVRY_SNI0': '192.168.88.158',
 'DELVRY_USE_SSL_TLS1': '0'}

注释

  1. 这是输入行
  2. &拆分以获取键值对。请注意,最后一个条目为空
  3. 以等号分割每个条目,同时丢弃空条目
  4. 建立字典
  5. 另一种解决方案

    In [8]: import urlparse
    
    In [9]: urlparse.parse_qsl(input)
    Out[9]: 
    [('DEFAULT_GATEWAY', '192.168.88.1'),
     ('DELVRY_AGGREGATION_INTERVAL0', '1'),
     ('DELVRY_AGGREGATION_INTERVAL1', '1'),
     ('DELVRY_SCHEDULE0', '1'),
     ('DELVRY_SNI0', '192.168.88.158'),
     ('DELVRY_USE_SSL_TLS1', '0')]
    
    In [10]: dict(urlparse.parse_qsl(input))
    Out[10]: 
    {'DEFAULT_GATEWAY': '192.168.88.1',
     'DELVRY_AGGREGATION_INTERVAL0': '1',
     'DELVRY_AGGREGATION_INTERVAL1': '1',
     'DELVRY_SCHEDULE0': '1',
     'DELVRY_SNI0': '192.168.88.158',
     'DELVRY_USE_SSL_TLS1': '0'}
    

答案 1 :(得分:0)

先拆分'&'获取字符串列表,然后按' =',如下所示:

d = dict(kv.split('=') for kv in line.split('&'))

答案 2 :(得分:0)

import re

keys = {"DEFAULT_GATEWAY",
    "DELVRY_AGGREGATION_INTERVAL0",
    "DELVRY_AGGREGATION_INTERVAL1",
    "DELVRY_SCHEDULE0",
    "DELVRY_SNI0",
    "DELVRY_USE_SSL_TLS1"}
resdict = {}
for k in keys:
    pat = '{}([^&])&'.format(k)
    mo = re.search(pat, bigstring)
    if mo is None: continue  # no match
    resdict[k] = mo.group(1)
如果resdict是您要搜索的字符串,

会在bigstring中保留您想要的结果。

这假设您事先知道要查找哪些密钥,并将它们保存在集keys中。如果您事先不知道感兴趣的关键字,那当然是一个非常不同的问题。

相关问题