字符串操作/正则表达式-查找并替换

时间:2020-09-03 11:37:59

标签: python regex string dictionary replace

我正在尝试替换以下txt文件中.之后的每个单词:

line1
line2
field: [orders.cancelled,orders.delivered,orders.reached
orders.pickup,orders.time]
some line
some line

我有字典:

   d = {'cancelled':'cancelled_at', 'deliver':'xxx'}

我正在运行以下代码。但是,我得到了部分匹配的结果,即

我看到新文件中包含以下文字

field: [orders.cancelled_at, orders.xxxed ..........

在这里,从交付的单词开始,程序仍然替换前7个单词(交付),并在末尾添加“ ed”。我不确定为什么

with open('list.txt', 'r') as g:
    text = g.read()
    for k in d:
        before = f'.{k}'
        after = f'.{d[k]}
        #print(before)
        #print(after)
        text = text.replace(before, after)
        #print(text)

with open('new_list.txt', 'w') as w:
    w.write(text)

我也尝试过这个,结果也一样

import re

with open('list.txt', 'r') as f:
    text = f.read()
    for k in d:
        before = f'.{k}(?!=\w)'
        print(before)
        after = f'.{d[k]}'
        print(after)
        text = re.sub(before, after, text)

with open('new_list.txt', 'w') as w:
    w.write(text)

1 个答案:

答案 0 :(得分:1)

您可以使用

import re

d = {'cancelled':'cancelled_at', 'deliver':'xxx'}
rx = re.compile(fr"(?<=\.)(?:{'|'.join(d)})\b")

with open('list.txt', 'r') as f:
    print( re.sub(rx, lambda x: d[x.group()], f.read()) )

请参见Python demo

由代码生成的正则表达式看起来像

(?<=\.)(?:cancelled|deliver)\b

请参见regex demo。详细信息:

  • (?<=\.)-向后正向匹配,该位置与文字.前紧跟的位置相匹配
  • (?:cancelled|deliver)-两种选择:cancelleddeliver
  • \b-作为整个单词,\b是单词边界。

lambda x: d[x.group()]替换将匹配的单词替换为相应的字典键值。