Python - 比较文件中的字符串

时间:2013-02-12 19:36:54

标签: python string file

我只是学习Python在Inkscape中进行扩展,而且我在比较从文件加载的字符串时遇到了问题。我要做的是加载我在文本文件中定义的多边形:

    polygon
    r:255
    g:0
    b:0
    50;50
    50;100
    100;50

我的解析方法是这样的:

    def load_file(filepath, parent, log):
        file = open(filepath)
        x = []
        y = []
        r = 0
        g = 0
        b = 0
        index = 0

        for line in file:
            fline = line.lstrip("\xef\xbb\xbf").rstrip("\n")
            log.write("Input string: " + repr(line) + "\n")
            log.write("Formatted: " + repr(fline) + "\n")
            if fline == "":
                continue
            elif fline is "polygon": ## Where the first line should be going
                log.write("\tDetected string as polygon start delimiter\n")
                if index > 0:
                    draw_shape(x, y, r, g, b, "Polygon", parent)
                    del x[0, len(x)]
                    del y[0, len(y)]
                    r = g = b = index = 0
                continue
            elif fline[:2] is "r:":
                log.write("\tDetected string as polygon red value delimiter\n")
                r = int(fline[2:])
                continue
            elif fline[:2] is "g:":
                log.write("\tDetected string as polygon green value delimiter\n")
                g = int(fline[2:])
                continue
            elif fline[:2] is "b:":
                log.write("\tDetected string as polygon blue value delimiter\n")
                b = int(fline[2:])
                continue
            else: ## Where the first line actually is going
                log.write("\tDelimiter failed previous detections; assumed to be polygon cordinates\n")
                spl = fline.split(";")
                x[index] = float(spl[0]) ## Error gets thrown here
                y[index] = float(spl[1])
                index += 1
                continue

        draw_shape(x, y, r, g, b, parent)

这种方法在第一行绊倒。它不断看到“多边形”并前往最后的其他块,在那里它解析坐标。我一直保持的日志文件如下所示:

    Process Started
    Input string: '\xef\xbb\xbfpolygon\n'
    Formatted: 'polygon'
        Delimiter failed previous detections; assumed to be polygon coordinates

我已经在shell中完成了这个过程,并在那里说line is "process"是真的,所以我完全迷失在这里。有什么帮助吗?

2 个答案:

答案 0 :(得分:1)

  1. 比较fline is "polygon"几乎总是假的。请改用fline == "polygon"

  2. 这与您的问题无关,但如果您使用正确的Unicode解码函数,则可以更轻松地处理文本,而不是手动剥离字节顺序标记并将其余部分视为字节。我更喜欢codecs.open(filename, encoding='utf-8-sig')

答案 1 :(得分:1)

成功打开Unicode文件后,我认为这样的事情比你正在做的更容易:

elements='''polygon
r:255
g:0
b:0
50;50
50;100
100;50

polygon
r:155
g:22
b:55
55;60
66;100
120;150
155;167'''       

for element in re.split(r'^\s*\n',elements,flags=re.MULTILINE):
    if element.startswith('polygon'):
        el=element.splitlines()
        poly={k:v for k,v in [s.split(':') for s in el[1:4]]}
        x,y=zip(*[s.split(';') for s in el[4:]])
        poly.update({'x':x, 'y': y})
        print poly

打印:

{'y': ('50', '100', '50'), 'x': ('50', '50', '100'), 'r': '255', 'b': '0', 'g': '0'}
{'y': ('60', '100', '150', '167'), 'x': ('55', '66', '120', '155'), 'r': '155', 'b': '55', 'g': '22'}