我有这个简单的txt文件:
[header]
width=8
height=5
tilewidth=175
tileheight=150
[tilesets]
tileset=../GFX/ts1.png,175,150,0,0
[layer]
type=Tile Layer 1
data=
1,1,1,1,1,1,1,1,
1,0,0,0,0,0,0,1,
1,0,0,0,0,1,1,1,
1,0,0,0,6,0,0,1,
1,1,1,1,4,1,1,1
我想通过“[header]”,“[tilesets]”和“[layers]”分隔文本。问题是,如果我以这种方式拆分它:
m = open(self.fullPath, 'r+')
sliced = m.read().split() # Default = \n
print sliced
它应该分隔每一行,因为read()总是在每一行的末尾留下一个'\ n':
['[header]', 'width=8', 'height=5', 'tilewidth=175', 'tileheight=150', '[tilesets]', 'tileset=../GFX/ts1.png,175,150,0,0', '[layer]', 'type=Tile', 'Layer', '1', 'data=', '1,1,1,1,1,1,1,1,', '1,0,0,0,0,0,0,1,', '1,0,0,0,0,1,1,1,', '1,0,0,0,6,0,0,1,', '1,1,1,1,4,1,1,1']
但是,如果是新行字符,则可以完美地分割,有一个“#”符号或任何分隔每个部分的内容。
然后,我想:“那里有空行,它们是新行字符,所以我只需要测试行是否等于换行字符并用'#'替换它”:
for line in m.readlines():
if line == '\n':
m.write('#')
for line in m.readlines():
print line
完美..除此之外......而不是实现这一点:
[header]
width=8
height=5
tilewidth=175
tileheight=150
#
[tilesets]
tileset=../GFX/ts1.png,175,150,0,0
#
[layer]
type=Tile Layer 1
data=
1,1,1,1,1,1,1,1,
1,0,0,0,0,0,0,1,
1,0,0,0,0,1,1,1,
1,0,0,0,6,0,0,1,
1,1,1,1,4,1,1,1
我明白了:
[header]
width=8
height=5
tilewidth=175
tileheight=150
[tilesets]
tileset=../GFX/ts1.png,175,150,0,0
[layer]
type=Tile Layer 1
data=
1,1,1,1,1,1,1,1,
1,0,0,0,0,0,0,1,
1,0,0,0,0,1,1,1,
1,0,0,0,6,0,0,1,
1,1,1,1,4,1,1,1##õÙÓ Z d Z d d l Z d d l Z d " Z d $ „ Z d - f d „ ƒ Y Z e H d ƒ Z e I j ƒ Æ Çîà õÙÓ ; | j d ƒ } i < g d 6 g d 6 g d 6 } d = d d g } x 0 ·ð? | j ƒ D u tîà õÙÓI À¶ð ) (–à W # "íà õ@ÎÔ €·ðB | j ƒ D ú ú–à õ(Tò `·ð } | C G H q | @·ð Ñ Ñ–à õ@ÎÔ ¨ ¨–à õ@ÎÔ
E G H | F j ƒ –à õ@ÎÔ S V V–à õ@ÎÔž ÿÿÿÿ t | j d ƒ } i g d 6g d 6g d 6} d d d g } x0 | j ƒ D]" } | d k rk | j d ƒ n qI Wx | j ƒ D] } | GHq| Wd
GH| j ƒ d S `:ð> >§à õ@ÎÔÀ:ðà¢îðà:ð ;ð`ßî ;ð@;ð0ð`;ð £îXð@ ï€;ð€ð ;ð`£îÀ;ðà;ð ð2 2›à õ@ÎÔ`<ð€<ðà¤î <ð ?îÀ<ðà<ð =ð =ð@=ðÀ?î ïÐð`=ð¸ï€=ð =ðøðÀ=ðà=ð >ð >ð@>ð`>ð ð€>ð >ðÀ>ðà>ð ?ð@OÑ ?ð@?ð`?ð€?ð ?ðHðpðÀ?ð˜ðÀðà?ð @ðÀ£î@@ð`@ð€@ð @ð PðHPðÀ@ðà@ð
没有意义:)。
答案 0 :(得分:3)
同时读取和写入文件往往会对您获得的输出产生不可预测的影响。
如果您的类别总是以两个换行符分隔,那么只需将其拆分,而不是进行任何奇特的查找/替换操作。
m = open("input.txt", "r+")
sliced = m.read().split("\n\n")
print "data has been split into {} categories.".format(len(sliced))
#print the starting line of each category
for category in sliced:
print category.split("\n")[0]
结果:
data has been split into 3 categories.
[header]
[tilesets]
[layer]