我正在尝试从src_files列表中排除以下称为文件夹名称的元组: src_files的输出看起来像这样:
[u'Archive/', u'Archive/Backup_978f124418f1359d7b7e50db7295bbba_300201_TEST.csv', u'Source/', u'Source/978f124418f1359d7b7e50db7295bbba_64106_TEST2.csv', u'Source/978f124418f1359d7b7e50db7295bbba_771618_TEST1.csv', u'Source/978f124418f1359d7b7e50db7295bbba_991406_TEST1.csv', u'TEST.csv', u'Target/', u'Target/Matillion_Report.csv', u'Target/Rejected_978f124418f1359d7b7e50db7295bbba_300201_TEST.csv', u'Target/Report_978f124418f1359d7b7e50db7295bbba_300201_TEST.csv']
这是我正在尝试的代码:
folder_names = ('Source/','Archive/','Target/')
src_files = []
s3 = boto3.resource('s3')
my_bucket = s3.Bucket('rampdataquality')
for file in my_bucket.objects.all():
print(file.key)
src_files.append(file.key)
print (src_files)
所需的输出是: src_files的列表:
Test.csv或任何.csv文件仅删除所有内容