我试图在不返回任何内容的void函数上使用带有多个参数的Python多处理库。这是我的最小工作示例。
import numpy as np
from multiprocessing import Pool
dim1 = 2
dim2 = 2
test1 = np.zeros((dim1,dim2))
test2 = np.zeros((dim1,dim2))
iteration = []
for i in range(0,dim1):
for j in range(0,dim2):
iteration.append((i,j))
def testing(num1,num2):
test1[num1,num2] = 1
test2[num1,num2] = 2
if __name__ == '__main__':
pool = Pool(processes=4)
pool.starmap(testing, iteration)
print(test1)
print(test2)
这里的问题是变量test1和test2在首次初始化时打印零数组。相反,我对test1的含义是test2的1s数组和2s的数组。我想要的代码
if __name__ == '__main__':
pool = Pool(processes=4)
pool.starmap(testing, iteration)
要做的是这样:
testing(0,0)
testing(1,0)
testing(0,1)
testing(1,1)
我看过一些相关的帖子,例如this。这篇文章和我的文章之间的区别在于,我的函数是一个void函数,而不是返回变量,我希望函数只是更改变量的值。
答案 0 :(得分:1)
要使用全局数组 返回多个结果来跨多个进程更新数组:
multiprocessing.Array
类存储数组数据。initializer
参数将数组传递给进程。请注意,Array
是一维的,因此必须重塑形状以进行更新和显示。
尝试以下代码:
import numpy as np
from multiprocessing import Pool, Array
dim1 = 2
dim2 = 2
def init(tt1,tt2): # receive shared arrays
global test1,test2
test1,test2 = tt1,tt2
def testing(num1,num2):
t1 = np.frombuffer(test1.get_obj()).reshape((dim1, dim2)) # need to reshape to 2D array
t2 = np.frombuffer(test2.get_obj()).reshape((dim1, dim2))
t1[num1,num2] = 1
t2[num1,num2] = 2
if __name__ == '__main__':
tt1 = Array('d', dim1*dim2) # 1 dimensional arrays
tt2 = Array('d', dim1*dim2)
iteration = []
for i in range(0,dim1):
for j in range(0,dim2):
iteration.append((i,j))
pool = Pool(processes=4, initializer=init, initargs=(tt1,tt2)) # pass shared arrays to processes
pool.starmap(testing, iteration)
# still have access to the shared arrays
t1final = np.frombuffer(tt1.get_obj()).reshape((dim1, dim2))
t2final = np.frombuffer(tt2.get_obj()).reshape((dim1, dim2))
print(t1final, t2final, sep='\n')
输出
[[1. 1.]
[1. 1.]]
[[2. 2.]
[2. 2.]]