s骂:前端作为计算节点不响应

时间:2019-03-08 15:24:49

标签: slurm

类似于slurm: use a control node also for computing

我想将前端用作计算节点。我在slurm.conf

中输入了以下内容
NodeName=gisc RealMemory=63000 Sockets=1 CoresPerSocket=8 ThreadsPerCore=2 State=UNKNOWN Weight=2
NodeName=c[0-2] RealMemory=126000 Sockets=1 CoresPerSocket=16 ThreadsPerCore=2 State=UNKNOWN Weight=1
PartitionName=normal Nodes=gisc,c[0-2] Default=YES MaxTime=INFINITE State=UP

并重新启动slurmdslurmctld。 但是,从前端节点总是看不到任何响应,状态显示为星号。

PARTITION AVAIL  TIMELIMIT  NODES  STATE NODELIST
normal*      up   infinite      1  idle* gisc
normal*      up   infinite      2  alloc c[0-1]
normal*      up   infinite      1   idle c2

此外,我无法在前端节点上启动slurmd。日志无济于事。 可能是slurmdslurmctld在前端节点上发生冲突?

我的/etc/hosts如下所示

192.168.1.1 gisc.localdomain gisc gisc-eth0.localdomain gisc-eth0

### ALL ENTRIES BELOW THIS LINE WILL BE OVERWRITTEN BY WAREWULF ###
#
# See provision.conf for configuration paramaters


# Node Entry for node: c0 (ID=22)
192.168.1.2             c0.localdomain c0 c0-eth0.localdomain c0-eth0

# Node Entry for node: c1 (ID=23)
192.168.1.3             c1.localdomain c1 c1-eth0.localdomain c1-eth0

# Node Entry for node: c2 (ID=24)
192.168.1.4             c2.localdomain c2 c2-eth0.localdomain c2-eth0

1 个答案:

答案 0 :(得分:0)

facepalm 前端缺少slurm-client库。仅安装了slurm-server库...