Troubleshooting Guide

Common issues and solutions when using slurm-factory.

Installation Issues

LXD Permission Errors

Problem: Permission denied when accessing LXD

Error: You don't have permission to access the LXD daemon

Solution:

# Add user to lxd group
sudo usermod -a -G lxd $USER
newgrp lxd

UV Not Found

Problem: uv command not found after installation

Solution:

# Reload shell configuration
source ~/.bashrc

# Or reinstall UV
curl -LsSf https://astral.sh/uv/install.sh | sh

Build Issues

Out of Memory Errors

Problem: Build fails with memory errors

Solutions:

# Increase container memory
lxc config set build-container limits.memory 32GB

# Reduce parallel jobs
export SPACK_BUILD_JOBS=2

Network/Download Issues

Problem: Downloads fail during build

Solutions:

# Test connectivity
ping -c 3 github.com

# Configure proxy (if needed)
export http_proxy=http://proxy.company.com:8080

Disk Space Issues

Problem: No space left on device

Solutions:

# Check space usage
df -h ~/.slurm-factory/

# Clean old builds
rm -rf ~/.slurm-factory/builds/old-version/

Runtime Issues

Module Not Found

Problem: Module system can’t find Slurm modules

Solutions:

# Check module path
echo $MODULEPATH

# Add module directory
export MODULEPATH=/opt/modules:$MODULEPATH

Command Not Found

Problem: Slurm commands not found after loading module

Solutions:

# Check module is loaded
module list

# Reload module
module unload slurm/25.05
module load slurm/25.05

Library Errors

Problem: Library loading errors

Solutions:

# Check library path
echo $LD_LIBRARY_PATH | grep slurm

# Update library cache
sudo ldconfig

Getting Help

Bug Reports

Include system information when reporting issues:

# System information
uname -a
lxd --version
python3 --version

# Slurm Factory logs
cat ~/.slurm-factory/logs/latest.log

Community Support


Still having issues? Contact us through GitHub Issues