chpox provides transparent checkpointing and
restarting of processes on Linux clusters. It was
originally designed for recovering tasks that have
a long execution time (i.e. numerical simulations)
in case of system crashes, power failures, etc. It
may work with openMosix, is SMP safe, does not
require program recompiling/relinking, and
supports virtual memory, regular open files,
pipes, Unix domain sockets, current directory, and
child processes.