nfsd.service: Order after mxmount.service #217

donald · 2021-11-17T06:59:04Z

Currently there is no order between nfsd.service and mxmount.service. If
mxmount is slow (e.g. when file systems have to process their journals
after a server crash), nfsd startup might execute exportfs -ra while
the filesystems are not yet mounted, thereby exporting the unmounted
mountpoints.

This can result in mount failures on nfs clients or in "stale NFS
handle" errors on clients, which had the filesystems mounted before the
crash.

mxmount.service uses mxmount --noexport so there is no reexport
triggered, after the filesystems are mounted. nfsd.services executes
additional exportfs -ra commands 10, 20 and 30 seconds after the nfsd
startup, but 30 seconds might not be enough time to mount all
filesystems after the crash of a fileserver.

These errors can persist. They are partly resolved by a manual
exportfs -ra after a longer time, making the file systems available for
mounting. However, the "stale NFS handle" problem might still be visible
on clients which picked up the now covered inodes of the mountpoints.

Order nfsd.service after mxmount.service so that we don't export the
mountpoints.

Currently there is no order between nfsd.service and mxmount.service. If mxmount is slow (e.g. when file systems have to process their journals after a server crash), nfsd startup might execute `exportfs -ra` while the filesystems are not yet mounted, thereby exporting the unmounted mountpoints. This can result in mount failures on nfs clients or in "stale NFS handle" errors on clients, which had the filesystems mounted before the crash. mxmount.service uses `mxmount --noexport` so there is no reexport triggered, after the filesystems are mounted. nfsd.services executes additional `exportfs -ra` commands 10, 20 and 30 seconds after the nfsd startup, but 30 seconds might not be enough time to mount all filesystems after the crash of a fileserver. These errors can persist. They are partly resolved by a manual `exportfs -ra` after a longer time, making the file systems available for mounting. However, the "stale NFS handle" problem might still be visible on clients which picked up the now covered inodes of the mountpoints. Order nfsd.service after mxmount.service so that we don't export the mountpoints.

donald · 2021-11-17T07:00:50Z

Tested on dose (with artificial sleep() delay in mxmount)

donald force-pushed the nfsd-after-mxmount branch from 00d49f7 to 2c7ad52 Compare November 17, 2021 06:59

pmenzel merged commit da1c66b into master Nov 17, 2021

nfsd.service: Order after mxmount.service #217

nfsd.service: Order after mxmount.service #217

donald commented Nov 17, 2021 •

edited

Loading

donald commented Nov 17, 2021

nfsd.service: Order after mxmount.service #217

nfsd.service: Order after mxmount.service #217

Conversation

donald commented Nov 17, 2021 • edited Loading

donald commented Nov 17, 2021

donald commented Nov 17, 2021 •

edited

Loading