Errors during yum don't halt update
Posted: 13 Apr 2017 17:52
If yum fails due to lack of RAM ("Cannot allocate memory"), then the EFA updater ignores this and proceeds regardless.
The rest of the update completed (although ^Z to suspend it to check the RAM just froze the entire update process, so I rebooted and tried again) but the kernel update was skipped and now it's impossible to resolve this using the stock configuration. yum list only shows .642 series kernels now, and the .696 headers package that I couldn't remove as some software is already using it.
I resumed the suspended yum transaction for the kernel update, but all that did was remove the old kernel (the one actually in use, .642) without completing installing the new one.
Now the current kernel (.696) isn't actually an available package. I removed the .696 packages (except headers) in an attempt to reinstall them, and this worked, but now I can't put them back. In a copy/paste error I removed the .642 kernel-devel package, but now I can't put that one back either.
It would have been preferable to detect that yum failed originally, and suspend/abort the EFA update, as it would have avoided yum getting itself thoroughly confused by crashing out in the middle of an update.
(I guess EFA needs more RAM that it did in the past — I've increased the VM memory allocation.)
The rest of the update completed (although ^Z to suspend it to check the RAM just froze the entire update process, so I rebooted and tried again) but the kernel update was skipped and now it's impossible to resolve this using the stock configuration. yum list only shows .642 series kernels now, and the .696 headers package that I couldn't remove as some software is already using it.
I resumed the suspended yum transaction for the kernel update, but all that did was remove the old kernel (the one actually in use, .642) without completing installing the new one.
Now the current kernel (.696) isn't actually an available package. I removed the .696 packages (except headers) in an attempt to reinstall them, and this worked, but now I can't put them back. In a copy/paste error I removed the .642 kernel-devel package, but now I can't put that one back either.
It would have been preferable to detect that yum failed originally, and suspend/abort the EFA update, as it would have avoided yum getting itself thoroughly confused by crashing out in the middle of an update.
(I guess EFA needs more RAM that it did in the past — I've increased the VM memory allocation.)