Hi again folks,
Here is a configuration change we have been working on refining. It will be added to the published KB articles relating to the H730 controller today. This configuration change could potentially mitigate several of the symptoms, specifically drives going offline as well as the periodic PSOD where you see completeCmdFusion referenced in the PSOD stack.
esxcfg-advcfg -s 110000 /LSOM/diskIoTimeout
esxcfg-advcfg -s 1 /LSOM/diskIoRetryFactor
These changes become effective immediately and will persist between reboots. At any time you can revert back to the default values as follows:
esxcfg-advcfg -s 20000 /LSOM/diskIoTimeout
esxcfg-advcfg -s 3 /LSOM/diskIoRetryFactor
There is no risk in applying the updated configuration values in terms of performance or data redundancy. This change only adjusts the VSAN IO retry behavior to be more lenient.