Win 11 Cl2 - OS in reboot loop with BSOD, rebuild in progress.

Shubham

Windows Team
Staff member
Win11 Cl2 is being checked as some service seems to be crashing and making server reboot. We are investigating it.
 
Although it is not showing in any logs to be the problem, it seems there is an issue with the RAID and hitting a bad sector then making it reboot. We are checking further now but may have to rebuild this.
 
It looks like I was incorrect about the RAID issue and bad sector data, there was a RAID having fault but it was not one with the CL2-WIN11 server. It is being addressed separately. It does explain why there was no bad sector log during the problems with CL2-WIN11 however.
However it continues to reboot and I've taken it completely offline to copy it across to another node and run some testing if it also happens there or not, it may be an OS level issue which will likely require a complete reinstall-restore operation.

About 40 minutes on the copy for the evaluation on another node, so it is going to be some time before it is live again even in the best case.
 
Good news is data copied without and error, so that was found to not be the issue, so far bad news is, reboot auto continues on new node, however I did see the brief flash of a BSOD on the server. I've disabled the auto reboot on error and trying to see if there is any way to solve this in a timely manner, otherwise it will require a rebuild operation.
 
Nothing helping, reinstalled all drivers via a safe mode installer, and still it is rebooting a few minutes into operating.
 
We are now starting a rebuild process. We won't have to copy/restore data of the sites, but we WILL have to do a recreate operation which takes quite a long time, to make the metabase and users, and then permissions will need to run on the users.
This server was a first edition of 2008 server, and needed an upgrade anyway, but this wasn't the intended or planned way to go about it!
 
It is going to be at least 12 hours for this, the physical creation time to rebuild the users and configs is always quite slow and sometimes drags on entirely too long. I apologize for this, but there is nothing we can do to speed it up.
 
Rebuilding is still ongoing, no major updates as of yet. We did attempt to restore the server back to the last few backups and still it was blue screen error. We had also rebooted the server recently with no issue, even since the last full backup and it rebooted without error, so the sudden error is very odd. The error that is coming indicates a driver or hardware failure(which it wasn't as it was doing the same on multiple nodes and is a virtual machine) and has a hotfix but only for the R2 edition of Windows 2008, not the version that was running.
 
The new Windows installation is acting funny, some updates didn't apply correctly and keep reinstalling, we're working on this issue now.
 
I am really not happy right now, no person made a mistake, but installs went all in wrong orders on windows update, and I'm basically starting over. This day, is a nightmare if I could actually sleep.
 
There is light at the end of the tunnel now, and all should be up in approx 1 hour give or take
 
The server is UP. there WILL be a number of reboots still as we finalize some installations of compenents and others but we wanted to get it online with sites even with reboot/s still pending.
 
Back
Top