What to do if a server fails to boot?
Common Scenarios
Early Boot Failures.
- Server displays
gruberror. - Server boots to the
grub>command prompt. - Server boots to a
blinking cursor. - Server is missing
/bootdue to accidental deletion or filesystem checks.
Interim Boot failures.
- Server displays
Kernel panic - not syncing - Attempted to kill init. - Server fails to boot due to
Out of memory:situation. - Server fails to boot after updating.
- Server hangs during the boot process.
- Server panic'd and on reboot it now fails to boot.
- Server stops booting and drops into
maintenance/emergencymode. - Server constantly reboots during the boot process.
- Server do not show anything post grub menu and has blank screen.
- Server stuck at boot with some tracebacks for filesystem corruption.
Troubleshooting Steps
Here to isolate the issue we have divided the booting process in two parts as "Early Boot Failure" and "Interim boot failure"
To troubleshoot and gather the logs below steps can be attempted to see if the server will boot and based on the failure type we can troubleshoot it.
Early Boot Failures.
- Server displays
grub error. - Server boots to the
grub>command prompt. - Server boots to a
blinking cursor. - Server has
/bootcontents missing or accidentally deleted.
Try the relative knowledge base article as,
- Fix can't boot situation due to misconfigured hugepages.
- System boots to grub> prompt after a reboot
- This content is not included.System fails to boot with error "file 'grub2/i386-pc/normal.mod' not found"
- How do I restore the /boot directory after accidentally deleting its contents or corruption.
Interim Boot failures.
- Server displays
Kernel panic - not syncing - Attempted to kill init. - Server fails to boot due to
Out of memory:situation. - Server fails to boot after updating.
- Server hangs during the boot process.
- Server panic'd and on reboot it now fails to boot.
- Server stops booting and drops into
maintenance/emergencymode. - Server constantly reboots during the boot process.
- Server do not show anything post grub menu and has blank screen.
- Server stuck at boot with some tracebacks for filesystem corruption.
Try the relative knowledge base article as,
-
If in case the server is giving a
kernel panicwith the latest installed kernel then try booting from another kernel: Refer to the following solution for assistance with this.
How do I boot using an alternate / previous / older kernel? -
If server fails to boot due to
Out of memory:messages on the console, then refer following solution.
How to fix can't boot issue caused due to misconfigured hugepages resulting in oom during bootup without using ISO? -
Try to capture the screenshot of the kernel to see the exact error it has failed with to troubleshoot it further. Refer the below article which demonstrates steps about a commonly faced issue due to missing initramfs file.
After update, kernel panic at boot with error: Unable to mount root fs on unknown-block(0,0) -
If in case the server stops booting and drops to
maintenance modethen check what are the failed mounts or volume groups: Refer the below article which demonstrates steps to troubleshoot the issues around Maintenance mode.
Why does Red Hat Enterprise Linux is dropping into maintenance mode while booting? -
If in case the server shows some filesystem corruption tracebacks during a boot then refer below article which demonstrates steps to perform filesystem repair.
How to repair filesystem in rescue mode for Red Hat Enterprise Linux -
If the Server constantly reboots during the boot process then refer below article which demonstrates a similar issue with resolution.
My system shuts down before completing the boot process -
If the system has GUI and not booting to login screen then refer below article to boot in
multi-usermode.
RHEL 6 and below: How to boot into single user mode or runlevel 3 using the GRUB bootloader?
RHEL 7 and RHEL 8: How to boot into emergency or multi-user mode from GRUB2?
Steps to take before creating support case
Below are a few steps that need to be completed, and the data from the steps provided when opening a case. This information is needed due to the boot process involving several subsystems, and it will help align the case with the correct specialty group.
-
Steps for capturing the screenshot: If the server still doesn't boot and remains hung or shows failed status for services please capture screenshots with more verbose boot messages using How to display more verbose boot-related messages during system startup.. First remove any of the following parameters:
quietorrhgb. Once they have been removed, append the below parameters to the kernel lines to enable booting with debugging. The output will help support pin point where the failure is occurring in the boot process. Please try to capture as many screenshots/pictures of the console while the server is booting as possible. Pictures are worth a thousand words, and will help get the case to the appropriate engineer quicker.- For RHEL 6 and below, append the below:
debug ignore_loglevel print_fatal_signals=1 printk.time=1 initcall_debug log_buf_len=10M
- For RHEL 7 and RHEL 8, append the below:
rd.debug initcall_debug log_buf_len=10M
- For RHEL 6 and below, append the below:
-
Steps for capturing the serial console logs: If you can capture serial console logs on the server, those would help us even more, please provide those rather than screenshots/pictures using How does one set up a serial terminal and/or console in Red Hat Enterprise Linux?
-
Capture sosreport from the rescue mode: After capturing any screenshots of the boot process with debugging enabled, support will need an sosreport (yes, an sosreport can be captured if the server isn't booting). The following knowledge article will help walk you through how to do this: How to boot Red Hat Enterprise Linux to Rescue Mode for Data Collection (sosreport, vmcore, etc.)
- Once an sosreport is generated in rescue mode, it can then be transferred to another working system by
scp or rsyncby enabling networking in rescue mode, and it can then be provided to support. - Alternatively, rescue mode assistant lab can be used for assistance.
- Once an sosreport is generated in rescue mode, it can then be transferred to another working system by