[Bug 2090994] [NEW] Seeing OS hang after running 300+ reboot loops on Turin with MI210 GPUs

Launchpad Bug Tracker 2090994 at bugs.launchpad.net
Wed Dec 4 14:52:47 UTC 2024


You have been subscribed to a public bug:

1. Run a rebooter script on Turin blade with AMD GPU MI210 installed in
Gdale

Result: After 300+ reboots OS (Ubuntu) hangs (see image attached)

Rebooting OS will fix an issue.

Seen on multiple blade setups. No hangs observer with less than 300
loops so far.

Here is a Linux grub workaround provided by AMD (also see image
attached):

Please disable 5-level page tables. Here is how to do it:
1. Open the GRUB configuration file: sudo nano /etc/default/grub
2. Add no5lvl to the list of parameters: GRUB_CMDLINE_LINUX="... no5lvl"
3. Add add iommu=pt to GRUB_CMDLINE_LINUX_DEFAULT
4. Update GRUB: sudo update-grub
5. Reboot the system.

** Affects: grub2 (Ubuntu)
     Importance: Undecided
         Status: New

-- 
Seeing OS hang after running 300+ reboot loops on Turin with MI210 GPUs
https://bugs.launchpad.net/bugs/2090994
You received this bug notification because you are a member of Ubuntu Foundations Bugs, which is subscribed to grub2 in Ubuntu.



More information about the foundations-bugs mailing list