[Bug 2090994] Re: Seeing OS hang after running 300+ reboot loops on Turin with MI210 GPUs

Brahamaprakash Vardhaman 2090994 at bugs.launchpad.net
Thu Dec 5 04:37:25 UTC 2024


** Description changed:

  1. Run a rebooter script on Turin blade with AMD GPU MI210 installed in
  Gdale
  
  Result: After 300+ reboots OS (Ubuntu) hangs (see image attached)
  
  Rebooting OS will fix an issue.
  
  Seen on multiple blade setups. No hangs observer with less than 300
  loops so far.
  
  Here is a Linux grub workaround provided by AMD (also see image
  attached):
  
  Please disable 5-level page tables. Here is how to do it:
  1. Open the GRUB configuration file: sudo nano /etc/default/grub
  2. Add no5lvl to the list of parameters: GRUB_CMDLINE_LINUX="... no5lvl"
  3. Add add iommu=pt to GRUB_CMDLINE_LINUX_DEFAULT
  4. Update GRUB: sudo update-grub
  5. Reboot the system.
+ 
+ Ubuntu OS version used here is 22.04

-- 
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to grub2 in Ubuntu.
https://bugs.launchpad.net/bugs/2090994

Title:
  Seeing OS hang after running 300+ reboot loops on Turin with MI210
  GPUs

Status in grub2 package in Ubuntu:
  New

Bug description:
  1. Run a rebooter script on Turin blade with AMD GPU MI210 installed
  in Gdale

  Result: After 300+ reboots OS (Ubuntu) hangs (see image attached)

  Rebooting OS will fix an issue.

  Seen on multiple blade setups. No hangs observer with less than 300
  loops so far.

  Here is a Linux grub workaround provided by AMD (also see image
  attached):

  Please disable 5-level page tables. Here is how to do it:
  1. Open the GRUB configuration file: sudo nano /etc/default/grub
  2. Add no5lvl to the list of parameters: GRUB_CMDLINE_LINUX="... no5lvl"
  3. Add add iommu=pt to GRUB_CMDLINE_LINUX_DEFAULT
  4. Update GRUB: sudo update-grub
  5. Reboot the system.

  Ubuntu OS version used here is 22.04

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/grub2/+bug/2090994/+subscriptions




More information about the foundations-bugs mailing list