[Bug 1103187] Re: automatic updates tend to reboot and die into grub rescue

Péter Prőhle 1103187 at bugs.launchpad.net
Thu Jan 24 22:35:34 UTC 2013


Thanks for the http://sourceforge.net/projects/bootinfoscript/
suggestion.

Today I made quite few experiments, and the result in brief is:

no reboot problem after "dpkg-reconfigure grub-pc" and neither after the
other two boot-tampering, in the 3 cases below:

        /boot/ tree is in a separated ext4 filesystem, root is xfs

        /boot/ tree is in a separated xfs filesystem, root is xfs

        /boot/ tree is in the root filesystem, which is ext4

deterministically appearing reboot problem in the original case:

        /boot/ tree is in the root filesystem, which is xfs

However I can not predict the prospective reboot error, today I saw
additional 2 new kind of error messeges:

        "error: attempt to read or write outside of partition."

        "error: file `/boot/grub/i386-pc/normal.mod' not found."

At grub rescue prompt the command "ls /boot/grub/i386-pc/normal.mod"
tend to give various error messages, like

        "error: not a correct XFS node."

sometimes even the /boot/grub appears to be empty, or one can see the
entry i386-pc in it, but it can't be listed by ls /boot/grub/i386-pc,
and so on.

While at the earlier missing initrd.img problem I could boot by hand,
this reboot error gives no opportunity to boot by hand.

However if I boot by SYSRESCCD, and I do nothing else (or work for a
while) and I reboot, than this second reboot is guaranted to be
successful.

SYSRESCCD I think does not tamper the boot system, at least the
bootinfoscript has an identical output before the unsuccessful boot, and
after the successful boot.

Now I try to attach the 4 results.txt files, corresponding the 4 cases,
according to whether the /boot/ tree is in separate partition, and
whether the partition containing the /boot/ tree is xfs or ext4.

Comment: for testing the case of when /boot is in the root partition of
type ext4, I installed a new Ubuntu on an other drive replaced into my
linux box.


** Attachment added: "tar bzip2 of the 4 RESULTS.txt"
   https://bugs.launchpad.net/ubuntu/+source/grub2/+bug/1103187/+attachment/3499254/+files/results.tbz

-- 
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to grub2 in Ubuntu.
https://bugs.launchpad.net/bugs/1103187

Title:
  automatic updates tend to reboot and die into grub rescue

Status in “grub2” package in Ubuntu:
  Incomplete

Bug description:
  
  On 3 essentially different Ubuntu 12.10 installations

          bios or uefi boot,
          linux is alone or other system is along,
          legacy or guid partiton table

  it is a regularly appearing issue since 12.10, that if the automatic
  update touches a package which has some impact on the boot,

  then the next reboot get stock at either the grub rescue prompt, or
  booting the new kernel hangs at the missing initial ram disk, the
  latter is typical after kernel update, even just after the virgin
  installation of a fresh Ubuntu.

  Grub rescue prompt is the much harder situation, I either type in the
  correct grub commands to boot the previous kernel, or I use a "boot an
  existing linux from a partition" menu of a SYSRESC pen drive.

  Never really clear, what was wrong and what really helped.

  Here is a list of my manual struggling and accidental solution
  methods:

  (1) Sometimes I do nothing, except for booting once more the system by
  hand or by SYSRESC pen, and surprisingly the next time the system
  boots, asif there was no kind of problem before.   THIS happens more
  frequently on the combination below, and less frequently on the other
  2 combinations:

          bios boot + other system along + legacy partition table

  (2) More frequent in general is that remove/reinstall grub2, and/or
  remove/reinstall new kernel, and/or simply grub-install and update-
  grub helps, but usually NOT IN ONE STEP, however even after a logical
  and defensive

          grub-install /dev/sda

  the situation likes to become worse, usually changes between the two
  possibilities below:

          "error: invalid arch-independant ELF magic."

          "error: ELF header smaller than expected."

  and naturally I have the grub rescue prompt.   This time, just before
  this error report, the solution appeared to be the

          removal of memtest86+ and reinstall of it,

  as first there was a normal grub menu, but memtest was missing from it
  and the boot was unsuccessful, and after the usual kernel and grub
  tampering I got the invalid arch-independant ELF magic, furter usual
  tampering bought the ELF header smaller than expected, and finally my
  desperate trial was the removal of the memtest against it's
  dependencies, ... and it helped this time.

  This kind of struggling is more frequent on the 2 combination below:

          bios boot + linux alone + legacy partition table

          uefi boot + linux alone + either legacy or guid partition
  table (the both yields the error)

  The latter uefi shows the problem most stable, even if I reinstall the
  12.10 back to bios, from scratch.

  I work with computers since 1972, and with linux 1994, but I still has
  no firm idea which package is buggy.

  My guess is that not the grub itself is buggy, but the other packages
  have a buggy configuration relation to grub.

  I suggest to rate this bug serious, if we take serious the #1 main
  bug, see this site.

  I spread linux among my students at the university since the 90's,
  however if their system can become unavailable due to an automatic
  update, then some of then will give up learning linux.

  I understand that a usual other bug can be serious and hard.  But if I
  suggest the someone to switch to linux, and he/she can lost even the
  booting opportunity, then it is scary for most of the average users,
  and usually they have no enough skills to tackle the situtation, even
  to rescue their own personal files back to a proprietary system.

  That's why I suggest to rate this bug serious.  Serious in the
  consequencies in public relations.

  The version of the all the packages I use are the most up to date due
  to my policy to update as frequent as possible.

  ProblemType: Bug
  DistroRelease: Ubuntu 12.10
  Package: grub2 (not installed)
  Uname: Linux 3.2.34-std312-amd64 x86_64
  NonfreeKernelModules: raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 multipath linear radeon r8169 mii usb_storage ttm drm_kms_helper drm i2c_algo_bit i2c_core wmi
  ApportVersion: 2.6.1-0ubuntu10
  Architecture: amd64
  Date: Tue Jan 22 21:28:32 2013
  InstallationDate: Installed on 2012-10-19 (95 days ago)
  InstallationMedia: Ubuntu 12.10 "Quantal Quetzal" - Release amd64 (20121017.5)
  MarkForUpload: True
  ProcEnviron:
   TERM=xterm
   PATH=(custom, no user)
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: grub2
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/grub2/+bug/1103187/+subscriptions




More information about the foundations-bugs mailing list