[Bug 1103187] Re: automatic updates tend to reboot and die into grub rescue

Péter Prőhle 1103187 at bugs.launchpad.net
Fri Jan 25 18:12:25 UTC 2013


New progression in confirming, that "grub-install /dev/sda" writes block information into boot area, while the content of these block are refreshed much later, latest at the next mounting of the XFS root filesystem.

The "dpkg-reconfigure grub-pc" contains first "grub-install /dev/sda"
and second "update-grub".

My separated experinces shows, that the second, "update-grub" makes no
reboot problem, while "grub-install /dev/sda" is guaranteed to cause
reboot fail.

I found an interesting new error, occuring wery late during the boot
process, namely I could already see the graphical works space, when a
small window popped up informing me, that

        Sorry, Ubuntu 12.10 experienced an internal error.

There was a facility to automatically send a report of it, I sent it.

Now the real progress is, that I found that the OTHER PROBLEMS described
in the very first post

        "booting the new kernel hangs at the missing initial ram disk"

now knowing much more about the incoherency problem, so now it is clear,
that the initial ram disk FAILS TO BE MISSING, instead, due to the
incoherency problems, the boot process can't see the new initial ram
disk of the new kernel!

Just after the previous post, I received the new kernel 3.5.0-23, this
is the 6. kernel update since the release of 12.10.  As a surprise, this
time there was NO reboot problem.  Hence I decided to remove the
previous kernel, the 3.5.0-22, and immediately I got reboot problem, but
slighly different from the previous ones:

namely I got NORMAL grub prompt, but in turn of it, the linux.mod was
not loadable in spite of the fact, that it was listable,

and the most instructive fact was, that I could reach the DELETED
kernel, ramdisk etc!, while the new ramdisk was missing, while at the
last boot I could boot it!

This means, that when dpkg purged the previous kernel, then the file
system journaled what to do, but did not eventually do it on the block
level before the reboot.

Now I see, that the diverse kind of problems in the last 2 1/2 months
were NOT diverse, but only this incoherency problem.

As a result, sometimes I got normal grub, but initrd was missing, or it
was not missing, but the grub menu was missing, etc.  Other times the
grub stoped at it's rescue mode due to the apparent inaccessibility of
normal.mod, etc.  These caused the impression of having diverse problems
instead of a single one.

-- 
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to grub2 in Ubuntu.
https://bugs.launchpad.net/bugs/1103187

Title:
  xfs left inconcistent after reboot, causing grub to fail

Status in “grub2” package in Ubuntu:
  Incomplete

Bug description:
  
  On 3 essentially different Ubuntu 12.10 installations

          bios or uefi boot,
          linux is alone or other system is along,
          legacy or guid partiton table

  it is a regularly appearing issue since 12.10, that if the automatic
  update touches a package which has some impact on the boot,

  then the next reboot get stock at either the grub rescue prompt, or
  booting the new kernel hangs at the missing initial ram disk, the
  latter is typical after kernel update, even just after the virgin
  installation of a fresh Ubuntu.

  Grub rescue prompt is the much harder situation, I either type in the
  correct grub commands to boot the previous kernel, or I use a "boot an
  existing linux from a partition" menu of a SYSRESC pen drive.

  Never really clear, what was wrong and what really helped.

  Here is a list of my manual struggling and accidental solution
  methods:

  (1) Sometimes I do nothing, except for booting once more the system by
  hand or by SYSRESC pen, and surprisingly the next time the system
  boots, asif there was no kind of problem before.   THIS happens more
  frequently on the combination below, and less frequently on the other
  2 combinations:

          bios boot + other system along + legacy partition table

  (2) More frequent in general is that remove/reinstall grub2, and/or
  remove/reinstall new kernel, and/or simply grub-install and update-
  grub helps, but usually NOT IN ONE STEP, however even after a logical
  and defensive

          grub-install /dev/sda

  the situation likes to become worse, usually changes between the two
  possibilities below:

          "error: invalid arch-independant ELF magic."

          "error: ELF header smaller than expected."

  and naturally I have the grub rescue prompt.   This time, just before
  this error report, the solution appeared to be the

          removal of memtest86+ and reinstall of it,

  as first there was a normal grub menu, but memtest was missing from it
  and the boot was unsuccessful, and after the usual kernel and grub
  tampering I got the invalid arch-independant ELF magic, furter usual
  tampering bought the ELF header smaller than expected, and finally my
  desperate trial was the removal of the memtest against it's
  dependencies, ... and it helped this time.

  This kind of struggling is more frequent on the 2 combination below:

          bios boot + linux alone + legacy partition table

          uefi boot + linux alone + either legacy or guid partition
  table (the both yields the error)

  The latter uefi shows the problem most stable, even if I reinstall the
  12.10 back to bios, from scratch.

  I work with computers since 1972, and with linux 1994, but I still has
  no firm idea which package is buggy.

  My guess is that not the grub itself is buggy, but the other packages
  have a buggy configuration relation to grub.

  I suggest to rate this bug serious, if we take serious the #1 main
  bug, see this site.

  I spread linux among my students at the university since the 90's,
  however if their system can become unavailable due to an automatic
  update, then some of then will give up learning linux.

  I understand that a usual other bug can be serious and hard.  But if I
  suggest the someone to switch to linux, and he/she can lost even the
  booting opportunity, then it is scary for most of the average users,
  and usually they have no enough skills to tackle the situtation, even
  to rescue their own personal files back to a proprietary system.

  That's why I suggest to rate this bug serious.  Serious in the
  consequencies in public relations.

  The version of the all the packages I use are the most up to date due
  to my policy to update as frequent as possible.

  ProblemType: Bug
  DistroRelease: Ubuntu 12.10
  Package: grub2 (not installed)
  Uname: Linux 3.2.34-std312-amd64 x86_64
  NonfreeKernelModules: raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 multipath linear radeon r8169 mii usb_storage ttm drm_kms_helper drm i2c_algo_bit i2c_core wmi
  ApportVersion: 2.6.1-0ubuntu10
  Architecture: amd64
  Date: Tue Jan 22 21:28:32 2013
  InstallationDate: Installed on 2012-10-19 (95 days ago)
  InstallationMedia: Ubuntu 12.10 "Quantal Quetzal" - Release amd64 (20121017.5)
  MarkForUpload: True
  ProcEnviron:
   TERM=xterm
   PATH=(custom, no user)
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  SourcePackage: grub2
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/grub2/+bug/1103187/+subscriptions




More information about the foundations-bugs mailing list