[Bug 1103187] Re: automatic updates tend to reboot and die into grub rescue
Péter Prőhle
1103187 at bugs.launchpad.net
Thu Jan 24 22:35:34 UTC 2013
Thanks for the http://sourceforge.net/projects/bootinfoscript/
suggestion.
Today I made quite few experiments, and the result in brief is:
no reboot problem after "dpkg-reconfigure grub-pc" and neither after the
other two boot-tampering, in the 3 cases below:
/boot/ tree is in a separated ext4 filesystem, root is xfs
/boot/ tree is in a separated xfs filesystem, root is xfs
/boot/ tree is in the root filesystem, which is ext4
deterministically appearing reboot problem in the original case:
/boot/ tree is in the root filesystem, which is xfs
However I can not predict the prospective reboot error, today I saw
additional 2 new kind of error messeges:
"error: attempt to read or write outside of partition."
"error: file `/boot/grub/i386-pc/normal.mod' not found."
At grub rescue prompt the command "ls /boot/grub/i386-pc/normal.mod"
tend to give various error messages, like
"error: not a correct XFS node."
sometimes even the /boot/grub appears to be empty, or one can see the
entry i386-pc in it, but it can't be listed by ls /boot/grub/i386-pc,
and so on.
While at the earlier missing initrd.img problem I could boot by hand,
this reboot error gives no opportunity to boot by hand.
However if I boot by SYSRESCCD, and I do nothing else (or work for a
while) and I reboot, than this second reboot is guaranted to be
successful.
SYSRESCCD I think does not tamper the boot system, at least the
bootinfoscript has an identical output before the unsuccessful boot, and
after the successful boot.
Now I try to attach the 4 results.txt files, corresponding the 4 cases,
according to whether the /boot/ tree is in separate partition, and
whether the partition containing the /boot/ tree is xfs or ext4.
Comment: for testing the case of when /boot is in the root partition of
type ext4, I installed a new Ubuntu on an other drive replaced into my
linux box.
** Attachment added: "tar bzip2 of the 4 RESULTS.txt"
https://bugs.launchpad.net/ubuntu/+source/grub2/+bug/1103187/+attachment/3499254/+files/results.tbz
--
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to grub2 in Ubuntu.
https://bugs.launchpad.net/bugs/1103187
Title:
automatic updates tend to reboot and die into grub rescue
Status in “grub2” package in Ubuntu:
Incomplete
Bug description:
On 3 essentially different Ubuntu 12.10 installations
bios or uefi boot,
linux is alone or other system is along,
legacy or guid partiton table
it is a regularly appearing issue since 12.10, that if the automatic
update touches a package which has some impact on the boot,
then the next reboot get stock at either the grub rescue prompt, or
booting the new kernel hangs at the missing initial ram disk, the
latter is typical after kernel update, even just after the virgin
installation of a fresh Ubuntu.
Grub rescue prompt is the much harder situation, I either type in the
correct grub commands to boot the previous kernel, or I use a "boot an
existing linux from a partition" menu of a SYSRESC pen drive.
Never really clear, what was wrong and what really helped.
Here is a list of my manual struggling and accidental solution
methods:
(1) Sometimes I do nothing, except for booting once more the system by
hand or by SYSRESC pen, and surprisingly the next time the system
boots, asif there was no kind of problem before. THIS happens more
frequently on the combination below, and less frequently on the other
2 combinations:
bios boot + other system along + legacy partition table
(2) More frequent in general is that remove/reinstall grub2, and/or
remove/reinstall new kernel, and/or simply grub-install and update-
grub helps, but usually NOT IN ONE STEP, however even after a logical
and defensive
grub-install /dev/sda
the situation likes to become worse, usually changes between the two
possibilities below:
"error: invalid arch-independant ELF magic."
"error: ELF header smaller than expected."
and naturally I have the grub rescue prompt. This time, just before
this error report, the solution appeared to be the
removal of memtest86+ and reinstall of it,
as first there was a normal grub menu, but memtest was missing from it
and the boot was unsuccessful, and after the usual kernel and grub
tampering I got the invalid arch-independant ELF magic, furter usual
tampering bought the ELF header smaller than expected, and finally my
desperate trial was the removal of the memtest against it's
dependencies, ... and it helped this time.
This kind of struggling is more frequent on the 2 combination below:
bios boot + linux alone + legacy partition table
uefi boot + linux alone + either legacy or guid partition
table (the both yields the error)
The latter uefi shows the problem most stable, even if I reinstall the
12.10 back to bios, from scratch.
I work with computers since 1972, and with linux 1994, but I still has
no firm idea which package is buggy.
My guess is that not the grub itself is buggy, but the other packages
have a buggy configuration relation to grub.
I suggest to rate this bug serious, if we take serious the #1 main
bug, see this site.
I spread linux among my students at the university since the 90's,
however if their system can become unavailable due to an automatic
update, then some of then will give up learning linux.
I understand that a usual other bug can be serious and hard. But if I
suggest the someone to switch to linux, and he/she can lost even the
booting opportunity, then it is scary for most of the average users,
and usually they have no enough skills to tackle the situtation, even
to rescue their own personal files back to a proprietary system.
That's why I suggest to rate this bug serious. Serious in the
consequencies in public relations.
The version of the all the packages I use are the most up to date due
to my policy to update as frequent as possible.
ProblemType: Bug
DistroRelease: Ubuntu 12.10
Package: grub2 (not installed)
Uname: Linux 3.2.34-std312-amd64 x86_64
NonfreeKernelModules: raid10 raid456 async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx raid1 raid0 multipath linear radeon r8169 mii usb_storage ttm drm_kms_helper drm i2c_algo_bit i2c_core wmi
ApportVersion: 2.6.1-0ubuntu10
Architecture: amd64
Date: Tue Jan 22 21:28:32 2013
InstallationDate: Installed on 2012-10-19 (95 days ago)
InstallationMedia: Ubuntu 12.10 "Quantal Quetzal" - Release amd64 (20121017.5)
MarkForUpload: True
ProcEnviron:
TERM=xterm
PATH=(custom, no user)
LANG=en_US.UTF-8
SHELL=/bin/bash
SourcePackage: grub2
UpgradeStatus: No upgrade log present (probably fresh install)
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/grub2/+bug/1103187/+subscriptions
More information about the foundations-bugs
mailing list