[Bug 574462] Re: udisks-probe-ata-smart causes HSM violations
Martin Pitt
martin.pitt at ubuntu.com
Mon May 3 13:29:22 UTC 2010
At this point I'd appreciate some input from the kernel team what this
message actually means, and what the likely cause could be. It does not
really seem specific to either hdparm nor libatasmart, all they do is
things like
https://bugs.launchpad.net/ubuntu/karmic/+source/libatasmart/+bug/445852/comments/202
(i. e. using the normal SCSI ioctls).
One possible workaround might be to not run hdparm on SSD devices. This
only might fix the symptom, but it might be an appropriate SRU for
lucid.
** Description changed:
- Will be added later.
+ This is related to bug 445852, it causes the same effects, but under
+ different circumstances.
+
+ During boot on an SSD system and when logging in and starting something
+ rather IO intense, like firefox, the system freezes for 30 seconds, and
+ afterwards dmesg shows an error like
+
+ ata2: lost interrupt (Status 0x58)
+ ata2: drained 16384 bytes to clear DRQ.
+ ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
+ ata2.00: BMDMA stat 0x4
+ ata2.00: cmd c8/00:40:cb:60:32/00:00:00:00:00/e0 tag 0 dma 32768 in
+ res 58/00:40:cb:60:32/00:00:00:00:00/e0 Emask 0x2 (HSM violation)
+ ata2.00: status: { DRDY DRQ }
+ ata2: soft resetting link
+ ata2.00: configured for UDMA/66
+ ata2: EH complete
+
+ The main cause of bug 445828 was fixed now, but there are still some
+ users who get those HSM violations/30 second hangs during boot.
+
+ Martin Pitt got ssh access to jarikvh's machine which is still affected
+ (he's willing to provide access to other people for debugging).
+
+ A lot of different commands were tried to reproduce this at runtime,
+ like
+
+ # for i in `seq 50`; do skdump --can-smart /dev/sda; hdparm -B254 /dev/sda; sleep 0.2; done
+ # udevadm trigger --action=change --sysname-match=sda # (also in a loop)
+ # (/lib/udev/udisks-probe-ata-smart /dev/sda &); /lib/udev/udisks-probe-ata-smart /dev/sda
+
+ and so on, but it seems impossible to reproduce at runtime
+ unfortunately. I also tried those commands while a "grep -r . /usr" was
+ running in the background to induce I/O and disk reading activity.
+
+ The interesting thing is that the bug goes away if you either disable
+ /lib/udev/rules.d/85-hdparm.rules, or udisks-probe-ata-smart in
+ /lib/udev/rules.d/80-udisks.rules. So this seems to happen in situations
+ where there is something reading a lot of files from the disk, and
+ hdparm or libatasmart send their ioctls to the drive.
+
ProblemType: Bug
DistroRelease: Ubuntu 10.04
Package: linux-image-2.6.32-21-generic 2.6.32-21.32
Regression: No
Reproducible: Yes
ProcVersionSignature: Ubuntu 2.6.32-21.32-generic 2.6.32.11+drm33.2
Uname: Linux 2.6.32-21-generic i686
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21.
AplayDevices:
- **** List of PLAYBACK Hardware Devices ****
- card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
- Subdevices: 1/1
- Subdevice #0: subdevice #0
+ **** List of PLAYBACK Hardware Devices ****
+ card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
+ Subdevices: 1/1
+ Subdevice #0: subdevice #0
Architecture: i386
ArecordDevices:
- **** List of CAPTURE Hardware Devices ****
- card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
- Subdevices: 1/1
- Subdevice #0: subdevice #0
+ **** List of CAPTURE Hardware Devices ****
+ card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
+ Subdevices: 1/1
+ Subdevice #0: subdevice #0
AudioDevicesInUse:
- USER PID ACCESS COMMAND
- /dev/snd/controlC0: jarik 1395 F.... pulseaudio
+ USER PID ACCESS COMMAND
+ /dev/snd/controlC0: jarik 1395 F.... pulseaudio
Card0.Amixer.info:
- Card hw:0 'Intel'/'HDA Intel at 0x78540000 irq 16'
- Mixer name : 'Realtek ALC268'
- Components : 'HDA:10ec0268,1025015b,00100101'
- Controls : 8
- Simple ctrls : 5
+ Card hw:0 'Intel'/'HDA Intel at 0x78540000 irq 16'
+ Mixer name : 'Realtek ALC268'
+ Components : 'HDA:10ec0268,1025015b,00100101'
+ Controls : 8
+ Simple ctrls : 5
Date: Mon May 3 15:21:38 2010
InstallationMedia: Ubuntu-Netbook-Remix 9.10 "Karmic Koala" - Release i386 (20091028.4)
MachineType: Acer AOA110
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-21-generic root=UUID=994b1074-44a7-4871-9553-fe61b94182cf ro quiet splash pciehp.pciehp_force=1 elevator=noop
ProcEnviron:
- LANG=en_US.utf8
- SHELL=/bin/bash
+ LANG=en_US.utf8
+ SHELL=/bin/bash
RelatedPackageVersions: linux-firmware 1.34
RfKill:
- 0: phy0: Wireless LAN
- Soft blocked: no
- Hard blocked: no
+ 0: phy0: Wireless LAN
+ Soft blocked: no
+ Hard blocked: no
SourcePackage: linux
dmi.bios.date: 05/09/2008
dmi.bios.vendor: INSYDE
dmi.bios.version: v0.3109
dmi.board.asset.tag: Base Board Asset Tag
dmi.board.name: Base Board Product Name
dmi.board.vendor: Intel Corp.
dmi.board.version: Base Board Version
dmi.chassis.type: 1
dmi.chassis.vendor: Chassis Manufacturer
dmi.chassis.version: Chassis Version
dmi.modalias: dmi:bvnINSYDE:bvrv0.3109:bd05/09/2008:svnAcer:pnAOA110:pvr1:rvnIntelCorp.:rnBaseBoardProductName:rvrBaseBoardVersion:cvnChassisManufacturer:ct1:cvrChassisVersion:
dmi.product.name: AOA110
dmi.product.version: 1
dmi.sys.vendor: Acer
** Changed in: linux (Ubuntu)
Status: New => Confirmed
** Changed in: linux (Ubuntu)
Importance: Undecided => High
** Changed in: linux (Ubuntu)
Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team)
** Description changed:
This is related to bug 445852, it causes the same effects, but under
different circumstances.
During boot on an SSD system and when logging in and starting something
rather IO intense, like firefox, the system freezes for 30 seconds, and
afterwards dmesg shows an error like
- ata2: lost interrupt (Status 0x58)
- ata2: drained 16384 bytes to clear DRQ.
- ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
- ata2.00: BMDMA stat 0x4
- ata2.00: cmd c8/00:40:cb:60:32/00:00:00:00:00/e0 tag 0 dma 32768 in
- res 58/00:40:cb:60:32/00:00:00:00:00/e0 Emask 0x2 (HSM violation)
- ata2.00: status: { DRDY DRQ }
- ata2: soft resetting link
- ata2.00: configured for UDMA/66
- ata2: EH complete
+ ata2: lost interrupt (Status 0x58)
+ ata2: drained 16384 bytes to clear DRQ.
+ ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
+ ata2.00: BMDMA stat 0x4
+ ata2.00: cmd c8/00:40:cb:60:32/00:00:00:00:00/e0 tag 0 dma 32768 in
+ res 58/00:40:cb:60:32/00:00:00:00:00/e0 Emask 0x2 (HSM violation)
+ ata2.00: status: { DRDY DRQ }
+ ata2: soft resetting link
+ ata2.00: configured for UDMA/66
+ ata2: EH complete
The main cause of bug 445828 was fixed now, but there are still some
users who get those HSM violations/30 second hangs during boot.
- Martin Pitt got ssh access to jarikvh's machine which is still affected
+ Martin Pitt got ssh access to Jarige's machine which is still affected
(he's willing to provide access to other people for debugging).
A lot of different commands were tried to reproduce this at runtime,
like
- # for i in `seq 50`; do skdump --can-smart /dev/sda; hdparm -B254 /dev/sda; sleep 0.2; done
- # udevadm trigger --action=change --sysname-match=sda # (also in a loop)
- # (/lib/udev/udisks-probe-ata-smart /dev/sda &); /lib/udev/udisks-probe-ata-smart /dev/sda
+ # for i in `seq 50`; do skdump --can-smart /dev/sda; hdparm -B254 /dev/sda; sleep 0.2; done
+ # udevadm trigger --action=change --sysname-match=sda # (also in a loop)
+ # (/lib/udev/udisks-probe-ata-smart /dev/sda &); /lib/udev/udisks-probe-ata-smart /dev/sda
and so on, but it seems impossible to reproduce at runtime
unfortunately. I also tried those commands while a "grep -r . /usr" was
running in the background to induce I/O and disk reading activity.
The interesting thing is that the bug goes away if you either disable
/lib/udev/rules.d/85-hdparm.rules, or udisks-probe-ata-smart in
/lib/udev/rules.d/80-udisks.rules. So this seems to happen in situations
where there is something reading a lot of files from the disk, and
hdparm or libatasmart send their ioctls to the drive.
-
ProblemType: Bug
DistroRelease: Ubuntu 10.04
Package: linux-image-2.6.32-21-generic 2.6.32-21.32
Regression: No
Reproducible: Yes
ProcVersionSignature: Ubuntu 2.6.32-21.32-generic 2.6.32.11+drm33.2
Uname: Linux 2.6.32-21-generic i686
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21.
AplayDevices:
**** List of PLAYBACK Hardware Devices ****
card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
Subdevices: 1/1
Subdevice #0: subdevice #0
Architecture: i386
ArecordDevices:
**** List of CAPTURE Hardware Devices ****
card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
Subdevices: 1/1
Subdevice #0: subdevice #0
AudioDevicesInUse:
USER PID ACCESS COMMAND
/dev/snd/controlC0: jarik 1395 F.... pulseaudio
Card0.Amixer.info:
Card hw:0 'Intel'/'HDA Intel at 0x78540000 irq 16'
Mixer name : 'Realtek ALC268'
Components : 'HDA:10ec0268,1025015b,00100101'
Controls : 8
Simple ctrls : 5
Date: Mon May 3 15:21:38 2010
InstallationMedia: Ubuntu-Netbook-Remix 9.10 "Karmic Koala" - Release i386 (20091028.4)
MachineType: Acer AOA110
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-21-generic root=UUID=994b1074-44a7-4871-9553-fe61b94182cf ro quiet splash pciehp.pciehp_force=1 elevator=noop
ProcEnviron:
LANG=en_US.utf8
SHELL=/bin/bash
RelatedPackageVersions: linux-firmware 1.34
RfKill:
0: phy0: Wireless LAN
Soft blocked: no
Hard blocked: no
SourcePackage: linux
dmi.bios.date: 05/09/2008
dmi.bios.vendor: INSYDE
dmi.bios.version: v0.3109
dmi.board.asset.tag: Base Board Asset Tag
dmi.board.name: Base Board Product Name
dmi.board.vendor: Intel Corp.
dmi.board.version: Base Board Version
dmi.chassis.type: 1
dmi.chassis.vendor: Chassis Manufacturer
dmi.chassis.version: Chassis Version
dmi.modalias: dmi:bvnINSYDE:bvrv0.3109:bd05/09/2008:svnAcer:pnAOA110:pvr1:rvnIntelCorp.:rnBaseBoardProductName:rvrBaseBoardVersion:cvnChassisManufacturer:ct1:cvrChassisVersion:
dmi.product.name: AOA110
dmi.product.version: 1
dmi.sys.vendor: Acer
** Also affects: hdparm (Ubuntu)
Importance: Undecided
Status: New
--
udisks-probe-ata-smart causes HSM violations
https://bugs.launchpad.net/bugs/574462
You received this bug notification because you are a member of Kernel
Bugs, which is subscribed to linux in ubuntu.
More information about the kernel-bugs
mailing list