[Bug 445852] Re: devkit-disks-probe-ata-smart causes HSM Violations on SSD, and potential hardware death

Martin Pitt martin.pitt at ubuntu.com
Fri Mar 26 09:06:48 UTC 2010


For Karmic we can backport
http://git.0pointer.de/?p=libatasmart.git;a=commitdiff;h=a223a4f6277a9f006b722b13671d5292dc6339bb
to fix this properly. If we do this, we should also apply
http://git.0pointer.de/?p=libatasmart.git;a=commitdiff;h=54f846c2115e7addf5468a9c10ecf9ba844b946e
on top, to avoid exporting this as a new symbol.

It just moves some initialization code into a new function and calls
this lazily. It does not change any API/ABI. It has been tested a long
time in lucid and should be fairly safe.

However, I'd like to keep the current workaround in devicekit-disks in
karmic-proposed for now (please test that this properly disables SMART
probing). I'd like to hear some more confirmations from affected people
here that things indeed work fine with Lucid beta-1 on a variety of
hardware platforms before re-enabling smart probing and this patch in
karmic again.

Thank you, and sorry for the trouble that this caused!

** Changed in: libatasmart (Ubuntu Karmic)
   Importance: Undecided => Medium

** Changed in: libatasmart (Ubuntu Karmic)
     Assignee: (unassigned) => Martin Pitt (pitti)

** Description changed:

- TEMPORARY WORK AROUND FOR THIS PROBLEM:
+ TEMPORARY WORK AROUND FOR THIS PROBLEM IN KARMIC: (This is now also in
+ karmic-proposed and needs testing feedback):
  
  1. sudo gedit /lib/udev/rules.d/95-devkit-disks.rules
  
  2. locate the following lines (about 1/3 the way into the file; search
  for "smart")
  
  # ATA disks driven by libata
  KERNEL=="sd*[!0-9]", ATTR{removable}=="0", ENV{ID_BUS}=="ata", ENV{DEVTYPE}=="disk", IMPORT{program}="devkit-disks-probe-ata-smart $tempnode"
  
  3. comment out the second line by adding a # in front, so you should
  have
  
  # ATA disks driven by libata
  #KERNEL=="sd*[!0-9]", ATTR{removable}=="0", ENV{ID_BUS}=="ata", ENV{DEVTYPE}=="disk", IMPORT{program}="devkit-disks-probe-ata-smart $tempnode"
  
  4. save the file and reboot
  
+ TECHNICAL ANALYSIS: https://bugs.launchpad.net/ubuntu/karmic/+source/libatasmart/+bug/445852/comments/202
+ LUCID STATUS: https://bugs.launchpad.net/ubuntu/karmic/+source/libatasmart/+bug/445852/comments/203
+ KARMIC SOLUTION: https://bugs.launchpad.net/ubuntu/karmic/+source/libatasmart/+bug/445852/comments/204
  
  BUG DESCRIPTION FOLLOWS:
  
  In the Karmic beta I experience ssd stalls during the boot process.  It
  happens almost everytime before xsplash loads and happens again
  frequently between logging into gdm and the desktop loading.  When it
  happens during login I think it is making gnome time out on loading
  panel items as I get errors related to lots of panel items failing to
  load.  If I log out and back in again when the ssd isn't stalled the
  panel items load fine.
  
  When it happens the following messages appear before xplash (or in dmesg
  when it happens after gdm):
  
  ata2: lost interrupt (Status 0x58)
  ata2: drained 16384 bytes to clear DRQ.
  ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
  ata2.00: BMDMA stat 0x4
  ata2.00: cmd c8/00:40:cb:60:32/00:00:00:00:00/e0 tag 0 dma 32768 in
  res 58/00:40:cb:60:32/00:00:00:00:00/e0 Emask 0x2 (HSM violation)
  ata2.00: status: { DRDY DRQ }
  ata2: soft resetting link
  ata2.00: configured for UDMA/66
  ata2: EH complete
  
  I did not have this issue in jaunty with this hardware and I don't think
  it has happened once the system is fully loaded.  I am running karmic
  unr on an Acer Aspire One netbook.
  
  ProblemType: Bug
  AplayDevices:
   **** List of PLAYBACK Hardware Devices ****
   card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
     Subdevices: 1/1
     Subdevice #0: subdevice #0
  Architecture: i386
  ArecordDevices:
   **** List of CAPTURE Hardware Devices ****
   card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
     Subdevices: 1/1
     Subdevice #0: subdevice #0
  AudioDevicesInUse:
   USER        PID ACCESS COMMAND
   /dev/snd/controlC0:  luke       1990 F.... pulseaudio
  CRDA: Error: [Errno 2] No such file or directory
  Card0.Amixer.info:
   Card hw:0 'Intel'/'HDA Intel at 0x58540000 irq 16'
     Mixer name	: 'Realtek ALC268'
     Components	: 'HDA:10ec0268,1025015b,00100101'
     Controls      : 9
     Simple ctrls  : 6
  CheckboxSubmission: 12ef539f3788bfbc46bc56b5c28128a6
  CheckboxSystem: c69722ecac764861be52925fa50b4dcc
  Date: Wed Oct  7 17:54:56 2009
  DistroRelease: Ubuntu 9.10
  HibernationDevice: RESUME=UUID=8d44b89b-2edb-4c02-a4be-94bd25b65081
  MachineType: Acer AOA110
  Package: linux-image-2.6.31-12-generic 2.6.31-12.40
  ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.31-12-generic root=UUID=039a096e-3486-4898-9eeb-44a705f8b7fd ro quiet splash elevator=noop usbcore.autosuspend=1
  ProcEnviron:
   LANG=en_CA.UTF-8
   SHELL=/bin/bash
  ProcVersionSignature: Ubuntu 2.6.31-12.40-generic
  RelatedPackageVersions: linux-firmware 1.21
  RfKill:
   0: phy0: Wireless LAN
    Soft blocked: no
    Hard blocked: no
  SourcePackage: linux
  Tags:  ubuntu-unr
  Uname: Linux 2.6.31-12-generic i686
  XsessionErrors:
   (gnome-settings-daemon:2006): GLib-CRITICAL **: g_propagate_error: assertion `src != NULL' failed
   (gnome-settings-daemon:2006): GLib-CRITICAL **: g_propagate_error: assertion `src != NULL' failed
   (nautilus:2092): Eel-CRITICAL **: eel_preferences_get_boolean: assertion `preferences_is_initialized ()' failed
   (polkit-gnome-authentication-agent-1:2118): GLib-CRITICAL **: g_once_init_leave: assertion `initialization_value != 0' failed
   (gnome-panel:2048): Gdk-WARNING **: /build/buildd/gtk+2.0-2.18.2/gdk/x11/gdkdrawable-x11.c:952 drawable is not a pixmap or window
  dmi.bios.date: 10/06/2008
  dmi.bios.vendor: Acer
  dmi.bios.version: v0.3309
  dmi.board.asset.tag: Base Board Asset Tag
  dmi.board.vendor: Acer
  dmi.board.version: Base Board Version
  dmi.chassis.type: 1
  dmi.chassis.vendor: Chassis Manufacturer
  dmi.chassis.version: Chassis Version
  dmi.modalias: dmi:bvnAcer:bvrv0.3309:bd10/06/2008:svnAcer:pnAOA110:pvr1:rvnAcer:rn:rvrBaseBoardVersion:cvnChassisManufacturer:ct1:cvrChassisVersion:
  dmi.product.name: AOA110
  dmi.product.version: 1
  dmi.sys.vendor: Acer

-- 
devkit-disks-probe-ata-smart causes HSM Violations on SSD, and potential hardware death
https://bugs.launchpad.net/bugs/445852
You received this bug notification because you are a member of Kernel
Bugs, which is subscribed to Linux.




More information about the kernel-bugs mailing list