[Bug 602979] Re: Marvel 88E8053 ethernet controller reports hw csum errors and stops responding

Patrick McNamara 602979 at bugs.launchpad.net
Mon Jul 12 00:08:27 UTC 2010


It took awhile, but the problem re-occured with an upstream kernel --

administrator at stein:/var/log$ uname -a
Linux stein 2.6.35-999-generic #201007081234 SMP Thu Jul 8 11:37:16 UTC 2010 x86_64 GNU/Linux                      
administrator at stein:/var/log$                                                                                      

This is the kernel log from failure to back online:

Jul 11 18:54:43 stein kernel: [255608.593487] <unknown>: hw csum failure.
Jul 11 18:54:43 stein kernel: [255608.640399] Pid: 1151, comm: nfsd Not tainted 2.6.35-999-generic #201007081234
Jul 11 18:54:43 stein kernel: [255608.640404] Call Trace:
Jul 11 18:54:43 stein kernel: [255608.640419]  [<ffffffff814889db>] netdev_rx_csum_fault+0x3b/0x50
Jul 11 18:54:43 stein kernel: [255608.640428]  [<ffffffff81483453>] __skb_checksum_complete_head+0x63/0x70
Jul 11 18:54:43 stein kernel: [255608.640439]  [<ffffffff814e4ba5>] udp_recvmsg+0x1e5/0x2d0
Jul 11 18:54:43 stein kernel: [255608.640447]  [<ffffffff814ec9c3>] inet_recvmsg+0x53/0x90
Jul 11 18:54:43 stein kernel: [255608.640455]  [<ffffffff8147838c>] sock_recvmsg+0x11c/0x140
Jul 11 18:54:43 stein kernel: [255608.640463]  [<ffffffff81049165>] ? account_entity_dequeue+0x85/0xb0
Jul 11 18:54:43 stein kernel: [255608.640471]  [<ffffffff810525b8>] ? hrtick_update+0x38/0x40
Jul 11 18:54:43 stein kernel: [255608.640479]  [<ffffffff810560a6>] ? dequeue_task_fair+0x56/0x60
Jul 11 18:54:43 stein kernel: [255608.640488]  [<ffffffff8100985b>] ? __switch_to+0xbb/0x2e0
Jul 11 18:54:43 stein kernel: [255608.640496]  [<ffffffff8104cf0e>] ? put_prev_entity+0x2e/0x70
Jul 11 18:54:43 stein kernel: [255608.640504]  [<ffffffff81036ec9>] ? default_spin_lock_flags+0x9/0x10
Jul 11 18:54:43 stein kernel: [255608.640513]  [<ffffffff8106f9eb>] ? lock_timer_base+0x3b/0x70
Jul 11 18:54:43 stein kernel: [255608.640520]  [<ffffffff81478574>] kernel_recvmsg+0x44/0x60
Jul 11 18:54:43 stein kernel: [255608.640558]  [<ffffffffa014ff03>] svc_udp_recvfrom+0x93/0x340 [sunrpc]
Jul 11 18:54:43 stein kernel: [255608.640566]  [<ffffffff815759da>] ? schedule_timeout+0x15a/0x2e0
Jul 11 18:54:43 stein kernel: [255608.640574]  [<ffffffff81036ec9>] ? default_spin_lock_flags+0x9/0x10
Jul 11 18:54:43 stein kernel: [255608.640581]  [<ffffffff81577544>] ? _raw_spin_lock_irqsave+0x34/0x50
Jul 11 18:54:43 stein kernel: [255608.640609]  [<ffffffffa015c78d>] svc_recv+0x44d/0x670 [sunrpc]
Jul 11 18:54:43 stein kernel: [255608.640618]  [<ffffffff8105be40>] ? default_wake_function+0x0/0x20
Jul 11 18:54:43 stein kernel: [255608.640640]  [<ffffffffa0254a60>] ? nfsd+0x0/0x150 [nfsd]
Jul 11 18:54:43 stein kernel: [255608.640655]  [<ffffffffa0254afe>] nfsd+0x9e/0x150 [nfsd]
Jul 11 18:54:43 stein kernel: [255608.640662]  [<ffffffff8107f8a6>] kthread+0x96/0xa0
Jul 11 18:54:43 stein kernel: [255608.640670]  [<ffffffff8100be64>] kernel_thread_helper+0x4/0x10
Jul 11 18:54:43 stein kernel: [255608.640677]  [<ffffffff8107f810>] ? kthread+0x0/0xa0
Jul 11 18:54:43 stein kernel: [255608.640683]  [<ffffffff8100be60>] ? kernel_thread_helper+0x0/0x10
Jul 11 18:55:03 stein kernel: [255627.984629] sky2 0000:07:00.0: eth0: disabling interface
Jul 11 18:55:03 stein kernel: [255628.300050] sky2 0000:07:00.0: PCI INT A disabled
Jul 11 18:55:03 stein kernel: [255628.451103] sky2: driver version 1.28
Jul 11 18:55:03 stein kernel: [255628.451240] sky2 0000:07:00.0: PCI INT A -> GSI 36 (level, low) -> IRQ 36
Jul 11 18:55:03 stein kernel: [255628.451259] sky2 0000:07:00.0: setting latency timer to 64
Jul 11 18:55:03 stein kernel: [255628.451266] sky2 0000:07:00.0: PCI: Disallowing DAC for device
Jul 11 18:55:03 stein kernel: [255628.451314] sky2 0000:07:00.0: Yukon-2 EC chip revision 1
Jul 11 18:55:03 stein kernel: [255628.451413] sky2 0000:07:00.0: irq 69 for MSI/MSI-X
Jul 11 18:55:03 stein kernel: [255628.550056] sky2 0000:07:00.0: No interrupt generated using MSI, switching to INTx mode.
Jul 11 18:55:03 stein kernel: [255628.552632] sky2 0000:07:00.0: eth0: addr 00:0f:ea:f1:86:37
Jul 11 18:55:03 stein kernel: [255628.559520] sky2 0000:07:00.0: eth0: enabling interface
Jul 11 18:55:03 stein kernel: [255628.560842] ADDRCONF(NETDEV_UP): eth0: link is not ready
Jul 11 18:55:06 stein kernel: [255631.155020] sky2 0000:07:00.0: eth0: Link is up at 1000 Mbps, full duplex, flow control both
Jul 11 18:55:06 stein kernel: [255631.155488] ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
Jul 11 18:54:18 stein kernel: [255641.270014] eth0: no IPv6 routers present


The same sequence, ifconfig down, rmmod, modprobe, ifconfig up, was used to recover.

** Tags removed: needs-upstream-testing

** Description changed:

  At seemingly random times, a "hw csum error" is reported on the system
  console.  When this happens, the NIC will stop responding.  The sequence
  "ifconfig down, rmmod sky2; modprobe sky2; ifconfig up" will bring the
  NIC back to a normal state.  There does not seem to be any correlation
  between this error and network traffic.  The system this is occuring on
  is a NFS server and can server many GB of data with no problem.
  
  ProblemType: Bug
  DistroRelease: Ubuntu 10.04
  Package: linux-image-generic 2.6.32.22.23
  Regression: No
  Reproducible: No
  ProcVersionSignature: Ubuntu 2.6.32-22.36-server 2.6.32.11+drm33.2
  Uname: Linux 2.6.32-22-server x86_64
  AlsaDevices: Error: command ['ls', '-l', '/dev/snd/'] failed with exit code 2: ls: cannot access /dev/snd/: No such file or directory
  AplayDevices: Error: [Errno 2] No such file or directory
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory
  Date: Wed Jul  7 21:51:55 2010
  Frequency: Once every few days.
  InstallationMedia: Kubuntu 10.04 LTS "Lucid Lynx" - Release amd64 (20100427.1)
  Lsusb:
-  Bus 005 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
-  Bus 004 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
-  Bus 003 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
-  Bus 002 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
-  Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
+  Bus 005 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
+  Bus 004 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
+  Bus 003 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
+  Bus 002 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
+  Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
  PciMultimedia:
-  
+ 
  ProcCmdLine: BOOT_IMAGE=/vmlinuz-2.6.32-22-server root=UUID=058136fc-97a2-4de0-b15b-91a39acb7150 ro console=ttyS0,9600n8
  ProcEnviron:
-  LANG=en_US.UTF-8
-  SHELL=/bin/bash
+  LANG=en_US.UTF-8
+  SHELL=/bin/bash
  SourcePackage: linux
  dmi.bios.date: 04/27/2005
  dmi.bios.vendor: Award Software International, Inc.
  dmi.bios.version: F1
  dmi.board.name: K8T890-8237
  dmi.board.vendor: Gigabyte Technology Co., Ltd.
  dmi.board.version: x.x
  dmi.chassis.type: 3
  dmi.modalias: dmi:bvnAwardSoftwareInternational,Inc.:bvrF1:bd04/27/2005:svn:pn:pvr:rvnGigabyteTechnologyCo.,Ltd.:rnK8T890-8237:rvrx.x:cvn:ct3:cvr:

-- 
Marvel 88E8053 ethernet controller reports hw csum errors and stops responding
https://bugs.launchpad.net/bugs/602979
You received this bug notification because you are a member of Kernel
Bugs, which is subscribed to linux in ubuntu.




More information about the kernel-bugs mailing list