[Bug 1379591] Re: Moonshot nodes with Mellanox interfaces fail to deploy in maas 1.7
dann frazier
dann.frazier at canonical.com
Tue Nov 4 16:20:26 UTC 2014
Fix verified:
ubuntu at ms01a:~$ sudo dpkg -i lshw_02.16-2ubuntu1.1_arm64.deb
dpkg: warning: downgrading lshw from 02.16-2ubuntu1.2 to 02.16-2ubuntu1.1
(Reading database ... 80719 files and directories currently installed.)
Preparing to unpack lshw_02.16-2ubuntu1.1_arm64.deb ...
Unpacking lshw (02.16-2ubuntu1.1) over (02.16-2ubuntu1.2) ...
Setting up lshw (02.16-2ubuntu1.1) ...
Processing triggers for man-db (2.6.7.1-1) ...
ubuntu at ms01a:~$ sudo lshw -class net > pre-fix
ubuntu at ms01a:~$ sudo dpkg -i lshw_02.16-2ubuntu1.2_arm64.deb
(Reading database ... 80719 files and directories currently installed.)
Preparing to unpack lshw_02.16-2ubuntu1.2_arm64.deb ...
Unpacking lshw (02.16-2ubuntu1.2) over (02.16-2ubuntu1.1) ...
Setting up lshw (02.16-2ubuntu1.2) ...
Processing triggers for man-db (2.6.7.1-1) ...
ubuntu at ms01a:~$ sudo lshw -class net > post-fix
ubuntu at ms01a:~$ diff -u pre-fix post-fix
--- pre-fix 2014-11-04 11:23:20.564933999 -0500
+++ post-fix 2014-11-04 11:23:36.674933999 -0500
@@ -1,4 +1,4 @@
- *-network
+ *-network DISABLED
description: Ethernet interface
product: MT27500 Family [ConnectX-3]
vendor: Mellanox Technologies
@@ -11,7 +11,7 @@
width: 64 bits
clock: 33MHz
capabilities: pm vpd msix pciexpress bus_master cap_list ethernet physical tp
- configuration: autonegotiation=off broadcast=yes driver=mlx4_en driverversion=2.2-1 (Feb 2014) duplex=full firmware=2.30.3000 ip=10.229.41.200 latency=0 link=yes multicast=yes port=twisted pair speed=1Gbit/s
+ configuration: autonegotiation=off broadcast=yes driver=mlx4_en driverversion=2.2-1 (Feb 2014) duplex=full firmware=2.30.3000 latency=0 link=no multicast=yes port=twisted pair speed=1Gbit/s
resources: irq:244 memory:a010800000-a0108fffff memory:a010000000-a0107fffff
*-network:0
description: Ethernet interface
@@ -32,14 +32,23 @@
*-network:2
description: Ethernet interface
physical id: 3
+ bus info: pci at 0000:01:00.0
+ logical name: eth0
+ serial: 2c:59:e5:36:9a:b2
+ size: 1Gbit/s
+ capabilities: ethernet physical tp
+ configuration: autonegotiation=off broadcast=yes driver=mlx4_en driverversion=2.2-1 (Feb 2014) duplex=full firmware=2.30.3000 ip=10.229.41.200 link=yes multicast=yes port=twisted pair speed=1Gbit/s
+ *-network:3
+ description: Ethernet interface
+ physical id: 4
logical name: vnet1
serial: fe:16:3e:fe:b1:a1
size: 10Mbit/s
capabilities: ethernet physical
configuration: autonegotiation=off broadcast=yes driver=tun driverversion=1.6 duplex=full link=yes multicast=yes port=twisted pair speed=10Mbit/s
- *-network:3
+ *-network:4
description: Ethernet interface
- physical id: 4
+ physical id: 5
logical name: novatap
serial: 46:df:52:39:2e:34
size: 10Mbit/s
** Tags removed: verification-needed
** Tags added: verification-done
--
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to lshw in Ubuntu.
https://bugs.launchpad.net/bugs/1379591
Title:
Moonshot nodes with Mellanox interfaces fail to deploy in maas 1.7
Status in lshw - Hardware Lister:
Unknown
Status in MAAS:
Won't Fix
Status in “lshw” package in Ubuntu:
Fix Released
Status in “lshw” source package in Trusty:
Fix Committed
Status in “lshw” source package in Utopic:
Fix Committed
Status in “lshw” source package in Vivid:
Fix Released
Status in “lshw” package in Debian:
New
Bug description:
[Impact]
On systems with multiple NICs on a single PCI function, lshw will fail to show all of the NICs, and might associate the wrong MAC with an interface. This is known to cause problems with MAAS functioning on such systems.
[Test Case]
Run lshw on a system with >1 NIC on a PCI function and observe the output.
[Regression Risk]
The cause of this issue is some deduplication code in lshw that checks to see if the NIC it is scanning has already been registered. The included solution is to also compare the MACs before assuming it is the same NIC. So, regression risk could be that this code is broken (e.g. segfaults) or that there is a real world case where multiple NICs may have the same MAC (hardware bridge?).
To manage notifications about this bug go to:
https://bugs.launchpad.net/lshw/+bug/1379591/+subscriptions
More information about the foundations-bugs
mailing list