NACK/Cmnt: [SRU][N/O/unstable][PATCH 0/2] Enable ASPM for nvme controller when working in RAID on mode
Hui Wang
hui.wang at canonical.com
Sat Aug 17 02:10:30 UTC 2024
On 8/17/24 03:58, Timo Aaltonen wrote:
> Stefan Bader kirjoitti 16.8.2024 klo 14.36:
>> On 16.08.24 12:16, Hui Wang wrote:
>>>
>>> On 8/15/24 20:38, Stefan Bader wrote:
>>>> On 15.08.24 05:22, Hui Wang wrote:
>>>>> BugLink: https://bugs.launchpad.net/bugs/2072679
>>>>>
>>>>> [Impact]
>>>>> The NVME controller works in RAID on mode by default on some Dell
>>>>> machines, and in this case, the PCIE ASPM couldn't be enabled, and
>>>>> as a result the system idle can't enter deep idle states. This issue
>>>>> not only impacts ubuntu users but also impacts our Dell OEM projects.
>>>>>
>>>>>
>>>>> [Fix]
>>>>> pick 2 commits from linux-pci mailist
>>>>>
>>>>> [Test]
>>>>> After running the patched kernel, we could run 'sudo lspci -nnvv'
>>>>> and check "Non-Volatile memory controller":
>>>>> LnkCtl: ASPM L1 Enabled;
>>>>>
>>>>> And check idle states, we could see the system could enter deep idle:
>>>>> $ sudo cat /sys/kernel/debug/pmc_core/package_cstate_show
>>>>> Package C2 : 55740989
>>>>> Package C3 : 4656373
>>>>> Package C6 : 43325041
>>>>> Package C7 : 6687655
>>>>> Package C8 : 44948950
>>>>> Package C9 : 1693
>>>>> Package C10 : 92865596
>>>>>
>>>>> [Where problems could occur]
>>>>> Because the patchset is not accepted by upstream yet, it is a bit
>>>>> risky to merge the patchset to ubuntu kernel. And the patch only
>>>>> impacts vmd driver, hence if there is regression, it could only be
>>>>> in the nvme driver with RAID on mode. The regression possibility is
>>>>> very low because we already tested the patch on many Dell, lenovo
>>>>> machines, there is no regression so far.
>>>>>
>>>>>
>>>>> Kai-Heng Feng (2):
>>>>> UBUNTU: SAUCE: PCI: ASPM: Allow OS to configure ASPM where BIOS is
>>>>> incapable of
>>>>> UBUNTU: SAUCE: PCI: vmd: Let OS control ASPM for devices under VMD
>>>>> domain
>>>>>
>>>>> drivers/pci/controller/vmd.c | 2 ++
>>>>> drivers/pci/pcie/aspm.c | 8 ++++++--
>>>>> include/linux/pci.h | 1 +
>>>>> 3 files changed, 9 insertions(+), 2 deletions(-)
>>>>>
>>>>
>>>> Rejected for the following reasons:
>>>> - Patches which are not at least in linux-next are not material for
>>>> SRU
>>>>
>>>> -Stefan
>>>
>>> Hi Stefan,
>>>
>>> Understand, that is the general rule for SRU.
>>>
>>> In this case, it impacts our Dell oem project. This issue is a
>>> regression of hwe-6.8 kernel, we don't have this issue with the
>>> oem-6.5 kernel, but oem-6.5 is EOL. Hence we need to merge this
>>> patchset to -generic kernel ASAP.
>>>
>>> Kai-Heng will continue working on this patchset and make sure it
>>> will be accepted by PCI maintainers. So could we get an exception in
>>> this case?
>>>
>>> Thanks,
>>>
>>> Hui.
>>>
>>
>> If you re-submit and put the reasoning about why this is needed now
>> and the plan going forward. If this does not get accepted as is we
>> need to make sure things get updated to the actual solution.
>
> FTR, this bug was originally
>
> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2034504
>
> and it got applied to the mantic kernel but not unstable (even though
> the bug suggests that it was), which is the reason for the regression..
>
> would've been nice if the new bug had a link to the old one, to save a
> lot of time chasing where things went wrong
Thanks for sharing this. I will add the link as you pointed out.
Thanks,
Hui.
>
>
More information about the kernel-team
mailing list