[SRU][N:linux-bluefield][PATCH v1 1/4] net/mlx5: Avoid report two health errors on same syndrome
Stav Aviram
saviram at nvidia.com
Mon Oct 6 09:40:45 UTC 2025
From: Moshe Shemesh <moshe at nvidia.com>
BugLink: https://bugs.launchpad.net/bugs/2126876
In case health counter has not increased for few polling intervals, miss
counter will reach max misses threshold and health report will be
triggered for FW health reporter. In case syndrome found on same health
poll another health report will be triggered.
Avoid two health reports on same syndrome by marking this syndrome as
already known.
Signed-off-by: Moshe Shemesh <moshe at nvidia.com>
Reviewed-by: Shahar Shitrit <shshitrit at nvidia.com>
Signed-off-by: Tariq Toukan <tariqt at nvidia.com>
Reviewed-by: Kalesh AP <kalesh-anakkur.purayil at broadcom.com>
Signed-off-by: David S. Miller <davem at davemloft.net>
(cherry picked from commit b5d7b2f04ebcff740f44ef4d295b3401aeb029f4)
Signed-off-by: Stav Aviram <saviram at nvidia.com>
---
drivers/net/ethernet/mellanox/mlx5/core/health.c | 1 +
1 file changed, 1 insertion(+)
diff --git a/drivers/net/ethernet/mellanox/mlx5/core/health.c b/drivers/net/ethernet/mellanox/mlx5/core/health.c
index 4ad88be1ecb5..206011101614 100644
--- a/drivers/net/ethernet/mellanox/mlx5/core/health.c
+++ b/drivers/net/ethernet/mellanox/mlx5/core/health.c
@@ -784,6 +784,7 @@ static void poll_health(struct timer_list *t)
health->prev = count;
if (health->miss_counter == MAX_MISSES) {
mlx5_core_err(dev, "device's health compromised - reached miss count\n");
+ health->synd = ioread8(&h->synd);
print_health_info(dev);
queue_work(health->wq, &health->report_work);
}
--
2.38.1
More information about the kernel-team
mailing list