-
Notifications
You must be signed in to change notification settings - Fork 0
Add Linux 5.15.88-444 or later fixing Broadcom/LSI logging and other things #2829
Conversation
The commit message needs to be added, and it needs to be tested on the affected file servers.
They all have 12 TB drives. The file server lyssa with 8 TB drives is not affected. The test file server wayofthedodo has 12 TB drives, but does not exhibit the problem – probably due to missing data and therefore not enough load. |
The system with 8TB drives is 'lyssa' |
Switch back to SRCURL (and fix version there) ? You've sneaked in ("firmware: coreboot: Check size of table entry and use flex-array") 1. Was this on purpose? |
I think there are some open questions about the nature of the issue. |
Good questions. I doubt, we are competent enough to answer them. The warnings are removed from
All call sites of Should we change that to WARN_ONCE so that we still get an indication on what systems that happens without filling the disk with repeated log entries? If I/O can be lost by this condition, it might be good to have a reminder in the logfile when stumble on a frozen system later. Btw: Today, I/O on "gone" froze for a second time since we have a 5.15 kernel. Again it started a few minutes after mdcheck was paused in the morning. This time it is the other md device. It blocks and shows I/Os in flight. |
Christoph Hellwig explicitly asked to drop the message.:
|
Otherwise, the referenced patch by John Pittman rate-limits the message. |
I must confess, that Christoph Hellwig might have a little more insight... And "double completion" doesn't sound that alarming. So leave it as it is without any messages. We can just try it out. |
Concerning the
As for the function:
And as for "double completion", to me it looks like this describes the error happening on the logging site? |
Can we trigger the error? If so, we could try to bisect, although this might take ages. |
Yes, will do. Just wanted to get the test Linux kernel build. (Nvidia drivers are also missing.)
Yes, it was as we have two coreboot machines. (It got picked for the stable series already, so I picked it too.) |
Hold it, I might like to add another kernel patch "request-key: Cannot find command to construct key" in /var/log/messages for sec=mariux |
3739f8d
to
3de9ba1
Compare
Correct. Hmm. What to do about this? |
Ignore it? If I see it right, it has nothing to do with the error messages generated by |
@thomas: Are you able to test whether the rate limit of the warnings works? (for linux-5.15.94-447.x86_64) ? |
e9c8eac
to
35957cc
Compare
Also cherry-pick a cbmem driver fix.
Add version 5.15.89 and remove version 5.15.88. The kernel mariux-5.15.89-445.tar.gz contains another patch which shoud avoid the "request-key: Cannot find command to construct key ,,;" messages in the syslog with a sec=mariux nfs client.
Add version 5.15.94 and remove version 5.15.89. The kernel mariux-5.15.94-447 contains patches to rate limit the "scsi_dma_map failed: request for %d bytes" warnings messages.
Build version 510.60.02 for Linux 5.15.94-447 and remove for Linux 5.15.89-445A.
35957cc
to
8ee0ab6
Compare
We no longer have the sas controllers to test the rate limit of the strange warning. Basic function is tested on sigusr2 (with nvidia) and - accidentally - on done. I merge that now, so that the unwanted "request-key: Cannot find command to construct key" log messages no longer appear on the nfs clients. This is fixed by mariux64/linux@5665b3517ce3a |
Tested on maleficent.