0

For a couple months now, I have had an issue with my Ubuntu server where every few days, the machine locks up and is completely unresponsive. The only thing I see in the tty is the following message over and over, usually between 2 of few processes (PLEX media server, SSHd, rtorrent, tmux, etc...)

Mar 31 22:11:43 yggdrasil kernel: watchdog: BUG: soft lockup - CPU#2 stuck for 22s! [Plex DLNA Serve:23621]

I see some other information sometimes, but I can never find it in any log, journalctl only ever has one instance of the log line, but there are dozens in the tty when I restart.

I have replaced the motherboard, the GPU, and the power supply and the problem persists.

Specs are as follows:

  • CPU: AMD Ryzen 5 1600X
  • Mobo: ASUS ROG STRIX X370-F Gaming
  • GPU: nVidia GT 210

Is there any other steps I can get to the bottom of this? Should I try to panic and get a kernel memory dump when this occurs? How would I do that?

Update, caught the crash earlier and saw a calltrace and some more info: https://i.stack.imgur.com/pxTwc.jpg

Evan C
  • 9
  • 3
  • From what I can tell, this is either related to the nouveau drivers, or a flaw with some Ryzen processors as described in this kernel bug: https://bugzilla.kernel.org/show_bug.cgi?id=196683 For now I have switched to an AMD dGPU and will wait for the issue to come up again before blaming the CPU. – Evan C Apr 09 '19 at 14:03
  • 2
    Does this answer your question? [NMI watchdog: BUG: soft lockup - CPU#2 stuck for 23s! \[plymouthd:305\]](https://askubuntu.com/questions/875173/nmi-watchdog-bug-soft-lockup-cpu2-stuck-for-23s-plymouthd305) – karel Aug 07 '20 at 12:49

0 Answers0