Github messages for voidlinux
 help / color / mirror / Atom feed
* [ISSUE] amdgpu system freeze
@ 2025-01-01 14:42 narodnik
  2025-01-01 22:03 ` TeusLollo
                   ` (9 more replies)
  0 siblings, 10 replies; 17+ messages in thread
From: narodnik @ 2025-01-01 14:42 UTC (permalink / raw)
  To: ml

[-- Attachment #1: Type: text/plain, Size: 2162 bytes --]

New issue by narodnik on void-packages repository

https://github.com/void-linux/void-packages/issues/53787

Description:
### Is this a new report?

Yes

### System Info

Void 6.9.12_1 x86_64 AuthenticAMD notuptodate rDDF

### Package(s) Affected

linux-firmware-amd-20241210_1.x86_64

### Does a report exist for this bug with the project's home (upstream) and/or another distro?

https://gitlab.freedesktop.org/drm/amd/-/issues/3863

See also: https://github.com/void-linux/void-packages/issues/53434#issuecomment-2564815160

### Expected behaviour

I'm using Wayland with a minimalistic window manager (DWL). After 30 mins usage, I get a full system freeze.

### Actual behaviour

The system fully freezes. I can still hear music playing, and I can reset using REISUB. But there is no response to input from the WM itself.

I've managed to fix it by doing these steps:

1.   `xdowngrade mesa-24.2.3_2.x86_64.xbps libglapi-24.2.3_2.x86_64.xbps libOSMesa-24.2.3_2.x86_64.xbps mesa-libgallium-24.2.3_2.x86_64.xbps libgbm-24.2.3_2.x86_64.xbps libgbm-devel-24.2.3_2.x86_64.xbps MesaLib-devel-24.2.3_2.x86_64.xbps`
2. `xdowngrade linux-firmware-amd-20241110_1.x86_64.xbps`
3. Using Linux 6.9
4. Making a completely new user.

The last step is very unusual and makes me think it's due to a stale mesa cache somewhere. I tried clearing out my normal user's home directory, and logging on as that user. But I still get the crash.

However with a completely new user, the system is completely stable. This is the configuration I've been using so far. When installing linux6.9, I got these messages:

```
File descriptor 21 (/home/myuser/.cache/mesa_shader_cache_db/part0/mesa_cache.db) leaked on lvs invocation. Parent PID 76833: /bin/sh
```

Which made me think it's that cache. So I removed `~/.cache/` completely but it didn't fix the issue. Only a completely new user does!

Please advise me the steps to triage this bug and help get it fixed. I have no idea if it's an issue with Linux, amdgpu or mesa.

### Steps to reproduce

Just use my computer. Strangely if I only use foot terminal, and play no videos or use the browser then it's fine.

^ permalink raw reply	[flat|nested] 17+ messages in thread
* [ISSUE] amdgpu system freeze
@ 2025-01-20 18:23 CaioFrancisco
  2025-01-22 12:22 ` Laitinlok
                   ` (5 more replies)
  0 siblings, 6 replies; 17+ messages in thread
From: CaioFrancisco @ 2025-01-20 18:23 UTC (permalink / raw)
  To: ml

[-- Attachment #1: Type: text/plain, Size: 2239 bytes --]

New issue by CaioFrancisco on void-packages repository

https://github.com/void-linux/void-packages/issues/54052

Description:
### Is this a new report?

Yes

### System Info

Void 6.12.9_1 x86_64 AuthenticAMD notuptodate rrrmFFFFFF

### Package(s) Affected

linux-firmware-amd-20250109_1

### Does a report exist for this bug with the project's home (upstream) and/or another distro?

I'm unsure.

### Expected behaviour

This issue is a dupe of #53787. I'm still suffering from the freezes even though the original OP said they stopped having crashes. I asked them to re-open the issue since I'm still suffering from it, but it's been a week and I'm unsure if they'll do it. At any rate, there's no updates since my first post.

If this isn't resolved, I'll be switching distros soon enough and I'm probably not coming back. What's the use of a computer that constantly crashes on me?

### Actual behaviour

Quoting myself from my previous post,

> currently running XFCE on X11, i also ran kde plasma X11 some time ago, which also crashed just the same way. weirdly enough, it might just be luck, but kde plasma wayland never crashed on me.
> 
> my system specs are ryzen 5 2400g and nvidia GTX 1650 GPU, and that the crash can happen randomly. i can sometimes do video intensive tasks for hours without a single hiccup, but sometimes i can crash 10 minutes after booting up while using my browser.
> 
> as a last note, the dmesg logs have some errors when the system "freezes" (i can still ssh my way in with my phone). they usually are prety quiet up until something like this happens:
> 
> ```
> [  422.608105] amdgpu 0000:08:00.0: amdgpu: failed to write reg 28b4 wait reg 28c6
> [  422.876854] [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:73:crtc-0] hw_done or flip_done timed out
> [  433.117233] [drm:amdgpu_dm_atomic_check [amdgpu]] *ERROR* [CRTC:77:crtc-1] hw_done or flip_done timed out
> [  434.689642] amdgpu 0000:08:00.0: amdgpu: failed to write reg 1a6f4 wait reg 1a706
> [  454.621453] amdgpu 0000:08:00.0: amdgpu: Dumping IP State
> ```
> 
> followed by the watchdog freaking out at the CPU threads getting stuck until i reisub.

### Steps to reproduce

1. Use your device for a random amount of time.
2. Freeze.

^ permalink raw reply	[flat|nested] 17+ messages in thread

end of thread, other threads:[~2025-01-22 23:51 UTC | newest]

Thread overview: 17+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2025-01-01 14:42 [ISSUE] amdgpu system freeze narodnik
2025-01-01 22:03 ` TeusLollo
2025-01-01 22:04 ` TeusLollo
2025-01-01 22:13 ` TeusLollo
2025-01-02  1:16 ` CaioFrancisco
2025-01-02  7:47 ` narodnik
2025-01-02 16:31 ` TeusLollo
2025-01-02 16:32 ` TeusLollo
2025-01-08 18:40 ` narodnik
2025-01-08 18:40 ` [ISSUE] [CLOSED] " narodnik
2025-01-13 17:46 ` CaioFrancisco
2025-01-20 18:23 [ISSUE] " CaioFrancisco
2025-01-22 12:22 ` Laitinlok
2025-01-22 16:01 ` CaioFrancisco
2025-01-22 17:15 ` CaioFrancisco
2025-01-22 19:07 ` CaioFrancisco
2025-01-22 23:44 ` ACR-Jeff
2025-01-22 23:51 ` Laitinlok

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).