Bluefin-dx: Nvidia woes

akarypid · April 26, 2024, 7:59pm

Hello,

I have a rather new Yoga Pro 9i (2024 model, 16IMH9) that has a 4060 alongside its Intel Arc and I am running the latest Bluefin-DX. My desktop environment is Gnome on Wayland.

Everything works fine when I disable the GPU in BIOS, but when I have it enabled I have quite a few issues:

Battery life is terrible
System log is spammed (probably relates to 2 above) with failure to set power limit messages
When my laptop goes to sleep, it sometimes will not wake up (it’s hit and miss)

Apr 26 20:50:40 myhost /usr/bin/nvidia-powerd[6008]: error setting power limit
Apr 26 20:50:40 myhost /usr/bin/nvidia-powerd[6008]: Error setting GPU limit: 55000.
Apr 26 20:50:40 myhost /usr/bin/nvidia-powerd[6008]: error setting power limit
Apr 26 20:50:40 myhost /usr/bin/nvidia-powerd[6008]: Error setting GPU limit: 55000.
...

Is this typical of Nvidia due to its closed-source drivers? Is there any hope of getting things to work in a stable way?

j0rge · April 26, 2024, 8:49pm

Can you post the output from a rpm-ostree status?

akarypid · April 26, 2024, 9:02pm

Of course, here is my status:

❯ rpm-ostree status
State: idle
AutomaticUpdates: stage; rpm-ostreed-automatic.timer: no runs since boot
Deployments:
● ostree-image-signed:docker://ghcr.io/ublue-os/bluefin-dx-nvidia:latest
                   Digest: sha256:54804a33df44857592b9639a2da8c6d3a9553bb96d78fa7ff6ea21677efecca5
                  Version: 40.20240423.0 (2024-04-24T14:54:46Z)
          LayeredPackages: adcli lm_sensors oddjob-mkhomedir samba-common-tools sssd-ad
                           touchegg

  ostree-image-signed:docker://ghcr.io/ublue-os/bluefin-dx-nvidia:latest
                   Digest: sha256:54804a33df44857592b9639a2da8c6d3a9553bb96d78fa7ff6ea21677efecca5
                  Version: 40.20240423.0 (2024-04-24T14:54:46Z)
          LayeredPackages: adcli lm_sensors oddjob-mkhomedir samba-common-tools sssd-ad
                           touchegg

Here is the lspci -k info for the adapters:

00:02.0 VGA compatible controller: Intel Corporation Meteor Lake-P [Intel Arc Graphics] (rev 08)
	Subsystem: Lenovo Device 3e47
	Kernel driver in use: i915
	Kernel modules: i915, xe
...
01:00.0 VGA compatible controller: NVIDIA Corporation AD107M [GeForce RTX 4060 Max-Q / Mobile] (rev a1)
	Subsystem: Lenovo Device 3e47
	Kernel driver in use: nvidia
	Kernel modules: nouveau, nvidia_drm, nvidia

Note that nouveau is not used due to the kernel parameters pre-configured by the bluefin-dx-nvidia image:

❯ journalctl -b | grep Command
Apr 26 21:54:44 fedora kernel: Command line: BOOT_IMAGE=(hd0,gpt2)/ostree/default-1a8fdd4d36ffc5ae1fe73ef692f3d5fa06ce6e5c5d16715f3299fa9ab32081f8/vmlinuz-6.8.7-300.fc40.x86_64 rd.luks.uuid=luks-0f45e4b2-02d1-4a30-9462-a67ed1db53bd rhgb quiet root=UUID=72ba393e-d809-4349-aec7-0b761a41a98e rootflags=subvol=root rw ostree=/ostree/boot.1/default/1a8fdd4d36ffc5ae1fe73ef692f3d5fa06ce6e5c5d16715f3299fa9ab32081f8/0 rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1 rd.luks.options=discard initcall_blacklist=simpledrm_platform_driver_init
Apr 26 21:54:44 fedora kernel: Command line: BOOT_IMAGE=(hd0,gpt2)/ostree/default-1a8fdd4d36ffc5ae1fe73ef692f3d5fa06ce6e5c5d16715f3299fa9ab32081f8/vmlinuz-6.8.7-300.fc40.x86_64 rd.luks.uuid=luks-0f45e4b2-02d1-4a30-9462-a67ed1db53bd rhgb quiet root=UUID=72ba393e-d809-4349-aec7-0b761a41a98e rootflags=subvol=root rw ostree=/ostree/boot.1/default/1a8fdd4d36ffc5ae1fe73ef692f3d5fa06ce6e5c5d16715f3299fa9ab32081f8/0 rd.driver.blacklist=nouveau modprobe.blacklist=nouveau nvidia-drm.modeset=1 rd.luks.options=discard initcall_blacklist=simpledrm_platform_driver_init

Here is what is actually loaded:

❯ lsmod | grep -E "nouveau|nvidia"
nvidia_wmi_ec_backlight    12288  0
nvidia_drm            122880  3
nvidia_modeset       1605632  2 nvidia_drm
video                  77824  5 nvidia_wmi_ec_backlight,ideapad_laptop,xe,i915,nvidia_modeset
wmi                    36864  4 video,nvidia_wmi_ec_backlight,wmi_bmof,ideapad_laptop
nvidia_uvm           6656000  0
nvidia              60497920  42 nvidia_uvm,nvidia_modeset

akarypid · April 26, 2024, 9:23pm

Just adding that I found the docs, for nvidia-powerd and they seem to have “XFree86” in the URL. Could it be that this is simply not supported for Wayland?

Anyway, running the command from the documentation seems to indicate that dynamic boost is “not supported”, as I get:

❯ nvidia-settings -q DynamicBoostSupport

ERROR: Error resolving target specification '' (No targets match target specification), specified in query 'DynamicBoostSupport'.

EDIT: I have stopped the service (systemctl stop nvidia-powerd) to stop the log spamming.

akarypid · April 26, 2024, 9:29pm

I have logged into Gnome using Xorg and now:

❯ nvidia-settings -q DynamicBoostSupport

  Attribute 'DynamicBoostSupport' (aurelius.ad.home.lan:0[gpu:0]): 1.
    'DynamicBoostSupport' is a boolean attribute; valid values are: 1 (on/true) and 0
    (off/false).
    'DynamicBoostSupport' is a read-only attribute.
    'DynamicBoostSupport' can use the following target types: GPU.

So it would seem like nvidia-powerd is a Xorg-only service? But even so, after restarting the service, I get the same 2 lines of messages every 2 seconds:

Apr 26 22:27:53 myhost /usr/bin/nvidia-powerd[29417]: error setting power limit
Apr 26 22:27:53 myhost /usr/bin/nvidia-powerd[29417]: Error setting GPU limit: 55000.
Apr 26 22:27:55 myhost /usr/bin/nvidia-powerd[29417]: error setting power limit
Apr 26 22:27:55 myhost /usr/bin/nvidia-powerd[29417]: Error setting GPU limit: 55000.
...

akarypid · April 26, 2024, 9:35pm

I tried to suspend/resume from Xorg and it failed. Here are the last messages from the kernel log:

Apr 26 22:30:27 myhost /usr/bin/nvidia-powerd[29417]: error setting power limit
Apr 26 22:30:27 myhost /usr/bin/nvidia-powerd[29417]: Error setting GPU limit: 55000.
Apr 26 22:30:29 myhost /usr/bin/nvidia-powerd[29417]: error setting power limit
Apr 26 22:30:29 myhost /usr/bin/nvidia-powerd[29417]: Error setting GPU limit: 55000.
Apr 26 22:30:31 myhost /usr/bin/nvidia-powerd[29417]: error setting power limit
Apr 26 22:30:31 myhost /usr/bin/nvidia-powerd[29417]: Error setting GPU limit: 55000.
Apr 26 22:30:31 myhost /usr/libexec/gdm-x-session[23845]: (EE) NVIDIA(GPU-0): WAIT (1, 8, 0x8000, 0x0000072c, 0x00000d2c)
Apr 26 22:30:31 myhost /usr/libexec/gdm-x-session[23845]: (II) NVIDIA(G0): Setting mode "NULL"
Apr 26 22:30:31 myhost kernel: BUG: kernel NULL pointer dereference, address: 0000000000000008
Apr 26 22:30:31 myhost kernel: #PF: supervisor read access in kernel mode
Apr 26 22:30:31 myhost kernel: #PF: error_code(0x0000) - not-present page
Apr 26 22:30:31 myhost kernel: PGD 0 P4D 0 
Apr 26 22:30:31 myhost kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
Apr 26 22:30:31 myhost kernel: CPU: 4 PID: 23845 Comm: Xorg Tainted: P           OE      6.8.7-300.fc40.x86_64 #1
Apr 26 22:30:31 myhost kernel: Hardware name: LENOVO 83DN/LNVNB161216, BIOS NKCN25WW 02/05/2024
Apr 26 22:30:31 myhost kernel: RIP: 0010:_nv002475kms+0x29/0xb0 [nvidia_modeset]
Apr 26 22:30:31 myhost kernel: Code: 00 f3 0f 1e fa 55 41 b8 14 00 00 00 48 89 e5 41 56 49 89 ce 41 55 48 8d 4d cc 41 89 d5 41 54 49 89 fc 53 48 89 f3 48 83 ec 20 <48>
Apr 26 22:30:31 myhost kernel: RSP: 0018:ffffadb2647e38e8 EFLAGS: 00010286
Apr 26 22:30:31 myhost kernel: RAX: ffffffffc438b170 RBX: 0000000000000000 RCX: ffffadb2647e38f4
Apr 26 22:30:31 myhost kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffadb2401a9008
Apr 26 22:30:31 myhost kernel: RBP: ffffadb2647e3928 R08: 0000000000000014 R09: ffffadb243251008

akarypid · April 27, 2024, 10:21am

Well, this is definitely an Nvidia driver issue. I was worried about this, haven’t owned an Nvidia device in years because of it…

Ironically the card works perfectly fine with PCI-passthrough into a Windows VM!

I’ve reverted to the non-nvidia image of Bluefin-DX (ostree-image-signed:docker://ghcr.io/ublue-os/bluefin-dx:latest)
I created a new Windows 11 virtual machine using Virt-Manager
I added a PCI device selecting the Nvidia card

Sure enough, I’m back to a stable host-system (battery life is reasonable, suspend/resume works, etc). Meanwhile the Nvidia card works fine inside the Windows VM! I was even able to suspend/resume with the Windows VM running!

Somewhat regretting my Yoga Pro 9i purchase now… I wonder how long before there is a stable kernel/nvidia-driver combination for my machine. Here is the nvidia module info, seems to be version 550.76:

Apr 26 22:43:18 fedora kernel: nvidia: module license 'NVIDIA' taints kernel.
Apr 26 22:43:18 fedora kernel: Disabling lock debugging due to kernel taint
Apr 26 22:43:18 fedora kernel: nvidia: module license taints kernel.
Apr 26 22:43:18 fedora kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 236
Apr 26 22:43:18 fedora kernel: 
Apr 26 22:43:18 fedora kernel: nvidia 0000:01:00.0: enabling device (0000 -> 0003)
Apr 26 22:43:18 fedora kernel: nvidia 0000:01:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=none:owns=none
Apr 26 22:43:18 fedora kernel: NVRM: loading NVIDIA UNIX x86_64 Kernel Module  550.76  Wed Apr 10 20:41:20 UTC 2024
Apr 26 22:43:18 fedora kernel: nvidia_uvm: module uses symbols nvUvmInterfaceDisableAccessCntr from proprietary module nvidia, inheriting taint.
Apr 26 22:43:18 fedora kernel: nvidia-uvm: Loaded the UVM driver, major device number 234.
Apr 26 22:43:18 fedora kernel: nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms  550.76  Wed Apr 10 20:05:49 UTC 2024
Apr 26 22:43:18 fedora kernel: [drm] [nvidia-drm] [GPU ID 0x00000100] Loading driver
Apr 26 22:43:19 fedora kernel: ACPI Warning: \_SB.NPCF._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20230628/nsarguments-61)
Apr 26 22:43:19 fedora kernel: ACPI Warning: \_SB.PC00.RP12.PXSX._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20230628/nsarguments-61)
Apr 26 22:43:20 fedora kernel: nvidia-modeset: WARNING: GPU:0: Unable to read EDID for display device DP-0
Apr 26 22:43:20 fedora kernel: nvidia-modeset: WARNING: GPU:0: Unable to read EDID for display device DP-0
Apr 26 22:43:20 fedora kernel: [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:01:00.0 on minor 4
Apr 26 22:43:20 fedora kernel: nvidia 0000:01:00.0: [drm] Cannot find any crtc or sizes

And of course the latest Bluefin-DX kernel:

❯ uname -a
Linux myhost 6.8.7-300.fc40.x86_64 #1 SMP PREEMPT_DYNAMIC Wed Apr 17 19:21:08 UTC 2024 x86_64 GNU/Linux

I was actually able to add the PCI card to a VM running Fedora 40 as well, but for some reason the QXL adapter (the VM has multiple GPUs, one software QXL and one being the physical Nvidia) doesn’t take control of the login screen. The VM comes up, I can ssh into it, but I can’t login from the GDM login screen as it is not visible. Seems like it renders on the Nvidia screen and needs some physical monitor attached? I’ll look into it more, as it may be a good way to try newer kernels/drivers to see if it stabilizes…

I will update this post if I come across anything new, in case anyone else is using the same laptop…

akarypid · May 4, 2024, 10:13pm

The new kernel and drivers just arrived today and the situation is greatly improved:

suspend/resume works without issue so far
battery life is back to normal#
there are no errors logged from nvidia-powerd
Unigine Superposition runs at ~110 FPS average on Nvidia (as opposed to ~50 for Arc)

Just a few hours on this new kernel, but things are looking great.

EDIT: just for reference, the packages are

❯ rpm -qa | grep kmod-nvidia
kmod-nvidia-6.8.8-300.fc40.x86_64-550.78-1.fc40.x86_64

~ 
❯ rpm -qa | grep kernel
kernel-modules-core-6.8.8-300.fc40.x86_64
kernel-core-6.8.8-300.fc40.x86_64
kernel-modules-6.8.8-300.fc40.x86_64
kernel-6.8.8-300.fc40.x86_64
kernel-modules-extra-6.8.8-300.fc40.x86_64
kernel-tools-libs-6.8.8-300.fc40.x86_64
kernel-tools-6.8.8-300.fc40.x86_64
kernel-headers-6.8.3-300.fc40.x86_64

akarypid · August 23, 2024, 10:00am

I think I finally have a specific bug report for my Nvidia suspend/resume woes:

I am not even sure if Bluefin is using the open source modules, or if it is using the proprietary drivers?

Anyway, now that I have a bit more experience and have looked into the logs more thoroughly, I seem to get almost the exact same kernel dump when the system hangs after a suspend/resume. It looks like this:

Aug 22 04:23:21 host kernel: NVRM: failed to allocate vmap() page descriptor table!
Aug 22 04:23:22 host tailscaled[6306]: health(warnable=network-status): error: Tailscale cannot connect because the network is down. Check your Internet connection.
Aug 22 04:23:27 host kernel: NVRM: GPU at PCI:0000:01:00: GPU-10c63f06-fd37-85fa-8cc3-c04ed62b1782
Aug 22 04:23:27 host kernel: NVRM: Xid (PCI:0000:01:00): 119, pid=287264, name=nvidia-sleep.sh, Timeout after 6s of waiting for RPC response from GPU0 GSP! Expected function 76 (GSP_RM_CONTROL) (0x2080205b 0x4).
Aug 22 04:23:27 host kernel: NVRM: GPU0 GSP RPC buffer contains function 76 (GSP_RM_CONTROL) and data 0x000000002080205b 0x0000000000000004.
Aug 22 04:23:27 host kernel: NVRM: GPU0 RPC history (CPU -> GSP):
Aug 22 04:23:27 host kernel: NVRM:     entry function                   data0              data1              ts_start           ts_end             duration actively_polling
Aug 22 04:23:27 host kernel: NVRM:      0    76   GSP_RM_CONTROL        0x000000002080205b 0x0000000000000004 0x0006203d2e448e17 0x0000000000000000          y
Aug 22 04:23:27 host kernel: NVRM:     -1    47   UNLOADING_GUEST_DRIVE 0x0000000000000000 0x0000000000000000 0x0006203d2e304393 0x0006203d2e33069f 181004us  
Aug 22 04:23:27 host kernel: NVRM:     -2    10   FREE                  0x00000000c1e0006a 0x0000000000000000 0x0006203d2e304107 0x0006203d2e304327    544us  
Aug 22 04:23:27 host kernel: NVRM:     -3    10   FREE                  0x000000000000000a 0x0000000000000000 0x0006203d2e303db6 0x0006203d2e304104    846us  
Aug 22 04:23:27 host kernel: NVRM:     -4    10   FREE                  0x000000000000000b 0x0000000000000000 0x0006203d2e3038b7 0x0006203d2e303a1d    358us  
Aug 22 04:23:27 host kernel: NVRM:     -5    10   FREE                  0x0000000000000006 0x0000000000000000 0x0006203d2e303435 0x0006203d2e30389f   1130us  
Aug 22 04:23:27 host kernel: NVRM:     -6    10   FREE                  0x0000000000000002 0x0000000000000000 0x0006203d2e3026f6 0x0006203d2e303396   3232us  
Aug 22 04:23:27 host kernel: NVRM:     -7    10   FREE                  0x0000000000000005 0x0000000000000000 0x0006203d2e302001 0x0006203d2e3026e5   1764us  
Aug 22 04:23:27 host kernel: NVRM: GPU0 RPC event history (CPU <- GSP):
Aug 22 04:23:27 host kernel: NVRM:     entry function                   data0              data1              ts_start           ts_end             duration during_incomplete_rpc
Aug 22 04:23:27 host kernel: NVRM:      0    4108 UCODE_LIBOS_PRINT     0x0000000000000000 0x0000000000000000 0x0006203d2e30f929 0x0006203d2e30f92a      1us  
Aug 22 04:23:27 host kernel: NVRM:     -1    4128 GSP_POST_NOCAT_RECORD 0x0000000000000002 0x0000000000000028 0x0006203d2e307db6 0x0006203d2e307dba      4us  
Aug 22 04:23:27 host kernel: NVRM:     -2    4111 PERF_BRIDGELESS_INFO_ 0x0000000000000000 0x0000000000000000 0x0006203d2e2845bf 0x0006203d2e2845c0      1us  
Aug 22 04:23:27 host kernel: NVRM:     -3    4099 POST_EVENT            0x0000000000000021 0x0000000000000100 0x0006203c70050062 0x0006203c70050077     21us  
Aug 22 04:23:27 host kernel: NVRM:     -4    4128 GSP_POST_NOCAT_RECORD 0x0000000000000005 0x00000000016515d8 0x0006203c6f658112 0x0006203c6f658117      5us  
Aug 22 04:23:27 host kernel: NVRM:     -5    4099 POST_EVENT            0x0000000000000021 0x0000000000000008 0x0006203c6f613959 0x0006203c6f613970     23us  
Aug 22 04:23:27 host kernel: NVRM:     -6    4099 POST_EVENT            0x0000000000000021 0x0000000000000001 0x0006203c6f5f1169 0x0006203c6f5f1178     15us  
Aug 22 04:23:27 host kernel: NVRM:     -7    4108 UCODE_LIBOS_PRINT     0x0000000000000000 0x0000000000000000 0x0006203c6f5d2a1b 0x0006203c6f5d2a1c      1us  
Aug 22 04:23:27 host kernel: CPU: 3 PID: 287264 Comm: nvidia-sleep.sh Tainted: P        W  OE      6.10.4-201.fsync.fc40.x86_64 #1
Aug 22 04:23:27 host kernel: Hardware name: LENOVO 83DN/INVALID, BIOS NKCN24WW 01/16/2024
Aug 22 04:23:27 host kernel: Call Trace:
Aug 22 04:23:27 host kernel:  <TASK>
Aug 22 04:23:27 host kernel:  dump_stack_lvl+0x5d/0x80
Aug 22 04:23:27 host kernel:  _nv012948rm+0x4ee/0x590 [nvidia]
Aug 22 04:23:27 host kernel:  _nv012865rm+0x74/0x330 [nvidia]
Aug 22 04:23:27 host kernel:  _nv048625rm+0x49f/0x7f0 [nvidia]
Aug 22 04:23:27 host kernel:  _nv051989rm+0xa1/0x150 [nvidia]
Aug 22 04:23:27 host kernel:  _nv047906rm+0x19e/0x1b0 [nvidia]
Aug 22 04:23:27 host kernel:  _nv049930rm+0x3fc/0x500 [nvidia]
Aug 22 04:23:27 host kernel:  _nv014741rm+0x42e/0x690 [nvidia]
Aug 22 04:23:27 host kernel:  _nv048043rm+0x26/0x30 [nvidia]
Aug 22 04:23:27 host kernel:  _nv000762rm+0x55/0x70 [nvidia]
Aug 22 04:23:27 host kernel:  _nv000761rm+0x21b/0x220 [nvidia]
Aug 22 04:23:27 host kernel:  _nv000713rm+0x2ad/0x300 [nvidia]
Aug 22 04:23:27 host kernel:  rm_power_management+0x22c/0x260 [nvidia]
Aug 22 04:23:27 host kernel:  nv_power_management+0x92/0x170 [nvidia]
Aug 22 04:23:27 host kernel:  nvidia_suspend+0x6c/0x100 [nvidia]
Aug 22 04:23:27 host kernel:  nv_set_system_power_state+0x407/0x470 [nvidia]
Aug 22 04:23:27 host kernel:  nv_procfs_write_suspend+0x105/0x1b0 [nvidia]
Aug 22 04:23:27 host kernel:  proc_reg_write+0x5a/0xa0
Aug 22 04:23:27 host kernel:  vfs_write+0xf5/0x460
Aug 22 04:23:27 host kernel:  ? syscall_exit_to_user_mode+0x72/0x220
Aug 22 04:23:27 host kernel:  ? do_syscall_64+0x8e/0x160
Aug 22 04:23:27 host kernel:  ksys_write+0x6d/0xf0
Aug 22 04:23:27 host kernel:  do_syscall_64+0x82/0x160
Aug 22 04:23:27 host kernel:  ? do_user_addr_fault+0x36c/0x620
Aug 22 04:23:27 host kernel:  ? exc_page_fault+0x7e/0x180
Aug 22 04:23:27 host kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
Aug 22 04:23:27 host kernel: RIP: 0033:0x7f2a0417e984
Aug 22 04:23:27 host kernel: Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d c5 06 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89
Aug 22 04:23:27 host kernel: RSP: 002b:00007fff605bd948 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
Aug 22 04:23:27 host kernel: RAX: ffffffffffffffda RBX: 0000000000000008 RCX: 00007f2a0417e984
Aug 22 04:23:27 host kernel: RDX: 0000000000000008 RSI: 00005614b49f0160 RDI: 0000000000000001
Aug 22 04:23:27 host kernel: RBP: 00007fff605bd970 R08: 0000000000000410 R09: 0000000000000001
Aug 22 04:23:27 host kernel: R10: 0000000000000004 R11: 0000000000000202 R12: 0000000000000008
Aug 22 04:23:27 host kernel: R13: 00005614b49f0160 R14: 00007f2a042585c0 R15: 00007f2a04255f00
Aug 22 04:23:27 host kernel:  </TASK>
Aug 22 04:23:28 host systemd[1]: NetworkManager-dispatcher.service: Deactivated successfully.
Aug 22 04:23:28 host audit[1]: SERVICE_STOP pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=NetworkManager-dispatcher comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Aug 22 04:23:33 host kernel: NVRM: Xid (PCI:0000:01:00): 119, pid=287264, name=nvidia-sleep.sh, Timeout after 6s of waiting for RPC response from GPU0 GSP! Expected function 76 (GSP_RM_CONTROL) (0x20800a81 0x4).
Aug 22 04:23:33 host kernel: ------------[ cut here ]------------
Aug 22 04:23:33 host kernel: WARNING: CPU: 12 PID: 287264 at /tmp/akmodsbuild.Jpake0km/BUILD/nvidia-kmod-560.31.02-x86_64/_kmod_build_6.10.4-201.fsync.fc40.x86_64/kernel/nvidia/nv.c:4598 nv_set_system_power_state+0x40d/0x470 [nvidia]
Aug 22 04:23:33 host kernel: Modules linked in: uas usb_storage uinput xt_addrtype xt_nat xt_mark xt_conntrack xt_comment xt_MASQUERADE nft_compat veth bridge stp llc overlay snd_seq_dummy snd_hrtimer rfcomm tun nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables qrtr bnep snd_ctl_led snd_soc_skl_hda_dsp snd_soc_hdac_hdmi snd_soc_intel_hda_dsp_common snd_sof_probes snd_hda_codec_hdmi spi_nor snd_hda_codec_realtek snd_soc_dmic snd_hda_codec_generic mtd snd_hda_scodec_component iTCO_wdt mei_gsc_proxy intel_pmc_bxt iTCO_vendor_support intel_rapl_msr intel_uncore_frequency snd_sof_pci_intel_mtl intel_uncore_frequency_common x86_pkg_temp_thermal snd_sof_intel_hda_generic intel_powerclamp coretemp soundwire_intel soundwire_cadence snd_sof_intel_hda_common kvm_intel binfmt_misc snd_sof_intel_hda_mlink snd_sof_intel_hda kvm vfat fat snd_sof_pci rapl
Aug 22 04:23:33 host kernel:  snd_sof_xtensa_dsp snd_sof snd_sof_utils intel_cstate snd_soc_hdac_hda uvcvideo btusb snd_hda_ext_core btrtl uvc videobuf2_vmalloc btintel videobuf2_memops snd_soc_acpi_intel_match btbcm intel_uncore videobuf2_v4l2 soundwire_generic_allocation btmtk snd_soc_acpi videobuf2_common iwlmvm soundwire_bus bluetooth wl(POE) pcspkr snd_hda_scodec_tas2781_i2c snd_hda_intel mac80211 snd_soc_tas2781_fmwlib nvidia_wmi_ec_backlight snd_intel_dspcfg wmi_bmof snd_soc_tas2781_comlib snd_intel_sdw_acpi crc8 libarc4 snd_hda_codec processor_thermal_device_pci processor_thermal_device spi_intel_pci snd_soc_core snd_hda_core mei_me processor_thermal_wt_hint snd_hwdep spi_intel processor_thermal_rfim i2c_i801 processor_thermal_rapl mei i2c_smbus idma64 snd_compress intel_rapl_common ac97_bus iwlwifi snd_pcm_dmaengine thunderbolt processor_thermal_wt_req intel_vpu snd_seq processor_thermal_power_floor processor_thermal_mbox snd_seq_device igen6_edac ideapad_laptop snd_pcm int3403_thermal int340x_thermal_zone intel_pmc_core
Aug 22 04:23:33 host kernel:  snd_timer legion_laptop snd platform_profile soundcore int3400_thermal intel_vsec acpi_thermal_rel intel_hid pmt_telemetry sparse_keymap pmt_class acpi_pad acpi_tad joydev brcmfmac brcmutil cfg80211 rfkill auth_rpcgss scsi_dh_rdac scsi_dh_emc sunrpc scsi_dh_alua kvmfr(OE) loop dm_multipath nfnetlink zram dm_crypt xe drm_gpuvm drm_exec gpu_sched drm_suballoc_helper drm_ttm_helper hid_sensor_hub intel_ishtp_hid i915 drm_buddy i2c_algo_bit crct10dif_pclmul crc32_pclmul drm_display_helper crc32c_intel rtsx_pci_sdmmc nvme polyval_clmulni polyval_generic nvme_core mmc_core intel_ish_ipc cec hid_multitouch ucsi_acpi ghash_clmulni_intel sha512_ssse3 typec_ucsi sha256_ssse3 rtsx_pci sha1_ssse3 intel_ishtp ttm typec nvme_auth i2c_hid_acpi i2c_hid pinctrl_meteorlake serio_raw nvidia_drm(POE) nvidia_modeset(POE) video wmi nvidia_uvm(POE) nvidia(POE) v4l2loopback(OE) videodev mc ip6_tables ip_tables fuse i2c_dev
Aug 22 04:23:33 host kernel: Unloaded tainted modules: nvidia_peermem(POE):1
Aug 22 04:23:33 host kernel: CPU: 12 PID: 287264 Comm: nvidia-sleep.sh Tainted: P        W  OE      6.10.4-201.fsync.fc40.x86_64 #1
Aug 22 04:23:33 host kernel: Hardware name: LENOVO 83DN/INVALID, BIOS NKCN24WW 01/16/2024
Aug 22 04:23:33 host kernel: RIP: 0010:nv_set_system_power_state+0x40d/0x470 [nvidia]
Aug 22 04:23:33 host kernel: Code: 0f eb 40 4d 8b a4 24 70 06 00 00 4d 85 e4 74 33 49 8b bc 24 d0 02 00 00 ba 01 00 00 00 89 de e8 79 c7 ff ff 89 c5 85 c0 74 d9 <0f> 0b 48 c7 c7 00 0d 40 c1 41 bd 01 00 00 00 e8 cf 0b b8 d0 e9 f9
Aug 22 04:23:33 host kernel: RSP: 0018:ffffab05ab52fd40 EFLAGS: 00010206
Aug 22 04:23:33 host kernel: RAX: 000000000000ffff RBX: 0000000000000001 RCX: 0000000080020001
Aug 22 04:23:33 host kernel: RDX: ffffa005839df650 RSI: 0000000000000286 RDI: ffffa005839df648
Aug 22 04:23:33 host kernel: RBP: 000000000000ffff R08: ffffa00591393000 R09: 0000000080020001
Aug 22 04:23:33 host kernel: R10: 0000000080020001 R11: 0000000000000100 R12: ffffa005839df000
Aug 22 04:23:33 host kernel: R13: ffffa005839df648 R14: ffffa00591393000 R15: ffffa00591393000
Aug 22 04:23:33 host kernel: FS:  00007f2a0406d740(0000) GS:ffffa00cdf400000(0000) knlGS:0000000000000000
Aug 22 04:23:33 host kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 22 04:23:33 host kernel: CR2: 00000a18ef2be000 CR3: 000000013ec1e006 CR4: 0000000000f70ef0
Aug 22 04:23:33 host kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Aug 22 04:23:33 host kernel: DR3: 0000000000000000 DR6: 00000000ffff07f0 DR7: 0000000000000400
Aug 22 04:23:33 host kernel: PKRU: 55555554
Aug 22 04:23:33 host kernel: Call Trace:
Aug 22 04:23:33 host kernel:  <TASK>
Aug 22 04:23:33 host kernel:  ? nv_set_system_power_state+0x40d/0x470 [nvidia]
Aug 22 04:23:33 host kernel:  ? __warn.cold+0x8e/0xe8
Aug 22 04:23:33 host kernel:  ? nv_set_system_power_state+0x40d/0x470 [nvidia]
Aug 22 04:23:33 host kernel:  ? report_bug+0xff/0x140
Aug 22 04:23:33 host kernel:  ? handle_bug+0x3c/0x80
Aug 22 04:23:33 host kernel:  ? exc_invalid_op+0x17/0x70
Aug 22 04:23:33 host kernel:  ? asm_exc_invalid_op+0x1a/0x20
Aug 22 04:23:33 host kernel:  ? nv_set_system_power_state+0x40d/0x470 [nvidia]
Aug 22 04:23:33 host kernel:  nv_procfs_write_suspend+0x105/0x1b0 [nvidia]
Aug 22 04:23:33 host kernel:  proc_reg_write+0x5a/0xa0
Aug 22 04:23:33 host kernel:  vfs_write+0xf5/0x460
Aug 22 04:23:33 host kernel:  ? syscall_exit_to_user_mode+0x72/0x220
Aug 22 04:23:33 host kernel:  ? do_syscall_64+0x8e/0x160
Aug 22 04:23:33 host kernel:  ksys_write+0x6d/0xf0
Aug 22 04:23:33 host kernel:  do_syscall_64+0x82/0x160
Aug 22 04:23:33 host kernel:  ? do_user_addr_fault+0x36c/0x620
Aug 22 04:23:33 host kernel:  ? exc_page_fault+0x7e/0x180
Aug 22 04:23:33 host kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
Aug 22 04:23:33 host kernel: RIP: 0033:0x7f2a0417e984
Aug 22 04:23:33 host kernel: Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d c5 06 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89
Aug 22 04:23:33 host kernel: RSP: 002b:00007fff605bd948 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
Aug 22 04:23:33 host kernel: RAX: ffffffffffffffda RBX: 0000000000000008 RCX: 00007f2a0417e984
Aug 22 04:23:33 host kernel: RDX: 0000000000000008 RSI: 00005614b49f0160 RDI: 0000000000000001
Aug 22 04:23:33 host kernel: RBP: 00007fff605bd970 R08: 0000000000000410 R09: 0000000000000001
Aug 22 04:23:33 host kernel: R10: 0000000000000004 R11: 0000000000000202 R12: 0000000000000008
Aug 22 04:23:33 host kernel: R13: 00005614b49f0160 R14: 00007f2a042585c0 R15: 00007f2a04255f00
Aug 22 04:23:33 host kernel:  </TASK>
Aug 22 04:23:33 host kernel: ---[ end trace 0000000000000000 ]---
Aug 22 04:23:39 host kernel: NVRM: Xid (PCI:0000:01:00): 119, pid=287264, name=nvidia-sleep.sh, Timeout after 6s of waiting for RPC response from GPU0 GSP! Expected function 76 (GSP_RM_CONTROL) (0x20801117 0x1).
Aug 22 04:23:39 host kernel: ------------[ cut here ]------------
Aug 22 04:23:39 host kernel: WARNING: CPU: 11 PID: 287264 at /tmp/akmodsbuild.Jpake0km/BUILD/nvidia-kmod-560.31.02-x86_64/_kmod_build_6.10.4-201.fsync.fc40.x86_64/kernel/nvidia/nv.c:4310 nv_restore_user_channels+0x4e/0x1d0 [nvidia]
Aug 22 04:23:39 host kernel: Modules linked in: uas usb_storage uinput xt_addrtype xt_nat xt_mark xt_conntrack xt_comment xt_MASQUERADE nft_compat veth bridge stp llc overlay snd_seq_dummy snd_hrtimer rfcomm tun nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables qrtr bnep snd_ctl_led snd_soc_skl_hda_dsp snd_soc_hdac_hdmi snd_soc_intel_hda_dsp_common snd_sof_probes snd_hda_codec_hdmi spi_nor snd_hda_codec_realtek snd_soc_dmic snd_hda_codec_generic mtd snd_hda_scodec_component iTCO_wdt mei_gsc_proxy intel_pmc_bxt iTCO_vendor_support intel_rapl_msr intel_uncore_frequency snd_sof_pci_intel_mtl intel_uncore_frequency_common x86_pkg_temp_thermal snd_sof_intel_hda_generic intel_powerclamp coretemp soundwire_intel soundwire_cadence snd_sof_intel_hda_common kvm_intel binfmt_misc snd_sof_intel_hda_mlink snd_sof_intel_hda kvm vfat fat snd_sof_pci rapl
Aug 22 04:23:39 host kernel:  snd_sof_xtensa_dsp snd_sof snd_sof_utils intel_cstate snd_soc_hdac_hda uvcvideo btusb snd_hda_ext_core btrtl uvc videobuf2_vmalloc btintel videobuf2_memops snd_soc_acpi_intel_match btbcm intel_uncore videobuf2_v4l2 soundwire_generic_allocation btmtk snd_soc_acpi videobuf2_common iwlmvm soundwire_bus bluetooth wl(POE) pcspkr snd_hda_scodec_tas2781_i2c snd_hda_intel mac80211 snd_soc_tas2781_fmwlib nvidia_wmi_ec_backlight snd_intel_dspcfg wmi_bmof snd_soc_tas2781_comlib snd_intel_sdw_acpi crc8 libarc4 snd_hda_codec processor_thermal_device_pci processor_thermal_device spi_intel_pci snd_soc_core snd_hda_core mei_me processor_thermal_wt_hint snd_hwdep spi_intel processor_thermal_rfim i2c_i801 processor_thermal_rapl mei i2c_smbus idma64 snd_compress intel_rapl_common ac97_bus iwlwifi snd_pcm_dmaengine thunderbolt processor_thermal_wt_req intel_vpu snd_seq processor_thermal_power_floor processor_thermal_mbox snd_seq_device igen6_edac ideapad_laptop snd_pcm int3403_thermal int340x_thermal_zone intel_pmc_core
Aug 22 04:23:39 host kernel:  snd_timer legion_laptop snd platform_profile soundcore int3400_thermal intel_vsec acpi_thermal_rel intel_hid pmt_telemetry sparse_keymap pmt_class acpi_pad acpi_tad joydev brcmfmac brcmutil cfg80211 rfkill auth_rpcgss scsi_dh_rdac scsi_dh_emc sunrpc scsi_dh_alua kvmfr(OE) loop dm_multipath nfnetlink zram dm_crypt xe drm_gpuvm drm_exec gpu_sched drm_suballoc_helper drm_ttm_helper hid_sensor_hub intel_ishtp_hid i915 drm_buddy i2c_algo_bit crct10dif_pclmul crc32_pclmul drm_display_helper crc32c_intel rtsx_pci_sdmmc nvme polyval_clmulni polyval_generic nvme_core mmc_core intel_ish_ipc cec hid_multitouch ucsi_acpi ghash_clmulni_intel sha512_ssse3 typec_ucsi sha256_ssse3 rtsx_pci sha1_ssse3 intel_ishtp ttm typec nvme_auth i2c_hid_acpi i2c_hid pinctrl_meteorlake serio_raw nvidia_drm(POE) nvidia_modeset(POE) video wmi nvidia_uvm(POE) nvidia(POE) v4l2loopback(OE) videodev mc ip6_tables ip_tables fuse i2c_dev
Aug 22 04:23:39 host kernel: Unloaded tainted modules: nvidia_peermem(POE):1
Aug 22 04:23:39 host kernel: CPU: 11 PID: 287264 Comm: nvidia-sleep.sh Tainted: P        W  OE      6.10.4-201.fsync.fc40.x86_64 #1
Aug 22 04:23:39 host kernel: Hardware name: LENOVO 83DN/INVALID, BIOS NKCN24WW 01/16/2024
Aug 22 04:23:39 host kernel: RIP: 0010:nv_restore_user_channels+0x4e/0x1d0 [nvidia]
Aug 22 04:23:39 host kernel: Code: 24 48 06 00 00 4c 89 ef e8 7f 4a b8 d0 f6 43 10 01 74 73 48 89 de 31 ff e8 af a0 bc 00 41 89 c6 85 c0 0f 84 3a 01 00 00 31 ed <0f> 0b 49 81 c4 70 07 00 00 4c 89 e7 e8 51 4a b8 d0 be 01 00 00 00
Aug 22 04:23:39 host kernel: RSP: 0018:ffffab05ab52fd10 EFLAGS: 00010206
Aug 22 04:23:39 host kernel: RAX: 0000000000000065 RBX: ffffa005839df000 RCX: ffffab05ab52fc90
Aug 22 04:23:39 host kernel: RDX: 0000000000000000 RSI: 0000000000000246 RDI: ffffab05ab52fc40
Aug 22 04:23:39 host kernel: RBP: ffffa009036ab000 R08: 0000000000000000 R09: 00000000802a0005
Aug 22 04:23:39 host kernel: R10: 00000000802a0005 R11: 0000000000000000 R12: ffffa005839df000
Aug 22 04:23:39 host kernel: R13: ffffa005839df648 R14: 0000000000000065 R15: ffffa00591393000
Aug 22 04:23:39 host kernel: FS:  00007f2a0406d740(0000) GS:ffffa00cdf380000(0000) knlGS:0000000000000000
Aug 22 04:23:39 host kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 22 04:23:39 host kernel: CR2: 00007f9d7b441000 CR3: 000000013ec1e005 CR4: 0000000000f70ef0
Aug 22 04:23:39 host kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Aug 22 04:23:39 host kernel: DR3: 0000000000000000 DR6: 00000000ffff07f0 DR7: 0000000000000400
Aug 22 04:23:39 host kernel: PKRU: 55555554
Aug 22 04:23:39 host kernel: Call Trace:
Aug 22 04:23:39 host kernel:  <TASK>
Aug 22 04:23:39 host kernel:  ? nv_restore_user_channels+0x4e/0x1d0 [nvidia]
Aug 22 04:23:39 host kernel:  ? __warn.cold+0x8e/0xe8
Aug 22 04:23:39 host kernel:  ? nv_restore_user_channels+0x4e/0x1d0 [nvidia]
Aug 22 04:23:39 host kernel:  ? report_bug+0xff/0x140
Aug 22 04:23:39 host kernel:  ? handle_bug+0x3c/0x80
Aug 22 04:23:39 host kernel:  ? exc_invalid_op+0x17/0x70
Aug 22 04:23:39 host kernel:  ? asm_exc_invalid_op+0x1a/0x20
Aug 22 04:23:39 host kernel:  ? nv_restore_user_channels+0x4e/0x1d0 [nvidia]
Aug 22 04:23:39 host kernel:  nv_set_system_power_state+0x38d/0x470 [nvidia]
Aug 22 04:23:39 host kernel:  nv_procfs_write_suspend+0x105/0x1b0 [nvidia]
Aug 22 04:23:39 host kernel:  proc_reg_write+0x5a/0xa0
Aug 22 04:23:39 host kernel:  vfs_write+0xf5/0x460
Aug 22 04:23:39 host kernel:  ? syscall_exit_to_user_mode+0x72/0x220
Aug 22 04:23:39 host kernel:  ? do_syscall_64+0x8e/0x160
Aug 22 04:23:39 host kernel:  ksys_write+0x6d/0xf0
Aug 22 04:23:39 host kernel:  do_syscall_64+0x82/0x160
Aug 22 04:23:39 host kernel:  ? do_user_addr_fault+0x36c/0x620
Aug 22 04:23:39 host kernel:  ? exc_page_fault+0x7e/0x180
Aug 22 04:23:39 host kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
Aug 22 04:23:39 host kernel: RIP: 0033:0x7f2a0417e984
Aug 22 04:23:39 host kernel: Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d c5 06 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 55 48 89 e5 48 83 ec 20 48 89
Aug 22 04:23:39 host kernel: RSP: 002b:00007fff605bd948 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
Aug 22 04:23:39 host kernel: RAX: ffffffffffffffda RBX: 0000000000000008 RCX: 00007f2a0417e984
Aug 22 04:23:39 host kernel: RDX: 0000000000000008 RSI: 00005614b49f0160 RDI: 0000000000000001
Aug 22 04:23:39 host kernel: RBP: 00007fff605bd970 R08: 0000000000000410 R09: 0000000000000001
Aug 22 04:23:39 host kernel: R10: 0000000000000004 R11: 0000000000000202 R12: 0000000000000008
Aug 22 04:23:39 host kernel: R13: 00005614b49f0160 R14: 00007f2a042585c0 R15: 00007f2a04255f00
Aug 22 04:23:39 host kernel:  </TASK>
Aug 22 04:23:39 host kernel: ---[ end trace 0000000000000000 ]---
Aug 22 04:23:39 host nvidia-sleep.sh[287264]: /usr/bin/nvidia-sleep.sh: line 20: echo: write error: Input/output error
Aug 22 04:23:39 host systemd[1]: nvidia-suspend.service: Main process exited, code=exited, status=1/FAILURE
Aug 22 04:23:39 host audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=nvidia-suspend comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=failed'
Aug 22 04:23:39 host systemd[1]: nvidia-suspend.service: Failed with result 'exit-code'.
Aug 22 04:23:39 host systemd[1]: Failed to start nvidia-suspend.service - NVIDIA system suspend actions.
Aug 22 04:23:39 host systemd[1]: nvidia-suspend.service: Consumed 19.881s CPU time.
Aug 22 04:23:39 host systemd[1]: Starting systemd-suspend.service - System Suspend...
Aug 22 04:23:39 host systemd-sleep[287394]: Performing sleep operation 'suspend'...
Aug 22 04:23:39 host kernel: PM: suspend entry (s2idle)
Aug 22 04:23:45 host /usr/bin/nvidia-powerd[5837]: Failed to get prerequisites for nvidia-powerd
Aug 22 04:23:45 host kernel: NVRM: Rate limiting GSP RPC error prints for GPU at PCI:0000:01:00 (printing 1 of every 30).  The GPU likely needs to be reset.
Aug 22 04:23:47 host kernel: Filesystems sync: 7.436 seconds
Aug 22 04:23:48 host kernel: Freezing user space processes
Aug 22 04:23:48 host kernel: Freezing user space processes completed (elapsed 0.004 seconds)
Aug 22 04:23:48 host kernel: OOM killer disabled.
Aug 22 04:23:48 host kernel: Freezing remaining freezable tasks
Aug 22 04:23:48 host kernel: Freezing remaining freezable tasks completed (elapsed 0.002 seconds)
Aug 22 04:23:48 host kernel: printk: Suspending console(s) (use no_console_suspend to debug)
Aug 22 04:23:48 host kernel: NVRM: GPU 0000:01:00.0: PreserveVideoMemoryAllocations module parameter is set. System Power Management attempted without driver procfs suspend interface. Please refer to the 'Configuring Power Management Support' section in the driver README.
Aug 22 04:23:48 host kernel: nvidia 0000:01:00.0: PM: pci_pm_suspend(): nv_pmops_suspend [nvidia] returns -5
Aug 22 04:23:48 host kernel: nvidia 0000:01:00.0: PM: dpm_run_callback(): pci_pm_suspend returns -5
Aug 22 04:23:48 host kernel: nvidia 0000:01:00.0: PM: failed to suspend async: error -5
Aug 22 04:23:48 host kernel: PM: Some devices failed to suspend, or early wake event detected

Does anyone else have issues with suspend resume? Would be interested in knowing if you get a similar stack trace. I have subscribed to the github bug report, though I am not 100% certain this is the same issue as described there…

Really hoping they finally fix this… It’s pretty annoying that I can’t suspend/resume my laptop properly. It’s either disable Nvidia GPU in the BIOS for a stable hassle-free experience, or enable it to use it (but turn off suspend to avoid freezes)…

NoYards · August 25, 2024, 3:31am

Pretty sure it’s using the proprietary drivers.

As for suspension problems I have noticed my system has failed a few times coming out of suspend. But I was using my GPU’s HDMI for my monitor’s output, but I recently enabled my integrated video on my CPU and have started using that HDMI (save a few bits of VRAM memory for AI stuff, and I don’t game.)

So far I have had no issues with the video output and I haven’t noticed any failed resume from suspends lately.

akarypid · August 26, 2024, 11:11am

On my Lenovo Yoga Pro 9i, the problem occurs without any external monitors connected…

I am simply forced to go into the BIOS and choose UMA graphics (i.e. CPU graphics) which totally disable the Nvidia.

Every time I notice some Nvidia update, I try using it again, but within 24 hours I’ll come across the suspend/resume issue…

Topic		Replies	Views
Nvidia runtime D3 status disabled by default Bluefin	2	95	March 15, 2025
Best branch for Nvidia hybrid laptop? General	14	527	April 26, 2025
Nvidia issues with bluefin-dx-asus-nvidia Bluefin	3	662	April 28, 2024
PSA: Try disabling S0ix with Nvidia 40xx laptops General	0	98	March 29, 2025
Making the most out of Nvidia dGPU on laptop General	0	152	November 26, 2024

Bluefin-dx: Nvidia woes

Related topics