Re: Probleme hardware ...

Page principale

Répondre à ce message
Auteur: Olivier Allard-Jacquin
Date:  
À: guilde
Sujet: Re: Probleme hardware ...

    Bonsoir,

Le 23/09/2022 à 20:39, Patrice Karatchentzeff a écrit :

> Salut Jérôme,
>
> Boote sur memtest et valide ta RAM. Ensuite, si ton cpu à un gpu
> interne, retire ta carte et valide la CM. Ensuite, tu pourras jeter ta
> carte vidéo :)


    Le kernel dit "HP-Pavilion KQ517AA-ABF", donc à priori c'est un 
portable. Donc difficile de retirer la vidéo.

    Je pencherai peut-être pour un problème de température, donc une 
ouverture et un bon dépoussiérage peut aider.

    Sinon, une recherche donne pas mal de retour :
https://www.google.com/search?q=ttm+buffer+eviction+failed

Entre autre que les drivers proprio peuvent aider (notamment, ils 
peuvent (ou pas) mieux gérer la température que "nouveau".

    Enfin, il semble que tu ais un kernel spécialement vieux : 
5.10.0-18-amd64. Il y a une raison à cela ?

    Sur une machine très récente, je viens de voir (ce matin !) une machine 
planter à répétition avec la vidéo (un module nouveau était dans les 
fauteur de trouble) et un kernel 5.14. Alors qu'un kernel 5.19 ne posait 
pas de problème sur la même machine.

    Cordialement,    
                            Olivier


> Le ven. 23 sept. 2022 à 18:02, Jérôme Kieffer
> <jerome.kieffer@???> a écrit :
>>
>> Bonjour,
>>
>> J'ai un de mes PC, pas tres recent, qui commence a montrer des signes de faiblesses...
>> J'aimerais une confirmation si c'est plutot un probleme de RAM ou si
>> c'est plutot du cote du driver nouveau (carte NVIDIA G98M GeForce 9300M GS) qu'il faut chercher.
>> Voici les logs de la bete (dmesg):
>>
>>
>> [  617.484198] usbcore: registered new interface driver snd-usb-audio
>> [  657.537381] nouveau 0000:05:00.0: fb: trapped read at 0000705148 on channel -1 [0fee0000 unknown] engine 06 [BAR] client 08 [PFIFO_READ] subclient 00 [FB] reason 00000002 [PAGE_NOT_PRESENT]
>> [  760.295581] perf: interrupt took too long (4986 > 4985), lowering kernel.perf_event_max_sample_rate to 40000
>> [ 1067.636255] nouveau 0000:05:00.0: fb: trapped read at 0000702b00 on channel -1 [0fee0000 unknown] engine 06 [BAR] client 08 [PFIFO_READ] subclient 00 [FB] reason 00000002 [PAGE_NOT_PRESENT]
>> [ 1100.123544] [TTM] Buffer eviction failed
>> [ 1163.610356] [TTM] Buffer eviction failed
>> [ 1380.438734] [TTM] Buffer eviction failed
>> [ 1443.021668] ------------[ cut here ]------------
>> [ 1443.021677] Trying to vfree() bad address (000000007dacf4eb)
>> [ 1443.021702] WARNING: CPU: 0 PID: 1399 at mm/vmalloc.c:2245 __vunmap+0x267/0x290
>> [ 1443.021704] Modules linked in: snd_usb_audio snd_usbmidi_lib cm109 snd_rawmidi snd_seq_device rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace nfs_ssc fscache snd_hda_codec_analog snd_hda_codec_generic ledtrig_audio btusb btrtl btbcm btintel bluetooth snd_hda_intel snd_intel_dspcfg soundwire_intel jitterentropy_rng ctr soundwire_generic_allocation snd_soc_core drbg uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 snd_compress videobuf2_common soundwire_cadence snd_hda_codec videodev snd_hda_core aes_generic ir_rc6_decoder snd_hwdep rc_rc6_mce crypto_simd soundwire_bus iTCO_wdt intel_pmc_bxt cryptd iTCO_vendor_support mc glue_helper at24 watchdog snd_pcm mceusb joydev serio_raw pcspkr ansi_cprng rc_core snd_timer sg snd ecdh_generic rfkill ecc libaes soundcore evdev acpi_cpufreq binfmt_misc udlfb coretemp parport_pc ppdev lp parport sunrpc fuse configfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 crc32c_generic sd_mod t10_pi crc_t10dif crct10dif_gener
>>   ic
>> [ 1443.021885]  hid_generic sr_mod cdrom crct10dif_common usbhid uas usb_storage hid nouveau mxm_wmi wmi i2c_algo_bit ttm drm_kms_helper ata_generic cec ahci libahci ata_piix drm r8169 realtek mdio_devres uhci_hcd ehci_pci libata ehci_hcd firewire_ohci libphy psmouse scsi_mod i2c_i801 i2c_smbus lpc_ich firewire_core crc_itu_t usbcore usb_common video button
>> [ 1443.021966] CPU: 0 PID: 1399 Comm: Renderer Not tainted 5.10.0-18-amd64 #1 Debian 5.10.140-1
>> [ 1443.021970] Hardware name: HP-Pavilion KQ517AA-ABF IQ500.fr/EVE, BIOS 5.10    01/16/2009
>> [ 1443.021976] RIP: 0010:__vunmap+0x267/0x290
>> [ 1443.021982] Code: 01 00 74 9e e8 3a e8 67 00 31 d2 31 f6 48 c7 c7 ff ff ff ff e8 9a cc ff ff eb 87 48 89 fe 48 c7 c7 f8 4f 6f 84 e8 d1 bf 63 00 <0f> 0b 5b 5d 41 5c 41 5d 41 5e c3 cc cc cc cc 4c 89 e6 48 c7 c7 20
>> [ 1443.021986] RSP: 0018:ffffadb141cbf918 EFLAGS: 00010286
>> [ 1443.021991] RAX: 0000000000000000 RBX: ffff963ab1593480 RCX: ffff963babc1ca08
>> [ 1443.021995] RDX: 00000000ffffffd8 RSI: 0000000000000027 RDI: ffff963babc1ca00
>> [ 1443.021998] RBP: 00000000000004e0 R08: 0000000000000000 R09: ffffadb141cbf738
>> [ 1443.022001] R10: ffffadb141cbf730 R11: ffffffff84ccb448 R12: ffff963b82b0c4e0
>> [ 1443.022004] R13: 0000000000000080 R14: ffffadb141cbf9b0 R15: 0000000000000000
>> [ 1443.022009] FS:  00007f00bda7b700(0000) GS:ffff963babc00000(0000) knlGS:0000000000000000
>> [ 1443.022013] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>> [ 1443.022017] CR2: 00007f00a7513008 CR3: 0000000105604000 CR4: 00000000000006f0
>> [ 1443.022020] Call Trace:
>> [ 1443.022151]  nvkm_umem_unmap+0x4d/0x70 [nouveau]
>> [ 1443.022225]  nvkm_ioctl+0xde/0x180 [nouveau]
>> [ 1443.022297]  nvif_object_unmap_handle+0x6b/0x90 [nouveau]
>> [ 1443.022409]  nouveau_ttm_io_mem_free+0x47/0x70 [nouveau]
>> [ 1443.022426]  ttm_mem_io_free+0x2c/0x50 [ttm]
>> [ 1443.022438]  ttm_bo_handle_move_mem+0x60/0x470 [ttm]
>> [ 1443.022450]  ttm_bo_evict+0x124/0x170 [ttm]
>> [ 1443.022465]  ttm_mem_evict_first+0x113/0x3f0 [ttm]
>> [ 1443.022477]  ttm_bo_mem_space+0x259/0x280 [ttm]
>> [ 1443.022489]  ttm_bo_validate+0x129/0x170 [ttm]
>> [ 1443.022501]  ttm_bo_init_reserved+0x2ac/0x330 [ttm]
>> [ 1443.022513]  ttm_bo_init+0x6d/0xf0 [ttm]
>> [ 1443.022624]  ? nouveau_bo_move+0x5a0/0x5a0 [nouveau]
>> [ 1443.022736]  nouveau_bo_init+0xaf/0xd0 [nouveau]
>> [ 1443.022847]  ? nouveau_bo_move+0x5a0/0x5a0 [nouveau]
>> [ 1443.022958]  nouveau_gem_new+0x75/0xe0 [nouveau]
>> [ 1443.023069]  ? nouveau_gem_new+0xe0/0xe0 [nouveau]
>> [ 1443.023179]  nouveau_gem_ioctl_new+0x53/0x100 [nouveau]
>> [ 1443.023290]  ? nouveau_gem_new+0xe0/0xe0 [nouveau]
>> [ 1443.023341]  drm_ioctl_kernel+0xae/0x100 [drm]
>> [ 1443.023378]  drm_ioctl+0x224/0x3c0 [drm]
>> [ 1443.023489]  ? nouveau_gem_new+0xe0/0xe0 [nouveau]
>> [ 1443.023650]  nouveau_drm_ioctl+0x55/0xb0 [nouveau]
>> [ 1443.023662]  __x64_sys_ioctl+0x8b/0xc0
>> [ 1443.023671]  do_syscall_64+0x33/0x80
>> [ 1443.023679]  entry_SYSCALL_64_after_hwframe+0x61/0xc6
>> [ 1443.023685] RIP: 0033:0x7f00db5966b7
>> [ 1443.023692] Code: 00 00 00 48 8b 05 d9 c7 0d 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a9 c7 0d 00 f7 d8 64 89 01 48
>> [ 1443.023698] RSP: 002b:00007f00bda78c98 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
>> [ 1443.023709] RAX: ffffffffffffffda RBX: 00007f00bda78cf0 RCX: 00007f00db5966b7
>> [ 1443.023715] RDX: 00007f00bda78cf0 RSI: 00000000c0306480 RDI: 0000000000000022
>> [ 1443.023722] RBP: 00000000c0306480 R08: 0000000000000000 R09: 00007f00db200c00
>> [ 1443.023727] R10: 000067dc3745a301 R11: 0000000000000246 R12: 00007f00bda78cf0
>> [ 1443.023733] R13: 0000000000000022 R14: 00007f00caacb190 R15: 0000000000001000
>> [ 1443.023740] ---[ end trace 164135f517e7fc75 ]---
>> [ 1458.259682] [TTM] Buffer eviction failed
>> [ 1739.593692] [TTM] Buffer eviction failed
>> [ 1845.317716] [TTM] Buffer eviction failed
>> [ 1895.751970] perf: interrupt took too long (6236 > 6232), lowering kernel.perf_event_max_sample_rate to 32000
>>
> 
> 


-- 
~~~~~~~  _____/\_____  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Phoenix /   _ \/ _   \    Olivier Allard-Jacquin
        /   / \  / \   \   Web:  http://olivieraj.free.fr/
       /___/  /  \  \___\  Mail: olivieraj@???
~~~~ /////  ///\\\  \\\\\ ~~~~~~~~~~~~~~~~~~~~~~~ Linux Powered !!