public inbox for developer@lists.illumos.org (since 2011-08)
 help / color / mirror / Atom feed
* Kernel crash & reboot
@ 2024-03-15 17:56 Jesus Cea
  2024-03-15 18:10 ` [developer] " Dan McDonald
  0 siblings, 1 reply; 2+ messages in thread
From: Jesus Cea @ 2024-03-15 17:56 UTC (permalink / raw)
  To: illumos-dev

(email originally sent to the SmartOS mailing list)

This seems related to https://www.illumos.org/issues/14679 .


"""
I had a crash & reboot condition here a couple of months ago and I 
didn't realize that a system dump was available. The size of "vmdump" is 
2.2 GB, expanded to ~6GB via save core. I could provide the dump to 
somebody trustworthy in MNX if requested.

The machine crashed but rebooted intermediately.

I run some "mdb" magic on the dump:

[root@xXx /var/crash/volatile]# mdb vmcore.3
Loading modules: [ unix genunix specfs dtrace mac cpu.generic uppc apix 
scsi_vhci ufs ip hook neti sockfs arp usba smbios mm fctl stmf_sbd stmf 
zfs lofs sata sd idm crypto fcp random cpc logindmux ptm kvm sppp nsmb 
smbsrv klmmod nfs ]
 > ::stack
vpanic()
cmi_mca_panic+0x1c()
cmi_mca_trap+0x145(fffffe2cea153f10)
mcetrap+0x155()
bcopy_ck_size+0xea()
abd_copy_from_buf_off_cb+0x2d(fffffe21e28b5000, 1000, fffffe003d111700)
abd_iterate_func+0x84(fffffe2e52e03e40, 0, 20000, fffffffff3d360a0, 
fffffe003d111700)
abd_copy_from_buf_off+0x42(fffffe2e52e03e40, fffffe2e88ac1000, 0, 20000)
abd_return_buf_copy+0x42(fffffe2e52e03e40, fffffe2e88ac1000, 20000)
vdev_disk_io_intr+0x8d(fffffe2d3d220d00)
biodone+0x25(fffffe2d3d220d00)
sd_buf_iodone+0x3a(3, fffffe2cf1630980, fffffe2d3d220d00)
sd_mapblockaddr_iodone+0x48(4, fffffe2cf1630980, fffffe2d3d220d00)
sd_return_command+0x122(fffffe2cf1630980, fffffe2d3d220d00)
sdintr+0x18c(fffffe2d1b419180)
scsi_hba_pkt_comp+0x7d(fffffe2d1b419180)
sata_txlt_rw_completion+0x140(fffffe2e1bb6ad60)
ahci_flush_doneq+0x6b(fffffe2cf0e40000)
ahci_intr_ncq_events+0x234(fffffe2cf10d0200, fffffe2cf0e40000, 
fffffe003d111a85)
ahci_intr_set_device_bits+0x72(fffffe2cf10d0200, fffffe2cf0e40000, 0)
ahci_port_intr+0x266(fffffe2cf10d0200, fffffe2cf0e40000, 0)
ahci_intr+0xa3(fffffe2cf10d0200, 0)
apix_dispatch_by_vector+0x8c(21)
apix_dispatch_lowlevel+0x29(21, 0)
switch_sp_and_call+0x15()
apix_do_interrupt+0xf3(fffffe003d0c9ac0, 0)
_interrupt+0xc3()
i86_mwait+0x12()
cpu_idle_mwait+0x14b()
idle+0xa8()
thread_start+0xb()



 > ::msgbuf
MESSAGE
SmartOS Version joyent_20230921T034751Z 64-bit
Copyright 2022-2023 MNX Cloud, Inc.
x86_feature: lgpg
x86_feature: tsc
x86_feature: msr
x86_feature: mtrr
x86_feature: pge
x86_feature: de
x86_feature: cmov
x86_feature: mmx
x86_feature: mca
x86_feature: pae
x86_feature: cv8
x86_feature: pat
x86_feature: sep
x86_feature: sse
x86_feature: sse2
x86_feature: htt
x86_feature: asysc
x86_feature: nx
x86_feature: sse3
x86_feature: cx16
x86_feature: cmp
x86_feature: tscp
x86_feature: mwait
x86_feature: cpuid
x86_feature: ssse3
x86_feature: sse4_1
x86_feature: sse4_2
x86_feature: clfsh
x86_feature: 64
x86_feature: aes
x86_feature: pclmulqdq
x86_feature: xsave
x86_feature: avx
x86_feature: vmx
x86_feature: f16c
x86_feature: rdrand
x86_feature: x2apic
x86_feature: smep
x86_feature: xsaveopt
x86_feature: pcid
x86_feature: ibrs
x86_feature: ibpb
x86_feature: stibp
x86_feature: ssbd
x86_feature: flush_cmd
x86_feature: fsgsbase
x86_feature: md_clear
x86_feature: core_thermal
x86_feature: pkg_thermal
x86_feature: lfence_serializing
mem = 33462308K (0x7fa609000)
TSC calibrated using PIT; freq is 3399 MHz
ACPI: RSDP 0x00000000000F0490 000024 (v02 ALASKA)
ACPI: XSDT 0x00000000DAA40078 00006C (v01 ALASKA A M I    01072009 AMI 
00010013)
ACPI: FACP 0x00000000DAA4B980 00010C (v05 ALASKA A M I    01072009 AMI 
00010013)
ACPI: DSDT 0x00000000DAA40180 00B7FD (v02 ALASKA A M I    00000022 INTL 
20051117)
ACPI: FACS 0x00000000DAB74080 000040
ACPI: APIC 0x00000000DAA4BA90 000092 (v03 ALASKA A M I    01072009 AMI 
00010013)
ACPI: FPDT 0x00000000DAA4BB28 000044 (v01 ALASKA A M I    01072009 AMI 
00010013)
ACPI: MCFG 0x00000000DAA4BB70 00003C (v01 ALASKA A M I    01072009 MSFT 
00000097)
ACPI: HPET 0x00000000DAA4BBB0 000038 (v01 ALASKA A M I    01072009 AMI. 
00000005)
ACPI: SSDT 0x00000000DAA4BBE8 00036D (v01 SataRe SataTabl 00001000 INTL 
20091112)
ACPI: DMAR 0x00000000DAA4D3F8 0000B0 (v01 INTEL  SNB      00000001 INTL 
00000001)
ACPI: SSDT 0x00000000DAA4BFB0 0009AA (v01 PmRef  Cpu0Ist  00003000 INTL 
20051117)
ACPI: SSDT 0x00000000DAA4C960 000A92 (v01 PmRef  CpuPm    00003000 INTL 
20051117)
ACPI: 4 ACPI AML tables successfully acquired and loaded
ACPI: Enabled 4 GPEs in block 00 to 3F
SMBIOS v2.7 loaded (3942 bytes)
Skipping psm: xpv_psm
root nexus = i86pc
NOTICE: iommulib_nexus_register: rootnex-1: Succesfully registered NEXUS 
i86pc nexops=fffffffffbd7fe40
pseudo0 at root
pseudo0 is /pseudo
scsi_vhci0 at root
scsi_vhci0 is /scsi_vhci
NOTICE: reprogram io-range on ppb[0/1/0]: 0x2000 ~ 0x2fff
NOTICE: reprogram mem-range on ppb[0/1/0]: 0xdf300000 ~ 0xdf3fffff
NOTICE: reprogram pmem-range on ppb[0/1/0]: 0x100000 ~ 0x1fffff
NOTICE: reprogram io-range on ppb[0/1c/0]: 0x3000 ~ 0x3fff
NOTICE: reprogram mem-range on ppb[0/1c/0]: 0xdf400000 ~ 0xdf4fffff
NOTICE: reprogram pmem-range on ppb[0/1c/0]: 0x200000 ~ 0x2fffff
Reading Intel IOMMU boot options
npe0 at root: space 0 offset 0
npe0 is /pci@0,0
PCI Express-device: isa@1f, isa0
NOTICE: apic: local nmi: 255 0x5 1
NOTICE: apic: Using ACPI (MADT) for SMP configuration
NOTICE: apic: Using APIC interrupt routing mode
NOTICE: amd_iommu: No AMD IOMMU ACPI IVRS table
pseudo-device: acpippm0
acpippm0 is /pseudo/acpippm@0
pseudo-device: ppm0
ppm0 is /pseudo/ppm@0
ramdisk0 at root
ramdisk0 is /ramdisk
root on /ramdisk:a fstype ufs
ACPI: Dynamic OEM Table Load:
ACPI: SSDT 0xFFFFFE2CE9712B08 00083B (v01 PmRef  Cpu0Cst  00003001 INTL 
20051117)
acpinex0 at root
acpinex0 is /fw
acpinex: cpu@1, cpudrv0
/fw/cpu@1 (cpudrv0) online
pseudo-device: dld0
dld0 is /pseudo/dld@0
PCI Express-device: pci1043,84ca@1a, ehci0
ehci0 is /pci@0,0/pci1043,84ca@1a
PCI Express-device: pci1043,84ca@1d, ehci1
ehci1 is /pci@0,0/pci1043,84ca@1d
cpu0: x86 (chipid 0x0 GenuineIntel 306A9 family 6 model 58 step 9 clock 
3400 MHz)
cpu0: Intel(r) Core(tm) i7-3770 CPU @ 3.40GHz
KPTI enabled (PCID in use, INVPCID not supported)
ACPI: Dynamic OEM Table Load:
ACPI: SSDT 0xFFFFFE2CDFBD4748 000303 (v01 PmRef  ApIst    00003000 INTL 
20051117)
ACPI: Dynamic OEM Table Load:
ACPI: SSDT 0xFFFFFE2CE9A58C88 000119 (v01 PmRef  ApCst    00003000 INTL 
20051117)
cpu1: microcode has been updated from version 0x20 to 0x21
acpinex: cpu@2, cpudrv1
/fw/cpu@2 (cpudrv1) online
cpu1: x86 (chipid 0x0 GenuineIntel 306A9 family 6 model 58 step 9 clock 
3400 MHz)
cpu1: Intel(r) Core(tm) i7-3770 CPU @ 3.40GHz
cpu1 initialization complete - online
cpu2: microcode has been updated from version 0x20 to 0x21
acpinex: cpu@3, cpudrv2
/fw/cpu@3 (cpudrv2) online
cpu2: x86 (chipid 0x0 GenuineIntel 306A9 family 6 model 58 step 9 clock 
3400 MHz)
cpu2: Intel(r) Core(tm) i7-3770 CPU @ 3.40GHz
cpu2 initialization complete - online
cpu3: microcode has been updated from version 0x20 to 0x21
acpinex: cpu@4, cpudrv3
/fw/cpu@4 (cpudrv3) online
cpu3: x86 (chipid 0x0 GenuineIntel 306A9 family 6 model 58 step 9 clock 
3400 MHz)
cpu3: Intel(r) Core(tm) i7-3770 CPU @ 3.40GHz
cpu3 initialization complete - online
acpinex: cpu@5, cpudrv4
/fw/cpu@5 (cpudrv4) online
cpu4: x86 (chipid 0x0 GenuineIntel 306A9 family 6 model 58 step 9 clock 
3400 MHz)
cpu4: Intel(r) Core(tm) i7-3770 CPU @ 3.40GHz
cpu4 initialization complete - online
acpinex: cpu@6, cpudrv5
/fw/cpu@6 (cpudrv5) online
cpu5: x86 (chipid 0x0 GenuineIntel 306A9 family 6 model 58 step 9 clock 
3400 MHz)
cpu5: Intel(r) Core(tm) i7-3770 CPU @ 3.40GHz
cpu5 initialization complete - online
acpinex: cpu@7, cpudrv6
/fw/cpu@7 (cpudrv6) online
cpu6: x86 (chipid 0x0 GenuineIntel 306A9 family 6 model 58 step 9 clock 
3400 MHz)
cpu6: Intel(r) Core(tm) i7-3770 CPU @ 3.40GHz
cpu6 initialization complete - online
acpinex: cpu@8, cpudrv7
/fw/cpu@8 (cpudrv7) online
cpu7: x86 (chipid 0x0 GenuineIntel 306A9 family 6 model 58 step 9 clock 
3400 MHz)
cpu7: Intel(r) Core(tm) i7-3770 CPU @ 3.40GHz
cpu7 initialization complete - online
NOTICE: SMT enabled

PCI Express-device: pci8086,1e18@1c,4, pcieb0
pcieb0 is /pci@0,0/pci8086,1e18@1c,4
PCI Express-device: pci8086,1e1a@1c,5, pcieb1
pcieb1 is /pci@0,0/pci8086,1e1a@1c,5
PCI Express-device: pci8086,1e1c@1c,6, pcieb2
pcieb2 is /pci@0,0/pci8086,1e1c@1c,6
pseudo-device: tzmon0
tzmon0 is /pseudo/tzmon@0
pseudo-device: audio0
audio0 is /pseudo/audio@0
USB 2.0 device (usb8087,24) operating at hi speed (USB 2.x) on USB 2.0 
root hub: hub@1, hubd0 at bus address 2
hubd0 is /pci@0,0/pci1043,84ca@1a/hub@1
/pci@0,0/pci1043,84ca@1a/hub@1 (hubd0) online
USB 2.0 device (usb8087,24) operating at hi speed (USB 2.x) on USB 2.0 
root hub: hub@1, hubd1 at bus address 2
hubd1 is /pci@0,0/pci1043,84ca@1d/hub@1
/pci@0,0/pci1043,84ca@1d/hub@1 (hubd1) online
USB 1.10 device (usb4d9,1400) operating at low speed (USB 1.x) on USB 
2.0 external hub: device@8, usb_mid0 at bus address 3
usb_mid0 is /pci@0,0/pci1043,84ca@1d/hub@1/device@8
/pci@0,0/pci1043,84ca@1d/hub@1/device@8 (usb_mid0) online
USB 1.10 interface (usbif4d9,1400.config1.0) operating at low speed (USB 
1.x) on USB 2.0 external hub: keyboard@0, hid0 at bus address 3
hid0 is /pci@0,0/pci1043,84ca@1d/hub@1/device@8/keyboard@0
/pci@0,0/pci1043,84ca@1d/hub@1/device@8/keyboard@0 (hid0) online
USB 1.10 interface (usbif4d9,1400.config1.1) operating at low speed (USB 
1.x) on USB 2.0 external hub: mouse@1, hid1 at bus address 3
hid1 is /pci@0,0/pci1043,84ca@1d/hub@1/device@8/mouse@1
/pci@0,0/pci1043,84ca@1d/hub@1/device@8/mouse@1 (hid1) online
pseudo-device: lofi0
lofi0 is /pseudo/lofi@0
pseudo-device: lofi1
lofi1 is /pseudo/lofi@1
/pseudo/lofi@1 (lofi1) online
pseudo-device: stmf_sbd0
stmf_sbd0 is /pseudo/stmf_sbd@0
NOTICE: ahci0: hba AHCI version = 1.30
pseudo-device: dtrace0
dtrace0 is /pseudo/dtrace@0
/pci@0,0/pci1043,84ca@1f,2 :
         SATA disk device at port 0
         model TOSHIBA DT01ACA300
         firmware MX6OABB0
         serial number            43JN9S9GS
         supported features:
          48-bit LBA, DMA, Native Command Queueing, SMART, SMART self-test
         SATA Gen3 signaling speed (6.0Gbps)
         Supported queue depth 32
         capacity = 5860533168 sectors
sd0 at ahci0: target 0 lun 0
sd0 is /pci@0,0/pci1043,84ca@1f,2/disk@0,0
/pci@0,0/pci1043,84ca@1f,2/disk@0,0 (sd0) online
/pci@0,0/pci1043,84ca@1f,2 :
         SATA disk device at port 1
         model TOSHIBA DT01ACA300
         firmware MX6OABB0
         serial number            43JN9S0GS
         supported features:
          48-bit LBA, DMA, Native Command Queueing, SMART, SMART self-test
         SATA Gen3 signaling speed (6.0Gbps)
         Supported queue depth 32
         capacity = 5860533168 sectors
sd1 at ahci0: target 1 lun 0
sd1 is /pci@0,0/pci1043,84ca@1f,2/disk@1,0
/pci@0,0/pci1043,84ca@1f,2/disk@1,0 (sd1) online
pseudo-device: ucode0
ucode0 is /pseudo/ucode@0
pseudo-device: devinfo0
devinfo0 is /pseudo/devinfo@0
iscsi0 at root
iscsi0 is /iscsi
pseudo-device: pseudo1
pseudo1 is /pseudo/zconsnex@1
pseudo-device: pseudo2
pseudo2 is /pseudo/zfdnex@2
acpinex: sb@0, acpinex1
acpinex1 is /fw/sb@0
acpinex: usbroothub@EHC1, acpinex2
acpinex2 is /fw/sb@0/usbroothub@EHC1
NOTICE: ahci1: hba AHCI version = 1.0
acpinex: usbroothub@EHC2, acpinex3
acpinex3 is /fw/sb@0/usbroothub@EHC2
ISA-device: pit_beep0
pit_beep0 is /pci@0,0/isa@1f/pit_beep
acpinex: port@1, acpinex5
acpinex5 is /fw/sb@0/usbroothub@EHC2/port@1
acpinex: port@1, acpinex4
acpinex4 is /fw/sb@0/usbroothub@EHC1/port@1
acpinex: port@1, acpinex6
acpinex6 is /fw/sb@0/usbroothub@EHC2/port@1/port@1
acpinex: port@2, acpinex7
acpinex7 is /fw/sb@0/usbroothub@EHC2/port@1/port@2
acpinex: port@1, acpinex12
acpinex: port@3, acpinex8
acpinex12 is /fw/sb@0/usbroothub@EHC1/port@1/port@1
acpinex8 is /fw/sb@0/usbroothub@EHC2/port@1/port@3
acpinex: port@2, acpinex13
acpinex: port@4, acpinex9
acpinex13 is /fw/sb@0/usbroothub@EHC1/port@1/port@2
acpinex9 is /fw/sb@0/usbroothub@EHC2/port@1/port@4
acpinex: port@3, acpinex14
acpinex14 is /fw/sb@0/usbroothub@EHC1/port@1/port@3
acpinex: port@5, acpinex10
acpinex10 is /fw/sb@0/usbroothub@EHC2/port@1/port@5
acpinex: port@4, acpinex15
acpinex15 is /fw/sb@0/usbroothub@EHC1/port@1/port@4
acpinex: port@6, acpinex11
acpinex11 is /fw/sb@0/usbroothub@EHC2/port@1/port@6
acpinex: port@5, acpinex16
acpinex16 is /fw/sb@0/usbroothub@EHC1/port@1/port@5
acpinex: port@6, acpinex17
acpinex17 is /fw/sb@0/usbroothub@EHC1/port@1/port@6
acpinex: port@7, acpinex18
acpinex18 is /fw/sb@0/usbroothub@EHC1/port@1/port@7
acpinex: port@8, acpinex19
acpinex19 is /fw/sb@0/usbroothub@EHC1/port@1/port@8
NOTICE: rge0: Using MSI interrupt type

pseudo-device: llc10
llc10 is /pseudo/llc1@0
NOTICE: rge0 registered
pseudo-device: power0
power0 is /pseudo/power@0
pseudo-device: ramdisk1024
ramdisk1024 is /pseudo/ramdisk@1024
pseudo-device: zfs0
zfs0 is /pseudo/zfs@0
pseudo-device: srn0
srn0 is /pseudo/srn@0
pseudo-device: dcpc0
dcpc0 is /pseudo/dcpc@0
pseudo-device: fasttrap0
fasttrap0 is /pseudo/fasttrap@0
pseudo-device: fbt0
fbt0 is /pseudo/fbt@0
pseudo-device: profile0
profile0 is /pseudo/profile@0
pseudo-device: lockstat0
lockstat0 is /pseudo/lockstat@0
pseudo-device: sdt0
sdt0 is /pseudo/sdt@0
pseudo-device: systrace0
systrace0 is /pseudo/systrace@0
pseudo-device: fcp0
fcp0 is /pseudo/fcp@0
pseudo-device: fcsm0
fcsm0 is /pseudo/fcsm@0
pseudo-device: ipd0
ipd0 is /pseudo/ipd@0
pseudo-device: stmf0
stmf0 is /pseudo/stmf@0
pseudo-device: fssnap0
fssnap0 is /pseudo/fssnap@0
pseudo-device: kvm0
kvm0 is /pseudo/kvm@0
pseudo-device: pool0
pool0 is /pseudo/pool@0
IP Filter: v4.1.9, running.
pseudo-device: bpf0
bpf0 is /pseudo/bpf@0
pseudo-device: pm0
pm0 is /pseudo/pm@0
pseudo-device: nsmb0
nsmb0 is /pseudo/nsmb@0
NOTICE: e1000g0 registered
Universal TUN/TAP device driver ver 1.3.0 09/21/2023 (C) 1999-2000 Maxim 
Krasnyansky
pseudo-device: tap0
tap0 is /pseudo/tap@0
Universal TUN/TAP device driver ver 1.3.0 09/21/2023 (C) 1999-2000 Maxim 
Krasnyansky
pseudo-device: tun0
tun0 is /pseudo/tun@0
pseudo-device: lx_systrace0
lx_systrace0 is /pseudo/lx_systrace@0
pseudo-device: inotify0
inotify0 is /pseudo/inotify@0
pseudo-device: eventfd0
eventfd0 is /pseudo/eventfd@0
pseudo-device: timerfd0
timerfd0 is /pseudo/timerfd@0
pseudo-device: signalfd0
signalfd0 is /pseudo/signalfd@0
pseudo-device: vmm0
vmm0 is /pseudo/vmm@0
pseudo-device: viona0
viona0 is /pseudo/viona@0
dump on /dev/zvol/dsk/zones/dump size 15360 MB
NOTICE: vnic1000 registered
NOTICE: vnic1000 link up, 0 Mbps, unknown duplex
NOTICE: e1000g0 link up, 1000 Mbps, full duplex
NOTICE: vnic1009 registered
NOTICE: vnic1009 link up, 0 Mbps, unknown duplex
Creating /etc/devices/devid_cache
Creating /etc/devices/pci_unitaddr_persistent
/pseudo/zconsnex@1/zcons@0 (zcons0) online
/pseudo/zconsnex@1/zcons@1 (zcons1) online
/pseudo/zconsnex@1/zcons@2 (zcons2) online
/pseudo/zconsnex@1/zcons@3 (zcons3) online
/pseudo/zconsnex@1/zcons@4 (zcons4) online
/pseudo/zconsnex@1/zcons@5 (zcons5) online
/pseudo/zconsnex@1/zcons@6 (zcons6) online
/pseudo/zconsnex@1/zcons@7 (zcons7) online
/pseudo/zconsnex@1/zcons@8 (zcons8) online
/pseudo/zconsnex@1/zcons@9 (zcons9) online
/pseudo/zconsnex@1/zcons@10 (zcons10) online
/pseudo/zconsnex@1/zcons@11 (zcons11) online
/pseudo/zconsnex@1/zcons@12 (zcons12) online
NOTICE: vnic1015 registered
NOTICE: vnic1015 link up, 0 Mbps, unknown duplex
NOTICE: vnic1016 registered
NOTICE: vnic1016 link up, 0 Mbps, unknown duplex
NOTICE: vnic1017 registered
NOTICE: vnic1017 link up, 0 Mbps, unknown duplex
NOTICE: vnic1019 registered
NOTICE: vnic1019 link up, 0 Mbps, unknown duplex
NOTICE: vnic1020 registered
NOTICE: vnic1020 link up, 0 Mbps, unknown duplex
NOTICE: vnic1021 registered
NOTICE: vnic1021 link up, 0 Mbps, unknown duplex
NOTICE: vnic1022 registered
NOTICE: vnic1022 link up, 0 Mbps, unknown duplex
NOTICE: vnic1023 registered
NOTICE: vnic1023 link up, 0 Mbps, unknown duplex
NOTICE: vnic1024 registered
NOTICE: vnic1024 link up, 0 Mbps, unknown duplex
NOTICE: vnic1025 registered
NOTICE: vnic1025 link up, 0 Mbps, unknown duplex
NOTICE: vnic1026 registered
NOTICE: vnic1026 link up, 0 Mbps, unknown duplex
NOTICE: vnic1027 registered
NOTICE: vnic1027 link up, 0 Mbps, unknown duplex
NOTICE: vnic1028 registered
NOTICE: vnic1028 link up, 0 Mbps, unknown duplex
pseudo-device: devinfo0
devinfo0 is /pseudo/devinfo@0
pseudo-device: llc10
llc10 is /pseudo/llc1@0
pseudo-device: ramdisk1024
ramdisk1024 is /pseudo/ramdisk@1024
pseudo-device: ucode0
ucode0 is /pseudo/ucode@0
pseudo-device: dcpc0
dcpc0 is /pseudo/dcpc@0
pseudo-device: fbt0
fbt0 is /pseudo/fbt@0
pseudo-device: profile0
profile0 is /pseudo/profile@0
pseudo-device: lockstat0
lockstat0 is /pseudo/lockstat@0
pseudo-device: sdt0
sdt0 is /pseudo/sdt@0
pseudo-device: systrace0
systrace0 is /pseudo/systrace@0
pseudo-device: fcp0
fcp0 is /pseudo/fcp@0
pseudo-device: fcsm0
fcsm0 is /pseudo/fcsm@0
pseudo-device: ipd0
ipd0 is /pseudo/ipd@0
pseudo-device: stmf0
stmf0 is /pseudo/stmf@0
pseudo-device: fssnap0
fssnap0 is /pseudo/fssnap@0
pseudo-device: kvm0
kvm0 is /pseudo/kvm@0
pseudo-device: bpf0
bpf0 is /pseudo/bpf@0
pseudo-device: pm0
pm0 is /pseudo/pm@0
pseudo-device: nsmb0
nsmb0 is /pseudo/nsmb@0
pseudo-device: lx_systrace0
lx_systrace0 is /pseudo/lx_systrace@0
pseudo-device: inotify0
inotify0 is /pseudo/inotify@0
pseudo-device: eventfd0
eventfd0 is /pseudo/eventfd@0
pseudo-device: timerfd0
timerfd0 is /pseudo/timerfd@0
pseudo-device: signalfd0
signalfd0 is /pseudo/signalfd@0
pseudo-device: viona0
viona0 is /pseudo/viona@0
Creating /etc/devices/devname_cache
NOTICE: vnic1023 unregistered
NOTICE: vnic1067 registered
NOTICE: vnic1067 link up, 0 Mbps, unknown duplex
NOTICE: vnic1067 unregistered
NOTICE: vnic1070 registered
NOTICE: vnic1070 link up, 0 Mbps, unknown duplex
NOTICE: vnic1070 unregistered
NOTICE: vnic1075 registered
NOTICE: vnic1075 link up, 0 Mbps, unknown duplex
NOTICE: vnic1016 unregistered
NOTICE: vnic1077 registered
NOTICE: vnic1077 link up, 0 Mbps, unknown duplex
ipfs: Cannot find /lib64/ld-linux-x86-64.so.2
ipfs: Cannot find /lib64/ld-linux-x86-64.so.2
NOTICE: SUNW-MSG-ID: SUNOS-8000-0G, TYPE: Error, VER: 1, SEVERITY: Major


panic[cpu1]/thread=fffffe003d111c20:
Unrecoverable Machine-Check Exception


fffffe2cea153ec0 unix:cmi_mca_panic+1c ()
fffffe2cea153f00 unix:cmi_mca_trap+145 ()
fffffe2cea153f10 unix:mcetrap+155 ()
fffffe003d111620 unix:bcopy_altentry+55a ()
fffffe003d111660 zfs:abd_copy_from_buf_off_cb+2d ()
fffffe003d1116f0 zfs:abd_iterate_func+84 ()
fffffe003d111730 zfs:abd_copy_from_buf_off+42 ()
fffffe003d111780 zfs:abd_return_buf_copy+42 ()
fffffe003d1117b0 zfs:vdev_disk_io_intr+8d ()
fffffe003d1117e0 genunix:biodone+25 ()
fffffe003d111820 sd:sd_buf_iodone+3a ()
fffffe003d111880 sd:sd_mapblockaddr_iodone+48 ()
fffffe003d1118e0 sd:sd_return_command+122 ()
fffffe003d111940 sd:sdintr+18c ()
fffffe003d111970 scsi:scsi_hba_pkt_comp+7d ()
fffffe003d1119c0 sata:sata_txlt_rw_completion+140 ()
fffffe003d1119f0 ahci:ahci_flush_doneq+6b ()
fffffe003d111a70 ahci:ahci_intr_ncq_events+234 ()
fffffe003d111ad0 ahci:ahci_intr_set_device_bits+72 ()
fffffe003d111b40 ahci:ahci_port_intr+266 ()
fffffe003d111b80 ahci:ahci_intr+a3 ()
fffffe003d111bd0 apix:apix_dispatch_by_vector+8c ()
fffffe003d111c00 apix:apix_dispatch_lowlevel+29 ()
fffffe003d0c9a50 unix:switch_sp_and_call+15 ()
fffffe003d0c9ab0 apix:apix_do_interrupt+f3 ()
fffffe003d0c9ac0 unix:_interrupt+c3 ()
fffffe003d0c9bb0 unix:i86_mwait+12 ()
fffffe003d0c9be0 unix:cpu_idle_mwait+14b ()
fffffe003d0c9c00 unix:idle+a8 ()
fffffe003d0c9c10 unix:thread_start+b ()

dumping to /dev/zvol/dsk/zones/dump, offset 65536, content: kernel
NOTICE: ahci0: ahci_tran_reset_dport port 0 reset port
NOTICE: ahci0: ahci_tran_reset_dport port 1 reset port




 > ::panicinfo
              cpu                1
           thread fffffe003d111c20
          message Unrecoverable Machine-Check Exception
              rdi fffffffffb93d610
              rsi fffffe2cea153e40
              rdx                1
              rcx fffffe2cea153bec
               r8                1
               r9 fffffe2ce7a8a578
              rax fffffe2cea153e60
              rbx fffffe2cea153f10
              rbp fffffe2cea153eb0
              r10 fffffffffb879020
              r11 fffffe2ce9a72600
              r12 fffffffffb93d610
              r13                5
              r14                0
              r15 fffffe2d29c0b980
           fsbase                0
           gsbase fffffe2ce9779000
               ds               4b
               es               4b
               fs                0
               gs              1c3
           trapno                0
              err                0
              rip fffffffffb885db0
               cs               30
           rflags               46
              rsp fffffe2cea153e38
               ss               38
           gdt_hi                0
           gdt_lo         600001ef
           idt_hi                0
           idt_lo         50000fff
              ldt                0
             task               70
              cr0         8005003b
              cr2          80b81e0
              cr3         34800000
              cr4           1626f8

-- 
Jesús Cea Avión                         _/_/      _/_/_/        _/_/_/
jcea@jcea.es - https://www.jcea.es/    _/_/    _/_/  _/_/    _/_/  _/_/
Twitter: @jcea                        _/_/    _/_/          _/_/_/_/_/
jabber / xmpp:jcea@jabber.org  _/_/  _/_/    _/_/          _/_/  _/_/
"Things are not so easy"      _/_/  _/_/    _/_/  _/_/    _/_/  _/_/
"My name is Dump, Core Dump"   _/_/_/        _/_/_/      _/_/  _/_/
"El amor es poner tu felicidad en la felicidad de otro" - Leibniz

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: [developer] Kernel crash & reboot
  2024-03-15 17:56 Kernel crash & reboot Jesus Cea
@ 2024-03-15 18:10 ` Dan McDonald
  0 siblings, 0 replies; 2+ messages in thread
From: Dan McDonald @ 2024-03-15 18:10 UTC (permalink / raw)
  To: illumos-developer

Thanks for pushing it over here. More ZFS knowledge is on this list.

I'll note that this dump is from SmartOS 20230921T034751Z, so there's a chance something fixed it between then and now. The only abd change I see is a marking of buffers change (illumos#16020), but that doesn't seem to fit your problem.  There are some other changes between then and now as well, but again, nothing seems immediately relevant.

If that stack is familiar to anyone , please speak up here.  I see you (Jesus) pasted this in the bug report for 14679, which is good.  I take it this was a spontaneous failure and you've not seen anything since then?

Thanks!
Dan


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2024-03-15 18:10 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2024-03-15 17:56 Kernel crash & reboot Jesus Cea
2024-03-15 18:10 ` [developer] " Dan McDonald

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).