NetBSD-Users archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Strange behaviour on PCEngines APU2



Hi!

I recently got a PCEngines APU2 (not sure of the exact model) to replace my failing Soekris gateway and some strange behaviour appeared after I took it in production.

After the system has been running for a few hours, it seems to stop being able to send packets on the internal wired network interface (possibly also the external, I can't tell) on a per-process basis, and seems to mostly affect IPv4. ICMP and UDP seems more prone to failure than TCP retransmission?).

For instance, if I ping a host on my network from the gateway, only a few icmp requests go out (checked with tcpdump), sometimes one, sometimes ten but then it just sits there. The process seems to be stuck in select, if top is to believed.

Attaching a debugger yields;

(gdb) bt
#0  0x000070e3f803e28a in poll () from /lib/libc.so.12
#1  0x000000002f003a6f in main ()

Once I quit the debugger, sometimes a few packets get sent (and received) again.

Pressing ctrl-c stops the ping process properly, and it says it sent and received 8/8 packets or whatever.

Disabling pf did nothing.

Packet forwarding seems to work just fine.

I also have a small daemon that I wrote that listens to pflog devices that decodes the log and sends the messages to syslog. These also seem to stop in the same maner as ping, but in read() in pcap_loop().

Once the system is in this state, it can't reboot itself either, presumably waiting something somewhere.

The apu2 is flashed with the latest firmware available, and that made no difference.

Since this is a new system, I don't know if it's faulty or if netbsd is doing the strange stuff.

Advice? I will probably try to roll back my sources to this summer sometime and see if an older kernel works, the kernel that was optimized for my NET6501 appeared to not have the same problem, but I am not sure.

current kernel config:

include "arch/amd64/conf/GENERIC"

# no need for this (no X11)
no options INSECURE
# not a mac
no options DKWEDGE_METHOD_APPLE

# is a gateway
options GATEWAY

# I prefer pf
no options IPFILTER_LOG
no options IPFILTER_LOOKUP
no options IPFILTER_COMPAT

dmesg:

Copyright (c) 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005,
    2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017,
    2018, 2019 The NetBSD Foundation, Inc.  All rights reserved.
Copyright (c) 1982, 1986, 1989, 1991, 1993
    The Regents of the University of California.  All rights reserved.

NetBSD 8.1_STABLE (APU2) #1: Thu Nov 14 06:56:15 EET 2019
        duck%power.shangtai.net@localhost:/usr/build/src/sys/arch/amd64/compile/obj/APU2
total memory = 1919 MB
avail memory = 1840 MB
rnd: seeded with 256 bits
timecounter: Timecounters tick every 10.000 msec
Kernelized RAIDframe activated
running cgd selftest aes-xts-256 aes-xts-512 done
[455] uftdi* disabled
[185] ehci* disabled
[186] ehci* disabled
timecounter: Timecounter "i8254" frequency 1193182 Hz quality 100
PC Engines apu2 (1.0)
mainbus0 (root)
ACPI: RSDP 0x00000000000F3AF0 000024 (v02 COREv4)
ACPI: XSDT 0x000000007EEA70E0 000074 (v01 COREv4 COREBOOT 00000000 CORE 20180531)
ACPI: FACP 0x000000007EEA8DF0 0000F4 (v04 COREv4 COREBOOT 00000000 CORE 20180531)
ACPI: DSDT 0x000000007EEA7280 001B6E (v02 COREv4 COREBOOT 00010001 INTL 20180531)
ACPI: FACS 0x000000007EEA7240 000040
ACPI: SSDT 0x000000007EEA8EF0 0001EA (v02 COREv4 COREBOOT 0000002A CORE 20180531)
ACPI: MCFG 0x000000007EEA90E0 00003C (v01 COREv4 COREBOOT 00000000 CORE 20180531)
ACPI: TPM2 0x000000007EEA9120 00004C (v04 COREv4 COREBOOT 00000000 CORE 20180531)
ACPI: APIC 0x000000007EEA9170 00007E (v02 COREv4 COREBOOT 00000000 CORE 20180531)
ACPI: HEST 0x000000007EEA91F0 0001D0 (v01 COREv4 COREBOOT 00000000 CORE 20180531)
ACPI: IVRS 0x000000007EEA93C0 000108 (v01 COREv4 COREBOOT 00000000 CORE 20180531)
ACPI: SSDT 0x000000007EEA94D0 0048A6 (v02 AMD    AGESA    00000002 MSFT 04000000)
ACPI: SSDT 0x000000007EEADD80 0007C8 (v01 AMD    AGESA    00000001 AMD  00000001)
ACPI: HPET 0x000000007EEAE550 000038 (v01 COREv4 COREBOOT 00000000 CORE 20180531)
ACPI: Executed 2 blocks of module-level executable AML code
ACPI: 4 ACPI AML tables successfully acquired and loaded
ioapic0 at mainbus0 apid 4: pa 0xfec00000, version 0x21, 24 pins
ioapic1 at mainbus0 apid 5: pa 0xfec20000, version 0x21, 32 pins
cpu0 at mainbus0 apid 0
cpu0: AMD GX-412TC SOC                               , id 0x730f01
cpu0: package 0, core 0, smt 0
cpu1 at mainbus0 apid 1
cpu1: AMD GX-412TC SOC                               , id 0x730f01
cpu1: package 0, core 1, smt 0
cpu2 at mainbus0 apid 2
cpu2: AMD GX-412TC SOC                               , id 0x730f01
cpu2: package 0, core 2, smt 0
cpu3 at mainbus0 apid 3
cpu3: AMD GX-412TC SOC                               , id 0x730f01
cpu3: package 0, core 3, smt 0
acpi0 at mainbus0: Intel ACPICA 20170303
acpi0: X/RSDT: OemId <COREv4,COREBOOT,00000000>, AslId <CORE,20180531>
acpi0: MCFG: segment 0, bus 0-63, address 0x00000000f8000000
acpi0: SCI interrupting at int 9
timecounter: Timecounter "ACPI-Safe" frequency 3579545 Hz quality 900
hpet0 at acpi0: high precision event timer (mem 0xfed00000-0xfed00400)
timecounter: Timecounter "hpet0" frequency 14318180 Hz quality 2000
acpibut0 at acpi0 (PWRB, PNP0C0C-170): ACPI Power Button
LDRC (PNP0C02) at acpi0 not configured
attimer1 at acpi0 (TMR, PNP0100): io 0x40-0x43 irq 0
pcppi1 at acpi0 (SPKR, PNP0800): io 0x61
spkr0 at pcppi1: PC Speaker
midi0 at pcppi1: PC speaker
sysbeep0 at pcppi1
COM1 (PNP0501) at acpi0 not configured
COM2 (PNP0501) at acpi0 not configured
GPIO (AMD0030) at acpi0 not configured
BTNS (PRP0001) at acpi0 not configured
BTN1 (PRP0001) at acpi0 not configured
LEDS (PRP0001) at acpi0 not configured
LED1 (PRP0001) at acpi0 not configured
LED2 (PRP0001) at acpi0 not configured
LED3 (PRP0001) at acpi0 not configured
AWR0 (PNP0C02) at acpi0 not configured
ABR0 (PNP0C02) at acpi0 not configured
ABR1 (PNP0C02) at acpi0 not configured
ABR2 (PNP0C02) at acpi0 not configured
ABR3 (PNP0C02) at acpi0 not configured
ABR4 (PNP0C02) at acpi0 not configured
CTBL (BOOT0000) at acpi0 not configured
ACPI: Enabled 4 GPEs in block 00 to 1F
attimer1: attached to pcppi1
pci0 at mainbus0 bus 0: configuration mode 1
pci0: i/o space, memory space enabled, rd/line, rd/mult, wr/inv ok
pchb0 at pci0 dev 0 function 0: vendor 1022 product 1566 (rev. 0x00)
vendor 1022 product 1567 (IOMMU system) at pci0 dev 0 function 2 not configured
pchb1 at pci0 dev 2 function 0: vendor 1022 product 156b (rev. 0x00)
ppb0 at pci0 dev 2 function 2: vendor 1022 product 1439 (rev. 0x00)
ppb0: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x1 @ 5.0GT/s
ppb0: link is x1 @ 2.5GT/s
pci1 at ppb0 bus 1
pci1: i/o space, memory space enabled, rd/line, wr/inv ok
wm0 at pci1 dev 0 function 0: I211 Ethernet (COPPER) (rev. 0x03)
wm0: for TX and RX interrupting at msix0 vec 0 affinity to 1
wm0: for TX and RX interrupting at msix0 vec 1 affinity to 2
wm0: for LINK interrupting at msix0 vec 2
wm0: PCI-Express bus
wm0: 64 words iNVM, version 0.6
wm0: Ethernet address 00:0d:b9:53:23:44
wm0: Copper
wm0: 0xc214420<INVM,IOH_VALID,PCIE,NEWQUEUE,WOL,PLLWA,CLSEMWA>
makphy0 at wm0 phy 1: I210 10/100/1000 media interface, rev. 0
makphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
ppb1 at pci0 dev 2 function 3: vendor 1022 product 1439 (rev. 0x00)
ppb1: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x1 @ 5.0GT/s
ppb1: link is x1 @ 2.5GT/s
pci2 at ppb1 bus 2
pci2: i/o space, memory space enabled, rd/line, wr/inv ok
wm1 at pci2 dev 0 function 0: I211 Ethernet (COPPER) (rev. 0x03)
wm1: for TX and RX interrupting at msix1 vec 0 affinity to 1
wm1: for TX and RX interrupting at msix1 vec 1 affinity to 2
wm1: for LINK interrupting at msix1 vec 2
wm1: PCI-Express bus
wm1: 64 words iNVM, version 0.6
wm1: Ethernet address 00:0d:b9:53:23:45
wm1: Copper
wm1: 0xc214420<INVM,IOH_VALID,PCIE,NEWQUEUE,WOL,PLLWA,CLSEMWA>
makphy1 at wm1 phy 1: I210 10/100/1000 media interface, rev. 0
makphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto
ppb2 at pci0 dev 2 function 4: vendor 1022 product 1439 (rev. 0x00)
ppb2: PCI Express capability version 2 <Root Port of PCI-E Root Complex> x1 @ 5.0GT/s
ppb2: link is x1 @ 2.5GT/s
pci3 at ppb2 bus 3
pci3: i/o space, memory space enabled, rd/line, wr/inv ok
wm2 at pci3 dev 0 function 0: I211 Ethernet (COPPER) (rev. 0x03)
wm2: for TX and RX interrupting at msix2 vec 0 affinity to 1
wm2: for TX and RX interrupting at msix2 vec 1 affinity to 2
wm2: for LINK interrupting at msix2 vec 2
wm2: PCI-Express bus
wm2: 64 words iNVM, version 0.6
wm2: Ethernet address 00:0d:b9:53:23:46
wm2: Copper
wm2: 0xc214420<INVM,IOH_VALID,PCIE,NEWQUEUE,WOL,PLLWA,CLSEMWA>
makphy2 at wm2 phy 1: I210 10/100/1000 media interface, rev. 0
makphy2: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseT, 1000baseT-FDX, auto vendor 1022 product 1537 (miscellaneous crypto) at pci0 dev 8 function 0 not configured
xhci0 at pci0 dev 16 function 0: vendor 1022 product 7814 (rev. 0x11)
xhci0: interrupting at msi3 vec 0
xhci0: xHCI version 1.0
usb0 at xhci0: USB revision 3.0
usb1 at xhci0: USB revision 2.0
ahcisata0 at pci0 dev 17 function 0: vendor 1022 product 7800 (rev. 0x39)
ahcisata0: interrupting at ioapic0 pin 19
ahcisata0: 64-bit DMA
ahcisata0: AHCI revision 1.30, 2 ports, 32 slots, CAP 0xf733ff81<CCCS,PSC,SSC,PMD,FBSS,SPM,ISS=0x3=Gen3,SCLO,SAL,SALP,SMPS,SSNTF,SNCQ,S64A>
atabus0 at ahcisata0 channel 0
atabus1 at ahcisata0 channel 1
vendor 1022 product 7808 (USB serial bus, EHCI, revision 0x39) at pci0 dev 19 function 0 not configured
piixpm0 at pci0 dev 20 function 0: vendor 1022 product 780b (rev. 0x42)
piixpm0: polling
iic0 at piixpm0 port 0: I2C bus
iic1 at piixpm0 port 1: I2C bus
iic2 at piixpm0 port 2: I2C bus
iic3 at piixpm0 port 3: I2C bus
pcib0 at pci0 dev 20 function 3: vendor 1022 product 780e (rev. 0x11)
sdhc0 at pci0 dev 20 function 7: vendor 1022 product 7813 (rev. 0x01)
sdhc0: interrupting at ioapic0 pin 16
sdhc0: SDHC 2.0, rev 16, SDMA, 50000 kHz, HS 3.3V, 2048 byte blocks
sdmmc0 at sdhc0 slot 0
pchb2 at pci0 dev 24 function 0: vendor 1022 product 1580 (rev. 0x00)
pchb3 at pci0 dev 24 function 1: vendor 1022 product 1581 (rev. 0x00)
pchb4 at pci0 dev 24 function 2: vendor 1022 product 1582 (rev. 0x00)
amdnb_misc0 at pci0 dev 24 function 3: AMD NB Misc Configuration
amdtemp0 at amdnb_misc0: AMD CPU Temperature Sensors (Family16h)
pchb5 at pci0 dev 24 function 4: vendor 1022 product 1584 (rev. 0x00)
pchb6 at pci0 dev 24 function 5: vendor 1022 product 1585 (rev. 0x00)
isa0 at pcib0
lpt0 at isa0 port 0x378-0x37b irq 7
com0 at isa0 port 0x3f8-0x3ff irq 4: ns16550a, working fifo
com0: console
com1 at isa0 port 0x2f8-0x2ff irq 3: ns16550a, working fifo
acpicpu0 at cpu0: ACPI CPU
ACPI Error: \_PR.P000: Evaluation of object type [Device] is not supported (20170303/nseval-215)
acpicpu0: failed to obtain CPU object
acpicpu0: C1: HLT, lat   0 us, pow     0 mW
acpicpu0: C2: I/O, lat 400 us, pow     0 mW
acpicpu0: P0: FFH, lat   4 us, pow   980 mW, 1000 MHz
acpicpu0: P1: FFH, lat   4 us, pow   807 mW,  800 MHz
acpicpu0: P2: FFH, lat   4 us, pow   609 mW,  600 MHz
acpicpu1 at cpu1: ACPI CPU
ACPI Error: \_PR.P001: Evaluation of object type [Device] is not supported (20170303/nseval-215)
acpicpu1: failed to obtain CPU object
acpicpu2 at cpu2: ACPI CPU
ACPI Error: \_PR.P002: Evaluation of object type [Device] is not supported (20170303/nseval-215)
acpicpu2: failed to obtain CPU object
acpicpu3 at cpu3: ACPI CPU
ACPI Error: \_PR.P003: Evaluation of object type [Device] is not supported (20170303/nseval-215)
acpicpu3: failed to obtain CPU object
timecounter: Timecounter "clockinterrupt" frequency 100 Hz quality 0
timecounter: Timecounter "TSC" frequency 998194130 Hz quality 3000
uhub0 at usb0: vendor 1022 (0x1022) xHCI Root Hub (0000), class 9/0, rev 1.00/1.00, addr 0
uhub0: 2 ports with 2 removable, self powered
uhub1 at usb1: vendor 1022 (0x1022) xHCI Root Hub (0000), class 9/0, rev 2.00/1.00, addr 0
uhub1: 2 ports with 2 removable, self powered
IPsec: Initialized Security Association Processing.
ahcisata0 port 0: device present, speed: 6.0Gb/s
wd0 at atabus0 drive 0
wd0: <SanDisk SSD U100 16GB>
wd0: drive supports 1-sector PIO transfers, LBA48 addressing
wd0: 15272 MB, 31029 cyl, 16 head, 63 sec, 512 bytes/sect x 31277232 sectors
wd0: drive supports PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133)
wd0(ahcisata0:0:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 6 (Ultra/133) (using DMA)
ugen0 at uhub1 port 1
ugen0: Telldus (0x1781) TellStick Duo (0xc31), rev 2.00/6.00, addr 1
pad0: outputs: 44100Hz, 16-bit, stereo
audio0 at pad0: half duplex, playback, capture, mmap
pad0: Virtual format configured - Format SLINEAR, precision 16, channels 2, frequency 44100
pad0: Latency: 139 milliseconds
spkr1 at audio0: PC Speaker (synthesized)
boot device: wd0
root on wd0a dumps on wd0b
/: replaying log to memory
root file system type: ffs
kern.module.path=/stand/amd64/8.1/modules
warning: no /dev/console
/: replaying log to disk


Home | Main Index | Thread Index | Old Index