NetBSD-Bugs archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: port-xen/47057: Xen NetBSD DomU file system trash under Linux Dom0
On Sat, Oct 20, 2012 at 01:31:37PM +0200, Roger Pau Monné wrote:
> More info on this subject, I was able to get to ddb after the system
> freezed (using +++++), here is the output:
>
> fatal breakpoint trap in supervisor mode
> trap type 1 code 0 rip ffffffff80130bf5 cs e030 rflags 202 cr2
> 7f7ff780c390 cpl 8 rsp ffffa0002e4848a8
> Stopped in pid 0.26 (system) at netbsd:breakpoint+0x5: leave
> breakpoint() at netbsd:breakpoint+0x5
> xencons_tty_input() at netbsd:xencons_tty_input+0xc9
> xencons_handler() at netbsd:xencons_handler+0x79
> intr_biglock_wrapper() at netbsd:intr_biglock_wrapper+0x1d
> evtchn_do_event() at netbsd:evtchn_do_event+0x15a
> call_evtchn_do_event() at netbsd:call_evtchn_do_event+0xd
> hypervisor_callback() at netbsd:hypervisor_callback+0x9e
> xenbus_thread() at netbsd:xenbus_thread+0xf5
> ds ca00
> es 4a58
> fs b9e8
> gs 6640
> rdi 1
> rsi ffffffff80a7b303
> rbp ffffa0002e4848a8
> rbx ffffffff80a7b303
> rdx 2b
> rcx 2b
> rax 7f
> r8 ffffffff80a8f800
> r9 0
> r10 ffffffff80a8fa00
> r11 246
> r12 ffffa00002365090
> r13 ffffffff80a7b303
> r14 ffffa00002367340
> r15 1
> rip ffffffff80130bf5 breakpoint+0x5
> cs e030
> rflags 202
> rsp ffffa0002e4848a8
> ss e02b
>
> And the ps:
>
> db{0}> ps
> PID LID S CPU FLAGS STRUCT LWP * NAME WAIT
> 26620 1 2 0 0 ffffa00008045b00 cc1
> 28013 1 3 0 80 ffffa00002fe61a0 x86_64--netbsd-g wait
> 23074 1 3 0 80 ffffa000080456e0 nbmkdep wait
> 16970 1 3 0 80 ffffa00002628180 sh wait
> 14476 1 2 0 0 ffffa00002b6f600 nbmkdep
> 24751 1 2 0 0 ffffa0000404a4c0 as
> 23651 1 3 0 80 ffffa000034d95a0 sh wait
> 16446 1 2 0 40000 ffffa00002fe65c0 cc1
> 3398 1 3 0 80 ffffa00002fd95a0 x86_64--netbsd-g wait
> 22279 1 3 0 80 ffffa0000387a980 sh wait
> 16566 1 3 0 40080 ffffa000076672c0 nbmake select
> 22489 1 3 0 40080 ffffa0000236e520 sh wait
> 16321 1 3 0 80 ffffa00002aa7660 nbmake select
> 21974 1 3 0 80 ffffa00002573960 sh wait
> 5656 1 3 0 80 ffffa0000236b8e0 x86_64--netbsd-g wait
> 6264 1 3 0 80 ffffa000075881e0 sh wait
> 870 1 3 0 80 ffffa0000404a0a0 nbmake select
> 28228 1 3 0 80 ffffa00002aa7a80 sh wait
> 19537 1 3 0 80 ffffa000076676e0 nbmake select
> 19740 1 3 0 80 ffffa00007588600 sh wait
> 8737 1 3 0 80 ffffa00002f8e220 nbmake select
> 29418 1 3 0 80 ffffa00002fa5aa0 sh wait
> 4427 1 3 0 80 ffffa0000263a5c0 nbmake select
> 28916 1 3 0 80 ffffa000080452c0 sh wait
> 13113 1 3 0 80 ffffa00002fe69e0 nbmake select
> 19412 1 3 0 80 ffffa000075691c0 sh wait
> 583 1 3 0 80 ffffa0000387a140 nbmake select
> 5923 1 3 0 80 ffffa000085da860 sh wait
> 21434 1 3 0 80 ffffa0000404a8e0 nbmake select
> 22103 1 3 0 80 ffffa00002f8ea60 sh wait
> 22077 1 3 0 80 ffffa00002b6f1e0 nbmake select
> 6976 1 3 0 80 ffffa000085da020 sh wait
> 18463 1 3 0 80 ffffa000075695e0 nbmake select
> 19784 1 3 0 80 ffffa00002f8e640 sh wait
> 8975 1 3 0 80 ffffa00007eb0420 nbmake select
> 6597 1 3 0 80 ffffa000026289c0 sh wait
> 18499 1 3 0 80 ffffa00002aa7240 nbmake select
> 649 1 3 0 80 ffffa0000263a1a0 sh wait
> 23152 1 3 0 80 ffffa0000387a560 nbmake select
> 11133 1 3 0 80 ffffa00007588a20 sh wait
> 11482 1 2 0 0 ffffa00007569a00 getty
> 15288 1 3 0 80 ffffa00007667b00 sh wait
> 6588 1 3 0 80 ffffa000034d99c0 screen-4.0.3 select
> 541 1 3 0 80 ffffa000026285a0 getty nanoslp
> 479 1 3 0 80 ffffa00002573540 getty nanoslp
> 539 1 3 0 80 ffffa0000236e100 getty nanoslp
> 532 1 3 0 80 ffffa000025fd580 cron nanoslp
> 535 1 3 0 80 ffffa0000263a9e0 inetd kqueue
> 333 1 3 0 80 ffffa000025fd9a0 sshd select
> 463 1 3 0 80 ffffa00002590980 powerd kqueue
> 307 1 2 0 0 ffffa000025fd160 syslogd
> 249 1 3 0 80 ffffa00002590560 dhcpcd select
> 1 1 3 0 80 ffffa0000236c0c0 init wait
> 0 36 3 0 200 ffffa00002590140 physiod physiod
> 0 35 3 0 200 ffffa0000236b4c0 aiodoned aiodoned
> 0 34 3 0 200 ffffa0000236c900 ioflush syncer
> 0 33 3 0 200 ffffa0000236b0a0 pgdaemon pgdaemon
> 0 30 3 0 200 ffffa0000235e080 cryptoret crypto_w
> 0 29 3 0 200 ffffa0000236c4e0 xen_balloon xen_balloon
> 0 28 3 0 200 ffffa0000236d920 unpgc unpgc
> 0 27 3 0 200 ffffa0000236d500 vmem_rehash vmem_rehash
> 0 > 26 7 0 200 ffffa0000236e940 xenbus
> 0 25 3 0 200 ffffa0000236d0e0 xenwatch evtsq
> 0 15 3 0 200 ffffa0000235e4a0 pmfsuspend pmfsuspend
> 0 14 3 0 200 ffffa0000235e8c0 pmfevent pmfevent
> 0 13 3 0 200 ffffa00001ee4060 sopendfree sopendfr
> 0 12 3 0 200 ffffa00001ee4480 nfssilly nfssilly
> 0 11 3 0 200 ffffa00001ee48a0 cachegc cachegc
> 0 10 3 0 200 ffffa00001ee3040 vrele vrele
> 0 9 3 0 200 ffffa00001ee3460 vdrain vdrain
> 0 8 3 0 200 ffffa00001ee3880 modunload mod_unld
> 0 7 3 0 200 ffffa00001ed9020 xcall/0 xcall
> 0 6 1 0 200 ffffa00001ed9440 softser/0
> 0 5 1 0 200 ffffa00001ed9860 softclk/0
> 0 4 1 0 200 ffffa00001ed6000 softbio/0
> 0 3 1 0 200 ffffa00001ed6420 softnet/0
> 0 2 1 0 201 ffffa00001ed6840 idle/0
> 0 1 3 0 200 ffffffff805b4c80 swapper uvm
>
> I will try to create a patch that shows the value of the ring indexes,
> since I'm pretty sure they are screwed up, and the system was blocked in
> xenbus_thread because of that before the callback came in.
What would be interesting here is a
tr/a ffffa0000236e940
(the lwp pointer of the xenbus thread). And alst what xenbus_thread+0xf5
points to in sources.
You can also try to type 'continue' and enter ddb again to see
if things changes (especially where in xenbus_thread it is
interrupted).
--
Manuel Bouyer <bouyer%antioche.eu.org@localhost>
NetBSD: 26 ans d'experience feront toujours la difference
--
Home |
Main Index |
Thread Index |
Old Index