NetBSD-Bugs archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: Re: bin/56728: iscsi redundancy doesn't work
The following reply was made to PR bin/56728; it has been noted by GNATS.
From: 6bone%6bone.informatik.uni-leipzig.de@localhost
To: gnats-bugs%netbsd.org@localhost
Cc: gnats-admin%netbsd.org@localhost, netbsd-bugs%netbsd.org@localhost
Subject: Re: Re: bin/56728: iscsi redundancy doesn't work
Date: Wed, 23 Feb 2022 09:34:03 +0100 (CET)
 Hello,
 
 iscsictl add_send_target -a 172.18.86.130
 iscsictl add_send_target -a 172.18.86.131
 iscsictl refresh_targets
 OK
 
 iscsictl list_targets
       1: iqn.1992-08.com.netapp:naclaug
          2: 172.18.86.130:3260,1026
          3: 172.18.86.131:3260,1027
 
 iscsictl login -P 2
 Created Session 3, Connection 1
 
 iscsictl list_sessions
 Session 3: Target iqn.1992-08.com.netapp:naclaug
 
 iscsictl add_connection -I 3
 iscsictl: add_connection: The login failed
 -> crash (no drive is muted yet!)
 
 
 [  1326.174188] scsibus4 at iscsi0: 1 target, 16 luns per target
 [  1326.174188] sd3 at scsibus4 target 0 lun 11: <NETAPP, LUN C-Mode, 
 9700> disk fixed
 [  1326.174188] sd3: 10240 GB, 65129 cyl, 16 head, 20607 sec, 512 
 bytes/sect x 21474836480 sectors
 [  1326.174188] sd3: GPT GUID: d644c65b-110e-4dd0-9500-0cfc70900463
 [  1326.174188] dk0 at sd3: "NetApp01", 21474836413 blocks at 34, type: 
 ffs
 [  1326.174188] sd3: async, 8-bit transfers, tagged queueing
 [  1355.946583] S3C2: Login failed (rc 4)
 [  1355.946583] S3C2: *** Connection Error, status=18, logout=2, state=6
 [  1356.950793] dk0 at sd3 (NetApp01) deleted
 [  1356.950793] sd3: detached
 [  1356.950793] scsibus4: detached
 [  1358.296346] uvm_fault(0xffffffff819014c0, 0xffffd68067902000, 2) -> e
 [  1358.296346] fatal page fault in supervisor mode
 [  1358.306388] trap type 6 code 0x2 rip 0xffffffff8022d80c cs 0x8 rflags 
 0x10246 cr2 0xffffd680679021c0 ilevel 0 rsp 0xffffd68396627f08
 [  1358.306388] curlwp 0xfffff80d32d67a00 pid 0.554 lowest kstack 
 0xffffd683966232c0
 [  1358.306388] panic: trap
 [  1358.306388] cpu1: Begin traceback...
 [  1358.306388] vpanic() at netbsd:vpanic+0x156
 [  1358.306388] panic() at netbsd:panic+0x3c
 [  1358.306388] trap() at netbsd:trap+0xb27
 [  1358.306388] --- trap (number 6) ---
 [  1358.306388] mutex_enter() at netbsd:mutex_enter+0xc
 [  1358.306388] send_nop_out() at iscsi:send_nop_out+0x133
 [  1358.306388] connection_timeout() at iscsi:connection_timeout+0x4d
 [  1358.306388] iscsi_cleanup_thread() at iscsi:iscsi_cleanup_thread+0x7b2
 [  1358.306388] cpu1: End traceback...
 
 [  1358.306388] dumping to dev 4,1 (offset=22227071, size=12581616):
 [  1358.306388] dump <4>mfi0: workqueue busy: updates stopped
 [  1390.048733] coretemp0: workqueue busy: updates stopped
 [  1390.048733] coretemp1: workqueue busy: updates stopped
 [  1390.048733] coretemp2: workqueue busy: updates stopped
 [  1390.048733] coretemp3: workqueue busy: updates stopped
 ipmi0: workqueue busy: updates stopped
 
 (gdb) target kvm netbsd.31.core
 0xffffffff80226145 in cpu_reboot (howto=howto@entry=260,
      bootstr=bootstr@entry=0x0)
      at 
 /mnt/iscsi_iqn.1992-08.com.netapp/usr/src/sys/arch/amd64/amd64/machdep.c:720
 720                     dumpsys();
 
 (gdb) bt
 #0  0xffffffff80226145 in cpu_reboot (howto=howto@entry=260,
      bootstr=bootstr@entry=0x0)
      at 
 /mnt/iscsi_iqn.1992-08.com.netapp/usr/src/sys/arch/amd64/amd64/machdep.c:720
 #1  0xffffffff80d37917 in kern_reboot (howto=howto@entry=260,
      bootstr=bootstr@entry=0x0)
      at /mnt/iscsi_iqn.1992-08.com.netapp/usr/src/sys/kern/kern_reboot.c:73
 #2  0xffffffff80d7afe2 in vpanic (fmt=fmt@entry=0xffffffff81390116 "trap",
      ap=ap@entry=0xffffd68396627cc8)
      at /mnt/iscsi_iqn.1992-08.com.netapp/usr/src/sys/kern/subr_prf.c:290
 #3  0xffffffff80d7b0a7 in panic (fmt=fmt@entry=0xffffffff81390116 "trap")
      at /mnt/iscsi_iqn.1992-08.com.netapp/usr/src/sys/kern/subr_prf.c:209
 #4  0xffffffff80228f67 in trap (frame=0xffffd68396627e10)
      at 
 /mnt/iscsi_iqn.1992-08.com.netapp/usr/src/sys/arch/amd64/amd64/trap.c:326
 #5  0xffffffff80221023 in alltraps ()
 #6  0xffffd680679021c0 in ?? ()
 #7  0x0000000000000000 in ?? ()
 
 list *(0xffffffff80226145)
 0xffffffff80226145 is in cpu_reboot 
 (/mnt/iscsi_iqn.1992-08.com.netapp/usr/src/sys/arch/amd64/amd64/machdep.c:720).
 715             /* Disable interrupts. */
 716             s = splhigh();
 717
 718             /* Do a dump if requested. */
 719             if ((howto & (RB_DUMP | RB_HALT)) == RB_DUMP)
 720                     dumpsys();
 721
 722     haltsys:
 723             doshutdownhooks();
 724
 
 
 Thank you for your efforts
 
 Regards
 Uwe
 
 
 
 On Wed, 23 Feb 2022, Michael van Elst wrote:
 
 > Date: Wed, 23 Feb 2022 04:00:02 +0000 (UTC)
 > From: Michael van Elst <mlelstv%serpens.de@localhost>
 > Reply-To: gnats-bugs%netbsd.org@localhost
 > To: gnats-admin%netbsd.org@localhost, netbsd-bugs%netbsd.org@localhost,
 >     6bone%6bone.informatik.uni-leipzig.de@localhost
 > Subject: [Extern] Re: bin/56728: iscsi redundancy doesn't work
 > 
 > The following reply was made to PR bin/56728; it has been noted by GNATS.
 >
 > From: mlelstv%serpens.de@localhost (Michael van Elst)
 > To: gnats-bugs%netbsd.org@localhost
 > Cc:
 > Subject: Re: bin/56728: iscsi redundancy doesn't work
 > Date: Wed, 23 Feb 2022 03:58:57 -0000 (UTC)
 >
 > 6bone%6bone.informatik.uni-leipzig.de@localhost writes:
 >
 > >On Tue, 22 Feb 2022, Michael van Elst wrote:
 >
 > >>
 > >> Do you have any messages from that crash? A Backtrace ?
 > >>
 >
 > >Does that help?
 >
 >
 > >[ 98560.538286] sd3d: error reading fsbn 7415496930 of
 > >7415496930-7415496993 (sd3 bn 7415496930; cn 22490 tn 13 sn 6159)
 > >[ 98591.063909] uvm_fault(0xffffffff81901bc0, 0xffff860067907000, 1) -> e
 > >[ 98591.063909] fatal page fault in supervisor mode
 > >[ 98591.063909] trap type 6 code 0 rip 0xffffffff8025381b cs 0x8 rflags
 > >0x10282 cr2 0xffff860067907070 ilevel 0 rsp 0xffff860396d57c20
 > >[ 98591.063909] curlwp 0xffff8186f0a7c940 pid 0.390 lowest kstack
 >
 >
 > Looks like sd_diskstart is running with a NULL periph pointer.
 > I don't see how that happens yet, but it is probably the result of
 > detaching the sd device while in use. The detach message is
 > probably not yet printed.
 >
 > So that's one problem.
 >
 > Detaching the sd device happens when no connection to the iscsi
 > server exists and no connection cannot be re-established
 > either.
 >
 > For multiple connections to an iscsi server you need to do something
 > like:
 >
 > add_send_target    -> add target to list
 > refresh_targets    -> get portals
 > login              -> establish session (creates sd)
 > add_connection     -> add redundant connection to session
 >
 > the man page doesn't look correct.
 >
 > # iscsictl add_send_target -a x.x.x.x
 > Added Send Target 1
 > # iscsictl refresh_targets
 > OK
 > # iscsictl list_targets
 >      1: iqn.2007-09.jp.ne.peach.istgt:pbulk1
 >         2: x.x.x.x:3260,1
 >      3: iqn.2007-09.jp.ne.peach.istgt:test
 >         4: x.x.x.x:3260,1
 > # iscsictl login -P 4
 > Created Session 2, Connection 1
 > # iscsictl list_sessions
 > Session 2: Target iqn.2007-09.jp.ne.peach.istgt:test
 > # iscsictl add_connection -I 2
 > Added Connection 2
 >
 > tcp        0      0  y.y.y.y.65330       x.x.x.x.3260         ESTABLISHED
 > tcp        0      0  y.y.y.y.65331       x.x.x.x.3260         ESTABLISHED
 >
 > # tcpdrop y.y.y.y 65530 x.x.x.x 3260
 >
 > [ 793856.693477] S2C2: *** Connection Error, status=18, logout=2, state=3
 > [ 793856.693477] S2C2: Write failed sock 0xffff8524621c5480 (ret: 32, req: 48, resid: 48)
 > [ 793856.693477] S2C2: *** Connection Error, status=18, logout=-1, state=5
 > [ 793858.693531] S2C2: Connection ReCreated successfully - status 0
 >
 >
 > tcp        0      0  y.y.y.y.65329       x.x.x.x.3260         ESTABLISHED
 > tcp        0      0  y.y.y.y.65331       x.x.x.x.3260         ESTABLISHED
 >
 > Not exactly the same (only a single target IP), but it shows how the
 > connection gets re-established.
 >
 >
 
Home |
Main Index |
Thread Index |
Old Index