Subject: Re: VM problems?
To: der Mouse <mouse@Rodents.Montreal.QC.CA>
From: None <rmcm@compsoft.com.au>
List: port-sparc
Date: 10/01/1998 15:45:05
Also having some weird memory-like problems (emacs{+/-X,19.34,20.2},
Xsun, fvwm2), with SS1 and SS1+ running 1.3.2 (as distributed, +/-
GENERIC kernel). gdb shows a variety of traces (listed
below). Swapping memory had no effect - but after netbooting sunos4,
the SS1+ has been running for 3+ weeks without problem. 1.3.2 appears
to be the only consistant factor associated with these problems. Any
suggestions?

      - Core was generated by `emacs'.
        Program terminated with signal 4, Illegal instruction.
        #0  0x1036df28 in bcopy ()
        #0  0x1036df28 in bcopy ()
        ----------------------------------------
      - Core was generated by `emacs'.
        Program terminated with signal 10, Bus error.
        #0  fatal_error_signal (sig=3D10)
            at /usr/local/src/gnu/emacs-19.34/src/emacs.c:181
        181=09}
        #0  fatal_error_signal (sig=3D10)
            at /usr/local/src/gnu/emacs-19.34/src/emacs.c:181
        #1  0xefffff74 in ?? ()
        #2  0x1022cb8c in XDrawImageString ()
        #3  0x100000c in ?? ()
        #4  0xc in ?? ()
        #5  0x1018e228 in ?? ()
        Cannot access memory at address 0x9210004a.
        ----------------------------------------
      - Core was generated by `emacs'.
        Program terminated with signal 11, Segmentation fault.
        #0  fatal_error_signal (sig=3D11)
            at /usr/local/src/gnu/emacs-19.34/src/emacs.c:181
        181=09}
        #0  fatal_error_signal (sig=3D11)
            at /usr/local/src/gnu/emacs-19.34/src/emacs.c:181
        #1  0xefffff74 in ?? ()
        #2  0xf00062fc in ?? ()
        #3  0xa in ?? ()
        #4  0x779e8 in re_search_2 (bufp=3D0x0, string1=3D0x19 "\003\22=
30=EF=FF=D7\212",=20
            size1=3D2170304, string2=3D0x0, size2=3D-268443848, startpo=
s=3D22, range=3D2196736,=20
            regs=3D0x211b80, stop=3D1) at /usr/local/src/gnu/emacs-19.3=
4/src/regex.c:3375
        #5  0x3e in ?? ()
        Cannot access memory at address 0x4020f038.
        ----------------------------------------

      - Core was generated by `emacs'.
        Program terminated with signal 6, Abort trap.
        ----------------------------------------

      - Core was generated by `Xsun'.
        Program terminated with signal 10, Bus error.
        ----------------------------------------

------------------------------------------------------------
Rex McMaster                            rmcm@compsoft.com.au=20
                                   rex@mcmaster.wattle.id.au
     PGP Public key: http://www.compsoft.com.au/~rmcm/pgp-pk


der Mouse writes:
 > I've got a weird problem with NetBSD/sparc, and I suspect something
 > weird in the VM subsystem.
 >=20
 > I (co-)run a mud on one of my home machines.  Over the last couple o=
f
 > days, I started trying to move this to sparkle, a SS1+ running
 > NetBSD/sparc.
 >=20
 > I started getting weird crashes: according to gdb, pointers were tak=
ing
 > on impossible values, and never quite the same crash twice.  My firs=
t
 > thought was, flaky RAM: I'd just recently pulled 4x1M and replaced i=
t
 > with new 4x4M.  Moving the RAM to another machine (a 386 that has
 > memory tests capable of identifying both of my known-bad SIMMs) turn=
ed
 > up no problems.
 >=20
 > Then I tried it on another SS1+ running the same OS (built from the
 > very same source tree), whose hardware hasn't been touched in ages.
 > Very similar crash.
 >=20
 > Then I tried it on a SS2 running NetBSD 1.2 (still at 1.2 because, a=
s I
 > mentioned in another thread, more modern kernels can't seem to play =
CDs
 > in its CD drive).  Worked like a charm.
 >=20
 > Thus, I conjecture there's something softwary wrong, probably with t=
he
 > VM subsystem.  The source tree does have a number of patches, but no=
ne
 > to UVM.  (I have two patches in vm/, but - I just checked - nothing =
in
 > there is getting used; I am using uvm/.)
 >=20
 > Any way I can help debug this?  If necessary I can probably even set=
 up
 > serial-console access to one of the affected machines, if pk or some=
one
 > wants to ssh in to one of my machines for it, set it up with a
 > crash-and-burn disk that I don't mind wiping later....
 >=20
 > =09=09=09=09=09der Mouse
 >=20
 > =09=09=09       mouse@rodents.montreal.qc.ca
 > =09=09     7D C8 61 52 5D E7 2D 39  4E F1 31 3E E8 B3 27 4B

--=20