Re: OMAP2/ARM1136 cache aliasing issue

To: Imre Deak <imre.deak%teleca.com@localhost>
Subject: Re: OMAP2/ARM1136 cache aliasing issue
From: Matt Thomas <matt%3am-software.com@localhost>
Date: Fri, 18 Jul 2008 10:09:07 -0700


On Jul 18, 2008, at 8:03 AM, Imre Deak wrote:

Hi all,

I have an H4/OMAP2420/ARM1136r0p2 board. The data cache is 32k/4way
cache line size is 32 bytes. I bumped into a memory corruption bug,wherea 4 byte location of the kernel code segment got corrupted. Thiscan't bethe result of a stray pointer store since pages containing code arereadonly leading to an exception when a store is attempted to such anaddress.(I also set a hardware memory write breakpoint at the given addressbut it
never triggered).

The bug is easily reproducible, happens always at the same place with
the same value and I narrowed it down to 5 instructions. I can'ttriggerat the first instruction to see as the bug happens, I can onlytrigger atthe last one when I detect the corruption. I read the data cachetags andTLB content with CP15 instructions both before and after the 5instructions.There is (of course) no store to the code segment location which iscorrupted,still I see the relevant cache line getting dirtied as the resultof these5 instructions. Storing occurs only to a different address but thevaluestored matches with the value at the corrupted location. Also thetwo virtualaddresses (the one being stored to and the one getting corrupted)have thesame cache index, but the physical addresses are different. Themapping forthe address being stored to is 4k the mapping for the corruptedaddress is 64k
big.
Following are the 5 instructions leading to the corruption. Thereare no
interrupts or exceptions during its execution:

r0: 82df050c
sp: 82e17e14
ip: 82e17e14

8045fdc4:       e59f302c        ldr     r3, [pc, #44]   ; 8045fdfc
8045fdc8:       e92dd800        stmdb   sp!, {fp, ip, lr, pc}
8045fdcc:       e24cb004        sub     fp, ip, #4
8045fdd0:       e24dd010        sub     sp, sp, #16
8045fdd4:       e50b0014        str     r0, [fp, #-20]  ; 82e17dfc

..


Relevant mappings:
VA:82e17000 -> PA:83cda000 4k, outer write-back, no allocate onwrite,
                                   supervisor read/write
VA:80450000 -> PA:80450000 64k, outer write-back, no allocate onwrite,
                                   non-accessible

Can I suggest changing pmap_map_chunk to not use large pages and seeif the

corruption still happens.  Just #if 0 the if at ~5578 in pmap.c

Of course that corruption shouldn't have happened and it may verywell bea silicon bug.

Follow-Ups:
- Re: OMAP2/ARM1136 cache aliasing issue
  - From: Imre Deak

References:
- OMAP2/ARM1136 cache aliasing issue
  - From: Imre Deak

Prev by Date: Re: [ARM32] Possible PMAP_KMPAGE = 0 produces lock collision
Next by Date: Re: [ARM32] Possible PMAP_KMPAGE = 0 produces lock collision
Previous by Thread: OMAP2/ARM1136 cache aliasing issue
Next by Thread: Re: OMAP2/ARM1136 cache aliasing issue
Indexes:

Home | Main Index | Thread Index | Old Index