|
|
|
|
|
|
|
|
|
|
xen-devel
buggy linear page table handling Re: [Xen-devel] xm pause causing lockup
I went through a few quick iterations to test page table reference
counting. In short, if I L2 pin a zeroed page that I've re-mapped
read-only the pin succeeds. If the page has a self-referential mapping
before it is remapped read-only the pin never returns. It is probably
safe to conclude that the type count is not correctly changed when the
page is re-mapped if there is a self-referential entry. This used to
work, thus it is also safe to say that this is a regression introduced
some time between 3/22 and 4/11. Test code from pmap_pinit below.
-Kip
/* ***** TEMP \/ ********** */
ma = xpmap_ptom(VM_PAGE_TO_PHYS(ptdpg[0]));
#if 0
/* works */
pmap_qremove((vm_offset_t)pmap->pm_pdir, NPGPTD);
#elif 0
/* works */
PT_SET_MA(pmap->pm_pdir, 0);
#elif 0
/* works */
PT_SET_MA(pmap->pm_pdir, ma | PG_V | PG_A);
#else
/* causes lockup on pin */
pmap->pm_pdir[PTDPTDI + i] = ma | PG_V | PG_A | PG_M;
PT_SET_MA(pmap->pm_pdir, ma | PG_V | PG_A);
#endif
printk("pinning %p - pass 0\n", ma);
xen_pgd_pin(xpmap_ptom(VM_PAGE_TO_PHYS(ptdpg[0])));
printk("pinned %p - pass 0\n", ma);
/* ***** TEMP ^ ********** */
On 4/15/05, Kip Macy <kip.macy@xxxxxxxxx> wrote:
> > Does this happen if you boot with 'nosmp'? I don't really believe it's a
> > race, but might be worth checking.
>
> Yes, it still happens. It would have found it quite astonishing if it
> were a race.
> (XEN) EIP: 0808:[<fc52d5a3>]
> (gdb) x/i 0xfc52d5a3
> 0xfc52d5a3 <get_page_type+265>: mov 0x14(%eax),%eax
> (gdb) info line *0xfc52d5a3
> Line 1236 of "mm.c" starts at address 0xfc52d5a0 <get_page_type+262>
> and ends at 0xfc52d5b0 <get_page_type+278>.
> (gdb)
>
> Line 1236-1240 of local mm.c:
> while ( (y = page->u.inuse.type_info) == x )
> cpu_relax();
> counter++;
> printk("page was not validated");
> goto again;
>
> > Also, it's worth adding a printk into this loop just to check that that
> > is where you're getting caught.
>
> Obviously wasn't thinking and stuck it in the wrong place.
> Nonetheless, even without the printk I think I've proven my point.
>
>
> >
> > /* Someone else is updating validation of this page. Wait...
> > */
> > while ( (y = page->u.inuse.type_info) == x )
> > cpu_relax();
> > goto again;
>
> Yep.
>
> >
> > We need to figure out how the type count managed to get to one without
> > the page being validated. I presume you're doing a debug=y build of Xen?
>
> Correct. Nothing comes out on the console apart from debug output from
> FreeBSD.
>
> > Do you get any warnings about illegal mmu_update attempts when you boot
> > FreeBSD?
>
> No, I don't. This is the offending code snippet from pmap_pinit:
>
> /* install self-referential address mapping entry(s) */
> for (i = 0; i < NPGPTD; i++) {
> ma = xpmap_ptom(VM_PAGE_TO_PHYS(ptdpg[i]));
> pmap->pm_pdir[PTDPTDI + i] = ma | PG_V | PG_A | PG_M;
> #ifdef PAE
> pmap->pm_pdpt[i] = ma | PG_V;
> #endif
> /* re-map page directory read-only */
> PT_SET_MA(pmap->pm_pdir, *vtopte((vm_offset_t)pmap->pm_pdir)
> & ~PG_RW);
> xen_pgd_pin(ma);
> }
>
> PT_SET_MA is just a wrapper for update_va_mapping. Have there been any
> recent changes to the page typing code that would cause it to get
> confused by a self-referential mapping?
>
> -Kip
>
_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel
|
|
|
|
|