xen-devel

[Top] [All Lists]

Re: [Xen-devel] Nouveau on dom0

from [Konrad Rzeszutek Wilk]

[Permanent Link][Original]

To:	Arvind R <arvino55@xxxxxxxxx>
Subject:	Re: [Xen-devel] Nouveau on dom0
From:	Konrad Rzeszutek Wilk <konrad.wilk@xxxxxxxxxx>
Date:	Wed, 3 Mar 2010 13:13:03 -0500
Cc:	xen-devel@xxxxxxxxxxxxxxxxxxx
Delivery-date:	Wed, 03 Mar 2010 10:41:08 -0800
Envelope-to:	www-data@xxxxxxxxxxxxxxxxxxx
In-reply-to:	<d799c4761003030911l51013f07mc1bf4aa15df519c4@xxxxxxxxxxxxxx>
List-help:	<mailto:xen-devel-request@lists.xensource.com?subject=help>
List-id:	Xen developer discussion <xen-devel.lists.xensource.com>
List-post:	<mailto:xen-devel@lists.xensource.com>
List-subscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=subscribe>
List-unsubscribe:	<http://lists.xensource.com/mailman/listinfo/xen-devel>, <mailto:xen-devel-request@lists.xensource.com?subject=unsubscribe>
References:	<d799c4761002250046j4fc14785ue17db46d6e3e71ce@xxxxxxxxxxxxxx> <20100225125552.GC9040@xxxxxxxxxxxxxxxxxxx> <d799c4761002250901g6029a69et21fcf1d8556f047@xxxxxxxxxxxxxx> <20100225174411.GA13270@xxxxxxxxxxxxxxxxxxx> <d799c4761002260734j13bc01e3vfd7788b6196230ea@xxxxxxxxxxxxxx> <20100301160130.GB7881@xxxxxxxxxxxxxxxxxxx> <d799c4761003021334t58815ed3p96dc343635b2da2c@xxxxxxxxxxxxxx> <d799c4761003030911l51013f07mc1bf4aa15df519c4@xxxxxxxxxxxxxx>
Sender:	xen-devel-bounces@xxxxxxxxxxxxxxxxxxx
User-agent:	Mutt/1.5.19 (2009-01-05)

> >> page-table directory so that when the GPU accesses the addresses, it
> >> gets the real bus address. I wonder if it fails at that thought -
> >> meaning that the addresses that are written to the page table are
> >> actually the guest page numbers (gpfn) instead of the machine page numbers 
> >> (mfn).
> >
> > No, I don't think thats how it works. The user-space write triggers an
> > aio-write -
> 
> which triggers do_page_fault, handle_mm_fault, do_linear_fault, __do_fault
> and finally ttm_bo_vm_fault.
> ttm_bo_fault returns VM_FAULT_NOPAGE

VM_FAULT_NOPAGE = means retry the fault, In other words, I've fixed the
PTE to point to the right PFN.
> 
>  - but xen-boot keeps on re-triggering the same fault.

Which probably means that something is not OK with the PTE. What is the
vma->vm_page_prot value before the vm_insert_mixed? (and maybe even
after)

Try also reading the true value of the PTE and seeing what it shows
before and after the vm_insert_mixed.

I've attached a simple patch I wrote some time ago to get the real MFNs
and its page protection. I think you can adapt it (print_data function to be 
exact)
to peet at the PTE and its protection values.

There is an extra flag that the PTE can have when running under Xen: 
_PAGE_IOMAP.
This signifies that the PFN is actually the MFN. In this case thought
it sholdn't be enabled b/c the memory is actually gathered from
alloc_page. But if it is, it might be the culprit.


> when vm_fault calls ttm_tt_get_page, the page is already there, and
> the handler does another vm_insert_page (i changed vm_insert_mixed
> vm_insert_page/pfn based on io_mem, now the only patch, and it works on
> bare machine) on and on and on.
> 
> What can possibly cause the fault-handler to repeat endlessly?

The VM_FAULT_NOPAGE shortcircuits most of the fault-handler and makes it
return back. The application is resumed and retries the operation that
caused the fault - in this case an attempt to write to an address that
was not present. Obviously the second attempt at writing to the address
should have worked without problems.

> If a wrong page is backed at the user-address, it should create bad_access or
> some other subsequent events - but the system is running fine minus all local
> consoles! If the insertion is to a wrong place, this can happen; but
> the top-level
> trap is the only provider of the address - and the fault addres and
> vma address match,
> and the same code works fine on bare-boot.

So you see this fault handler being called endlessly while the machine
is still running and other pieces of code work just fine, right?

> 
> ttm_tt_get_page calls alloc in a loop - so it may allocate multiple pages from
> start/end depending on Highmem memory or not - implying asynchronous 
> allocation
> and mapping.

I thought it had some logic to figure out that it already handled this
page and would return an already allocate page?

> 
> All I want now is *ptr = (uint32_t)data to work as of now!

You are doing a great job at this head-spinning detective work. Much
appreciated!

debug-print-pte.patch
Description: Text document

_______________________________________________
Xen-devel mailing list
Xen-devel@xxxxxxxxxxxxxxxxxxx
http://lists.xensource.com/xen-devel

[More with this subject...]

<Prev in Thread]	Current Thread	[Next in Thread>
Re: [Xen-devel] Nouveau on dom0, Konrad Rzeszutek Wilk Re: [Xen-devel] Nouveau on dom0, Arvind R Re: [Xen-devel] Nouveau on dom0, Arvind R Re: [Xen-devel] Nouveau on dom0, Konrad Rzeszutek Wilk <= Re: [Xen-devel] Nouveau on dom0, Arvind R Re: [Xen-devel] Nouveau on dom0, Konrad Rzeszutek Wilk Re: [Xen-devel] Nouveau on dom0, Arvind R Re: [Xen-devel] Nouveau on dom0, Konrad Rzeszutek Wilk Re: [Xen-devel] Nouveau on dom0, Arvind R Re: [Xen-devel] Nouveau on dom0, Arvind R Re: [Xen-devel] Nouveau on dom0, Arvind R Re: [Xen-devel] Nouveau on dom0, Konrad Rzeszutek Wilk Re: [Xen-devel] [Solved] Nouveau on dom0, Arvind R Re: [Xen-devel] [Solved] Nouveau on dom0, Pasi Kärkkäinen

Previous by Date:	Re: [Xen-devel] [PATCH 5/7] xen: Make event channel work with PV extension of HVM, Jeremy Fitzhardinge
Next by Date:	Re: [Xen-devel] dom0 hang in xen-4.0.0-rc5 - possible acpi issue? [WAS: Using xen-unstable, dom0 hangs during boot], Konrad Rzeszutek Wilk
Previous by Thread:	Re: [Xen-devel] Nouveau on dom0, Arvind R
Next by Thread:	Re: [Xen-devel] Nouveau on dom0, Arvind R
Indexes:	[Date] [Thread] [Top] [All Lists]