Andi Kleen wrote:
> Yes. All inline assembly tells gcc what registers are clobbered
> and it fills in the tables. Hand clobbering in inline assembly cannot
> be expressed with the current toolchain, so we moved all those
> out of line.
> But again I'm not sure it will work anyways. For once you would
> need large padding around the calls anyways for inline replacement --
> how would you generate that? I expect you would need to put the calls
> into asm() again and with that a custom annotiation format looks reasonable.
Inlining is most important for very small code: sti, cli, pushf;pop eax,
etc (in many cases, no-ops). We'd have at least 5 bytes to work in, and
maybe more if there are surrounding push/pops to be consumed.
For example, say we wanted to put a general call for sti into entry.S,
where its expected it won't touch any registers. In that case, we'd
have a sequence like:
If we parse the relocs, then we'd find the reference to paravirt_cli.
If we look at the byte before and see 0xe8, then we can see if its a
call. If we then work out in each direction and see matched push/pops,
then we know what registers can be trashed in the call. This also
allows us to determine the callsite size, and therefore how much space
we need for inlining.
So in this case, we see that there are 5 bytes for the call and a
further 6 bytes of push/pops available for inlining.
Of course this is hand-written code anyway, so there's no particular
burden to having some extra metadata stashed away in another section.
For compiler-generated code, we know that it's already expecting
standard C ABI calling conventions. The downside, of course, is that
only the 5 byte call space is available for inline patching.
Xen-devel mailing list