[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH RFC] x86/lld: fix symbol map generation


  • To: Roger Pau Monne <roger.pau@xxxxxxxxxx>
  • From: Jan Beulich <jbeulich@xxxxxxxx>
  • Date: Tue, 3 May 2022 10:17:44 +0200
  • Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=suse.com; dmarc=pass action=none header.from=suse.com; dkim=pass header.d=suse.com; arc=none
  • Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=QnZqch3/2Z5D44xGyAUPkoZg7cZBGL2XMAuJFVXCT3A=; b=UFOy+f7/kNO9MV8NhnyRpAyWgQAVjUKZTcRngsKh+jAItq1d5r+M1Zl3dqHiBVOp+A7aNQZzN6SmRqpqqu6azxAVcP4mzY2AdYf283fC6LHnEKM7MxNBjz9UJH6iqzCvZ+zVWT5anxHbEzo9oqBfzXS8TS/N4oKzxE8jerCK7NESS4cywvpVwwcl7Wz4X/biwh3MF8nrsyidn8q55pwfC3WxtLz9RiDAAOEYQciMhiVwgb/xrxMulLHQW8y+iNS6eve1mh6yHu0eRfEVt8Wm2XlUf/di/RjSrwsvLjIneWoTzZjIwK8jD5TSBPaiamaOaixO58CG5yvFUqY8e4y1KA==
  • Arc-seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=NwApJTCcwxcxbrnAqL2neX8lA0Ulh3kNbo89fg0154X8BV2/r8fHGEqb8/Tk7iaxY4FoAAL5afEibSQ+qIrNkB4X2veMfs2TrPqQaL7g+KUkZRiqSfTWz3CawTJu2OOGcurIto/7qIDhuuKevpb5IRDsSMv9VrpHom+TYdCZOp6rg+gVrki8pdrjHvsvMC9VxooYFBr97G33MZkirzUXqkX4ONY0mVY3EpZ8DH2xCLpnWZW9xiVQHvpG2L4iFy1Nk6YP2to5rg69N3o3fuOjbzbnKdUb5VSk+dA5n7IN5Ne6S2RoitU8Ak7G+GMvQzBlOyq6NC1safC1WNrzUIB6Ig==
  • Authentication-results: dkim=none (message not signed) header.d=none;dmarc=none action=none header.from=suse.com;
  • Cc: Andrew Cooper <andrew.cooper3@xxxxxxxxxx>, Wei Liu <wl@xxxxxxx>, xen-devel@xxxxxxxxxxxxxxxxxxxx
  • Delivery-date: Tue, 03 May 2022 08:17:59 +0000
  • List-id: Xen developer discussion <xen-devel.lists.xenproject.org>

On 02.05.2022 17:20, Roger Pau Monne wrote:
> The symbol map generation (and thus the debug info attached to Xen) is
> partially broken when using LLVM LD.  That's due to LLD converting
> almost all symbols from global to local in the last linking step, and

I'm puzzled by "almost" - is there a pattern of which ones aren't
converted?

Also "last linking step" is ambiguous, as we link three binaries and
aiui the issue is present on every of these passes. May I suggest
"... when linking actual executables" or (still somewhat ambiguous)
"... when linking final binaries"?

> thus confusing tools/symbols into adding a file prefix to all text
> symbols, the results looks like:
> 
> Xen call trace:
>    [<ffff82d040449fe8>] R xxhash64.c#__start_xen+0x3938/0x39c0
>    [<ffff82d040203734>] F __high_start+0x94/0xa0
> 
> In order to workaround this create a list of global symbols prior to
> the linking step, and use objcopy to convert the symbols in the final
> binary back to global before processing with tools/symbols.
> 
> Signed-off-by: Roger Pau Monné <roger.pau@xxxxxxxxxx>
> ---
> I haven't found a way to prevent LLD from converting the symbols, so
> I've come up with this rather crappy workaround.

Perhaps a map file (like we use for shared libraries in tools/) would
allow doing so? But of course this would want to be machine-generated,
not manually maintained.

Have you gained any insight into _why_ they are doing what they do?

> Not applied to EFI, partially because I don't have an environment with
> LLD capable of generating the EFI binary.
> 
> Obtaining the global symbol list could likely be a target on itself,
> if it is to be shared between the ELF and the EFI binary generation.

If, as the last paragraph of the description is worded, you did this
just once (as a prereq), I could see this working. Otherwise (as you
have it now, with it done 3 times) it would first require splitting
the linking rules into many separate ones (which has been the plan
anyway, but so far I didn't get to it).

> --- a/xen/arch/x86/Makefile
> +++ b/xen/arch/x86/Makefile
> @@ -134,24 +134,34 @@ $(TARGET): $(TARGET)-syms $(efi-y) $(obj)/boot/mkelf32
>  CFLAGS-$(XEN_BUILD_EFI) += -DXEN_BUILD_EFI
>  
>  $(TARGET)-syms: $(objtree)/prelink.o $(obj)/xen.lds
> +     # Dump global text symbols before the linking step
> +     $(NM) -pa --format=bsd $< | awk '{ if($$2 == "T") print $$3}' \
> +         > $(@D)/.$(@F).global-syms
>       $(LD) $(XEN_LDFLAGS) -T $(obj)/xen.lds -N $< $(build_id_linker) \
> -         $(objtree)/common/symbols-dummy.o -o $(@D)/.$(@F).0
> +         $(objtree)/common/symbols-dummy.o -o $(@D)/.$(@F).0.tmp
> +     # LLVM LD has converted global symbols into local ones as part of the
> +     # linking step, convert those back to global before using tools/symbols.
> +     $(OBJCOPY) --globalize-symbols=$(@D)/.$(@F).global-syms \
> +         $(@D)/.$(@F).0.tmp $(@D)/.$(@F).0
>       $(NM) -pa --format=sysv $(@D)/.$(@F).0 \
>               | $(objtree)/tools/symbols $(all_symbols) --sysv --sort \
>               >$(@D)/.$(@F).0.S
>       $(MAKE) $(build)=$(@D) $(@D)/.$(@F).0.o
>       $(LD) $(XEN_LDFLAGS) -T $(obj)/xen.lds -N $< $(build_id_linker) \
> -         $(@D)/.$(@F).0.o -o $(@D)/.$(@F).1
> +         $(@D)/.$(@F).0.o -o $(@D)/.$(@F).1.tmp
> +     $(OBJCOPY) --globalize-symbols=$(@D)/.$(@F).global-syms \
> +         $(@D)/.$(@F).1.tmp $(@D)/.$(@F).1
>       $(NM) -pa --format=sysv $(@D)/.$(@F).1 \
>               | $(objtree)/tools/symbols $(all_symbols) --sysv --sort 
> $(syms-warn-dup-y) \
>               >$(@D)/.$(@F).1.S
>       $(MAKE) $(build)=$(@D) $(@D)/.$(@F).1.o
>       $(LD) $(XEN_LDFLAGS) -T $(obj)/xen.lds -N $< $(build_id_linker) \
> -         $(orphan-handling-y) $(@D)/.$(@F).1.o -o $@
> +         $(orphan-handling-y) $(@D)/.$(@F).1.o -o $@.tmp
> +     $(OBJCOPY) --globalize-symbols=$(@D)/.$(@F).global-syms $@.tmp $@

Is this very useful? It only affects ...

>       $(NM) -pa --format=sysv $(@D)/$(@F) \
>               | $(objtree)/tools/symbols --all-symbols --xensyms --sysv 
> --sort \
>               >$(@D)/$(@F).map

... the actual map file; what's in the binary and in this map file doesn't
depend on local vs global anymore (and you limit this to text symbols
anyway; I wonder in how far livepatching might also be affected by the
same issue with data symbols).

In any event I would like to ask that the objcopy invocations be tied to
lld being in use. No matter that it shouldn't, objcopy can alter binaries
even if no actual change is being made (I've just recently observed this
with xen.efi, see the thread rooted at "EFI: strip xen.efi when putting it
on the EFI partition", and recall that at least for GNU binutils objcopy
and strip are effectively [almost] the same binary).

Jan




 


Rackspace

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.