[compiler-rt][RISCV] Implement __init_riscv_feature_bits #85790

BeMg · 2024-03-19T14:02:53Z

Base on riscv-non-isa/riscv-c-api-doc#74, this patch defines the __riscv_feature_bits and __riscv_vendor_feature_bits structures to store the enabled feature bits at runtime.

It also introduces the __init_riscv_features_bit function to update these structures based on the platform query mechanism.

Additionally, the groupid/bitmask definitions from riscv-non-isa/riscv-c-api-doc#74 are declared and used to update the __riscv_feature_bits and __riscv_vendor_feature_bits structures.

BeMg · 2024-03-19T14:03:56Z

This patch make #85786 could run some real test.

compiler-rt/lib/builtins/riscv/ifunc_select.c

BeMg · 2024-03-30T12:13:37Z

Align with latest sys_riscv_hwprobe
Update __riscv_ifunc_select, from __riscv_ifunc_select(char *) to __riscv_ifunc_select(unsigned long long, unsigned long long).
Remove the cpuinfo relate code and string process relate code
Use the bitset method to determine whether a set of extension is available for current environment.

compiler-rt/lib/builtins/riscv/ifunc_select.c

github-actions · 2024-04-01T02:34:26Z

✅ With the latest revision this PR passed the C/C++ code formatter.

wangpc-pp · 2024-04-01T02:52:21Z

Are there any processes for GNU/GCC implementation? If we want to port glibc, I think it should be required.

BeMg · 2024-04-08T04:47:25Z

Let the caller to manage and construct the necessary key/value pairs for hwprobe, eliminating the need for the runtime site to sync with the hwprobe key table.
Modify __riscv_ifunc_select to accept a pointer to riscv_hwprobe and its length, so the prototype does not need to be updated when the hwprobe keys increase or change.

compiler-rt/lib/builtins/riscv/ifunc_select.c

BeMg · 2024-04-22T04:30:54Z

Since this resolver function is expected to be available and interchangeable for both libgcc and compiler-rt, a formal specification for the resolver function interface is necessary.

I've create one for this PR riscv-non-isa/riscv-c-api-doc#74 and provide the three different candidate approach to achieve the same purpose.

…ature_bits/__init_riscv_features_bit Base on riscv-non-isa/riscv-c-api-doc#74, this patch defines the __riscv_feature_bits and __riscv_vendor_feature_bits structures to store the enabled feature bits at runtime. It also introduces the __init_riscv_features_bit function to update these structures based on the platform query mechanism. Additionally, the groupid/bitmask definitions from riscv-non-isa/riscv-c-api-doc#74 are declared and used to update the __riscv_feature_bits and __riscv_vendor_feature_bits structures.

kito-cheng

LGTM from my end, also I has implemented libgcc version, and posted into mailing list: https://patchwork.sourceware.org/project/gcc/patch/[email protected]/

kito-cheng

Few comment to improving multi-threading issue.

compiler-rt/lib/builtins/riscv/feature_bits.c

preames · 2024-07-19T15:52:50Z

Following up to conversation from the RISCV sync up call yesterday.

LGTM to the approach. I'm deferring to Kito on the implementation details of compiler-rt. This LGTM is subject to the requirement that this patch is reverted from the release branch if for any reason the dependent compiler default ifunc resolver change doesn't make it into the release. (Edit: After going and taking a detail look at the dependent compiler changes - yeah, those are likely not getting in. As such, this LGTM will not mean much.)

As broader context (as much for my future self as anything else). We have three major options on the default resolver approach.

We could just use hwcaps. This is pretty universally rejected as the bits are ambiguous in several known cases, and only cover a handful of extensions.
We could use the libc entry point to hwprobe provided to the resolver in the second argument register. This is only available in glibc 2.40 and later, before that a nullptr is passed (args are nullptr terminated.) 2.40 is unreleased, and we don't want to dependent on an unpublished ABI. As such, this option would require we delay this feature until 20.x.
This approach. The downsides of this approach are that a) most users use libgcc not compiler-rt, and b) we have an extra dependency layer which may slow pickup of future extensions. The benefit is that a clang toolchain using compiler-rt picks up this feature at least 6 months sooner. There's also some discussion of backporting the corresponding libgcc change.

The later two both have pros and cons. I personally would mildly prefer the second option, but am deferring to the folks who've worked on this as the third choice (this one) is at least reasonable. Worth noting is that even if we land this, if we later decide the versioning upgrade thing is a major problem, there's nothing preventing a future compiler version from switching to the glibc entry if available.

preames · 2024-07-19T15:39:55Z

compiler-rt/lib/builtins/riscv/feature_bits.c

+
+  // Init vendor extension
+  __riscv_vendor_feature_bits.length = 0;
+  __riscv_vendor_feature_bits.vendorID = Hwprobes[2].value;


Maybe worth a note in the code...

On first glance it looks like there's missing error handling here. The code is actually okay, but that's slightly non-obvious.

You may be on a kernel version which supports hwprobe, but doesn't recognize a given key. In that situation, the documentation says that the syscall will return success, but the key field will be set to -1. This code is relying on the fact that the value field will also be 0 in this case. This happens to work out to having all the bits unset.

jrtc27 · 2024-07-19T15:56:50Z

Following up to conversation from the RISCV sync up call yesterday.

LGTM to the approach. I'm deferring to Kito on the implementation details of compiler-rt. This LGTM is subject to the requirement that this patch is reverted from the release branch if for any reason the dependent compiler default ifunc resolver change doesn't make it into the release.

As broader context (as much for my future self as anything else). We have three major options on the default resolver approach.

We could just use hwcaps. This is pretty universally rejected as the bits are ambiguous in several known cases, and only cover a handful of extensions.

We could use the libc entry point to hwprobe provided to the resolver in the second argument register. This is only available in glibc 2.40 and later, before that a nullptr is passed (args are nullptr terminated.) 2.40 is unreleased, and we don't want to dependent on an unpublished ABI. As such, this option would require we delay this feature until 20.x.

This approach. The downsides of this approach are that a) most users use libgcc not compiler-rt, and b) we have an extra dependency layer which may slow pickup of future extensions. The benefit is that a clang toolchain using compiler-rt picks up this feature at least 6 months sooner. There's also some discussion of backporting the corresponding libgcc change.

The later two both have pros and cons. I personally would mildly prefer the second option, but am deferring to the folks who've worked on this as the third choice (this one) is at least reasonable. Worth noting is that even if we land this, if we later decide the versioning upgrade thing is a major problem, there's nothing preventing a future compiler version from switching to the glibc entry if available.

One of the points of having an abstraction, whether like this or otherwise, is that you don't need to have per-OS code in the compiler to handle multi-versioning. This format is simple enough that it's not tied to one OS, and the extensions specified by a body other than a specific OS, unlike hwprobe which is defined by Linux and has an interface tied to it (e.g. the use of its notion of CPU sets). This is something FreeBSD can realistically implement.

preames · 2024-07-19T16:48:04Z

One of the points of having an abstraction, whether like this or otherwise, is that you don't need to have per-OS code in the compiler to handle multi-versioning. This format is simple enough that it's not tied to one OS, and the extensions specified by a body other than a specific OS, unlike hwprobe which is defined by Linux and has an interface tied to it (e.g. the use of its notion of CPU sets). This is something FreeBSD can realistically implement.

@jrtc27 Is there an interface provided by e.g. FreeBSD that we should be looking at here? If not, this seems like somewhat of a moot argument.

As a second point, asking from ignorance here as I honestly don't know, don't we generally know the target OS from the triple? Generating code which has to work on any OS versus some specific OS seems like a generally harder problem. The dependent patches already have e.g.:

  if (getContext().getTargetInfo().getTriple().getOS() !=
      llvm::Triple::OSType::Linux) {
    CGM.getDiags().Report(diag::err_os_unsupport_riscv_target_clones);
    return;
  }

Is your argument that while we can generate OS specific code, we should prefer not to? If so, that seems like a reasonable code quality point, but I don't see how it's in anyway blocking. We can ship a version of the compiler with the OS specific enable, and then generalize once we have a second example, and sink common APIs into compiler runtimes if useful. It also seems like a concern which deserves to be balanced with e.g. the timeline to expose a new extension as opposed to a hard and fast rule.

jrtc27 · 2024-07-19T16:54:27Z

One of the points of having an abstraction, whether like this or otherwise, is that you don't need to have per-OS code in the compiler to handle multi-versioning. This format is simple enough that it's not tied to one OS, and the extensions specified by a body other than a specific OS, unlike hwprobe which is defined by Linux and has an interface tied to it (e.g. the use of its notion of CPU sets). This is something FreeBSD can realistically implement.

@jrtc27 Is there an interface provided by e.g. FreeBSD that we should be looking at here? If not, this seems like somewhat of a moot argument.

Not yet, because I was waiting to see what happened with function multiversioning.

As a second point, asking from ignorance here as I honestly don't know, don't we generally know the target OS from the triple?

We do, but the less conditionality the better; easier to maintain, and less to test.

Generating code which has to work on any OS versus some specific OS seems like a generally harder problem. The dependent patches already have e.g.:
  if (getContext().getTargetInfo().getTriple().getOS() !=
      llvm::Triple::OSType::Linux) {
    CGM.getDiags().Report(diag::err_os_unsupport_riscv_target_clones);
    return;
  }
Is your argument that while we can generate OS specific code, we should prefer not to? If so, that seems like a reasonable code quality point

Yeah, exactly.

but I don't see how it's in anyway blocking. We can ship a version of the compiler with the OS specific enable, and then generalize once we have a second example,

Eh, you can go either way on the compiler. It's an interface that FreeBSD will implement at some point, so you could argue that it's better to get it in the compiler sooner rather than later so you can use an older compiler on a newer system that provides it (especially given it has to have run-time detection of the interface's availability anyway). But you could also argue that it's known to be useless so making it look like it works is unhelpful.

and sink common APIs into compiler runtimes if useful. It also seems like a concern which deserves to be balanced with e.g. the timeline to expose a new extension as opposed to a hard and fast rule.

jrtc27 · 2024-07-19T16:55:51Z

And to be clear, I'm not saying that anything should be blocked on getting FreeBSD supported. I'm just saying that one of the benefits of this interface is that it can be reused on other OSes in future PRs, all that's needed is implementing it for that OS in compiler-rt, which is completely doable for any OS.

This implements the __builtin_cpu_init and __builtin_cpu_supports builtin routines based on the compiler runtime changes in llvm#85790. This is inspired by llvm#85786. Major changes are a) a restriction in scope to only the builtins (which have a much narrower user interface), and the avoidance of false generality. This change deliberately only handles group 0 extensions (which happen to be all defined ones today), and avoids the tblgen changes from that review. This is still a WIP. It is posted for initial feedback on whether this makes sense to try to get into 19.x release. Major items left undone: * Updating clang tests to exercise this logic. * Actually running it at all. I did not build compiler-rt, and thus all my checking was of generated asm/IR. * Investigate claims from gcc docs that __builtin_cpu_init is called early in process lifetime with high priority constructor. I did not find this with some quick searching.

preames · 2024-07-22T15:53:29Z

compiler-rt/lib/builtins/riscv/feature_bits.c

+
+static int FeaturesBitCached = 0;
+
+void __init_riscv_feature_bits() {


I think there's a missing piece here. The corresponding bit of X86 code (in compiler-rt/lib/builtins/cpu_model/x86.c), uses CONSTRUCTOR_ATTRIBUTE to ensure that the initialization is called early in process lifetime even if an ifunc which explicitly depends invokes the initialization isn't called. I believe we need to do the same thing here. The slightly confusing bit is that aarch64 appears not to do this.

@BeMg

This implements the __builtin_cpu_init and __builtin_cpu_supports builtin routines based on the compiler runtime changes in #85790. This is inspired by #85786. Major changes are a) a restriction in scope to only the builtins (which have a much narrower user interface), and the avoidance of false generality. This change deliberately only handles group 0 extensions (which happen to be all defined ones today), and avoids the tblgen changes from that review. I don't have an environment in which I can actually test this, but @BeMg has been kind enough to report that this appears to work as expected. Before this can make it into a release, we need a change such as #99958. The gcc docs claim that cpu_support can be called by "normal" code without calling the cpu_init routine because the init routine will have been called by a high priority constructor. Our current compiler-rt mechanism does not do this.

Base on riscv-non-isa/riscv-c-api-doc#74, this patch defines the `__riscv_feature_bits` and `__riscv_vendor_feature_bits` structures to store the enabled feature bits at runtime. It also introduces the `__init_riscv_feature_bits` function to update these structures based on the platform query mechanism. Additionally, the groupid/bitmask definitions from riscv-non-isa/riscv-c-api-doc#74 are declared and used to update the `__riscv_feature_bits` and `__riscv_vendor_feature_bits` structures. --------- Co-authored-by: Kito Cheng <[email protected]>

@BeMg

This implements the __builtin_cpu_init and __builtin_cpu_supports builtin routines based on the compiler runtime changes in #85790. This is inspired by #85786. Major changes are a) a restriction in scope to only the builtins (which have a much narrower user interface), and the avoidance of false generality. This change deliberately only handles group 0 extensions (which happen to be all defined ones today), and avoids the tblgen changes from that review. I don't have an environment in which I can actually test this, but @BeMg has been kind enough to report that this appears to work as expected. Before this can make it into a release, we need a change such as #99958. The gcc docs claim that cpu_support can be called by "normal" code without calling the cpu_init routine because the init routine will have been called by a high priority constructor. Our current compiler-rt mechanism does not do this.

asb · 2024-07-30T09:33:48Z

For what it's worth, I left a comment on the C API PR querying whether we should better define the interface for failure (e.g. if __init_riscv_features doesn't do anything useful for the target platform). See here.

…m#85790)" This reverts commit a41a4ac.

BeMg mentioned this pull request Mar 21, 2024

Function multi-version proposal riscv-non-isa/riscv-c-api-doc#48

Merged

lukel97 mentioned this pull request Mar 21, 2024

[RISCV][FMV] Support target_clones #85786

Merged

wangpc-pp reviewed Mar 21, 2024

View reviewed changes

compiler-rt/lib/builtins/riscv/ifunc_select.c Outdated Show resolved Hide resolved

BeMg marked this pull request as ready for review March 30, 2024 12:13

BeMg requested a review from kito-cheng March 30, 2024 12:14

llvmbot added compiler-rt compiler-rt:builtins labels Mar 30, 2024

BeMg requested review from topperc, lukel97 and preames March 30, 2024 12:14

lukel97 reviewed Apr 1, 2024

View reviewed changes

compiler-rt/lib/builtins/riscv/ifunc_select.c Outdated Show resolved Hide resolved

compiler-rt/lib/builtins/riscv/ifunc_select.c Outdated Show resolved Hide resolved

BeMg requested review from lukel97 and wangpc-pp April 1, 2024 02:35

topperc reviewed Apr 9, 2024

View reviewed changes

compiler-rt/lib/builtins/riscv/ifunc_select.c Outdated Show resolved Hide resolved

compiler-rt/lib/builtins/riscv/ifunc_select.c Outdated Show resolved Hide resolved

compiler-rt/lib/builtins/riscv/ifunc_select.c Outdated Show resolved Hide resolved

wangpc-pp reviewed Apr 9, 2024

View reviewed changes

compiler-rt/lib/builtins/riscv/ifunc_select.c Outdated Show resolved Hide resolved

BeMg mentioned this pull request Apr 9, 2024

[RFC] Function multiversion resolver function implementation riscv-non-isa/riscv-c-api-doc#72

Open

BeMg requested review from wangpc-pp and topperc April 15, 2024 06:51

BeMg mentioned this pull request Apr 19, 2024

[FMV] Runtime Resolver Function riscv-non-isa/riscv-c-api-doc#74

Merged

BeMg marked this pull request as draft May 23, 2024 05:34

wangpc-pp mentioned this pull request Jun 4, 2024

[RISCV] Add support for getHostCPUFeatures using hwprobe #94352

Merged

BeMg force-pushed the IFUNC/riscv_ifunc_select-impl branch from 9b06f1b to 628f3e8 Compare June 11, 2024 04:38

BeMg marked this pull request as ready for review June 11, 2024 14:31

Move FeaturesBitCached = 1 after __riscv_feature_bits be inited.

406db36

kito-cheng approved these changes Jul 17, 2024

View reviewed changes

kito-cheng reviewed Jul 17, 2024

View reviewed changes

compiler-rt/lib/builtins/riscv/feature_bits.c Show resolved Hide resolved

compiler-rt/lib/builtins/riscv/feature_bits.c Outdated Show resolved Hide resolved

compiler-rt/lib/builtins/riscv/feature_bits.c Outdated Show resolved Hide resolved

Only store the global object

25b29be

dtcxzyw requested a review from MaskRay July 17, 2024 11:48

BeMg added 2 commits July 17, 2024 04:54

Init local features

e159608

Fixup format

31c7b0d

preames reviewed Jul 19, 2024

View reviewed changes

preames mentioned this pull request Jul 19, 2024

[RISCV] Support __builtin_cpu_init and __builtin_cpu_supports #99700

Merged

BeMg added 2 commits July 20, 2024 05:27

Add comment when hwprobe key is unknown

c3b5d15

fixup format

a809208

BeMg merged commit a41a4ac into llvm:main Jul 21, 2024
6 checks passed

preames reviewed Jul 22, 2024

View reviewed changes

BeMg added a commit to BeMg/llvm-project that referenced this pull request Jul 31, 2024

Revert "[compiler-rt][RISCV] Implement __init_riscv_feature_bits (llv…

e99bdcc

…m#85790)" This reverts commit a41a4ac.

tru pushed a commit to BeMg/llvm-project that referenced this pull request Aug 1, 2024

Revert "[compiler-rt][RISCV] Implement __init_riscv_feature_bits (llv…

b148019

…m#85790)" This reverts commit a41a4ac.

philnik777 mentioned this pull request Aug 2, 2024

[Clang] Add a release note deprecating __is_nullptr #101638

Merged

mgabka mentioned this pull request Aug 22, 2024

Add release note about ABI mgabka/llvm-project#6

Open

thewtex mentioned this pull request Feb 10, 2025

llvmorg 19.1.5 libcxxabi pthread lib name #126605

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[compiler-rt][RISCV] Implement __init_riscv_feature_bits #85790

[compiler-rt][RISCV] Implement __init_riscv_feature_bits #85790

BeMg commented Mar 19, 2024 •

edited

Loading

BeMg commented Mar 19, 2024

BeMg commented Mar 30, 2024

github-actions bot commented Apr 1, 2024 •

edited

Loading

wangpc-pp commented Apr 1, 2024

BeMg commented Apr 8, 2024

BeMg commented Apr 22, 2024

kito-cheng left a comment

kito-cheng left a comment

preames commented Jul 19, 2024 •

edited

Loading

preames Jul 19, 2024

jrtc27 commented Jul 19, 2024

preames commented Jul 19, 2024

jrtc27 commented Jul 19, 2024

jrtc27 commented Jul 19, 2024

preames Jul 22, 2024

asb commented Jul 30, 2024


		static int FeaturesBitCached = 0;

		void __init_riscv_feature_bits() {

[compiler-rt][RISCV] Implement __init_riscv_feature_bits #85790

[compiler-rt][RISCV] Implement __init_riscv_feature_bits #85790

Conversation

BeMg commented Mar 19, 2024 • edited Loading

BeMg commented Mar 19, 2024

BeMg commented Mar 30, 2024

github-actions bot commented Apr 1, 2024 • edited Loading

wangpc-pp commented Apr 1, 2024

BeMg commented Apr 8, 2024

BeMg commented Apr 22, 2024

kito-cheng left a comment

Choose a reason for hiding this comment

kito-cheng left a comment

Choose a reason for hiding this comment

preames commented Jul 19, 2024 • edited Loading

preames Jul 19, 2024

Choose a reason for hiding this comment

jrtc27 commented Jul 19, 2024

preames commented Jul 19, 2024

jrtc27 commented Jul 19, 2024

jrtc27 commented Jul 19, 2024

preames Jul 22, 2024

Choose a reason for hiding this comment

asb commented Jul 30, 2024

BeMg commented Mar 19, 2024 •

edited

Loading

github-actions bot commented Apr 1, 2024 •

edited

Loading

preames commented Jul 19, 2024 •

edited

Loading