[BUG] neon_intrinsics: some fp16 convert intrinsics have type mismatches #212

rmcclure-nv · 2022-07-26T22:39:53Z

The intrinsics for scalar converts between fp16 and integer/fixed-point values are currently specified to always use 16bit -> 16bit converts, regardless of the size of the integer/fixed-point value. For example:

acle/tools/intrinsic_db/advsimd.csv

Line 3795 in 6eb8516

float16_t vcvth_f16_s32(int32_t a) a -> Hn SCVTF Hd,Hn Hd -> result A32/A64

Using the instruction listed causes certain input values to be treated incorrectly. For the above example, an input int32 65504 produces an fp16 value of -32.0, instead of the expected 65504.0.

Instead, the above intrinsic should use the SCVTF Hd,Wn instruction, which better matches the input type.

This applies to all scalar converts between fp16 and 32-bit/64-bit integer/fixed-point converts:

float16_t vcvth_f16_s32
float16_t vcvth_f16_s64
float16_t vcvth_f16_u32
float16_t vcvth_f16_u64
int32_t vcvth_s32_f16
int64_t vcvth_s64_f16
uint32_t vcvth_u32_f16
uint64_t vcvth_u64_f16
float16_t vcvth_n_f16_s32
float16_t vcvth_n_f16_s64
float16_t vcvth_n_f16_u32
float16_t vcvth_n_f16_u64
int32_t vcvth_n_s32_f16
int64_t vcvth_n_s64_f16
uint32_t vcvth_n_u32_f16
uint64_t vcvth_n_u64_f16

Testing with two mainstream compilers (gcc and clang/llvm) shows that these intrinsics are often already generating the proposed instructions, rather than the instructions listed in the ACLE. In particular:
GCC (tested with 9.2.0) generates the proposed instructions for all of the intrinsics.
clang/llvm (tested with 14.0.0) generates the proposed instructions for the integer converts, but generates the ACLE instructions for the fixed-point converts.

The text was updated successfully, but these errors were encountered:

vhscampos · 2023-02-20T14:25:09Z

Hi, thanks for your issue report and apologies for the delay.

If possible, we encourage you to contribute with a Pull Request that addresses this issue. We will be happy to review it.

rmcclure-nv added the bug Something isn't working label Jul 26, 2022

rmcclure-nv assigned fpetrogalli Jul 26, 2022

fpetrogalli removed their assignment Sep 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] neon_intrinsics: some fp16 convert intrinsics have type mismatches #212

[BUG] neon_intrinsics: some fp16 convert intrinsics have type mismatches #212

rmcclure-nv commented Jul 26, 2022

vhscampos commented Feb 20, 2023

[BUG] neon_intrinsics: some fp16 convert intrinsics have type mismatches #212

[BUG] neon_intrinsics: some fp16 convert intrinsics have type mismatches #212

Comments

rmcclure-nv commented Jul 26, 2022

vhscampos commented Feb 20, 2023