1.0.4
- Add PPC8..10, SSE2, AVX3_ZEN4, NEON_WITHOUT_AES targets
- Add Expand, LoadExpand, integer AbsDiff, SumsOf8AbsDiff
- Improved Half/Twice support, codegen for Shift*Same
- Support Wasm in Godbolt
- Faster KV128 sorting
- Fix armv7 build config, CMake config mode
- Update RVV intrinsics for 1.0-draft