Implement `Debug` for `EncodeWide` #140153

thaliaarchi · 2025-04-22T08:29:11Z

Since std::os::windows::ffi::EncodeWide is reexported from std::sys_common::wtf8::EncodeWide, which has #![allow(missing_debug_implementations)] in the parent module, it does not implement Debug.

Implement it like core::str::Chars. Format the WTF-16 code units as char literals and escape them when they're surrogate halves. Although this is not valid Rust syntax, this is fine for Debug.

This becomes insta-stable.

r? libs-api

bjorn3 · 2025-04-23T09:00:26Z

We can't format it like char, because \u escape sequences for surrogate halves are invalid syntax in Rust.

Even if rust doesn't accept \u for surrogate halves, we can still format them as \u. Debug isn't guaranteed to be valid rust source code anyway.

thaliaarchi · 2025-04-23T11:33:32Z

Here are the options, as I see them. I think formatting them as chars when not surrogates is the most readable for most texts. Rust style seems to lean towards uppercase hex.

EncodeWide([97, 233, 32, 55357, 55357, 56489]): decimal
EncodeWide([0x61, 0xE9, 0x20, 0xD83D, 0xD83D, 0xDCA9]): hex, zero-pad min 2, upper
EncodeWide([0x61, 0xe9, 0x20, 0xd83d, 0xd83d, 0xdca9]): hex, zero-pad min 2, lower
EncodeWide([0x0061, 0x00E9, 0x0020, 0xD83D, 0xD83D, 0xDCA9]): hex, zero-pad 4, upper
EncodeWide([0x0061, 0x00e9, 0x0020, 0xd83d, 0xd83d, 0xdca9]): hex, zero-pad 4, lower
EncodeWide(['a', 'é', ' ', '\u{D83D}', '\u{D83D}', '\u{DCA9}']): pseudo-char, escaped surrogates, upper
EncodeWide(['a', 'é', ' ', '\u{d83d}', '\u{d83d}', '\u{dca9}']): pseudo-char, escaped surrogates, lower

thaliaarchi · 2025-06-26T23:39:26Z

I've updated formatting to be 4-wide zero-padded hex (style 4)

For another opinion, @ChrisDenton wrote:

I think EncodeWide, semantically, produces a string like a str but with a different encoding. That said, EncodeWide itself is an iterator more akin to a Bytes iterator of &str (except u16 instead of bytes). But this is all made more complicated by the fact that std doesn't have a WideStr type so people typically collect into a Vec<u16>, which doesn't know it's a wide string. Whereas the iterator does know. So I am sympathetic to the char like representation but for debug I do think we really have to go with a hex repr. And my gut preference is for the 4 zero pad, which is more suggestive of unicode without introducing illegal rust syntax. But I reserve the right to change my mind on that 😆

I think the char-like repr is a non-starter in any case because it looks like rust code but may be illegal (i.e. for surrogates).

(in favor of style 4)

Since `std::os::windows::ffi::EncodeWide` is reexported from `std::sys_common::wtf8::EncodeWide`, which has `#![allow(missing_debug_implementations)]` in the parent module, it does not implement `Debug`.

thaliaarchi · 2025-06-27T00:01:07Z

What's needed next to move this along?

ChrisDenton · 2025-06-27T00:09:25Z

I've nominated for libs-api. I don't know if it needs a full libs API discussion but I believe at least someone from the team will need to sign off on it.

rustbot assigned joshtriplett Apr 22, 2025

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Apr 22, 2025

thaliaarchi force-pushed the encode-wide-debug branch from d211853 to 827bb27 Compare June 26, 2025 23:36

Implement Debug for EncodeWide

a339db4

Since `std::os::windows::ffi::EncodeWide` is reexported from `std::sys_common::wtf8::EncodeWide`, which has `#![allow(missing_debug_implementations)]` in the parent module, it does not implement `Debug`.

thaliaarchi force-pushed the encode-wide-debug branch from 827bb27 to a339db4 Compare June 26, 2025 23:58

ChrisDenton added the I-libs-api-nominated Nominated for discussion during a libs-api team meeting. label Jun 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement `Debug` for `EncodeWide` #140153

Implement `Debug` for `EncodeWide` #140153

thaliaarchi commented Apr 22, 2025 •

edited

Loading

Uh oh!

bjorn3 commented Apr 23, 2025

Uh oh!

thaliaarchi commented Apr 23, 2025 •

edited

Loading

Uh oh!

thaliaarchi commented Jun 26, 2025 •

edited

Loading

Uh oh!

thaliaarchi commented Jun 27, 2025

Uh oh!

ChrisDenton commented Jun 27, 2025

Uh oh!

Uh oh!

Implement Debug for EncodeWide #140153

Are you sure you want to change the base?

Implement Debug for EncodeWide #140153

Conversation

thaliaarchi commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bjorn3 commented Apr 23, 2025

Uh oh!

thaliaarchi commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thaliaarchi commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thaliaarchi commented Jun 27, 2025

Uh oh!

ChrisDenton commented Jun 27, 2025

Uh oh!

Uh oh!

Implement `Debug` for `EncodeWide` #140153

Implement `Debug` for `EncodeWide` #140153

thaliaarchi commented Apr 22, 2025 •

edited

Loading

thaliaarchi commented Apr 23, 2025 •

edited

Loading

thaliaarchi commented Jun 26, 2025 •

edited

Loading