Skip to content
Ambrose Li edited this page Aug 5, 2020 · 38 revisions

UTF-8–patched Threaded Read News (trn)

This is still a work-in-progress but I’m already using it to read some mailing lists (UTF-8; some in HTML) and some fr.* groups (mostly UTF-8, some ISO-8859-15, a little Windows-1252 – very little if any ISO-8859-1 that I can see).

This version supports only UTF-8 output, but supports posts in UTF-8, ISO-8859-1, ISO-8859-15 and Windows-1252. The original “character set” conversions (TeX, ISO-8859-1-to-ASCII) assume most displays are ASCII (or DBCS ¹, for example, or some other character set that isn’t Latin1); they have been disabled in the process of retrofitting it with UTF-8 support.

In theory, on a modern system such as Linux it’s not difficult to make it support other character sets once it supports UTF-8; unfortunately, trn tries hard to run on ancient systems. It doesn’t even use curses, just bare termcap/terminfo ²; it uses a name I’ve never heard of to refer to termcap and terminfo; the code is still written in K&R C, with no enums.

Article selector from a local newsgroup gated from number of mailing lists

Plain-text UTF-8 article from W3C’s public-html-ig-zh mailing list

Plain-text ISO-8859-15 article from fr.lettres.langue.francaise


  1. The likes of Big5, EUC-JP, Shift_JIS, etc.
  2. We could argue that bare termcap/terminfo is more accessible (in the sense that it plays nicer with accessibility software), though.
Clone this wiki locally