Hello,
On Fri, Sep 27, 2024 at 07:55:58 CEST, Johannes Schlatow wrote:
I noticed that your .txt file is UTF-8 encoded, which apparently breaks gosh's title and summary detection. Converting the file to ASCII solves the issue.
From my experience, the UTF-8 support in gosh is quite good. The issue with Pirmins text file stems from the use of a BOM [1] in combination with UTF-8, which is not required as UTF-8 is not affected by byte order and also not recommended by the Unicode standard.
Nevertheless, we may improve gosh to ignore ef bb bf at the beginning of a text file. AFAIK the byte sequence produces garbage in Latin1 and Windows-1252.
[1] https://en.wikipedia.org/wiki/Byte_order_mark
Regards