??? File corrupted for "Extended (Unicode) Characters"

Report issues, odd behaviors or submit a detailed bug report.
Post Reply
User avatar
rjbill
Posts: 832
Joined: 13 Jun 2011 06:36

??? File corrupted for "Extended (Unicode) Characters"

Post by rjbill »

I have a file that is UTF-8 w/o BOM that has some "extended" (Unicode?) characters,
like fancy quotes and a middot, and they have all been corrupted and
changed to � ($FFFD-65533).

In Options:
The default new and open file encoding is Unicode (UTF-8).
Detect Unicode without a signature (BOM) is checked.
Detect all (encoding and code page) is NOT checked.

It happened sometime in the last few (?) version updates.

I found an older version of the file in my backup edit files from 1/10/2020 and it is correct.

For example, this is a list of Character Entities I use for reference when I need a code.

Code: Select all

HTML/XHTML Character Entities

Non-printing:              ‌   ‍   ‎   ‏

�    
�   €
�   ¢
�   £
�   ¥
�   ¤
�   ©
�   ®
�   ™
�   •
�   ⋅
'   ′
?   ″
?   ⟨
?   ⟩
�   ‹
�   ›
�   «
�   »
"   "
&   &
�   µ
?   ∇
?   ∫
?   ∑
?   ∏

�   –
�   —
�   …
�   §
�   ¶
�   †
�   ‡
�   ¡
�   ¿
�   ‰
?   ◊
�   ·
�   ‘
�   ’
�   ‚
�   “
�   ”
�   „
�   ƒ
�   º
�   ª
?   ∴
*   ∗
�   ¦
-   ­
�   ¯
This is what it is supposed to look like from my backup file:

Code: Select all

HTML/XHTML Character Entities

Non-printing:              ‌   ‍   ‎   ‏

     
€   €
¢   ¢
£   £
¥   ¥
¤   ¤
©   ©
®   ®
™   ™
•   •
⋅   ⋅
′   ′
″   ″
⟨   ⟨
⟩   ⟩
‹   ‹
›   ›
«   «
»   »
"   "
&   &
µ   µ
∇   ∇
∫   ∫
∑   ∑
∏   ∏

–   –
—   —
…   …
§   §
¶   ¶
†   †
‡   ‡
¡   ¡
¿   ¿
‰   ‰
◊   ◊
·   ·
‘   ‘
’   ’
‚   ‚
“   “
”   ”
„   „
ƒ   ƒ
º   º
ª   ª
∴   ∴
∗   ∗
¦   ¦
-   ­
¯   ¯
RJTE version 14.64 (actual) - 64-bit
Win 10 Pro 64-bit 8 GB RAM Intel Core i7-6700 3.40 GHz SCSI Hard Drive 1 TB

Note: The signature is dynamic, not static,
so it may not show the correct version above
that was in use at the time of the post.

User avatar
Rickard Johansson
Site Admin
Posts: 6001
Joined: 19 Jul 2006 14:29

Re: File corrupted for "Extended (Unicode) Characters"

Post by Rickard Johansson »

I've tried to replicate this but without luck. If you ever figure out exactly how to reproduce this - let me know.

Post Reply