Jump to content

Lin Files - French comments - UTF8 re-encoded


Recommended Posts

I notice a problem with comments in French in the *.lin files.

French special characters (é è à ô ç etc...) are twice UTF8 encoded instead of once.

 

The cause : the comment is encoded in UTF8 once. Good.

Then down in the process the UTF8 text is taken as an ASCII text and encoded in UTF8 !

 

This double-encoded UTF8 is, of course, unreadable.

 

a sample :

é in UTF8 is $C3A9

instead the .lin files has : $C383C2A9

that is exactly UTF8 encoding of $C3 + UTF8 encoding of $A9.

(same problem with ü, ö etc... in other languages).

 

another one :

’ = UTF8 $E28099

the .lin files has : $C3A2C280C299 !!!

look in BB-QF1-FRANCE-CHINA 2017 at :

choisissent l’équipe qu’ils

instead of choisissent l’équipe qu’ils...

 

and a wink at Walddk2 in BB-QF1-SWEDEN-USA2.lin :

his reference to his Häxen has been converted in Häxan...

Link to comment
Share on other sites

Thanks, we'll have our techs check.

 

Fine, diana_eva.

 

In the meanwhile I wrote a program to correct all the 2017 WBTC which have the problem. Just for the fun, and the pleasure to read French comments cleared from things like

Peut-être en raison dâun mauvais souvenir en 2016 à Wroclaw a-t-on préféré éviter la Nouvelle Zélande.|pg||

Le commentaire de Bas, notre efficace opérateurnsuggère que Thomas voulait une revenche|pg||

 

:-)

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Loading...
×
×
  • Create New...