Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unicode index missing for Big5 #61

Open
rarealphacat opened this issue Jun 30, 2021 · 8 comments
Open

Unicode index missing for Big5 #61

rarealphacat opened this issue Jun 30, 2021 · 8 comments

Comments

@rarealphacat
Copy link

<118>utf8_to_big5: did not find unicode index '0xfffd'
Can you add the related definition?

@scandum
Copy link
Owner

scandum commented Jun 30, 2021

What would be the big5 equivalent?

@rarealphacat
Copy link
Author

It should be "symbols" from unicode unable to be mapped under BIG5TOUTF, so there isn't big5 equivalent.

@scandum
Copy link
Owner

scandum commented Aug 31, 2021

So I can't really come up with a good way to fix this, assuming it needs fixing, suggestions are welcome.

@rarealphacat
Copy link
Author

Can it fall back and print the Unicode instead of showing big5 equivalent when missing index happens?

@scandum
Copy link
Owner

scandum commented Dec 10, 2021

Not sure if I understand what you mean. When you get

<118>utf8_to_big5: did not find unicode index '0xfffd'

You can use: #showme \ufffd to print the unicode character.

@rarealphacat
Copy link
Author

I mean can I make tintin++ to print the unicode character automatically when utf8 to big5 index is missing, along with the error message or even ignore the message? As it seems not all unicodes have the corresponding big5 equivalent. It only shows the "did not find unicode index xxxx" right now.

@scandum
Copy link
Owner

scandum commented Dec 12, 2021

You should be able to substitute the message:

#substitute {^utf8_to_big5: did not find unicode index '0x%1'} {<118>utf8_to_big5: did not find unicode character '\u%1' with index '0x%1'}

And you can remove it with #gag.

@SlySven
Copy link

SlySven commented Nov 10, 2022

Actually that Unicode codepoint is a special one: - that is used to show a character that cannot be shown (e.g. it doesn't exist in Unicode or possibly just not in the current font) if your OS / Browser can show:
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants