Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Character references: 8 hexadecimal digits are too much. #487

Closed
shyouhei opened this issue Aug 7, 2017 · 0 comments
Closed

Character references: 8 hexadecimal digits are too much. #487

shyouhei opened this issue Aug 7, 2017 · 0 comments

Comments

@shyouhei
Copy link
Contributor

shyouhei commented Aug 7, 2017

We allow hexadecimal numeric character references to be at most 8 digits length. However, the current Unicode specification limits its allocation range up to U+10FFFF inclusive. This restriction is due to UTF-16 encoding's structure and will not be relaxed. A collateral evidence is that RFC3629 is defined as such.

So 8 is too much; 6 is enough. Also, 0x10FFFF== 1,114,112 so 8 digits in decimal character references are too much too; 7 is enough.

@jgm jgm closed this as completed in 2104a5d Mar 25, 2018
jgm added a commit to commonmark/commonmark.js that referenced this issue Mar 25, 2018
jgm added a commit to commonmark/cmark that referenced this issue Mar 25, 2018
robinst added a commit to commonmark/commonmark-java that referenced this issue Jul 12, 2019
talum pushed a commit to github/cmark-gfm that referenced this issue Sep 14, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant