-
-
Notifications
You must be signed in to change notification settings - Fork 322
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Character references: 8 hexadecimal digits are too much. #487
Comments
jgm
added a commit
to commonmark/commonmark.js
that referenced
this issue
Mar 25, 2018
jgm
added a commit
to commonmark/cmark
that referenced
this issue
Mar 25, 2018
colinodell
added a commit
to thephpleague/commonmark
that referenced
this issue
Mar 26, 2018
colinodell
added a commit
to thephpleague/commonmark
that referenced
this issue
Mar 26, 2018
colinodell
added a commit
to thephpleague/commonmark
that referenced
this issue
Sep 18, 2018
colinodell
added a commit
to thephpleague/commonmark
that referenced
this issue
Mar 24, 2019
robinst
added a commit
to commonmark/commonmark-java
that referenced
this issue
Jul 12, 2019
talum
pushed a commit
to github/cmark-gfm
that referenced
this issue
Sep 14, 2021
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
We allow hexadecimal numeric character references to be at most 8 digits length. However, the current Unicode specification limits its allocation range up to U+10FFFF inclusive. This restriction is due to UTF-16 encoding's structure and will not be relaxed. A collateral evidence is that RFC3629 is defined as such.
So 8 is too much; 6 is enough. Also, 0x10FFFF== 1,114,112 so 8 digits in decimal character references are too much too; 7 is enough.
The text was updated successfully, but these errors were encountered: