Valid Character ISO Table for SVG Identifiers

The SVG specification refers the reader to the XML Database Specification for characters that can appear in the value of the id attribute.

However, the XML database specification does not specify these characters, AFAICT. Instead, he makes his recommendations in terms of the "Unicode properties" of ID_Start and ID_Continue .

I am looking for a table (or table s ) that explicitly indicates those characters that have the ID_Start and / or ID_Continue .

(If various XML-based applications or standards can specify their own character sets using the ID_Start and / or ID_Continue , I'm interested in the SVG built into HTML5).

+4
source share
2 answers

I found a repo on Github that generates many different tables using Python scripts based on the Unicode standard. For example, here are the tables for ID_START , ID_CONTINUE , XID_START , XID_CONTINUE , etc.: https://github.com/sourtin/libucd/blob/master/src/tables/bool.rs

Edit: I think they are being parsed from XML databases provided at: http://www.unicode.org/Public/5.2.0/ucdxml/

+1
source

A valid range of characters seems to be defined:

Attribute Value:

 AttValue ::= '"' ([^<&"] | Reference)* '"' | "'" ([^<&'] | Reference)* "'" 

http://www.w3.org/TR/2008/REC-xml-20081126/#NT-AttValue

Link:

 Reference ::= EntityRef | CharRef 

http://www.w3.org/TR/2008/REC-xml-20081126/#NT-Reference

CharRef leads us to Char here:

 Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] | [#x10000-#x10FFFF] /* any Unicode character, excluding the surrogate blocks, FFFE, and FFFF. */ 

http://www.w3.org/TR/2008/REC-xml-20081126/#NT-Char

0
source

All Articles