Jump to content
Main menu
Main menu
move to sidebar
hide
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Special pages
Niidae Wiki
Search
Search
Appearance
Create account
Log in
Personal tools
Create account
Log in
Pages for logged out editors
learn more
Contributions
Talk
Editing
Universal Disk Format
(section)
Page
Discussion
English
Read
Edit
View history
Tools
Tools
move to sidebar
hide
Actions
Read
Edit
View history
General
What links here
Related changes
Page information
Appearance
move to sidebar
hide
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
== Character set == The UDF specifications<ref name="OSTA - UDF Specifications"/> allow only one Character Set ''OSTA CS0'', which can store any [[Unicode]] [[Code point]] excluding U+FEFF and U+FFFE. Additional character sets defined in ECMA-167 are not used.<ref name=ecma167/>{{rp|at=7.2}} Since Errata DCN-5157, the range of code points was expanded to all code points from Unicode 4.0 (or any newer or older version), which includes [[Plane (Unicode)#Supplementary Multilingual Plane|Plane]] 1β16 characters such as [[Emoji]]. DCN-5157 also recommends [[Unicode equivalence#Normalization|normalizing]] the strings to Normalization Form C.<ref name=dcn-5157>{{cite web|title=UDF 2.60 approved errata|url=http://www.osta.org/specs/pdf/udf260_errata.pdf|access-date=22 April 2018}}</ref> The OSTA CS0 character set stores a 16-bit Unicode string "compressed" into 8-bit or 16-bit units, preceded by a single-byte "compID" tag to indicate the compression type. The 8-bit storage is functionally equivalent to [[ISO-8859-1]], and the 16-bit storage is [[UTF-16]] in big endian. 8-bit-per-character file names save space because they only require half the space per character, so they should be used if the file name contains no special characters that can not be represented with 8 bits only.<ref>[http://www.osta.org/specs/pdf/udf102.pdf UDF 1.02 specification]: 2.1.1 Character Sets (also present in later versions)</ref> The reference algorithm neither checks for forbidden code points nor interprets [[Universal Character Set characters#Surrogates|surrogate pairs]], so like [[NTFS]] the string may be malformed.<ref name="OSTA - UDF Specifications" />{{rp|at=2.1.2, 6.4}} (No specific form of storage is specified by DCN-5157, but UTF-16BE is the only well-known method for storing all of Unicode while being mostly backward compatible with [[UCS-2]].)<ref name="dcn-5157" />
Summary:
Please note that all contributions to Niidae Wiki may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
Encyclopedia:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Search
Search
Editing
Universal Disk Format
(section)
Add topic