OpenSolaris

Printable Version Enter a New Search
Bug ID 4515010
Synopsis ldterm: GB18030 two byte char and four byte char shares the same leading byte
State 6-Fix Understood (Fix is known)
Category:Subcategory library:l10n-common
Keywords ldterm
Responsible Engineer Ienup Sung
Reported Against s81_47
Duplicate Of
Introduced In
Commit to Fix
Fixed In
Release Fixed
Related Bugs 4512793
Submit Date 16-October-2001
Last Update Date 4-September-2008
Description
please reassign to correct cat/subcat if library/l10n-common isn't, thanks.

GB 18030-2000 consists of two byte characters and four byte characters.

Its two byte characters' byte range is:
	1th byte:   0x81-0xFE
	2th byte:   0x40-0xFE

and its four byte characters' byte range is:
	1th byte: 0x81-0xFE
	2th byte: 0x30-0x39
	3th byte: 0x81-0xFE
	4th byte: 0x30-0x39

Since two byte characters and four byte characters share the same leading byte, ldterm module can't decide only the first byte whether it is two byte character or four byte character, need to enhance the ldterm module to solve it.
Work Around
N/A
Comments
N/A