detect ucs2: U+000a = linefeed U+0a00 = not a character hiragana range: Unicode: U+3041 .. U+3093 UTF-8: e38181 .. e381bf, e38280 .. e38293 EUC-JP: 0xa4a1 .. 0xa4f3 Shift_JIS: 0x829f .. 0x82f1 punctuation: half , 2c . 2e ! 21 ? 3f full sjis eucjp utf8 ucs2le ucs2be , 8141 a1a2 e38081 0130 3001 . 8142 a1a3 e38082 0230 3002 ! 8149 a1aa efbc81 01ff ff01 ? 8148 a1a9 efbc9f 1fff ff1f