[Overview][Constants][Types][Procedures and functions][Variables][Index] Reference for unit 'LConvEncoding' (#lazutils)

CP932ToUTF8

Converts a string value using Code Page 932 to UTF-8.

Declaration

Source position: lconvencoding.pas line 210

function CP932ToUTF8(

  const s: string

):string;

Arguments

s

  

String value using code page 932.

Function result

String value after conversion to UTF-8 encoding.

Description

CP932ToUTF8 is defined for platforms where Asian code pages are enabled. It uses the byte maps defined for the Japanese Shift-JIS Double Byte Character Set as defined for the Microsoft Windows Code Page 932.

The internal implementation ensures that single-byte half-width Katakana characters are handled properly when converted to UTF-8. Specifically, the characters with an initial byte in the ranges $A0..$DF and $FD..$FF. $A0, $FD, $FE, $FF are not allowed as the initial byte in a Shift-JIS character. Values in the range $A1..$DF are treated as single-byte half-width codepoints.

Version info

Modified in LazUtils 4.0 to handle single-byte half-width Katakana character ranges.


Version 4.0 Generated 2025-05-03 Home