RtlUTF8StringToUnicodeString - NtDoc

Native API online documentation, based on the System Informer (formerly Process Hacker) phnt headers
#ifndef _NTRTL_H
#if (PHNT_VERSION >= PHNT_WINDOWS_10_20H1)

NTSYSAPI
NTSTATUS
NTAPI
RtlUTF8StringToUnicodeString(
    _Inout_ PUNICODE_STRING DestinationString,
    _In_ PCUTF8_STRING SourceString,
    _In_ BOOLEAN AllocateDestinationString
    );

#endif
#endif

View code on GitHub
// ntifs.h

NTSYSAPI NTSTATUS RtlUTF8StringToUnicodeString(
  PUNICODE_STRING DestinationString,
  PUTF8_STRING    SourceString,
  BOOLEAN         AllocateDestinationString
);
View the official Windows Driver Kit DDI reference
// wdm.h

NTSYSAPI NTSTATUS RtlUTF8StringToUnicodeString(
  PUNICODE_STRING DestinationString,
  PUTF8_STRING    SourceString,
  BOOLEAN         AllocateDestinationString
);
View the official Windows Driver Kit DDI reference

NtDoc

This function is documented in Windows Driver Kit here and here.

Windows Driver Kit DDI reference (nf-ntifs-rtlutf8stringtounicodestring)

RtlUTF8StringToUnicodeString function (ntifs.h)

Description

The RtlUTF8StringToUnicodeString routine converts the specified UTF-8 string to a Unicode string.

Parameters

DestinationString

Pointer to the buffer in which the converted output Unicode string is stored. The DestinationString->MaximumLength field is set only if AllocateDestinationString is TRUE.

SourceString

Pointer to the UTF-8 source string to be converted to Unicode.

AllocateDestinationString

Boolean value. When set TRUE, RtlUTF8StringToUnicodeString allocates the buffer space for the destination string. Only storage for DestinationString->Buffer is allocated by this API. If RtlUTF8StringToUnicodeString does the buffer allocation, then the caller must deallocate the buffer using RtlFreeUnicodeString.

Return value

This function returns STATUS_SUCCESS when the conversion is successful. Possible error or warning codes include:

Code Description
STATUS_INVALID_PARAMETERX Error: One of the parameter values is invalid.
STATUS_NO_MEMORY Error: RtlUTF8StringToUnicodeString was unable to allocate buffer space.
STATUS_BUFFER_OVERFLOW Warning: The converted string in DestinationString->Buffer is truncated due to insufficient space in the destination buffer.
STATUS_SOME_NOT_MAPPED Warning: The call was successful, but one or more of the input characters were invalid and were converted by the Unicode replacement character, U+FFFD, before being converted to UTF-8.

Remarks

The Unicode output string is null-terminated only if the UTF-8 input string is null-terminated.

RtlUTF8StringToUnicodeString supports Unicode surrogate pairs. However, a surrogate leading word value that is not followed by a trailing word value, or a trailing word value that is not preceded by a leading word value, is not recognized as a valid character and is replaced by the Unicode replacement character, U+FFFD.

RtlUTF8StringToUnicodeString continues to convert the input string to an output string until it reaches the end of the source buffer or the end of the destination buffer, whichever occurs first. The routine converts any null characters in the input string to null characters in the output string. If the input string contains a terminating null character, but the null character is not located at the end of the source buffer, the routine continues past the terminating null character until it reaches the end of the available buffer space.

The RtlUnicodeStringToUTF8String routine converts a Unicode string to a UTF-8 string.

You can use the RtlUTF8StringToUnicodeString and RtlUnicodeStringToUTF8String routines to perform a lossless conversion of valid text strings between the UTF-8 and Unicode formats. However, strings that have arbitrary data values are likely to violate the Unicode rules for encoding surrogate pairs, and any information that is contained in the invalid values in an input string is lost and cannot be recovered from the resulting output string.

See also

RtlFreeUnicodeString

RtlUnicodeStringToUTF8String


Windows Driver Kit DDI reference (nf-wdm-rtlutf8stringtounicodestring)

RtlUTF8StringToUnicodeString function (wdm.h)

Description

The RtlUTF8StringToUnicodeString function converts the specified UTF8 source string into a Unicode string in accordance with the current system locale information.

Parameters

DestinationString

Pointer to a UNICODE_STRING structure to hold the converted Unicode string.

If AllocateDestinationString is TRUE, the routine allocates a new buffer to hold the string data, updates the Buffer member of DestinationString to point to the new buffer, and set the maximum length field. Otherwise, the routine uses the currently-specified buffer to hold the string.

SourceString

Pointer to the UTF8 string to be converted to Unicode.

AllocateDestinationString

Specifies if this routine should allocate the buffer space for the destination string. If it does, the caller must deallocate the buffer by calling RtlFreeUnicodeString.

Return value

If the conversion succeeds, RtlUTF8StringToUnicodeString returns STATUS_SUCCESS. On failure, the routine does not allocate memory or perform a conversion.

See also

RtlFreeUnicodeString