urlstd.parse.utf8_encode#

urlstd.parse.utf8_encode(s: str) bytes#

Encodes a string with utf-8 and returns its byte sequence.

Invalid surrogates will be replaced with U+FFFD.

Parameters:

s – A string to encode with utf-8.

Returns:

A utf-8 encoded byte sequence.

Examples

>>> utf8_encode('\ud83c\udf08').decode()  # surrogate pair
'🌈'
>>> utf8_encode('\udf08\ud83c').decode()  # invalid surrogates
'��'  # '\ufffd\ufffd'