SIGN IN SIGN UP
python / cpython UNCLAIMED

The Python programming language

0 0 1 Python

Correctly fold unknown-8bit originating from encoded words. (#142517)

The unknown-8bit trick was designed to deal with unknown bytes in an
ASCII message, and it works fine for that.  However, I also tried to
extend it to handle bytes that can't be decoded using the charset
specified in an encoded word, and there it fails because there can be
other non-ASCII characters that were *successfully* decoded.  The fix is
simple: do the unknown-8bit encoding using the utf-8 codec.  This is
especially appropriate since anyone trying to do recovery on an unknown
byte string will probably attempt utf-8 first.
R
R. David Murray committed
1e17ccd030a2285ad53db5952360fffa33a8a877
Parent: d4dc3dd
Committed by GitHub <noreply@github.com> on 12/24/2025, 2:14:39 PM