diff options
Diffstat (limited to 'python/python3-webencodings/README')
-rw-r--r-- | python/python3-webencodings/README | 13 |
1 files changed, 13 insertions, 0 deletions
diff --git a/python/python3-webencodings/README b/python/python3-webencodings/README new file mode 100644 index 0000000000000..736e9a287048c --- /dev/null +++ b/python/python3-webencodings/README @@ -0,0 +1,13 @@ +webencodings is a Python implementation of the WHATWG Encoding standard. + +In order to be compatible with legacy web content when interpreting +something like Content-Type: text/html; charset=latin1, tools need to +use a particular set of aliases for encoding labels as well as some +overriding rules. For example, US-ASCII and iso-8859-1 on the web are +actually aliases for windows-1252, and an UTF-8 or UTF-16 BOM takes +precedence over any other encoding declaration. The Encoding standard +defines all such details so that implementations do not have to +reverse-engineer each other. + +This module has encoding labels and BOM detection, but the actual +implementation for encoders and decoders is Python's. |