diff options
Diffstat (limited to 'python/w3lib/README')
-rw-r--r-- | python/w3lib/README | 10 |
1 files changed, 10 insertions, 0 deletions
diff --git a/python/w3lib/README b/python/w3lib/README new file mode 100644 index 000000000000..b2e25c80c486 --- /dev/null +++ b/python/w3lib/README @@ -0,0 +1,10 @@ +This is a Python library of web-related functions, such as: + +remove comments, or tags from HTML snippets +extract base url from HTML snippets +translate entites on HTML strings +convert raw HTTP headers to dicts and vice-versa +construct HTTP auth header +converting HTML pages to unicode +sanitize urls (like browsers do) +extract arguments from urls |