Age | Commit message (Collapse) | Author | |
---|---|---|---|
2016-06-16 | [utils] Don't transform numbers not starting with a zero | Yen Chi Hsuan | |
Fix test_Viidea and maybe others | |||
2016-06-10 | [utils] Decode HTML5 entities | Yen Chi Hsuan | |
Used in test_Vporn_1. Also related to #9270 | |||
2016-06-02 | Added sanitization support for Hungarian letters Ő and Ű | bzc6p | |
2016-05-19 | [utils] Allow None in remove_{start,end} | Sergey M․ | |
2016-05-14 | [test_utils] PEP 8 | Sergey M․ | |
2016-05-14 | [utils] Process non-base 10 integers in js_to_json | Sergey M․ | |
2016-05-14 | [utils] js_to_json: various improvements | felix | |
now JS object literals like { /* " */ 0: ",]\xaa<\/p>", } will be correctly converted to JSON. | |||
2016-05-12 | [utils] Add Œ and œ found in French to ACCENT_CHARS | Yen Chi Hsuan | |
Fixes #9463 | |||
2016-05-10 | [utils,compat] Move struct_pack and struct_unpack to compat.py | Yen Chi Hsuan | |
2016-05-02 | Instead of replacing accented characters with an underscore when sanitizing ↵ | Adam Thalhammer | |
file names in restricted mode, replace them with their non-accented equivalents fixes #9347 | |||
2016-05-02 | Instead of replacing accented characters with an underscore when sanitizing ↵ | Adam Thalhammer | |
file names in restricted mode, replace them with their non-accented equivalents fixes #9347 | |||
2016-04-21 | Merge pull request #9110 from remitamine/parse_duration | Sergey M | |
[utils] imporove parse_duration to handle more formats | |||
2016-04-21 | [utils] imporove parse_duration to handle more formats | remitamine | |
2016-04-09 | [test/utils] Add test for date_from_str | Jaime Marquínez Ferrándiz | |
2016-03-23 | [test/test_utils] Update for escape_url change (again) | Yen Chi Hsuan | |
2016-03-23 | [test/test_utils] Update for escape_url change | Yen Chi Hsuan | |
2016-03-19 | [utils] lookup_unit_table: Match word boundary instead of end of string | Jaime Marquínez Ferrándiz | |
2016-03-16 | [utils] PEP 8 | Sergey M․ | |
2016-03-16 | Merge pull request #8092 from bpfoley/twitter-thumbnail | remitamine | |
[utils] Add extract_attributes for extracting html tag attributes | |||
2016-03-13 | [bbc] Generalize unit table lookup and add parse_count | Sergey M․ | |
2016-03-03 | [test/test_utils] add more tests for update_url_query | remitamine | |
2016-03-03 | [test/test_utils] add tests for update_url_query | remitamine | |
2016-03-03 | [utils] Add extract_attributes for extracting html tag attributes | Brian Foley | |
This is much more robust than just using regexps, and handles all the common scenarios, such as empty/no values, repeated attributes, entity decoding, mixed case names, and the different possible value quoting schemes. | |||
2016-02-27 | [utils] Multiple changes to base_n() | Yen Chi Hsuan | |
1. Renamed to encode_base_n() 2. Allow tables longer than 62 characters 3. Raise ValueError instead of AssertionError for invalid input data 4. Return the first character in the table instead of '0' for number 0 5. Add tests | |||
2016-02-25 | [utils] Remove AM/PM from unified_strdate patterns | Sergey M․ | |
2016-02-20 | [utils] Add OHDave's RSA encryption function | Yen Chi Hsuan | |
2016-02-07 | [utils] Allow dot in strip_jsonp | Sergey M․ | |
2016-02-07 | [utils] Add ability to control skipping false values in dict_get | Sergey M․ | |
2016-02-07 | [utils] Add dict_get convenience method | Sergey M․ | |
2015-12-20 | [test_utils] Add tests for encode_compat_str | Sergey M․ | |
2015-12-19 | [utils] Support alternative timestamp format in TTML | Yen Chi Hsuan | |
Fixes #7608 | |||
2015-12-19 | [utils] Fix TTML conversion | Yen Chi Hsuan | |
Tolerate invalid timestamps (closes #7909) | |||
2015-12-14 | [utils] Add remove_quotes | Sergey M․ | |
2015-11-22 | [utils] Check ext with trailing slash against the list of known extensions | Sergey M․ | |
2015-11-22 | [test_utils] Add tests for determine_ext | Sergey M․ | |
2015-11-16 | [utils] Skip invalid/non HTML entities (Closes #7518) | Sergey M․ | |
2015-11-02 | [utils] unified_strdate: Return None if the date format can't be recognized ↵ | Jaime Marquínez Ferrándiz | |
(fixes #7340) This issue was introduced with ae12bc3ebb4cb377c2b4337ec255e652b36f5143, it returned 'None'. | |||
2015-10-31 | Merge pull request #7296 from jaimeMF/xml_attrib_unicode | Sergey M | |
Use a wrapper around xml.etree.ElementTree.fromstring in python 2.x (… | |||
2015-10-31 | [utils] Support list of xpath in xpath_element | Sergey M․ | |
2015-10-28 | [utils] Improve parse_iso8601 | Sergey M․ | |
2015-10-25 | Use a wrapper around xml.etree.ElementTree.fromstring in python 2.x (#7178) | Jaime Marquínez Ferrándiz | |
Attributes aren't unicode objects, so they couldn't be directly used in info_dict fields (for example '--write-description' doesn't work with bytes). | |||
2015-10-20 | [utils:js_to_json] Fix bad escape in double quoted strings | Sergey M․ | |
2015-09-05 | [test_utils] Add tests for cli option converters | Sergey M․ | |
2015-09-05 | [test_utils] Add more tests for xpath | Sergey M․ | |
2015-08-01 | [utils] Make value optional for find_xpath_attr | Sergey M․ | |
This allows selecting particular attributes by name but without specifying the value and similar to xpath syntax `[@attrib]` | |||
2015-07-22 | [utils] Improve parse_duration | Yen Chi Hsuan | |
Now dots are parsed. For example '87 Min.' | |||
2015-05-19 | [utils] Support TTML without default namespace | Yen Chi Hsuan | |
In a strict sense such TTML is invalid, but Yahoo uses it. | |||
2015-05-12 | [utils] Support 'dur' field in TTML | Yen Chi Hsuan | |
2015-05-09 | [utils] Remove sanitize_url_path_consecutive_slashes() | Yen Chi Hsuan | |
This function is used only in SohuIE, which is updated to use a new extraction logic. | |||
2015-05-04 | [NBC] Enhance embedURL extraction (closes #2549) | Yen Chi Hsuan | |