Age | Commit message (Collapse) | Author | |
---|---|---|---|
2017-02-11 | Introduce get_elements_by_class and get_elements_by_attribute utility functions | Thomas Christlieb | |
2017-02-03 | [utils] Improve comments processing in js_to_json (closes #11947) | Sergey M․ | |
2017-02-03 | [utils] Handle single-line comments in js_to_json | Michal Čihař | |
2017-01-26 | [utils] Improve parse_duration | Sergey M․ | |
2017-01-12 | [utils] Add more date formats | Sergey M․ | |
2016-12-20 | [common] fix dash codec information for mixed videos and fragment url ↵ | Remita Amine | |
construction(#11490) | |||
2016-12-17 | [utils] Improve urljoin | Sergey M․ | |
2016-12-13 | [utils] Add convenience urljoin | Sergey M․ | |
2016-11-17 | Update coding style after pycodestyle 2.1.0 | Yen Chi Hsuan | |
In pycodestyle 2.1.0, E305 was introduced, which requires two blank lines after top level declarations, too. See https://github.com/PyCQA/pycodestyle/issues/400 See also #10689; thanks @stepshal for first mentioning this issue and initial patches | |||
2016-11-02 | [utils] Introduce base_url | Sergey M․ | |
2016-09-29 | [utils] Lower priority for rare date formats and add tests | Sergey M․ | |
2016-09-14 | [utils] Use native french month names | Sergey M․ | |
2016-09-14 | [utils] Improve month_by_name and add tests | Sergey M․ | |
2016-09-02 | [utils] Improve mimetype2ext | Sergey M․ | |
2016-08-20 | [utils] Recognize units with full names in parse_filename | Yen Chi Hsuan | |
Reference: https://en.wikipedia.org/wiki/Template:Quantities_of_bytes | |||
2016-08-19 | [utils] Correct octal/hexadecimal number detection in js_to_json | Yen Chi Hsuan | |
2016-08-18 | [utils] Recognize lowercase units in parse_filesize | Sergey M․ | |
2016-08-13 | [test_utils] add test for option with not str value | Remita Amine | |
2016-08-07 | [utils] Add support TV Parental Guidelines ratings in parse_age_limit | Sergey M․ | |
2016-08-05 | [utils] Fix unified_timestamp for formats parsed by parsedate_tz() | Yen Chi Hsuan | |
2016-07-10 | Merge pull request #8876 from remitamine/html5_media | Yen Chi Hsuan | |
[extractor/common] add helper method to extract html5 media entries | |||
2016-07-06 | [utils] Add get_element_by_class | Yen Chi Hsuan | |
For #9950 | |||
2016-07-04 | [test_utils] add test for smuggling a smuggled url | Remita Amine | |
2016-06-26 | [utils] add helper function for parsing codecs | remitamine | |
2016-06-26 | [utils] Add urshift() | Yen Chi Hsuan | |
Used in IqiyiIE and LeIE | |||
2016-06-25 | [utils] Add unified_timestamp | Sergey M․ | |
2016-06-16 | [utils] Don't transform numbers not starting with a zero | Yen Chi Hsuan | |
Fix test_Viidea and maybe others | |||
2016-06-10 | [utils] Decode HTML5 entities | Yen Chi Hsuan | |
Used in test_Vporn_1. Also related to #9270 | |||
2016-06-02 | Added sanitization support for Hungarian letters Ő and Ű | bzc6p | |
2016-05-19 | [utils] Allow None in remove_{start,end} | Sergey M․ | |
2016-05-14 | [test_utils] PEP 8 | Sergey M․ | |
2016-05-14 | [utils] Process non-base 10 integers in js_to_json | Sergey M․ | |
2016-05-14 | [utils] js_to_json: various improvements | felix | |
now JS object literals like { /* " */ 0: ",]\xaa<\/p>", } will be correctly converted to JSON. | |||
2016-05-12 | [utils] Add Œ and œ found in French to ACCENT_CHARS | Yen Chi Hsuan | |
Fixes #9463 | |||
2016-05-10 | [utils,compat] Move struct_pack and struct_unpack to compat.py | Yen Chi Hsuan | |
2016-05-02 | Instead of replacing accented characters with an underscore when sanitizing ↵ | Adam Thalhammer | |
file names in restricted mode, replace them with their non-accented equivalents fixes #9347 | |||
2016-05-02 | Instead of replacing accented characters with an underscore when sanitizing ↵ | Adam Thalhammer | |
file names in restricted mode, replace them with their non-accented equivalents fixes #9347 | |||
2016-04-21 | Merge pull request #9110 from remitamine/parse_duration | Sergey M | |
[utils] imporove parse_duration to handle more formats | |||
2016-04-21 | [utils] imporove parse_duration to handle more formats | remitamine | |
2016-04-09 | [test/utils] Add test for date_from_str | Jaime Marquínez Ferrándiz | |
2016-03-23 | [test/test_utils] Update for escape_url change (again) | Yen Chi Hsuan | |
2016-03-23 | [test/test_utils] Update for escape_url change | Yen Chi Hsuan | |
2016-03-19 | [utils] lookup_unit_table: Match word boundary instead of end of string | Jaime Marquínez Ferrándiz | |
2016-03-16 | [utils] PEP 8 | Sergey M․ | |
2016-03-16 | Merge pull request #8092 from bpfoley/twitter-thumbnail | remitamine | |
[utils] Add extract_attributes for extracting html tag attributes | |||
2016-03-13 | [bbc] Generalize unit table lookup and add parse_count | Sergey M․ | |
2016-03-03 | [test/test_utils] add more tests for update_url_query | remitamine | |
2016-03-03 | [test/test_utils] add tests for update_url_query | remitamine | |
2016-03-03 | [utils] Add extract_attributes for extracting html tag attributes | Brian Foley | |
This is much more robust than just using regexps, and handles all the common scenarios, such as empty/no values, repeated attributes, entity decoding, mixed case names, and the different possible value quoting schemes. | |||
2016-02-27 | [utils] Multiple changes to base_n() | Yen Chi Hsuan | |
1. Renamed to encode_base_n() 2. Allow tables longer than 62 characters 3. Raise ValueError instead of AssertionError for invalid input data 4. Return the first character in the table instead of '0' for number 0 5. Add tests |