aboutsummaryrefslogtreecommitdiff
path: root/youtube_dl/extractor/common.py
AgeCommit message (Collapse)Author
2021-02-01[youtube] Rewrite ExtractorRemita Amine
- improve format sorting - remove unused code(swf parsing, ...) - fix series metadata extraction - fix trailer video extraction - improve error reporting - extract video location
2020-12-19[common] remove unwanted query params from unsigned akamai manifest URLsRemita Amine
2020-12-13[extractor/common] Improve JSON-LD interaction statistic extraction (refs ↵Sergey M․
#23306)
2020-12-13[extractor/common] Document duration meta field for playlistsSergey M․
2020-12-09[extractor/common] Fix inline HTML5 media tags processing and add test ↵Sergey M․
(closes #27345)
2020-12-07[extractor/common] Add support for dl8-* media tags (closes #27283)Sergey M․
2020-12-07[extractor/common] Eliminate media tag name regex duplicationSergey M․
2020-12-07[extractor/common] Fix media type extraction for HTML5 media tags in ↵Sergey M․
start/end form
2020-12-03[extractor/commons] improve Akamai HTTP formats extractionRemita Amine
2020-12-02[extractor/common] improve Akamai HTTP format extractionRemita Amine
- Allow m3u8 manifest without an additional audio format - Fix extraction for qualities starting with a number Solution provided by @nixxo based on: https://stackoverflow.com/a/5984688
2020-11-22[extractor/common] add generic support for akamai http format extractionRemita Amine
2020-11-21Fix typos (#27084)Josh Soref
* spelling: authorization Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: brightcove Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: creation Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: exceeded Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: exception Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: extension Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: extracting Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: extraction Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: frontline Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: improve Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: length Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: listsubtitles Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: multimedia Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: obfuscated Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: partitioning Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: playlist Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: playlists Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: restriction Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: services Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: split Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: srmediathek Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: support Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: thumbnail Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: verification Signed-off-by: Josh Soref <jsoref@users.noreply.github.com> * spelling: whitespaces Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>
2020-11-18[extractor/common] Output error for invalid URLs in _is_valid_url (refs ↵Sergey M․
#21400, refs #24151, refs #25617, refs #25618, refs #25586, refs #26068, refs #27072)
2020-09-19[extractor/common] Relax interaction count extraction in _json_ldSergey M․
2020-09-19[extractor/common] Extract author as uploader for VideoObject in _json_ldSergey M․
2020-09-18[extractor/common] Handle ssl.CertificateError in _request_webpage (closes ↵Sergey M․
#26601) ssl.CertificateError is raised on some python versions <= 3.7.x
2020-05-05[extractor/common] Use compat_cookiejar_Cookie for _set_cookie (closes ↵Sergey M․
#23256, closes #24776) To always ensure cookie name and value are bytestrings on python 2.
2020-05-02[extractor/common] Extract multiple JSON-LD entriesSergey M․
2020-04-07[extractor/common] Skip malformed ISM manifest XMLs while extracting ISM ↵Sergey M․
formats (#24667)
2020-02-29Remove no longer needed compat_str around geturlSergey M․
2020-02-29[extractor/common] Convert ISM manifest to unicode before processing on ↵Sergey M․
python 2 (#24152)
2019-11-26[dailymotion] improve extractionRemita Amine
- extract http formats included in m3u8 manifest - fix user extraction(closes #3553)(closes #21415) - add suport for User Authentication(closes #11491) - fix password protected videos extraction(closes #23176) - respect age limit option and family filter cookie value(closes #18437) - handle video url playlist query param - report alowed countries for geo-restricted videos
2019-11-16[extractor/common] Add data, headers and query to all major extract methods ↵Sergey M․
preserving standard order for potential future use
2019-11-09[extractor/common] clean jwplayer description HTML tagsRemita Amine
2019-11-06[common] initialize headers param with empty dictRemita Amine
2019-11-05[common] fix typoRemita Amine
2019-11-05[common] pass headers to _extract_(m3u8|mpd)_formats methodsRemita Amine
2019-10-03[extractor/common] Make _is_valid_url more relaxedSergey M․
2019-07-17[extractor/common] Fix typo in thumbnails resolution description (#21817)Petr Vaněk
2019-05-23[extractor/common] Strip src attribute for HTML5 entries code (closes ↵Sergey M․
#18485, closes #21169)
2019-05-20[extractor/common] Add doc string for _apply_first_set_cookie_headerSergey M․
2019-05-18[extractor/common] Move workaround for applying first Set-Cookie header into ↵Sergey M․
a separate method
2019-05-11[extractor/common] Fix typoSergey M․
2019-05-11Fix W504 and disable W503 (closes #20863)Sergey M․
2019-04-21[vimeo] add support live streams and improve info extraction(closes #19144)Remita Amine
2019-03-17[extractor/common] Improve HTML5 entries extraction and add some realworld testsSergey M․
2019-03-15[extractor/common] Fix url meta field for unfragmented DASH formats (closes ↵Sergey M․
#20346)
2019-03-11Start moving to ytdl-orgSergey M․
2019-03-06Use compat_etree_ElementSergey M․
2019-03-06[extractor/common] Fallback url to base URL for DASH formatsSergey M․
2019-03-05[extractor/common] Do not fail on invalid data while parsing F4M manifest in ↵Sergey M․
non fatal mode
2019-03-05[extractor/common] Clarify url and manifest_url meta fieldsSergey M․
2019-03-05[extractor/common] Return MPD manifest as format's url meta field (#20242)Sergey M․
For symmetry with other segmented media
2019-02-08[malltv] Add extractor (closes #18058)Ales Jirasek
2019-01-27[extractor/common] Extract season in _json_ldSergey M․
2019-01-20[extractor/common] improve jwplayer relative url handling(closes #18892)Remita Amine
2019-01-19[extractor/common] fix typoRemita Amine
2019-01-19[extractor/common] imporove HLS video only format detection(closes #18923)Remita Amine
2019-01-08[extractor/common] Use episode name as title in _json_ldSergey M․
2019-01-08[extractor/common] Add support for movies in _json_ldSergey M․