aboutsummaryrefslogtreecommitdiff
path: root/youtube_dl/extractor
AgeCommit message (Collapse)Author
2023-02-02[pr0gramm] implement InfoExtractor, Resolves #31433 (#31434)Leon Etienne
* [pr0gramm] implement infoextractor * [pr0gramm] remove misplaced comment, uncapture regex-group * [pr0gramm]: specify utf-8 coding * [pr0gramm]: add trailing comma to lists for maintainability * [pr0gramm]: ie only sets upload_date attribute * [pr0gramm]: add video_id to title * [pr0gramm]: more forgiving _valid_url regex * [pr0gramm]: add uploader to title, if set * Discriminate URL pattern --------- Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-02[cammodels] fix and improve extractor (#31453)JChris246
Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-02[americastestkitchen] Add support for downloading entire series (#31493)Brian Marks
Also * support new sites and URL patterns * back-port from yt-dlp Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-11-13[generic] Improve KVS (etc) extractiondirkf
2022-11-13[generic] Improve KVS (etc) extractiondirkf
* detect kt_player('kt_player', 'https://.../kt_player.swf?v=5... * detect age limit if 18 USC 2257 is mentioned * test with shooshtime.com Partially resolves #31332.
2022-11-13Added ThisVid.com support (#29187)FraFraFra-LongD
* add ThisVidIE, ThisVidMemberIE, ThisVidPlaylistIE * redirect embed to main page for more metadata * use KVS extraction newly added to GenericIE and remove duplicate tests * also add MrDeepFake etc compat to GenericIE (closes #22390) Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-11-12[generic] Add KVS player extractiondirkf
2022-11-11[common:jwplayer] Improve jwplayer extraction and parsing (#31000)dirkf
* don't crash parser if jwplayer_data is invalid (empty, or no formats) * use `label` in `sources[n]` as `format_id` * relax `jwplayer().setup(...)` RE (also rework PR #27274 enhancement) * detect more manifest formats in _parse_jwplayer_formats() (from PR #29596) * improve metadata extraction (from PR #25433) * remember URLs in a set * use parse_resolution() in format * extract filesize in format (from yt-dlp) Co-authored-by: kikuyan <kikuyan@users.noreply.github.com> Co-authored-by: martin54 <martin54@users.noreply.github.com>
2022-11-09[PeekVids, PlayVids] Add new extractor (#29765)Moises Lima
* Merge back-port from yt-dlp * Merge features from PR #29798 * Improve metadata extraction Co-authored-by: dirkf <fieldhouse@gmx.net> Co-authored by: AXDOOMER
2022-11-04[extractor/ceskatelevize] Back-port extractor from yt-dlp, etc (#30713)dirkf
* back-port extractor, removing CeskaTelevizePoradyIE * follow redirect URL * support liveBroadcast and videobonusDetail in __NEXT__ data * return single video for singleton playlist * fix/add tests
2022-10-30[netease] Support urls shared from mobile app (#31304)Xie Yanbo
Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-10-30[netease] Impove error handling (#31303)Xie Yanbo
* add warnings for users outside of China * skip empty song urls Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-10-27[Vimeo] Update variable name in hydration JSON patterndirkf
Fixes #31311
2022-10-20[BongaCams] Support new .net domaindirkf
Resolves #31262.
2022-10-18Fix ADN extractor (#31275)ache
* Rename Anime Digital Network to Animation Digital Network, animationdigitalnetwork.fr * Update the test to an available video * Update the decoding key of subtitles * Keep the support of old URLs * Add a test to match the old URL * Reduce redundancy of the URL name * Fix md5 ^^" * Fix undefined _BASE * Process HTTP error text (eg geo-block) correctly and uniformly in Py3, Py2 * Skip test for CI since geo-blocked Signed-off-by: ache <ache@ache.one> Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-10-13[ManyVids] Support new single-page app structuredirkf
2022-10-13[ManyVids] Support new single-page app structuredirkf
See https://github.com/yt-dlp/yt-dlp/issues/5210#issuecomment-1276919962.
2022-10-12[Motherless] Pull from yt-dlp, etcdirkf
* use username field * loosen regexes * warn on page count 0 in group * avoid reloading group page 1 Closes #29626
2022-10-11[netease] Get netease music download url through player api (#31235)Xie Yanbo
* remove unplayable song from test * compatible with python 2 * using standard User_Agent, fix imports * use hash instead of long description * fix lint * fix hash
2022-10-11[Common:JWPlayer] Fix x1000 scaling errordirkf
See https://github.com/yt-dlp/yt-dlp/issues/5106#issuecomment-1264625161
2022-10-11[ZDF] Overhaul ZDF extractorsdirkf
* pull some yt-dlp changes into ZDFBaseIE._extract_format() * add test cases from yt-dlp to ZDFIE * fix crash in ZDFIE._extract_mobile() when object had no `formitaeten` * improve title extraction in ZDFChannelIE (remove trailing station ident) * avoid extracting non-video playlist items (fixes #31149)
2022-10-10[motherless] Fixed the broken uploader_id in the extractor (#31243)Xiyue
* Fixed the broken uploader_id in the extractor. * Make uploader_id RE looser * Fix uploader_id in test Motherless_3 * Fix group pagination * # coding: utf-8 Co-authored-by: Andy Xuming <xuminic@gmail.com> Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-10-10[manyvids] Improve extraction (#31172)dirkf
* extract all formats from page * extract description, uploader, views, likes * downrate previews * fix tests * use txt_or_none()
2022-10-10[NRK] Remove explicit Accept-Encoding header that invites Brotlidirkf
Fixes #31285
2022-10-04[Telegraaf] Use mobile GraphQL API endpointcoletdjnz
Workaround for Cloudflare 403 Fixes https://github.com/yt-dlp/yt-dlp/issues/5000 Authored by: coletdjnz
2022-08-25[YouTube] Improve error check for n-sig processingdirkf
2022-08-19[infoq] Avoid crash if the page has no `mp3Form`gudata
* proposed fix for issue #31131, aligns with yt-dlp Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-08-19[uktvplay] Support domain without .uktvdirkf
2022-08-09[test, etc] Improve download test logs; also clean up some new flake8 issues ↵dirkf
(#31153) * [test] Identify testcase errors better * [test] Identify download errors better * [extractor/minds] Linter * [extractor/aes] Linter
2022-07-30[aenetworks] Update _THEPLATFORM_KEY and _THEPLATFORM_SECRET (#29749)Wes
Fixes ytdl-org/youtube-dl#29300
2022-07-12[VVVVID] Support video/dash types (#31060)Kyraminol Endyeran
Resolves #31030.
2022-06-26[Mediaset] Support player version number in URL patterndirkf
Ref: https://github.com/yt-dlp/yt-dlp/issues/4141
2022-06-15[NHK] Use new API URLdirkf
2022-06-12[XHamster] Support xhday.com alias, extract `uploader_id`dirkf
* support xhday.com alias for xhamster.com (resolves #31023) Authored by: dirkf * extract `uploader_id`: from https://github.com/yt-dlp/yt-dlp/commit/908b56eaf7872149706dbd7fa071f838d0c786b7 (PR https://github.com/yt-dlp/yt-dlp/pull/844) Authored by: octotherp
2022-06-10[utils, etc] Kill child processes when yt-dl is killedpukkandan
* derived from PR #26592, closes #26592 Authored by: Unrud
2022-06-09[YouTube] Support JSON3 subtitle formatdirkf
* subtitle tests updated to match
2022-06-06[HRFernsehen] Back-port new extractor from yt-dlpdirkf
Closes #26445, where this was originally proposed.
2022-05-28[YouPorn] Improve `upload_date` extractiondirkf
See https://github.com/yt-dlp/yt-dlp/issues/2701#issuecomment-1034341883
2022-05-24[YouTube] Handle player c5a4daa1 with indirect n-function definitiondirkf
* resolves #30976
2022-05-19[wat.tv] Add version `pver` to metadata API calldirkf
Resolves #30959.
2022-05-09[NHK] Support alphabetic characters in 7-char NhkVod IDs (#29682)dirkf
2022-04-29[streamcz] Remove empty `'{}'.format()` for Py2.6 dirkf
Use `'-join()'` here, or `{0}`, ..., in general.
2022-04-28[KTH] Add new extractor for KTH play (#30885)Árni Dagur
* Implement extractor for KTH play * Make KTH Play url regex more relaxed
2022-04-15[Youtube] Fix "n" descrambling for player fae06c11dirkf
Resolves #30856.
2022-04-05[RAI] Fix extraction of http formatsnixxo
From https://github.com/yt-dlp/yt-dlp/pull/3272 Closes https://github.com/yt-dlp/yt-dlp/issues/3270 Authored by: nixxo
2022-04-05[RAI] Extend formats with direct http mp4 link (PR #27990)nixxo
* initial support for creating direct mp4 link * improved regexes and info extraction * added "connection: close" to request headers * updated to https://github.com/yt-dlp/yt-dlp/pull/208
2022-04-05[RAI] Added checks for DRM protected content (PR #27657)nixxo
reviewed by pukkandan (https://github.com/yt-dlp/yt-dlp/pull/150)
2022-04-01[extractor/bilibili] Fix path "/audio/auxxxxx" download return 403lihan7
2022-02-26Avoid skipping ID when unlisted_hash is numericdirkf
Pattern needed a non-greedy match; also replaced a redundant test with one for this, issue 29690
2022-02-24[Alsace20TV] Add new extractors Alsace20TVIE, Alsace20TVEmbedIEdirkf