aboutsummaryrefslogtreecommitdiff
path: root/youtube_dl/extractor
AgeCommit message (Collapse)Author
2023-08-31[S4C] Add thumbnail extraction, extract series as playlistdirkf
Based on https://github.com/yt-dlp/yt-dlp/pull/7776: thx ifan-t, bashonly
2023-08-04[S4C] Add extractor for Sianel Pedwar Cymrudirkf
* from https://github.com/yt-dlp/yt-dlp/pull/7730, thx ifan-t, bashonly
2023-07-25[compat] Use `compat_open()`dirkf
2023-07-19[InfoExtractor] Add `_match_valid_url()` class method and refactordirkf
* API compatible with yt-dlp * also support Sequence of patterns in _VALID_URL * one place to compile _VALID_URL * TODO: remove existing extractor shims
2023-07-19[InfoExtractor] Add search methods for Next/Nuxt.js from yt-dlpdirkf
* add _search_nextjs_data(), from https://github.com/yt-dlp/yt-dlp/pull/1386 thanks selfisekai * add _search_nuxt_data(), from https://github.com/yt-dlp/yt-dlp/pull/1921, thanks Lesmiscore, pukkandan * add tests for the above * also fix HTML5 type recognition and tests, from https://github.com/yt-dlp/yt-dlp/commit/222a230871fe4fe63f35c49590379c9a77116819, thanks Lesmiscore * update extractors in PR using above, fix tests.
2023-07-19[Clipchamp] Add new extractor back-ported from yt-dlpdirkf
2023-07-19[DLF] Add site extractors back-ported from yt-dlpdirkf
* from https://github.com/yt-dlp/yt-dlp/pull/6697, thanks nick-cd
2023-07-19[Whyp] Add extractor back-ported from yt-dlpdirkf
* from https://github.com/yt-dlp/yt-dlp/pull/6803, thanks CoryTibbettsDev
2023-07-19[GlobalPlayer] Add site extractors back-ported from yt-dlpdirkf
* from https://github.com/yt-dlp/yt-dlp/pull/6903, thanks garret1317
2023-07-19[InfoExtractor] Support groups in _`search_regex()`, etcdirkf
2023-06-22[YouTube] Avoid crash in author extractiondirkf
2023-06-22[YouTube] Improve nsig function name extractionpukkandan
Fixes player b7910ca8, using `,` vs `;` See https://github.com/ytdl-org/youtube-dl/issues/32292#issuecomment-1602231170 Co-authored-by: dirkf
2023-06-18[YouTube] Improve fix for ae8ba2cdirkf
Thx: https://github.com/yt-dlp/yt-dlp/commit/01aba25
2023-06-17[YouTube] Fix `KeyError QV` in signature extraction faileddirkf
* temporarily force missing global definition into sig JS * improve test: thanks https://github.com/yt-dlp/yt-dlp/issues/7327#issuecomment-1595274615 * resolves #32314
2023-05-23[ITV] Fix UA capitalisation in 384f632dirkf
2023-04-23[YouTube] Support Releases tabdirkf
2023-04-12[YouTube] Simplify signature patternsdirkf
2023-03-19[extractor/youtube] Bypass throttling for `-f17`pukkandan
and related cleanup Thanks @AudricV for the finding Ref: yt-dlp/yt-dlp/commit/c9abebb
2023-03-19[extractor/youtube] Construct fragment list lazilypukkandan
Ref: yt-dlp/yt-dlp/commit/e389d17 See: yt-dlp/yt-dlp#6517
2023-03-14[AENetworksBaseIE] Report missing show data instead of crashdirkf
2023-03-03[Youtube] Construct dash formats with `range` querypukkandan
See yt-dlp/yt_dlp#6369
2023-02-24[YouTube] Support @owner format in uploader_id etcdirkf
* implement https://github.com/ytdl-org/youtube-dl/issues/31530#issuecomment-1435734719 * update affected tests * misc clean-ups
2023-02-20Escape URLs in `sanitized_Request`, not `sanitize_url` ↵pukkandan
d2558234cf5dd12d6896eed5427b7dcdb3ab7b5a added escaping of URLs while sanitizing. However, `sanitize_url` may not always receive an actual URL. Eg: When using `youtube-dl "search query" --default-search ytsearch`, `search query` gets escaped to `search%20query` before being prefixed with `ytsearch:` which is not the intended behavior. So the escaping is moved to `sanitized_Request` instead.
2023-02-20[Vimeo] Fix e19ec52 for tween-age Pythonsdf
* a check in older Pythons in the 2.7 and earlier, 3.3, 3.4 series caused "sre_constants.error: nothing to repeat" * satisfy the check by avoiding nested qualifiers that can match empty string Resolves #31597
2023-02-17[YouTube] Avoid crash if uploader_id extraction failsdirkf
See #31530.
2023-02-14[InfoExtractor] Handle unquoted values in OpenGraph searchesdirkf
2023-02-13[StreamsbIE] Add extractor for streamsb.com (viewsb.com) (#31517)fonkap
* Add extractor for streamsb.com (viewsb.com) * make data url using app.js version --------- Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-13[KommunetvIE] Add extractor for kommunetv.no (#31516)fonkap
* Add extractor for kommunetv.no * Using utils.update_url instead of regex --------- Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-13[FileMoonIE] Add extractor for filemoon.sx (#31515)fonkap
--------- Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-13[rbgtum] Add new extractor (#31305)Valentin Metz
* [rbgtum] Add new extractor * Small update, force CI --------- Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-13[YouTube] Fix testsdirkf
2023-02-13[YouTube] Refresh compat/utils usagedirkf
* import parse_qs() * import parse_qs in lazy_extractors (clears old TODO) * clean up old compiled lazy_extractors for Py2 * use update_url()
2023-02-13[YouTube] Add `signatureTimestamp` for age-gate bypassdirkf
2023-02-13[YouTube] Bypass age-gating for certain restricted videosdirkf
* Use TVHTML5_SIMPLY_EMBEDDED_PLAYER client * Also add and fix tests * Introduce and use new utility function `update_url()`
2023-02-12[Vimeo] Support /user{video_id}/{slug} URL formatdirkf
2023-02-12[Vimeo] Fix `Unable to extract info section` reduxdirkf
* as reported in yt-dlp/yt-dlp#6149 * also allow newline in target JSON object
2023-02-12[IGN] Overhaul extractor to avoid URL redirection loopdirkf
Consequently/also: * centralise video data extraction * detect 404 and 503 expected errors * handle the test video in IGNVideo * handle two additional page formats for the tests in IGNArticle
2023-02-03[ITV] Overhaul ITV extractor (#30266)dirkf
* support ITVX URLs (thanks Vangelis66) * support legacy ITV Hub URLs * include extraction fix 4c57dd2 from sleaux-meaux 3 May 2021 * include extraction fix 6fbcc16, fix by staubichsauger & pukkandan * work-around duration parsing pending fix to utils.parse_duration * apply default vanilla UA for pages and media to avoid site blocking * also detect and report `Episode not found` instead of generic 404 * rework ITVBTCCIE with geo-block detection, best effort geo-restriction handling, news article support * fix tests
2023-02-02[myvideoge] Add new extractor (#31360)dirkf
NB download tests on CI servers blocked Co-authored-by: Alfonso Solbes <fonk666@gmail.com>
2023-02-02[xhamster] add support for new domain xhvid.com (#31370)afterdelight
2023-02-02[FIFA] Back-port extractor from yt-dlp (#31385)dirkf
2023-02-02[Blerp] Add new extractor (#31398)Epsilonator
Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-02[YouTube] Fix not finding videos listed under a channel's "shorts" subpage. ↵zhangeric-15
(#31409) Resolves #31336 Co-authored-by: Jouni Järvinen <rautamiekka@users.noreply.github.com> Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-02[Callin] Add new extractor (#31414)Ruowang Sun
Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-02[pr0gramm] implement InfoExtractor, Resolves #31433 (#31434)Leon Etienne
* [pr0gramm] implement infoextractor * [pr0gramm] remove misplaced comment, uncapture regex-group * [pr0gramm]: specify utf-8 coding * [pr0gramm]: add trailing comma to lists for maintainability * [pr0gramm]: ie only sets upload_date attribute * [pr0gramm]: add video_id to title * [pr0gramm]: more forgiving _valid_url regex * [pr0gramm]: add uploader to title, if set * Discriminate URL pattern --------- Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-02[cammodels] fix and improve extractor (#31453)JChris246
Co-authored-by: dirkf <fieldhouse@gmx.net>
2023-02-02[americastestkitchen] Add support for downloading entire series (#31493)Brian Marks
Also * support new sites and URL patterns * back-port from yt-dlp Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-11-13[generic] Improve KVS (etc) extractiondirkf
2022-11-13[generic] Improve KVS (etc) extractiondirkf
* detect kt_player('kt_player', 'https://.../kt_player.swf?v=5... * detect age limit if 18 USC 2257 is mentioned * test with shooshtime.com Partially resolves #31332.
2022-11-13Added ThisVid.com support (#29187)FraFraFra-LongD
* add ThisVidIE, ThisVidMemberIE, ThisVidPlaylistIE * redirect embed to main page for more metadata * use KVS extraction newly added to GenericIE and remove duplicate tests * also add MrDeepFake etc compat to GenericIE (closes #22390) Co-authored-by: dirkf <fieldhouse@gmx.net>