aboutsummaryrefslogtreecommitdiff
path: root/yt_dlp/extractor/generic.py
AgeCommit message (Collapse)Author
2023-02-04[extractor/txxx] Add extractors (#5240)chio0hai
Authored by: chio0hai Closes #5021
2023-01-29[extractor/generic] Avoid catastrophic backtracking in KVS regexbashonly
Authored by: bashonly
2023-01-02[extractor/generic] Decode unicode-escaped embed URLs (#5919)bashonly
Authored by: bashonly Closes #5854
2023-01-02Update to ytdl-commit-195f22f6Simon Sawicki
[generic] Improve KVS (etc) extraction https://github.com/ytdl-org/youtube-dl/commit/195f22f679330549882a8234e7234942893a4902 Closes #3716 Authored by: Grub4k, pukkandan
2023-01-01[extractor/generic] Use `Accept-Encoding: identity` for initial requestcoletdjnz
The existing comment seems to imply this was the desired behavior from the beginning. Partial fix for https://github.com/yt-dlp/yt-dlp/issues/5855, https://github.com/yt-dlp/yt-dlp/issues/5851, https://github.com/yt-dlp/yt-dlp/issues/4748
2022-12-09[downloader/ffmpeg] Fix headers for video+audio formats (#5659)bashonly
Authored by: bashonly, Grub4K
2022-12-09[extractor/mediaset] Better embed detection and error messages (#5664)nixxo
Authored by: nixxo
2022-12-08[extractor/sibnet] Separate from VKIEpukkandan
Fixes https://github.com/yt-dlp/yt-dlp/commit/bfd973ece3369c593b5e82a88cc16de80088a73e#commitcomment-91834251
2022-11-21[extractor/generic] Add `fragment_query` extractor arg for DASH and HLS (#5528)bashonly
* `fragment_query`: passthrough any query in generic mpd/m3u8 manifest URLs to their fragments * Add support for `extra_param_to_segment_url` to DASH downloader Authored by: bashonly, pukkandan
2022-11-18[extractor/generic] Fix JSON LD manifest extraction (#5577)bashonly
Closes #5572 Authored by: bashonly, pukkandan
2022-11-17[extractor/brightcove] Add `BrightcoveNewBaseIE` and fix embed extraction ↵bashonly
(#5558) * Move Brightcove embed extraction and tests into the IEs * Split `BrightcoveNewBaseIE` from `BrightcoveNewIE` * Fix bug in ade1fa70cbaaaadaa4772e5f0564870cea3167ef with the "wrong" spelling of `referrer` being smuggled Closes #5539
2022-11-17[extractor] Deprecate `_sort_formats`pukkandan
2022-11-17[extractor/generic] Don't report redirect to httpspukkandan
2022-11-09[extractor/rumble] Add HLS formats and extract more metadata (#5280)MMM
Closes #5177, #5277 Authored by: flashdagger
2022-10-31[extractor] Improve `_generic_title`pukkandan
2022-10-18[cleanup Miscpukkandan
Closes #5162
2022-10-14[extractor/generic:quoted-html] Add extractor (#5213)Matthew
Extracts embeds from escaped HTML within `data-html` attribute. Related: https://github.com/ytdl-org/youtube-dl/issues/21294, https://github.com/yt-dlp/yt-dlp/pull/5121 Authored by: coletdjnz Co-authored-by: pukkandan <pukkandan.ytdlp@gmail.com>
2022-10-09[extractor/generic] Separate embed extraction into own function (#5176)pukkandan
2022-10-09[extractor/common] Fix `json_ld` type checks (#5145)Simon Sawicki
Closes #5144, #5143 Authored by: Grub4K
2022-10-03[extractor/JWPlatform] Fix extractor (#5112)coletdjnz
Fix bitrate and filesize extraction and support embeds with unquoted urls. Related: #5106 Authored by: coletdjnz
2022-09-27[extractor] Improve json+ld extractionpukkandan
Related #5035
2022-09-23[extractor/wistia] Add support for channels (#4819)coletdjnz
Fixes https://github.com/yt-dlp/yt-dlp/issues/4748 Related: https://github.com/yt-dlp/yt-dlp/issues/4985 Authored by: coletdjnz
2022-09-22[extractor/wistia] Match IDs in embed URLs (#4990)bashonly
Closes #4985 Authored by: bashonly
2022-09-22[docs] Misc improvementspukkandan
Closes #4987, Closes #4906, Closes #4919, Closes #4977, Closes #4979
2022-09-16[extractor/generic] Pass through referer from json-ldpukkandan
Closes #4941
2022-09-10[cleanup] Miscpukkandan
2022-08-27[extractor/generic] Don't return JW player without formatspukkandan
CLoses #4765
2022-08-24Fix bug in fe7866d0ed6bfa3904ce12b049a3424fdc0ea1fapukkandan
2022-08-14[docs] Consistent use of `e.g.` (#4643)Lesmiscore
Authored by: Lesmiscore
2022-08-02[extractors] Use new framework for existing embeds (#4307)pukkandan
`Brightcove` is difficult to migrate because it's subclasses may depend on the signature of the current functions. So it is left as-is for now Note: Tests have not been migrated
2022-08-02[extractor/html5] Separate into own extractor (#4307)pukkandan
Closes #4291 Authored by: coletdjnz, pukkandan
2022-08-02[extractor/camtasia] Separate into own extractor (#4307)pukkandan
Authored by: coletdjnz
2022-08-02[extractor, test] Basic framework for embed tests (#4307)pukkandan
and split download tests so they can be more easily run in CI Authored by: coletdjnz
2022-08-02[extractor] Framework for embed detection (#4307)pukkandan
2022-07-07[extractor/generic] Remove HEAD requestpukkandan
2022-06-29[cleanup] Misc cleanuppukkandan
2022-06-26[cleanup] Fix some typos (#4194)crazymoose77756
Authored by: crazymoose77756
2022-06-25[compat] Remove deprecated functions from core codepukkandan
2022-06-20[cleanup] Miscpukkandan
2022-06-18[extractor/substack] Add extractor (#4011)Elyse
Closes #3722 Authored by: elyse0
2022-06-13[extractor] Handle `json_ld` with multiple `@type`spukkandan
Closes: #4022
2022-06-13[extractor/rumble] Detect JS embedpukkandan
Closes #4064
2022-06-13[extractor/generic] Revert e6ae51c123897927eb3c9899923d8ffd31c7f85dpukkandan
85553414ae3007fe866b307b3befd3b9d2423679 made it unnecessary
2022-06-12[extractor/jwplatform] Look for `data-video-jw-id`pukkandan
Closes #3821
2022-05-20[tiktok] Detect embedspukkandan
Closes #3799
2022-05-18[utils] `is_html`: Handle double BOMpukkandan
Closes #2885
2022-05-18[vimeo] Fix extractorspukkandan
Closes #3037, Closes #2858, Closes #2880, Closes #3712 May also fix #3602, #3360
2022-05-18[generic] Refactor `_extract_rss`pukkandan
Closes #3738
2022-05-17[cleanup] Misc cleanuppukkandan
2022-05-09[cleanup] Minor fixes (See desc)pukkandan
* [youtube] Fix `--youtube-skip-dash-manifest` * [build] Use `$()` in `Makefile`. Closes #3684 * Fix bug in 385ffb467b2285e85a2a5495b90314ba1f8e0700 * Fix bug in 43d7f5a5d0c77556156a3f8caa6976d3908a1e38 * [cleanup] Remove unnecessary `utf-8` from `str.encode`/`bytes.decode` * [utils] LazyList: Expose unnecessarily "protected" attributes and other minor cleanup