Sergey M․
cc357c4db8
[extractor/common] Properly handle full URLs
2015-07-16 01:14:52 +06:00
Sergey M․
97f4aecfc1
[extractor/common] Handle malformed f4m manifests
2015-07-16 01:14:08 +06:00
Sergey M․
cf61d96df0
[extractor/common] Add _form_hidden_inputs
2015-07-14 22:38:10 +06:00
Sergey M․
f8da79f828
[extractor/common] Improve _form_hidden_inputs and rename to _hidden_inputs
2015-07-14 22:36:30 +06:00
Sergey M․
27713812a0
[extractor/common] Add method for extracting form hidden input fields as dict
2015-07-10 21:49:09 +06:00
Yen Chi Hsuan
13af92fdc4
[common] Add 'fatal' to _extract_m3u8_formats
2015-07-06 08:39:38 +08:00
Sergey M․
5414623791
[extractor/common] Remove superfluous line
2015-06-29 00:49:19 +06:00
Sergey M․
c342041fba
[extractor/common] Use NO_DEFAULT from utils
2015-06-28 22:56:45 +06:00
Yen Chi Hsuan
621ed9f5f4
[common] Add note and errnote field for _extract_m3u8_formats
2015-06-07 16:33:22 +08:00
Sergey M․
baa43cbaf0
[extractor/common] Relax valid url check verbosity
2015-05-17 02:59:35 +06:00
Yen Chi Hsuan
c1c924abfe
[utils,common] Merge format_srt_time and _subtitles_timecode
...
format_srt_time uses a comma as the delimiter between seconds and
milliseconds while _subtitles_timecode uses a dot. All .srt examples I
found on the Internet uses a comma, so I use a comma in the merged
version. See http://matroska.org/technical/specs/subtitles/srt.html and
http://devel.aegisub.org/wiki/SubtitleFormats/SRT
2015-05-12 13:04:54 +08:00
Yen Chi Hsuan
05d5392cda
[common] Ignore subtitles in m3u8
2015-05-07 18:06:22 +08:00
Sergey M․
74f728249f
[extractor/common] Fallback to empty string for (yet) missing format_id
in _sort_formats
( Closes #5624 )
2015-05-06 21:24:24 +06:00
Jaime Marquínez Ferrándiz
2ddcd88129
Remove code that was only used by the Grooveshark extractor
2015-05-02 17:29:56 +02:00
zouhair
cf0649f8b7
Typo: twice "the the" to "the"
2015-04-29 11:03:10 -04:00
Sergey M․
3ded7bac16
[extractor/common] Add ability to specify custom field preference for _sort_formats
2015-04-20 21:13:31 +06:00
Jaime Marquínez Ferrándiz
08f2a92c9c
InfoExtractor._search_regex: Suggest updating when the regex is not found (suggested in #5442 )
...
Reuse the same message from ExtractorError
2015-04-17 14:55:24 +02:00
Yen Chi Hsuan
c9a779695d
[extractor/common] Add the encoding parameter
...
The QQMusic info extractor need forced encoding for correct working.
2015-04-16 17:34:54 +08:00
Sergey M․
830d53bfae
[utils] Add video_title
for url_result
2015-04-12 23:11:47 +06:00
Sergey M․
e21a55abcc
[extractor/common] Remove f4m section
...
It's now provided by `f4m_id`
2015-04-04 23:05:25 +06:00
Sergey M․
4a34f69ea6
[extractor/common] Add subtitles timecode formatter
2015-03-13 21:38:28 +06:00
Sergey M․
f207019ce5
[extractor/common] Remove 'm3u8' from quality selection URL
2015-03-06 22:53:53 +06:00
Sergey M․
8dc9d361c2
[extractor/common] Fix format_id when last_media
is None and always include m3u8_id
if present
...
The rationale behind `m3u8_id` was to resolve duplicates when processing several m3u8 playlists within the same media that give equal resulting `format_id`'s,
e.g. `youtube-dl http://www.rts.ch/play/tv/passe-moi-les-jumelles/video/la-fee-des-bois-mustang-les-chemins-du-vent?id=3854925 -F`
2015-03-06 22:52:50 +06:00
Philipp Hagemeister
a0bb7c5593
[extractor/common] Improve m3u format IDs ( #5143 )
2015-03-06 10:49:42 +01:00
Sergey M․
2f0f6578c3
[extractor/common] Assume non HTTP(S) URLs valid
2015-03-02 22:38:44 +06:00
Philipp Hagemeister
72a406e7aa
[extractor/common] Pass in video_id ( #5057 )
2015-02-26 01:35:43 +01:00
Antti Ajanki
6f4ba54079
[extractor/common] Extract HTTP (possibly f4m) URLs from a .smil file
2015-02-24 21:22:59 +02:00
Antti Ajanki
637570326b
[extractor/common] Extract the first of a seq of videos in a .smil file
2015-02-24 21:22:59 +02:00
Jaime Marquínez Ferrándiz
bfc993cc91
Merge branch 'subtitles-rework'
...
(Closes PR #4964 )
2015-02-23 17:13:03 +01:00
Sergey M․
9fe6ef7ab2
[extractor/common] Fix preference for m3u8 quality selection URL
2015-02-23 03:30:10 +06:00
Philipp Hagemeister
8fb3ac3649
PEP8: W503
2015-02-21 14:55:13 +01:00
Philipp Hagemeister
77b2986b5b
[extractor/common] Recognize Indian censorship ( #5021 )
2015-02-21 14:51:07 +01:00
Jaime Marquínez Ferrándiz
9868ea4936
[extractor/common] Simplify subtitles handling methods
...
Initially I was going to use a single method for handling both subtitles and automatic captions, that's why I used the 'list_subtitles' and the 'subtitles' variables.
2015-02-17 22:16:29 +01:00
Philipp Hagemeister
fa15607773
PEP8 fixes
2015-02-17 21:46:20 +01:00
Jaime Marquínez Ferrándiz
4cd95bcbc3
[twitch:stream] Prefer the 'source' format ( fixes #4972 )
2015-02-17 18:57:01 +01:00
Sergey M?
4069766c52
[extractor/common] Test URLs with GET
2015-02-17 22:35:27 +06:00
Jaime Marquínez Ferrándiz
360e1ca5cc
[youtube] Convert to new subtitles system
...
The automatic captions are stored in the 'automactic_captions' field, which is used if no normal subtitles are found for an specific language.
2015-02-16 22:47:39 +01:00
Jaime Marquínez Ferrándiz
c84dd8a90d
[YoutubeDL] store the subtitles to download in the 'requested_subtitles' field
...
We need to keep the orginal subtitles information, so that the '--load-info' option can be used to list or select the subtitles again.
We'll also be able to have a separate field for storing the automatic captions info.
2015-02-16 21:51:08 +01:00
Jaime Marquínez Ferrándiz
a504ced097
Improve subtitles support
...
For each language the extractor builds a list with the available formats sorted (like for video formats), then YoutubeDL selects one of them using the '--sub-format' option which now allows giving the format preferences (for example 'ass/srt/best').
For each format the 'url' field can be set so that we only download the contents if needed, or if the contents needs to be processed (like in crunchyroll) the 'data' field can be used.
The reasons for this change are:
* We weren't checking that the format given with '--sub-format' was available, checking it in each extractor would be repetitive.
* It allows to easily support giving a format preference.
* The subtitles were automatically downloaded in the extractor, but I think that if you use for example the '--dump-json' option you want to finish as fast as possible.
Currently only the ted extractor has been updated, but the old system still works.
2015-02-16 21:51:03 +01:00
Philipp Hagemeister
03cd72b007
[extractor/common] Move up filesize
...
filesize and tbr should correlate, so it doesn't make sense to treat them differently.
2015-02-16 04:39:22 +01:00
Jaime Marquínez Ferrándiz
6ca7732d5e
[extractor/common] Fix link to external documentation
2015-02-14 22:20:24 +01:00
Jaime Marquínez Ferrándiz
2d30521ab9
[youtube] Extract average rating ( closes #2362 )
2015-02-11 18:39:31 +01:00
Philipp Hagemeister
9650885be9
[escapist] Filter video differently ( Fixes #4919 )
2015-02-10 15:55:51 +01:00
Philipp Hagemeister
7e5db8c930
[options] Add --no-color
2015-02-10 04:22:10 +01:00
Philipp Hagemeister
3a5bcd0326
[extractor/common] Wrap extractor errors ( Fixes #1194 )
...
For now, we just wrap some common errors. More may follow. We do not want to catch actual programming errors in the extractors, such as 1 // 0.
2015-02-10 01:17:23 +01:00
Naglis Jonaitis
69319969de
[extractor/common] Add new helper method _family_friendly_search
2015-02-08 17:39:00 +02:00
Philipp Hagemeister
1e1896f2de
[extractor/common] Correct sort order.
...
We should look at height and width before ext_preference.
2015-02-06 15:16:45 +01:00
Sergey M․
3900eec27c
[extractor/common] Fix 2.0 manifest extraction ( Closes #4830 )
2015-02-06 04:29:29 +06:00
Sergey M․
60ca389c64
[extractor/common] Prefix f4m/m3u8 entries with identifier
2015-02-05 22:16:27 +06:00
Philipp Hagemeister
9bb8e0a3f9
[wsj] Add new extractor ( Fixes #4854 )
2015-02-03 10:58:28 +01:00