CCExtractor / ccextractor

CCExtractor - Official version maintained by the core team
https://www.ccextractor.org
GNU General Public License v2.0
705 stars 420 forks source link

[QUESTION]How to get Teletext pages with possible subtitles without actual extraction #1034

Open workflowsguy opened 5 years ago

workflowsguy commented 5 years ago

CCExtractor version (using the --version parameter preferably) : 0.87

In raising this issue, I confirm the following:

My familiarity with the project is as follows:

Necessary information

Additional information

When running ccextractor against a video file, I get an output e.g. like this:

Notice: Teletext page with possible subtitles detected: 152
- No teletext page specified, first received suitable page is 152, not guaranteed
Notice: Teletext page with possible subtitles detected: 888
Notice: Teletext page with possible subtitles detected: 151
Notice: Teletext page with possible subtitles detected: 150
Notice: Teletext page with possible subtitles detected: 889

I would like to get this information without ccextractor automatically extracting the subtitles because I need to specify the actual page to extract from based on service name and desired language. I thought the option -out=report would achieve this, but for the same video file I get the output

//////// Program #10302: ////////
DVB Subtitles: No
Teletext: Yes
Pages With Subtitles: 
ATSC Closed Caption: Yes
EIA-608: No
CEA-708: No

MPEG-4 Timed Text: No

Is this a bug/limitation in the report parameter or is there a different way to achieve this?

navimakarov commented 5 years ago

@workflowsguy you can easily extract captions to console only with -stdout parameter. Note that ccextractor will ignore -o, -o1 and -o2 in this case. So the most simple way to use this parameter is: input_file -stdout You can read more about this option(and other ccextractor available parameters) here: https://ccextractor.org/public:general:command_line_usage

workflowsguy commented 5 years ago

@MakarovGCI2018 sorry, but I do not understand how your answer relates to my question. Using -stdout still causes ccextractor to parse the video file completely which is not what I want.

navimakarov commented 5 years ago

@workflowsguy sorry for misunderstanding. So you just want to get info about teletext pages with possible subtitles without actual processing it using -out=report, right?

workflowsguy commented 5 years ago

@navimakarov, sorry for the long delay in replying. Yes, I need some way to get information if the video file contains teletext pages with possible subtitles without actually starting the extraction process.

neilmehta31 commented 3 years ago

Hey guys, I am new to open source. I would like to work on this issue if it is still open. Please guide me how to get started and work on this issue.

cfsmp3 commented 3 years ago

@neilmehta31 By all means go ahead. Usually you want to start by reproducing it.

neilmehta31 commented 3 years ago

Hey @cfsmp3 , I got the following output while running against one of the sample recording files given on the website

Stream Mode: Transport Stream
Program Count: 1
Program Numbers: 4287 
PID: 201, Program: 4287, MPEG-2 video
PID: 202, Program: 4287, MPEG-1 audio
PID: 205, Program: 4287, DVB Subtitles
PID: 206, Program: 4287, MPEG-1 audio
PID: 250, Program: 4287, MPEG-2 private table sections
PID: 7201, Program: 4287, ISO/IEC 13818-6 type B
PID: 7219, Program: 4287, ISO/IEC 13818-6 type B
PID: 7270, Program: 4287, MPEG-2 private table sections
//////// Program #4287: ////////
DVB Subtitles: Yes
Teletext: No
ATSC Closed Caption: Yes
EIA-608: No
CEA-708: No

MPEG-4 Timed Text: No

If its the wrong file could you please provide a link to that file to reproduce the issue. Thanks

siv2r commented 3 years ago

When running ccextractor against a video file, I get an output e.g. like this:

Notice: Teletext page with possible subtitles detected: 152
- No teletext page specified, first received suitable page is 152, not guaranteed
Notice: Teletext page with possible subtitles detected: 888
Notice: Teletext page with possible subtitles detected: 151
Notice: Teletext page with possible subtitles detected: 150
Notice: Teletext page with possible subtitles detected: 889

@workflowsguy can you provide the video sample used? I am unable to reproduce this.

I ran ccextractor path_to_file for the video file (provided in ccextractor's tv samples page) containing teletex subtitles. I got the following results.

Opening file: linux/tests/teletex_test.mpg
File seems to be a transport stream, enabling TS mode
Analyzing data in general mode
VBI/teletext stream ID 1044 (0x414) for SID 1040 (0x410)
- Programme Identification Data = FAB-TELETEXT SYSTEM 
- Universal Time Co-ordinated = Mon Jan  9 18:27:47 2017
100%  |  24:51
Teletext decoder: 37286 packets processed 

Number of NAL_type_7: 0
Number of VCL_HRD: 0
Number of NAL HRD: 0
Number of jump-in-frames: 0
Number of num_unexpected_sei_length: 0

Min PTS:                04:21:47:620
Max PTS:                04:46:39:020
Length:              00:24:51:400
Done, processing time = 3 seconds

There is no information like Notice: Teletext page with possible subtitles detected: 152

85ayush commented 3 years ago

I would love to work on this issue, I am a beginner, can you please guide me.

vaishnavi192 commented 9 months ago

Hey I want to work on this issue @workflowsguy. please tell how to get started I am a beginner

workflowsguy commented 9 months ago

@vaishnavi192, I am the wrong person to ask for guidance. I asked this question here 5 years ago and it has not been answered/adressed to my satisfaction since then. I have long since moved on.