~ubuntu-branches/ubuntu/natty/gst-entrans/natty

\*(T<\fB\-\-vn \fR\*(T>\fIstreamnumber[,\&...]\fR, \*(T<\fB\-\-an \fR\*(T>\fIstreamnumber[,\&...]\fR, \*(T<\fB\-\-on \fR\*(T>\fIstreamnumber[,\&...]\fR

274

[multi-valued] Only the video, audio or other streams with (respectively) listed

275

\fIstreamnumber\fR will be considered, others disregarded.

276

Furthermore, streams of each type will be muxed into the target in the order

277

in which their \fIstreamnumber\fRs are given in these options,

278

and in overall first video, then audio and others.

279

.TP

280

\*(T<\fB\-\-sync\-link\fR\*(T>

281

This option is mainly meant for testing and diagnostic purposes.

282

It basically disables the stream(re)-ordering mechanism as implied by and

283

explained in the above option (though still retains stream selection).

284

.TP

285

\*(T<\fB\-\-at \fR\*(T>\fItag[,\&...]\fR

286

[multi-valued] Audio streams can (though need not) be accompanied by a language tag

287

(typically a 2- or 3-letter language code). The regular expression

288

\fItag\fR

289

is matched against each detected audio stream's language tag and only

290

streams without language tag or streams with a matching language tag are

291

processed, others disregarded. This selection method \fIcannot\fR

292

be combined with the above streamnumber based selection.

293

294

.RS

295

\fBNote\fR

296

297

The current (or another?) method of

298

implementing this selection may very well not work with all muxer elements. As

299

such, this option can be given a try, but if not successful, the much more

300

robust \*(T<\fB\-\-an\fR\*(T> should be used instead. The proper

301

number(s) to use for this may be derived from (ordered) tag information that is

302

provided by some elements and reported at start-up.

303

.RE

304

.TP

305

\*(T<\fB\-\-stamp\fR\*(T>

306

This is enabled by default, and makes \fBentrans\fR insert an \fIidentity\fR element

307

(with single-segment True) before the connected pipeline-fragment

308

to perform timestamp re-sequencing. This is typically useful when

309

muxing into a format that records timestamps (and does not hurt

310

when the format does not).

311

.SS "BASIC OPTIONS"

312

Whereas no specific dependencies exist for all other options,

313

the REQUIREMENTS section applies particularly

314

to the following options (in more or less degree).

315

.TP

316

\*(T<\fB\-c \fR\*(T>\fI[[(]format[)]:]t1-t2[,\&...]\fR, \*(T<\fB\-\-cut \fR\*(T>\fI[[(]format[)]:]t1-t2[,\&...]\fR

317

[multi-valued]

318

Only process the indicated sections of the pipeline (by default the complete

319

input is processed). The option \*(T<\fB\-s\fR\*(T> determines the method used

320

for selecting the desired data.

321

322

If no \fIformat\fR is provided,

323

the \fItN\fR parameters can be given in timecode format

324

HH:MM:SS.FRACTION or as a (video)framenumber, which is a number following

325

\*(T<f\*(T> or \*(T<F\*(T>. Any buffer that

326

overlaps with the indicated section is taken into account. The last indicated

327

section may be open-ended, i.e. the end point may be omitted.

328

329

330

However, if \fIformat\fR is the nickname of a pre-defined

331

or custom format (defined by some element in the pipeline), then it is used as

332

unit for the \fItN\fR numbers.

333

In this case, option \*(T<\fB\-s\fR\*(T> below must select a seek-based method,

334

and the seek will be executed in \fIformat\fR if it is provided

335

without matching (\&...), otherwise the given units will be (query) converted

336

to time format first, and these results will be used to seek in time.

337

.TP

338

\*(T<\fB\-s \fR\*(T>\fImethod\fR, \*(T<\fB\-\-section \fR\*(T>\fImethod\fR

339

Possible choices for \fImethod\fR are:

340

.RS

341

.TP

342

\*(T<seek\*(T>

343

Sections are selected by means of regular GStreamer seeks.

344

A flushing seek is performed for the first section,

345

segment seek for the others, and a normal seek for the last section.

346

This is also the default method and is OK for most circumstances.

347

In some cases, however, the other methods may be in order,

348

e.g. when (the driving element in) the pipeline does not support

349

some type of seek that would be used.

350

.TP

351

\*(T<seek\-key\*(T>

352

This is similar to the case above, but each seek is also a keyframe seek,

353

that is, the actual selection may not be exactly the one

354

that was requested but may start at the nearest keyframe

355

preceding the desired start.

356

This would typically be required when performing re-muxing

357

without fully decoding and re-encoding the material.

358

.TP

359

\*(T<cut\-time\*(T>

360

In this case, the pipeline is run from the start (without any seeks)

361

and all but the desired sections are disregarded.

362

The decision whether or not to drop a piece of data is based on its timestamp.

363

This is the recommended (only possible) method to use

364

when performing A/V sync correction using data-timestamp shifting,

365

e.g. by means of the \fIshift\fR element. It can also be used

366

to end a live source(s) driven pipeline after a specified duration.

367

.TP

368

\*(T<cut\*(T>

369

This method applies in case any/all of the above fail, e.g. some element

370

does not support seek or unreliable timestamps are produced in the pipeline.

371

No seeks or anything special is done, the pipeline is run from the start and all but the

372

indicated sections is dropped. Timestamps on the datastream are disregarded,

373

and the incoming data is restamped based on framecount and framerate

374

(video), or on sample count and samplerate (audio).

375

In particular, audio cutting is performed up to sample precision.

376

.RE

377

378

Note that the last 2 methods require the desired sections to be in ascending

379

order, whereas the former methods make it possible to select sections

380

that are out-of-order with regards to the input material.

381

.TP

382

\*(T<\fB\-a\fR\*(T>

383

Perform (audio) sample precision selection, that is, it is possible for only parts of

384

buffers to be passed or dropped. This is done by default

385

in the \*(T<cut\*(T> method above, but not for the other methods.

386

.TP

387

\*(T<\fB\-\-dam\fR\*(T>

388

Indicate, or rather, confirm that \fIdam\fR elements are being used

389

in \fIraw\fR pipelines.

390

Otherwise, surrogate dam elements will be searched for and used instead.

391

These are identified as elements whose name is of the form

392

\*(T<dam\fI<digits>\fR\*(T>.

393

.TP

394

\*(T<\fB\-f \fR\*(T>\fIframerate\fR, \*(T<\fB\-\-framerate \fR\*(T>\fIframerate\fR

395

The framerate that is used for framenumber to time conversion is normally

396

auto-detected (soon enough) within the pipeline. If this were not to succeed,

397

then \fIframerate\fR,

398

specified as \fINUM[/DENOM]\fR,

399

is used as a fallback instead of the otherwise default 25/1.

400

.TP

401

\*(T<\fB\-b\fR\*(T>, \*(T<\fB\-\-block\-overrun\fR\*(T>

402

This makes \fBentrans\fR prevent automatic adjustments to queues

403

(in a \fIdecodebin\fR), thereby keeping them at fixed size.

404

405

Despite all the automagic, the pipeline may stall in exotic cases

406

(e.g. some strange behaving element/muxer, \&...). A good first thing to try

407

then is to configure queues at a larger-than-default setting (see for example

408

following section) and to use this option to ensure they really remain

409

configured as intended without any other influence.

410

.SS "PIPELINE CONFIGURATION"

411

Although element properties are typically set in the pipeline descriptions,

412

the following options can be useful for dynamically created elements (see for

413

instance the dvd example in EXAMPLES) or when it is desired to configure a property

414

syntactically separated from the pipeline on the command line.

415

.TP

416

\*(T<\fB\-\-set\-prop \fR\*(T>\fIelement:prop:value[:time] \fR\*(T<\fB\fR\*(T>

417

Sets property \fIprop\fR of

418

\fIelement\fR to \fIvalue\fR,

419

where \fIelement\fR can either be an element-type, or the

420

name or full path-string of a specific element.

421

If \fIprop\fR is a \*(T<GST_CONTROLLABLE\*(T>

422

property (use \fBgst-inspect\fR(1) to determine this),

423

then a (pipeline media) \fItime\fR at which

424

\fIprop\fR

425

should be set to this particular \fIvalue\fR

426

(using the controller framework) can be given as well.

427

428

In general, a value given for a property within the pipeline description

429

will override any value provided this way. In case of \fIqueue\fRs,

430

however, both \fIdecodebin\fR and \fBentrans\fR will override some

431

properties' values in order to minimize the possibility of blocking.

432

Though it is not recommended, \*(T<\fBset\-prop\fR\*(T> can be used

433

to forcibly overwrite such aforementioned defaults.

434

435

The rank of an element (as a plugin feature), which (a.o.) determines whether or

436

not it is considered for auto-plugging (in dynamic mode) is considered

437

as a pseudo-property \*(T<pf_rank\*(T> and can therefore be set

438

in this way as well.

439

.TP

440

\*(T<\fB\-\-vb \fR\*(T>\fIvkbitrate\fR, \*(T<\fB\-\-ab \fR\*(T>\fIakbitrate\fR

441

Sets the \*(T<bitrate\*(T> property of any video or audio element to

442

\fIvkbitrate\fR [ * 1000] or

443

\fIakbitrate\fR [ * 1000] respectively,

444

depending on whether the property expects to be provided with (k)bitrate.

445

.TP

446

\*(T<\fB\-\-vq \fR\*(T>\fIvquality\fR, \*(T<\fB\-\-aq \fR\*(T>\fIaquality\fR

447

Sets either the \*(T<quality\*(T> or \*(T<quantizer\*(T>

448

property of any video or audio element to \fIvquality\fR or

449

\fIaquality\fR respectively.

450

.TP

451

\*(T<\fB\-\-pass \fR\*(T>\fIpass\fR

452

Sets the \*(T<pass\*(T> property of any video element to

453

\fIpass\fR, which

454

must be \*(T<1\*(T> or \*(T<2\*(T>.

455

.TP

456

\*(T<\fB\-t \fR\*(T>\fItag:value\fR\*(T<\fB \fR\*(T>

457

[multi-valued]

458

\fBentrans\fR locates a \*(T<TagSetter\*(T> in

459

the pipeline and sets each given \fItag\fR to

460

\fIvalue\fR.

461

462

A list of possible tags can be found in

463

.URL http://www.gstreamer.net/data/doc/gstreamer/head/gstreamer/html/gstreamer-GstTagList.html "GStreamer core documentation"

464

\&.

465

.SS "REPORTING OPTIONS"

466

By default, \fBentrans\fR reports the following items at startup:

467

.TP 0.2i

468

\(bu

469

an overview of elements found in the pipeline,

470

along with the current values of properties that differ from their

471

usual (default) setting;

472

tags announced by an element are considered as a pseudo-property

473

\*(T<tag\*(T> of the element

474

.TP 0.2i

475

\(bu

476

a set of distinct caps that have been found flowing through the pipeline,

477

.TP 0.2i

478

\(bu

479

the (video) queues found in the pipeline (with their neighbours),

480

and their maximum capacity settings (size, buffers, time)

481

.PP

482

After that, it provides the following in regular intervals, if already available

483

or applicable:

484

.TP 0.2i

485

\(bu

486

(time) position in input stream,

487

.TP 0.2i

488

\(bu

489

movie time: position in output stream, and the total expected output movie time

490

.TP 0.2i

491

\(bu

492

processing speed: ratio of elapsed (input) stream time to elapsed CPU time

493

.TP 0.2i

494

\(bu

495

ETA: expected time to completion of processing;

496

this calculation \fIalways\fR uses elapsed system clock time,

497

regardless of options below,

498

.TP 0.2i

499

\(bu

500

amount of buffers presently contained in the queues reported at startup,

501

.TP 0.2i

502

\(bu

503

if a specific output file can be identified, (combined) bitrate so far.

504

.PP

505

The following options can influence some of this behaviour:

506

.TP

507

\*(T<\fB\-d \fR\*(T>\fIdelay\fR, \*(T<\fB\-\-delay \fR\*(T>\fIdelay\fR

508

Sets the interval between progress updates to \fIdelay\fR

509

seconds. The default delay is 2 seconds.

510

.TP

511

\*(T<\fB\-\-timeout \fR\*(T>\fItimeout\fR

512

As an \fBentrans\fR run spins up, it transitions through a number of stages, which

513

should normally follow each other in pretty rapid succession.

514

\fItimeout\fR, by default 4, is the maximum interval

515

(in seconds) that will be allowed between such stages. If the interval is exceeded,

516

\fBentrans\fR may simply abort altogether or try to recover the situation

517

(depending on the stage), the success of which depends on the cause

518

(e.g. a misbehaving element, or a badly constructed pipeline, \&...).

519

Evidently, a high \fItimeout\fR value essentially renders

520

this check mute. Setting it to 0 completely disables this check as well

521

as some other mechanisms employed to support it, and is not normally advisable.

522

.TP

523

\*(T<\fB\-\-progress\-fps\fR\*(T>

524

Makes regular reports also provide processing speed in fps, which is calculated

525

using either auto-detected framerate or provided by \*(T<\fB\-f\fR\*(T>.

526

.TP

527

\*(T<\fB\-\-progress\-real\fR\*(T>

528

Calculate processing speed based on elapsed system clock time (instead of CPU-time).

529

.PP

530

In the following, \fIproppattern\fR

531

is a regular expression that will be matched a combination of

532

an \fIelement\fR and (optionally) one its properties

533

\fIprop\fR. More precisely, this combination

534

matches if the regular expression matches any of the following:

535

.TP 0.2i

536

\(bu

537

\fIelement's factory name\fR.\fIprop\fR

538

.TP 0.2i

539

\(bu

540

\fIelement's name\fR.\fIprop\fR

541

.TP 0.2i

542

\(bu

543

\fIelement's path name\fR.\fIprop\fR

544

.PP

545

In each case, the last part is omitted if there is no

546

\fIprop\fR in the context in question.

547

.PP

548

Similarly, \fImsgpattern\fR is matched against

549

expressions as above, but with \fIprop\fR replaced by

550

\fImessage type name\fR.\fImessage structure name\fR

551

Again, in each case, the last part is omitted if there is no structure for the

552

message in question.

553

.TP

554

\*(T<\fB\-m\fR\*(T>

555

Output messages posted on the pipeline's bus

556

.TP

557

\*(T<\fB\-v\fR\*(T>

558

Provide output on property changes of the pipeline's elements.

559

This output can be filtered using \*(T<\fB\-x\fR\*(T>

560

.TP

561

\*(T<\fB\-\-short\-caps\fR\*(T>

562

Do not perform complete caps to string conversion, instead replace e.g. buffers

563

with their (string) type representation.

564

This can make for more comfortable display of e.g. Vorbis and Theora related caps.

565

.TP

566

\*(T<\fB\-\-ignore\-msg \fR\*(T>\fImsgpattern[,\&...]\fR\*(T<\fB \fR\*(T>, \*(T<\fB\-\-display\-msg \fR\*(T>\fImsgpattern[,\&...]\fR\*(T<\fB \fR\*(T>

567

[multi-valued] If message reporting is enabled by \*(T<\fB\-m\fR\*(T>,

568

only report on those that match \*(T<\fB\-\-display\-msg\fR\*(T>

569

or do not match \*(T<\fB\-\-ignore\-msg\fR\*(T>

570

.TP

571

\*(T<\fB\-x \fR\*(T>\fIproppattern[,\&...]\fR\*(T<\fB \fR\*(T>, \*(T<\fB\-\-exclude \fR\*(T>\fIproppattern[,\&...]\fR\*(T<\fB \fR\*(T>, \*(T<\fB\-\-include \fR\*(T>\fIproppattern[,\&...]\fR\*(T<\fB \fR\*(T>

572

[multi-valued] If property change reporting is enabled by \*(T<\fB\-v\fR\*(T>,

573

only report those on properties that match \*(T<\fB\-\-include\fR\*(T>

574

or do not match \*(T<\fB\-\-exclude\fR\*(T>

575

.TP

576

\*(T<\fB\-\-display\-prop \fR\*(T>\fIproppattern[,\&...]\fR\*(T<\fB \fR\*(T>, \*(T<\fB\-\-ignore\-prop \fR\*(T>\fIproppattern[,\&...]\fR\*(T<\fB \fR\*(T>

577

[multi-valued] An element's property (value) is reported at start-up if and only if

578

it matches an expression given in \*(T<\fB\-\-display\-prop\fR\*(T>

579

or its value differs from the usual and it does not match

580

\*(T<\fB\-\-ignore\-prop\fR\*(T>.

581

Also, in any case, an element's presence in the pipeline is at least

582

mentioned by default, unless the element (by itself) matches

583

\*(T<\fB\-\-ignore\-prop\fR\*(T>

584

.SS "CONFIGURATION OPTIONS"

585

Each \fBentrans\fR option \(embe it one affecting \fBentrans\fR'

586

run-time behaviour or affecting pipeline element (properties)\(em

587

can also be provided on a more permanent basis using a configuration

588

file.

589

Such a file consists of sections, led by

590

a \*(T<[section]\*(T> header and

591

followed by \*(T<name: value\*(T> entries,

592

\*(T<name=value\*(T> is also accepted.

593

Note that leading whitespace is removed from values.

594

Lines beginning with \*(T<#\*(T> or

595

\*(T<;\*(T> are ignored and may be used to

596

provide comments.

597

.PP

598

In the special section \*(T<[options]\*(T>,

599

each entry \*(T<name: value\*(T>

600

is equivalent to providing \*(T<\fB\-\-\fR\*(T>\fIname\fR\*(T<\fB

601

\fR\*(T>\fIvalue\fR on the command-line,

602

where \fIname\fR must be an option's longname.

603

The name of any other section is interpreted as an element-type

604

or an element, with each entry providing a value for a property.

605

Otherwise put, each \*(T<prop: value\*(T>

606

in a section

607

\*(T<[\fIelement\fR]\*(T>

608

is equivalent to mentioning it in \*(T<\fB\-\-set\-prop\fR\*(T> as

609

\fIelement:prop:value\fR

610

.PP

611

By default, the configuration file is called \*(T<\fI.gst\-entrans\fR\*(T>

612

and is searched for in the current directory and in the user's home directory

613

(in that order).

614

Any setting provided on the command line for a single-valued option

615

(e.g. a boolean option) overrides a similar value given in a configuration file,

616

whereas values provided for multi-valued ones append to those already provided.

617

.TP

618

\*(T<\fB\-\-config \fR\*(T>\fIfile\fR

619

Use \fIfile\fR \fIinstead\fR of any

620

default configuration file.

621

.TP

622

\*(T<\fB\-\-profile \fR\*(T>\fIfile\fR

623

Load \fIfile\fR \fIafter\fR other

624

configuration files, either default ones or given by \*(T<\fB\-\-config\fR\*(T>.

625

This option derives its name from its use in loading certain export or

626

encoder profiles (e.g. MPEG1, MPEG2, etc),

627

which are mainly a collection of presets for certain properties that can

628

be kept in corresponding profile configuration files.

629

.SS "MISCELLANEOUS OPTIONS"

630

.TP

631

\*(T<\fB\-\-save \fR\*(T>\fImessage:file[:append][,\&...] \fR\*(T<\fB\fR\*(T>

632

[multi-valued]

633

Makes \fBentrans\fR save a custom element \fImessage\fR to

634

\fIfile\fR,

635

that is, the string representation of the message's structure is saved

636

(followed by a \fBlinefeed\fR).

637

If \fIappend\fR is true (e.g. \*(T<1\*(T>, \*(T<t\*(T>,

638

\*(T<T\*(T>), then all messages are appended to \fIfile\fR,

639

otherwise the default is to truncate \fIfile\fR whenever

640

\fImessage\fR is received, thereby saving only the

641

most recently received \fImessage\fR.

642

For good measure,

643

it might be noted here there was a change in version 0.10.15 in a structure's

644

string representation with respect to usage of a terminating \fB;\fR.

645

646

\fIfile\fR will be subject to having \*(T<${n}$\*(T>

647

replaced by the name of the element sending \fImessage\fR.

648

.SH EXAMPLES

649

More examples of pipelines can also be found in \fBgst-launch\fR(1);

650

the ones below are primarily meant to illustrate some of

651

\fBentrans\fR' features.

652

.PP

653

In each case, everything could also be written on one line, without the backslash

654

character. In some cases, some particular quoting is needed to prevent

655

(mis)interpretation by the shell.

656

.PP

657

\fBBasic transcoding\fR

658

.PP

659

.nf

660

\*(T<

661

entrans.py \-i dvd://2 \-o dvd.ogm \-\-short\-caps \-\- \e

662

\-\-video ! theoraenc \-\-audio audioconvert ! vorbisenc

663

\*(T>.fi

664

665

Transcodes complete title 2 from a DVD into a mixed Ogg containing all video

666

and audio tracks (but no subtitles) using default settings, while preventing

667

some extensive caps display.

668

.PP

669

\fBSelected stream transcoding\fR

670

.PP

671

.nf

672

\*(T<

673

entrans.py \-\-set\-prop dvdreadsrc:device:/mnt \-i dvd://2 \-o dvd.mkv \e

674

\-\-set\-prop dvdsubdec:pf_rank:128 \-\-on 6,12 \-\-an 1,2 \-\-vq 4 \-\-ab 256 \-\- \e

675

\-\-video ffenc_mpeg4 \-\-audio audioconvert ! lame \e

676

\-\-other ffmpegcolorspace ! ffenc_mpeg4 ! matroskamux ! filesink location='sub\-${on}.mkv'

677

\*(T>.fi

678

679

Transcodes video and selected audio tracks from a DVD image mounted at /mnt

680

into a Matroska file, using the indicated fixed quantization and bitrate for

681

video and audio respectively. Some selected subtitle tracks are also encoded

682

as separate video streams into other Matroska files. Note that the rank of

683

the \fIdvdsubdec\fR must be increased for a subtitle stream to be

684

considered for decoding and subsequently made available.

685

.PP

686

\fBExtensive progress update transcoding\fR

687

.PP

688

.nf

689

\*(T<

690

entrans.py \-i example\-in.avi \-o example\-out.mkv \-\- \e

691

\-\-video tee name=tee ! queue ! theoraenc \e

692

tee. ! queue ! timeoverlay ! xvimagesink sync=false qos=false \e

693

\-\-audio audioconvert ! lame

694

\*(T>.fi

695

696

Transcodes into Matroska using Theora and MP3 codecs, while providing for

697

(excessive) live preview progress info.

698

.PP

699

\fBPartial stream transcoding\fR

700

.PP

701

.nf

702

\*(T<

703

entrans.py \-i dvd://2 \-o dvd.mkv \-c chapter:5\- \-\-short\-caps \-\- \e

704

\-\-video ! identity single\-segment=true ! ffenc_mpeg4 \e

705

\-\-audio audioconvert ! identity single\-segment=true ! vorbisenc

706

\*(T>.fi

707

708

Transcodes all video and audio tracks from a DVD title 2 (chapter 5 to the end)

709

into Matroska using MPEG4 and Vorbis codecs (while catering for properly

710

timestamped input for the container which records these faithfully for posterity).

711

.PP

712

\fBPass-through transcoding\fR

713

.PP

714

.nf

715

\*(T<

716

entrans.py \-s seek\-key \-c 60\-180 \-\-dam \-\- \-\-raw \e

717

filesrc location=example\-in.avi ! avidemux name=demux \e

718

avimux name=mux ! filesink location=example\-out.avi \e

719

demux.video_00 ! queue ! dam ! queue ! mux.video_0 \e

720

demux.audio_00 ! queue ! dam ! queue ! mad ! audioconvert ! lame ! mux.audio_0

721

\*(T>.fi

722

723

Transcodes a particular section from an AVI file into another AVI file without

724

re-encoding video (but does re-encode audio, which is

725

recommended). The output will range from (approximately) 60 seconds into the

726

input up to 180 seconds; actually there will be a (key)seek to the nearest

727

keyframe just before the 60 seconds point to ensure the integrity of later

728

decoding of the output.

729

In addition, \fBentrans\fR will report changes on any object's properties, except

730

for any (pad's) caps.

731

.PP

732

The pipeline above uses raw mode, and as such must abide

733

by some rules and recommendations in pipeline building (see MUXING PIPELINES),

734

which leads in particular to the above abundance in \fIqueue\fRs.

735

With some extra configuration, pass-through could also be performed in

736

dynamic mode as follows (assuming that \*(T<video/mpeg\*(T> properly

737

describes the encoded video content):

738

739

.nf

740

\*(T<

741

entrans.py \-s seek\-key \-c 60\-180 \-\-decoder decodebin2 \e

742

\-\-set\-prop 'decodebin2:caps:video/mpeg;audio/x\-raw\-int;audio/x\-raw\-float' \e

743

\-i example\-in.avi \-o example\-out.avi \-\- \e

744

\-\-video capsfilter caps=video/mpeg \e

745

\-\-audio audioconvert ! lame

746

\*(T>.fi

747

.PP

748

\fBLive recording\fR

749

.PP

750

.nf

751

\*(T<

752

entrans.py \-s cut\-time \-c 0\-60 \-v \-x '.*caps' \-\-dam \-\- \-\-raw \e

753

v4l2src queue\-size=16 ! video/x\-raw\-yuv,framerate=25/1,width=384,height=576 ! \e

754

stamp sync\-margin=2 silent=false progress=0 ! queue ! \e

755

dam ! ffenc_mpeg4 name=venc \e

756

alsasrc buffer\-time=1000000 ! audio/x\-raw\-int,rate=48000,channels=1 ! queue ! \e

757

dam ! audioconvert ! queue ! lame name=aenc \e

758

avimux name=mux ! filesink location=rec.avi aenc. ! mux. venc. ! mux.

759

\*(T>.fi

760

761

Records 1 minute of video and audio from a video4linux device and features

762

additional synchronization and reporting on object property changes (if any),

763

which includes reports on frame drops or duplications, although

764

(pad's) caps changes are excluded for convenience.

765

.PP

766

\fB2-pass transcoding\fR

767

.PP

768

.nf

769

\*(T<

770

entrans.py \-i example\-in.avi \-o /dev/null \-\-muxer avimux \-\-vb 1200 \-\-pass 1 \e

771

\-\-save astat:astat.log \-\- \-\-video ffenc_mpeg4 \e

772

\-\-audio audioconvert ! astat ! fakesink

773

SCALE="$(cat astat.log | sed 's/astat, scale=(double)//' | sed 's/;//')"

774

entrans.py \-i example\-in.avi \-o example\-out.avi \-\-vb 1200 \-\-pass 2 \e

775

\-\-tag 'comment:2\-pass demonstration' \-\- \e

776

\-\-video ffenc_mpeg4 \-\-audio audioconvert ! volume volume=$SCALE ! lame

777

\*(T>.fi

778

779

Performs 2-pass transcoding from one AVI into another. The first pass also

780

determines and saves the maximum volume scaling that can safely be applied

781

without having to resort to clipping. It does not bother performing audio encoding

782

or producing an output file. Although the particular (encoder compatible) muxer

783

is hardly relevant here, one is nevertheless indicated explicitly

784

as a reasonablechoice cannot be determined

785

from \*(T</dev/null\*(T>. After some scripting to retrieve the

786

saved value from a file, the second pass performs volume scaling and encoding.

787

It also sets a comment (tag) in the resulting file to note its lofty goal.

788

.PP

789

\fBConfiguration file\fR

790

.PP

791

.nf

792

\*(T<

793

[options]

794

ignore\-prop=.*src.*,.*sink.*,dam.*,queue.*,identity.*,.*decodebin.*

795

display\-prop=.*\e.tag,.*\e.bitrate$,.*bframes,.*quantizer,.*\e.pass,.*\e.queue\-size

796

797

[ffenc_mpeg4]

798

me\-method=epzs

799

max\-bframes=0

800

gop\-size=250

801

802

[dvdsubdec]

803

pf_rank = 128

804

\*(T>.fi

805

806

A basic, though adequate configuration file that filters out some

807

(usually less interesting) information on some technical

808

technical elements, while making sure on the other hand that some other

809

settings get displayed in any case. In addition, an element's properties

810

can be given defaults (other than the hardcoded ones), and the rank

811

of \fIdvdsubdec\fR is increased so that subtitles streams will

812

also be provided.

813

.PP

814

\fBProfile configuration\fR

815

.PP

816

Some examples of (encoding) profiles that can be passed to

817

\*(T<\fB\-\-profile\fR\*(T> (each profile is in a separate file).

818

Note that profiles also impose

819

constraints on e.g. width and height which are not automagically

820

enforced; one must still take care of this by means of e.g. proper scaling.

821

Similarly, the elements that are to perform the required encoding

822

must be properly (manually) specified, though their configuration

823

is then taken care of by the examples below.

824

825

.nf

826

\*(T<

827

# vcd:

828

# 352 x 240|288

829

# 44.1kHz, 16b, 2ch, mp2

830

831

[options]

832

vb = 1150

833

ab = 224

834

835

[ffenc_mpeg1video]

836

bitrate = 1150

837

# <= 10

838

gop\-size = 9

839

rc\-min\-rate = 1150

840

rc\-max\-rate = 1150

841

rc\-buffer\-size = 320

842

rc\-buffer\-aggressivity = 99

843

844

[mpeg2enc]

845

format = 1

846

\*(T>.fi

847

.PP

848

.nf

849

\*(T<

850

# \-\-\-\-\-\-\-\-

851

# svcd:

852

# 480 x 480|576

853

# 44.1kHz, 16b, 2ch, mp2

854

855

[options]

856

vb = 2040

857

# >= 64, <= 384

858

ab = 224

859

860

[ffenc_mpeg2video]

861

bitrate = 2040

862

# <= 19

863

gop\-size = 15

864

# ntsc: gop\-size = 18

865

rc\-min\-rate = 0

866

rc\-max\-rate = 2516

867

rc\-buffer\-size = 1792

868

rc\-buffer\-aggressivity = 99

869

flags = scanoffset

870

871

[mpeg2enc]

872

format = 4

873

\*(T>.fi

874

.PP

875

.nf

876

\*(T<

877

# \-\-\-\-\-\-\-\-\-

878

# xvcd:

879

# 480 x 480|576

880

# 32|44.1|48kHz, 16b, 2ch, mp2

881

882

[options]

883

vb = 5000

884

# >= 64, <= 384

885

ab = 224

886

887

[ffenc_mpeg2video]

888

# 1000 <= bitrate <= 9000

889

bitrate = 2040

890

# <= 19

891

gop\-size = 15

892

# ntsc: gop\-size = 18

893

rc\-min\-rate = 0

894

# optional:

895

rc\-max\-rate = 5000

896

897

rc\-buffer\-size = 1792

898

rc\-buffer\-aggressivity = 99

899

flags = scanoffset

900

\*(T>.fi

901

.PP

902

.nf

903

\*(T<

904

# \-\-\-\-\-\-

905

# dvd:

906

# 353|704|720 x 240|288|480|576

907

# 48kHz, 16b, 2ch, mp2|ac3

908

909

[options]

910

vb = 5000

911

# >= 64, <= 384

912

ab = 224

913

914

[ffenc_mpeg2video]

915

# 1000 <= bitrate <= 9800

916

bitrate = 5000

917

# <= 19

918

gop\-size = 15

919

# ntsc: gop\-size = 18

920

rc\-min\-rate = 0

921

rc\-max\-rate = 9000

922

rc\-buffer\-size = 1792

923

rc\-buffer\-aggressivity = 99

924

925

[mpeg2enc]

926

format = 8

927

\*(T>.fi

928

.SH REQUIREMENTS

929

Evidently, there must be a basic GStreamer framework installation,

930

the extent of which and available plugins determine the processing scope that

931

can be achieved.

932

Beyond this, it is highly recommended for the \fIdam\fR element to be available,

933

which is part of the plugins supplied along with \fBentrans\fR.

934

More (technical) details and motivation can be found in the documentation for

935

\fIdam\fR, but suffice it to say that without \fIdam\fR the following notes apply:

936

.TP 0.2i

937

\(bu

938

For \*(T<\fB\-\-seek\fR\*(T>, only the methods

939

\*(T<seek\*(T> and \*(T<seek\-key\*(T> are available.

940

However, for reasons explained in \fIdam\fR documentation and in

941

MUXING PIPELINES, even these methods can not be considered reliable,

942

either functional or technical.

943

As such, only full (uninterrupted) pipeline transcoding is really available.

944

945

As a technical note, this unreliability could be alleviated by

946

having the functionality to drop out-of-segment-bound buffers not only in sinks

947

or in some elements, but as a specific ability in

948

e.g. \fIidentity\fR.

949

.TP 0.2i

950

\(bu

951

The graceful (signal initiated) termination usually

952

also relies on \fIdam\fR, and is quite sturdy in this case.

953

However, and fortunately, there is also an alternative fall-back implementation

954

that will take care of this in the vast majority of circumstances.

955

.PP

956

Note that in case of dynamic pipelines, the availability of the \fIdam\fR element

957

in the system is auto-detected, and the proper pipeline-construction action is

958

taken accordingly. In raw mode, however, it must be explictly confirmed (by

959

\*(T<\fB\-\-dam\fR\*(T>) that \fIdam\fRs are properly used, otherwise none will

960

be assumed present and only restricted operation is then available, as indicated

961

above. Proper usage of \fIdam\fR is subject to comments in MUXING PIPELINES,

962

but basically comes down to putting a \fIdam\fR as upstream as possible in each

963

independent stream, e.g. preceding some queue. Alternatively, such a queue could

964

be used as surrogate dam by naming it

965

\*(T<dam\fI<digits>\fR\*(T>.

966

.SH "GNONLIN COMPARISON"

967

On the one hand, one might compare in the sense that both GNonLin and \fBentrans\fR

968

aim to create media files while allowing for non-linear editing/cutting.

969

On the other hand, \fBentrans\fR is quite complementary and can actually be combined

970

with GNonLin, for example

971

972

.nf

973

\*(T<

974

entrans.py \-\- \-\-raw gnlcomposition. '(' name=comp gnlfilesource \e

975

location=test.avi start=0 duration=20000000000 media\-start=0 \e

976

media\-duration=20000000000 caps=video/x\-raw\-yuv ')' \e

977

comp.src ! ffenc_mpeg4 ! queue ! avimux ! filesink location=test\-gnonlin.avi

978

\*(T>.fi

979

.PP

980

That being the case, why this alternative and yet another program, rather than

981

being content with e.g. GNonLin (and \fBgst-launch\fR(1)) ?

982

.PP

983

First and foremost, as a matter of style. The above example pipeline looks and

984

feels rather contrived (and that's only the video part!), and is not much of a

985

\fIpipeline\fR in that e.g. \fIgnlcomposition\fR is

986

really (deliberately) not so much a pipeline than it is more of a bag (serving

987

its intended purpose well). On the other hand, Un*x pipelines have a long

988

standing beauty (and a.o. adaptability and flexibility), and a (traditional)

989

GStreamer pipeline continues in that tradition. In that sense, \fBentrans\fR'

990

(and \fBgst-launch\fR(1)) approach is an ode to pipeline and its

991

inherent simplicity and clarity of intent.

992

In particular, typical \fBentrans\fR pipelines are only slightly different than

993

playback pipelines, and the cutting/seeking mechanism is pretty identical.

994

In that way, an \fBentrans\fR media file creation is very close to a

995

(non-linear) playback session (and closer than a GNonLin equivalent would be).

996

.PP

997

Secondly, there are some technical reasons. Particular care has been given

998

in \fBentrans\fR to the non-linear cutting functionality to make sure it operates

999

with great precision in a variety of circumstances. In this area;

1000

.TP 0.2i

1001

\(bu

1002

GNonLin, in particular \fIgnlcomposition\fR, may (and has been

1003

known to) experience some frame-dropping near end-of-stream due to its

1004

(asynchronous) technique of EOS-event generation

1005

.TP 0.2i

1006

\(bu

1007

GNonLin only relies on segment seek, as does \fBentrans\fR usually, which

1008

is fine when dealing with elements that support what and how they are supposed

1009

to. However (and unfortunately), elements are not always so perfect (for

1010

example, \fIdvdreadsrc\fR does not support segment seek).

1011

In such cases, \fBentrans\fR has some plain-and-simple alternative methods that can

1012

still get the (cutting) job done.

1013

.PP

1014

So, the idea and intention basically is; if you can play it, you can \fBentrans\fR

1015

it (with a similar pipeline, and precision in various circumstances).

1016

.SH "MUXING PIPELINES"

1017

As mentioned earlier, one might run into some pitfalls when constructing

1018

a (raw) pipeline, most of which are taken care of by \fBentrans\fR when runnning in

1019

dynamic mode.

1020

.PP

1021

\fBBuilding\fR.

1022

To begin with, the pipeline must be properly built and linked,

1023

in particular to the muxer element, which typically only has request pads.

1024

For linking and subsequent negotiation to succeed, a stream must manage

1025

to get the request pad it really needs and is compatible with. That means that

1026

the element connecting to the muxer (or trying so) should provide enough

1027

(static) information to identify the corresponding muxer pad (template).

1028

As this information is typically derived from the connecting element's

1029

pad templates, it is not quite comfortable to try connecting a generic

1030

element (e.g. \fIqueue\fR). If this were needed, or there is some

1031

or the other problem connecting to the muxer, then it may be helpful to use

1032

a capsfilter (aka as a filtered connection) or to specifically

1033

indicate the desired muxer pad (by a name corresponding to the intended

1034

template, as in example Pass-through transcoding),

1035

see also \fBgst-launch\fR(1) for specific syntax in either case.

1036

.PP

1037

\fBMuxer and Queue\fR.

1038

In the particular case of a transcoding pipeline (having some media-file

1039

as source), some other care is needed to make sure it does not stall, i.e.

1040

simply block without any notification (or an application having

1041

a chance to do something about it). Specifically, a muxer will often block a

1042

thread on its incoming pad until it has enough data across all of its pads for

1043

it to make a decision as to what to mux in next.

1044

Evidently, if a single (demuxer) thread were having to provide data to several

1045

pads, then stalling would almost surely occur. As such, each of these

1046

pads/streams should have its own thread. In case of live sources, this is

1047

usually already the case, e.g. separate video and audio capturing threads.

1048

However, when transcoding, and therefore demuxing an input file, a

1049

\fIqueue\fR should be inserted into each outgoing stream to act as

1050

a thread boundary and supply a required separate thread.

1051

In addition, this multi-threading will evidently also spread processing across

1052

several CPUs (if available) and improve performance. In this respect, it

1053

can also be beneficial to use more than 1 \fIqueue\fR/thread per

1054

stream; one that performs filter processing (if any) and one that performs

1055

encoding.

1056

.PP

1057

Note that similar stalling can also occur in some variations of these

1058

circumstances. For instance, \fItee\fR could have the role of

1059

demuxer in the above story, and/or a collection of several sinks in a

1060

pipeline could act as a muxer in the above story, since they will each also

1061

block a thread in PAUSED state (when the pipeline is trying to make it to

1062

that state). Another such situation can arise (temporarily) when pads are

1063

blocked as part of the seek-mechanism. In any case, the remedy is the same;

1064

use \fIqueue\fRs to prevent one thread from having to go around in

1065

too many places, and ending up stuck in one of them.

1066

.PP

1067

\fBMuxer and Time(stamps)\fR.

1068

Even if the above is taken into account, the time-sensitivity of a typical

1069

muxer may lead to (at first sight mysterious) stalling (in transcoding

1070

pipelines). That is, a muxer will usually examine the timestamps of the incoming

1071

data, select the oldest of these to be put into the outgoing stream and then

1072

wait for new data to arrive on this selected input stream (i.e. keep other

1073

streams/threads wait'ing).

1074

It follows easily that if there is a gap, imbalance,

1075

imperfection, irregularity (or however described) in timestamps between various

1076

incoming streams, a muxer will then consistently/continuously

1077

expect/want more data on 1 particular pad (for some

1078

amount proportional to the irregularity).

1079

For some time, this need could be satisfied by the \fIqueue\fRs

1080

that are present in this stream. At some point, however, these may not hold

1081

enough data and will need further input from the upstream demuxer element.

1082

However, for this demuxer to provide data for 1 of its (outgoing) streams, it

1083

will also need to output data for the other streams, and this ends up

1084

into the other streams' \fIqueue\fRs. Unfortunately, in

1085

this situation those queues cannot send out any data, as the muxer is

1086

holding their threads (blocked). Hence, they will (soon) fillup and then in

1087

turn block the demuxer thread trying to insert data, until there is again space

1088

available. Consequently, a deadlock circle ensues; muxer waiting for new

1089

data from queue, waiting for data from demuxer, waiting for space to put this

1090

into another stream queue, waiting to unload data into the muxer.

1091

Note that this deadlock phenomenom will not occur with live (recording) pipelines,

1092

as the various threads are then not joined to a single demuxer (thread),

1093

though it is then likely to manifest itself it by loss of (live) input data.

1094

.PP

1095

There a number of possible causes for the irregularities mentioned above.

1096

.TP 0.2i

1097

\(bu

1098

Rarely, but not impossibly so, the problem may be inherent in the very input

1099

medium itself. Or there could be a problem with a (experimental) demuxer

1100

that might produce incorrect timestamps.

1101

.TP 0.2i

1102

\(bu

1103

If a (segment) seek is performed in a pipeline \fIwithout\fR

1104

using \fIdam\fR, it is quite likely that some stream(s) may perform proper

1105

filtering of out-of-segment-bound data, and that other(s) may not. This would

1106

then cause a typical imbalance between the various streams.

1107

.TP 0.2i

1108

\(bu

1109

If timestamps are being resequenced in some incoming streams (e.g. by

1110

\fIidentity\fR), but not in the other ones, there is an obvious imbalance.

1111

.TP 0.2i

1112

\(bu

1113

Though more exotic, even if \fIdam\fR is being used, one must take care to perform

1114

this filtering \fIbefore\fR (the by now

1115

clearly essential) \fIqueue\fR in the respective stream,

1116

particularly when several distinct sections are to be cut out of the input.

1117

After all, if \fIqueue\fR were placed before \fIdam\fR, then the

1118

latter does not have a chance to drop unneeded buffers soon enough.

1119

As such, if a muxer tries to get the first piece of data on a

1120

particular pad following the gap between the sections, these

1121

\fIqueue\fRs would fillup and be effectively as if they were not

1122

present.

1123

.PP

1124

\fBDynamic mode services\fR.

1125

Dynamic mode takes care of (most of) the above as follows:

1126

.TP 0.2i

1127

\(bu

1128

Pipeline building is performed almost completely automagically.

1129

Of course, this does not mean it can fit a square into a circle, so some

1130

consideration for compatibility and negotiation is still in order.

1131

.TP 0.2i

1132

\(bu

1133

\fIqueue\fRs are inserted where needed and/or useful,

1134

either by \fIdecodebin\fR or by \fBentrans\fR

1135

.TP 0.2i

1136

\(bu

1137

Whenever available (recommended), \fIdam\fR is used and inserted in proper locations.

1138

.SH "SEE ALSO"

1139

.URL http://www.gstreamer.net/ "GStreamer homepage"

1140

1141

\fBgst-launch\fR(1),

1142

.URL http://gentrans.sourceforge.net/ "entrans plugins"

1143

.SH AUTHOR

1144

Mark Nauwelaerts <\*(T<mnauw@users.sourceforge.net\*(T>>

Older »