2
.\" Title: gt-extractfeat
3
.\" Author: [see the "AUTHOR" section]
4
.\" Generator: DocBook XSL Stylesheets v1.76.1 <http://docbook.sf.net/>
10
.TH "GT\-EXTRACTFEAT" "1" "07/11/2012" "\ \&" "\ \&"
11
.\" -----------------------------------------------------------------
12
.\" * Define some portability stuff
13
.\" -----------------------------------------------------------------
14
.\" ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
15
.\" http://bugs.debian.org/507673
16
.\" http://lists.gnu.org/archive/html/groff/2009-02/msg00013.html
17
.\" ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
20
.\" -----------------------------------------------------------------
21
.\" * set default formatting
22
.\" -----------------------------------------------------------------
23
.\" disable hyphenation
25
.\" disable justification (adjust text to left margin only)
27
.\" -----------------------------------------------------------------
28
.\" * MAIN CONTENT STARTS HERE *
29
.\" -----------------------------------------------------------------
31
gt-extractfeat \- Extract features given in GFF3 file from sequence file\&.
34
gt extractfeat [option \&...] [GFF3_file]
39
set type of features to extract (default: undefined)
44
join feature sequences in the same subgraph into a single one (default: no)
49
translate the features (of a DNA sequence) into protein (default: no)
54
add sequence ID of extracted features to FASTA descriptions (default: no)
59
add target ID(s) of extracted features to FASTA descriptions (default: no)
64
set the sequence file from which to extract the features (default: undefined)
69
set the sequence files from which to extract the features use
71
to terminate the list of sequence files
76
match the sequence descriptions from the input files for the desired sequence IDs (in GFF3) (default: no)
81
use sequence descriptions to map the sequence IDs (in GFF3) to actual sequence entries\&. If a description contains a sequence range (e\&.g\&., III:1000001\&.\&.2000000), the first part is used as sequence ID (\fIIII\fR) and the first range position as offset (\fI1000001\fR) (default: no)
86
set file containing sequence\-region to sequence file mapping (default: undefined)
91
be verbose (default: no)
96
set output width for FASTA sequence printing (0 disables formatting) (default: 0)
101
redirect output to specified file (default: undefined)
106
write gzip compressed output file (default: no)
111
write bzip2 compressed output file (default: no)
116
force writing to output file (default: no)
121
display help and exit
126
display version information and exit
130
File format for option \-regionmapping:
132
The file supplied to option \-regionmapping defines a \(lqmapping\(rq\&. A mapping maps the sequence\-regions given in the GFF3_file to a sequence file containing the corresponding sequence\&. Mappings can be defined in one of the following two forms:
134
mapping = { chr1 = "hs_ref_chr1\&.fa\&.gz", chr2 = "hs_ref_chr2\&.fa\&.gz" }
138
function mapping(sequence_region) return "hs_ref_"\&.\&.sequence_region\&.\&."\&.fa\&.gz" end
140
The first form defines a Lua (http://www\&.lua\&.org/) table named \(lqmapping\(rq which maps each sequence region to the corresponding sequence file\&. The second one defines a Lua function \(lqmapping\(rq, which has to return the sequence file name when it is called with the sequence_region as argument\&.
143
Report bugs to <gt\-users@genometools\&.org>\&.