1
Tool for aligning cDNA and genomic DNA
3
Some information about sim4 from the Debian-Med project.
5
sim4 is a similarity-based tool for aligning an expressed DNA sequence
6
(EST, cDNA, mRNA) with a genomic sequence for the gene. It also detects end
7
matches when the two input sequences overlap at one end (i.e., the start of
8
one sequence overlaps the end of the other).
10
sim4 employs a blast-based technique to first determine the basic matching
11
blocks representing the "exon cores". In this first stage, it detects all
12
possible exact matches of W-mers (i.e., DNA words of size W) between the two
13
sequences and extends them to maximal scoring gap-free segments. In the
14
second stage, the exon cores are extended into the adjacent as-yet-unmatched
15
fragments using greedy alignment algorithms, and heuristics are used to favor
16
configurations that conform to the splice-site recognition signals (GT-AG,
17
CT-AC). If necessary, the process is repeated with less stringent parameters
18
on the unmatched fragments.
21
Homepage: http://www.bx.psu.edu/miller_lab/
23
Please read the man page of this project: