3
* shortread.c: Fixed bug in printing null accession for second end when
4
using --fails-as-input flag
6
* gmap_build.pl.in: Checking return codes from system calls
10
* VERSION, index.html: Updated version number
12
* gsnap.c: Changed name of flag from --ambig-splice-notrim to
15
* pair.c: Pushing "0M" between adjacent I and D operations in a cigar string
17
* gmap.c, gsnap.c, uniqscan.c: Using new interface to Splicetrie_setup.
18
Providing --ambig-splice-notrim flag in GSNAP.
20
* splicetrie.c, splicetrie.h: Providing behavior to turn clipping off at
21
ambiguous known splice sites, useful if trying to turn off all soft
24
* stage1hr.c: Fixed debugging statement
28
* stage1.c: Fixed variable name so PMAP could compile
32
* dynprog.c, maxent_hr.c, maxent_hr.h, pair.c, stage1hr.c, stage3.c,
33
substring.c: Checking for case where splice_pos minus margin goes beyond
34
beginning of chromosome
36
* VERSION, index.html: Updated version number
38
* maxent_hr.c: Checking for case where splice_pos is smaller than margin
40
* pair.c, samprint.c: Changed value of NM tag in SAM output to be edit
41
distance (mismatches plus gaps)
43
* stage1hr.c: Fixed debugging statement
47
* VERSION, index.html: Updated version number
49
* gsnap.c: Calling SAM_setup
51
* outbuffer.c: For GMAP, when quiet_if_excessive_p is true and npaths >
52
maxpaths, not printing any output
54
* pair.c, pair.h, stage3.c: Printing HI tag in SAM output
56
* samprint.c, samprint.h: When quiet_if_excessive_p is true and npaths_mate
57
> maxpaths, setting MATE_UNMAPPED in flag. Printing HI tag. Added
62
* VERSION, index.html: Updated version number
64
* stage3.c: For GMAP, using QUERYEND_INDELS instead of QUERYEND_NOGAPS
66
* stage1.c: Added check for querylength being too short
68
* gmap.c: Changed information in --help about how to turn off chimera
71
* stage3.c: In Stage3_mergeable, added check for firstpart_npairs or
72
secondpart_npairs being 0
76
* index.html: Updated version number
78
* VERSION: Updated version number
80
* stage1.c: Made direct calls to fields in Match_T objects for speed. Made
81
changes so debugging macros work.
83
* dynprog.c: Made special traceback procedure for queryend_nogaps
87
* stage3hr.c: Fixed bug where Stage3end_remove_overlaps was not keeping ends
88
where paired_usedp was true. In Stage3end_remove_overlaps and
89
Stage3pair_remove_overlaps, terminal alignments lose to all other types.
91
* stage3hr.h: Put GMAP hittype before TERMINAL hittype
93
* gsnap.c: Increased default values for max_gmap_pairsearch,
94
max_gmap_terminal, and max_gmap_improvement
96
* stage3hr.c: Changed code for Stage3end_remove_overlaps to parallel that
97
for Stage3pair_remove_overlaps
99
* samprint.c: Allowing for GMAP alignments to be printed for single-end
102
* VERSION, config.site.rescomp.prd, config.site.rescomp.tst: Updated version
105
* stage3.c: Added comment about pass 8 and 9
107
* index.html: Made changes for 2011-11-20 version
109
* uniqscan.c: Using new interface to Stage1_single_read
111
* stage3hr.c: Changed the code for Stage3end_optimal_score to be consistent
112
with that of Stage3pair_optimal_score. Allowing terminal alignments to be
113
considered along with other alignments.
115
* stage1hr.c: For single-end alignment, initializing done_level to
116
user_maxlevel, rather than opt_level, to be consistent with the code for
117
paired-end alignment. However, the values are the same anyway.
119
* gsnap.c: Changed the default value for terminal_threshold from 3 to 2.
120
Expanded the --help entry for --terminal-threshold.
122
* index.html: Made changes for version 2011-11-17
124
* stage3.c: Added more restrictions on non-canonical splices at ends of read
128
* stage3hr.c, stage3hr.h: Added functions Stage1end_start_endtype and
129
Stage1end_end_endtype for terminal alignments
131
* gsnap.c: Using new interface to Stage1_single_read
133
* gmap.c, stage3.c, stage3.h: Changed name of variable
135
* stage1hr.c, stage1hr.h: Implemented GMAP terminal mode for single-end
138
* dynprog.c: Lowered rewards for canonical introns, to help find
139
non-canonical introns
141
* stage3.c: In fill_in_gaps, if splicingp is false, then filling in a
142
deletion, not an intron
146
* stage3.c: Moved trim_noncanonical procedures from path_trim to
147
path_compute. After trim_noncanonical procedures, doing an extension using
150
* stage1hr.c: Using -3*nmismatches and -4 for an indel in evaluating end
151
indels, corresponding to default values for trim_mismatch_score and
154
* stage3.c: Using endalign instead of to_queryend_p. When endalign is
155
QUERY_NOGAPS, not doing peelback. In pass 8, changed extendp from false
156
to true (QUERYEND_INDELS).
158
* gsnap.c, uniqscan.c: Added flag --trim-indel-score
160
* dynprog.c, dynprog.h: Replaced to_queryend_p with endalign, with types
161
BEST_LOCAL, QUERYEND_INDELS, and QUERYEND_NOGAPS. For extensions to
162
queryend, always returning gappairs.
164
* atoiindex.c, cmetindex.c: Updated --help to indicate how --kmer and -D are
169
* VERSION: Updated version number
171
* gmap.c, gsnap.c, stage1hr.c, stage3.c, stage3.h, uniqscan.c: Revised
172
calculation of insertlength inside trim_noncanonical_ends procedures to
173
compensate for using maximum overlap in computing genomicseg.
175
* stage3.c: Made fill_in_gaps procedure replace short non-canonical introns
176
with deletions. Removed this feature from assign_gap_types. Added
177
additional checks on translation coordinates to stay within array bounds.
179
* dynprog.c: Added debugging statements
181
* substring.c: Added comment
183
* shortread.c, shortread.h, stage1hr.c: For GMAP algorithm in GSNAP,
184
assuming maximum overlap, rather than trying to compute overlap
186
* outbuffer.c: Printing all paths when -E flag is given to GMAP
188
* stage3hr.c: Improved error message when ambig end splicetype has an
191
* dynprog.c: Removed unused code for END_KNOWNSPLICING_SHORTCUT. Always
192
assigning *ambig_end_length in Dynprog_end5_known and Dynprog_end3_known.
196
* uniqscan.c: Using new interface to Stage3hr_setup
198
* stage3.c: Not substituting for long deletions at end of path_compute
200
* stage1hr.c, stage3hr.c, stage3hr.h: Removed end_indel_p parameters to
201
Stage3end_new_insertion and Stage3end_new_deletion
203
* stage1hr.c: Preventing call to Genome_count_mismatches_limit by
204
find_doublesplices where pos5 >= pos3
206
* VERSION: Updated version number
208
* configure.ac: Added flag --enable-popcnt
210
* dynprog.c: Initialing value of ambig_end_length
212
* stage3.c: Skipping gap pairs at the beginning of alignments in
215
* goby.c: Using new interface to Result_array
219
* VERSION: Updated version number
221
* stage1hr.c, stage3hr.c: Checking in Stage3end_new_gmap if genomicstart or
222
genomicend exceeds chrhigh, and if so, returns NULL.
224
* stage3.c: Moved assigning of gap types to end of path_compute, rather than
225
beginning of path_trim
227
* stage3.c: Inserting gap pairs and adding gap types at beginning of
232
* gsnap.c, outbuffer.c, outbuffer.h, samprint.c, samprint.h, stage1hr.c,
233
stage3hr.c, stage3hr.h: Divided DISTANT_SPLICE type to SAMECHR_SPLICE and
234
TRANSLOC_SPLICE. Making merge_samechr_p act only at print time, which
235
allows SAMECHR_SPLICE to undergo pair_up_concordant again. Stopping
236
clip-overlap on distant splices.
238
* VERSION: Updated version number
240
* stage3hr.c: Keeping hits in optimal scoring if nmatches_posttrim is
241
sufficiently high relative to the best hit
243
* gsnap.c, uniqscan.c: Increased value of max_deletionlength from 30 to 50
245
* gmap.c, stage1hr.c: Using new interface to Stage3_compute
247
* stage3.c, stage3.h: Changed flow to path_compute on both cdna directions,
248
then pick_cdna_direction, then path_trim, which removes non-canonical end
251
* stage3hr.c, stage3hr.h, substring.c, substring.h: Added nmatches_posttrim
252
and using it to break ties resulting from equal values of nmatches. Added
253
a general test for Substring_new based on matches and mismatches before
256
* pair.c: Modified Pair_nmatches to add penalties for an indel and for a
257
splice site with low probabilities. Modified compute_md_string on I
258
tokens to skip only for insertion pairs.
260
* dynprog.c: Fixed procedure find_best_endpoint_to_queryend to look only at
265
* stage3hr.c: Turned off separate treatment of terminal alignments in
266
Stage3pair_optimal_score. Always trimming ends of insertions and
267
deletions (previously depended on value of end_indel_p).
271
* stage3.c: Using a sliding scale in trimming end exons
273
* pair.c: Added debugging macro for compute_md_string
275
* pair.c, pair.h: Taking cdna_direction as a parameter in
276
Pair_print_exonsummary, instead of determining sense separately for each
279
* stage3.c: Restored alignment score in pick_cdna_direction with different
280
values of significant difference for GMAP and GSNAP. Considering a
281
non-canonical intron as canonical if both splice site probabilities are
284
* splicing-score.c: Fixed getopt so -D takes an argument
286
* pairpool.c, pairpool.h: Revised Pairpool_count_bounded to return number of
289
* gmap.c: Added comments
291
* stage3.c: Restored use of alignment scores in pick_cdna_direction, after
292
comparing number of noncanonical splices
294
* VERSION: Updated version number
296
* shortread.c: No longer printing warning message about not finding "/1" or
299
* gsnap.c, uniqscan.c: Added max_deletionlength variable
301
* gmap.c, outbuffer.c, outbuffer.h, pair.c, pair.h, samprint.c: No longer
302
converting short noncanonical splices from type N to type D, since this is
303
now performed in stage 3. Removed cigar_noncanonical_splices_p variable.
305
* stage3.c, stage3.h: In assign_gap_types, converting noncanonical splices
306
smaller than max_deletionlength into deletions
308
* gsnap.c, stage3hr.c, stage3hr.h, uniqscan.c: Treating distant splices on
309
the same chromosome as a translocation by default. Added a flag
310
--merge-distant-samechr to get previous behavior.
314
* stage3.c: Using maxintronlen in trimming end exons
316
* stage3.c: Removed alignment scores from pick_cdna_direction. Revised
317
procedures for trimming noncanonical end exons and doing distal/medial
318
comparison by adding procedures canonicalp and good_end_intron_p, with
319
latter using probabilities.
321
* gmap.c: Increased parameter for maxpeelback_distalmedial from 24 to 100
323
* dynprog.c: Added debugging statements
325
* pair.c: Computing MD string from cigar tokens
327
* stage2.c: Restored querydist penalty
331
* VERSION: Revised version number
333
* index.html: Added entry for new version
335
* stage1hr.c: Commented out second round of terminal alignments
337
* gmap.c, gsnap.c, uniqscan.c: Set trim_indel_score to -4 to be consistent
340
* gmap.c, gsnap.c, uniqscan.c: Calling Pair_setup
342
* substring.c: Extending trimming toward ends in case of ties
344
* stage3.c: Extending ends completely before final trim
346
* pair.h: Extending trimming toward ends in case of ties. Added Pair_setup
347
function to use trim_mismatch_score value provided by user.
349
* pair.c: Extending trimming toward ends in case of ties. Added Pair_setup
350
function to use trim_mismatch_score value provided by user.
352
* stage2.c: Put code for suboptimal starts into a compiler directive
354
* gmap.c, gsnap.c, uniqscan.c: Provided new defaults for
355
suboptimal_score_start and suboptimal_score_end, based on simulations
357
* gmap.c, gsnap.c, stage2.c, stage2.h, uniqscan.c: Introduced parameters for
358
suboptimal_score_end and suboptimal_score_start
360
* gmap.c: Added flag for --suboptimal-score
364
* gmap.c, gsnap.c, outbuffer.c, pair.c, pair.h, result.c, result.h,
365
resulthr.c, resulthr.h, samprint.c, samprint.h, stage1hr.c, stage3.c,
366
stage3.h, stage3hr.c, stage3hr.h, uniqscan.c: Restoring old MAPQ score.
367
Making absolute MAPQ score a separate calculation, and printing it in an
372
* stage1hr.c: Added comments
374
* stage3hr.c: Added field indel_low, and using to prefer indels at low
377
* stage1hr.c: Fixed computation of firstbound and lastbound so end indels
378
are found on short reads, such as 36-mers.
382
* pair.c: Added code to compute_cigar to merge duplicate token types
384
* stage2.c: Fixed uninitialized value for last_canonicalp
386
* stage2.c: Implemented ability to generate suboptimal paths based on
387
different initial positions
389
* gmapindex.c: Removed unnecessary file open for -P flag
391
* gmapindex.c: Fixed memory leaks for -G flag
393
* gmapindex.c: Fixed memory leaks for -A flag
397
* pairpool.c: Commented out copy of shortexonp
399
* dynprog.c, dynprog.h, gmap.c, gsnap.c, stage3.c, uniqscan.c: Removed
400
endlength requirement for microexons. Returning prob2 and prob3 from
401
Dynprog_microexon_int and applying two standards, depending on whether an
402
indel was originally present
404
* pair.c: Printing shortexon information
406
* pairpool.c: Copying shortexon information in copying pairs
408
* smooth.c: Removed unused parameters
410
* stage3.c: Adding endlength requirement for finding microexons
412
* smooth.c: Printing result of smoothing as pairs
414
* stage3.c: Made penalties harsher for indels at end near poor splice sites
416
* stage2.c: Computing best overall score during dynamic programming process
418
* uniqscan.c: Using new interface to Stage3hr_setup
420
* stage2.c: Made changes so PMAP could compile
422
* stage3.h: Added parameter favor_mode to Stage3_compute
424
* stage3.c: Counting indel near splice as 2 mismatches. Added endlength
425
requirement of 12 for indel near splice.
427
* stage2.c: Going through all hits to accumulate cells. No longer using
428
number of links to set root scores.
430
* gmap.c, stage1hr.c: Passing value of favor_mode to Stage3_compute
432
* smooth.c: Relaxing probability requirement for end exons in GSNAP from
437
* stage3.c: In trimming non-canonical end exons, not combining nearindelp
438
with splice probs, requiring 1 mismatch or less for bingop, and allowing
439
AT-AC introns. Extending alignments to queryend before trimming
440
non-canonical end exons.
442
* stage3.c: In trimming end exons, using a sliding scale based on intron
443
length. Also penalizing for indels and mismatches close to exon-exon
448
* gsnap.c, stage1hr.c, stage1hr.h, stage3hr.c, stage3hr.h: Providing
449
expected_pairlength and pairlength_deviation values in Stage3hr_setup, and
450
removing from Stage1hr procedures.
452
* stage3.c: In end exons, checking if indel present, and if so, requiring
453
that splice probabilities both be greater than 0.9.
455
* gsnap.c, stage3hr.c, stage3hr.h: Restoring pairlength deviation. Using
456
expected pairlength and pairlength deviation to discriminate among
459
* stage2.c: Adding all hits from final querypos directly to celllist, rather
460
than updating rootscores. Fixed update of rootscore information to use
461
current querypos and hit. Dynamic programming starting from querypos 0,
462
rather than querypos 1.
464
* diag.c: Restricted update of diagonal in middle region, to avoid affecting
465
subsequent beginning and end regions
467
* splicetrie_build.c: Fixed handling of intron intervals, by introducing
470
* stage3hr.c: Made other fixes to allow copying of GMAP hit types
472
* stage1hr.c: Making copies where necessary for multiple GMAP subpaths, and
473
freeing old Stage3pair_T objects at the appropriate time. Calling
474
Stage3pair_remove_overlaps on double GMAP alignments.
476
* pairpool.c: Using CALLOC_OUT in Pairpool_copy_array
478
* gsnap.c: Printing sequence name when debugging memusage
480
* stage1hr.c: Handling multiple subpaths from stage 2 computation
482
* stage2.c: Removed restriction on number of positions for final non-zero
485
* stage3hr.c: Allowing Stage3end_T objects of type GMAP to be copied
487
* pairpool.c, pairpool.h: Added function Pairpool_copy_array
491
* stage2.c: Fixed location of compiler directive for PMAP
493
* stage3.c: Restoring negative points for non-canonical introns in computing
496
* stage2.c: Adding root scores for final non-zero and specific querypos
498
* gmap.c, list.c, list.h, stage1hr.c, stage1hr.h, stage2.c, stage2.h: Going
499
back to changes from revision 50508 for multiple subpaths from stage2,
500
plus revisions 50504 to 50909 from branches/gmap-2011-10-24-mult-stage2
501
for a root position method for finding optimal and suboptimal subpaths
505
* gmap.c, list.c, list.h, src, stage1hr.c, stage1hr.h, stage2.c, stage2.h:
506
Reverted to version 50507, before changes made to allow multiple paths
507
from a stage2 computation
511
* VERSION: Updated version number
513
* archive.html, index.html: Added changes for 2011-10-24 version
515
* gmap.c, list.c, list.h, stage1hr.c, stage1hr.h, stage2.c, stage2.h: Merged
516
revisions 50469 to 50504 from branches/2011-10-24-mult-stage2 to allow for
517
multiple stage2 results from the same genomic segment for GMAP.
519
* stage3.c: Assigning sensedir in all cases, based on intron scores if
522
* gmap.c, gsnap.c, outbuffer.c, pair.c, pair.h, result.c, result.h,
523
resulthr.c, resulthr.h, samprint.c, samprint.h, stage1hr.c, stage1hr.h,
524
stage3hr.c, stage3hr.h, uniqscan.c: Computing MAPQ score relative to best
525
alignment. Printing X2 field in SAM output to provide second best MAPQ
528
* stage3.c, stage3.h: When sense_try is provided, assigning sensedir
532
* VERSION: Updated version number
534
* index.html: Added comment about change to GMAP
536
* gmap.c: Sorting stage3list before evaluating for chimeras
540
* VERSION: Updated version number
542
* index.html: Updated for new version
544
* configure.ac: Grouped together checks for built-in procedures
546
* splicetrie_build.c, splicetrie_build.h: Checking for splice sites being
547
beyond the chromosome length boundary
549
* splicetrie.c: Handling case where trieoffsets has a NULL_POINTER, which
550
can occur with intron-type splicing
552
* splicetrie_build.c: Handling case where nsites from a given splice site is
557
* popcnt.m4: Checking each built-in instruction only if available
559
* asm-bsr.m4: Assigning a value to x
3
561
* VERSION: Updated version number
5
563
* gsnap.c: Removed 'S' from getopt