4
Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
5
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
6
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
7
protein database search programs", Nucleic Acids Res. 25:3389-3402.
11
Database: All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS,
12
GSS,environmental samples or phase 0, 1 or 2 HTGS sequences)
13
14,049,258 sequences; 36,075,095,184 total letters
21
Sequences producing significant alignments: (Bits) Value
23
gi|194750627|ref|XM_001957596.1| Drosophila ananassae GF23929 (D... 42.8 2.1
24
gi|118568029|gb|CP000325.1| Mycobacterium ulcerans Agy99, comple... 42.8 2.1
25
gi|24395301|emb|AL928579.6| Mouse DNA sequence from clone RP23-3... 42.8 2.1
26
gi|326411376|gb|CP002568.1| Polymorphum gilvum SL003B-26A1, comp... 41.0 7.4
27
gi|317165637|gb|CP002447.1| Mesorhizobium ciceri biovar biserrul... 41.0 7.4
28
gi|312283773|gb|AC243242.1| Panicum virgatum clone PV_ABa051-K05... 41.0 7.4
29
gi|303315262|ref|XM_003067593.1| Coccidioides posadasii C735 del... 41.0 7.4
30
gi|297153409|gb|CP002047.1| Streptomyces bingchenggensis BCW-1, ... 41.0 7.4
31
gi|295434944|gb|CP002013.1| Burkholderia sp. CCGE1002 chromosome... 41.0 7.4
32
gi|283945692|gb|CP001854.1| Conexibacter woesei DSM 14684, compl... 41.0 7.4
33
gi|194346582|gb|CP001111.1| Stenotrophomonas maltophilia R551-3,... 41.0 7.4
34
gi|166857509|gb|CP000926.1| Pseudomonas putida GB-1, complete ge... 41.0 7.4
35
gi|148498119|gb|CP000699.1| Sphingomonas wittichii RW1, complete... 41.0 7.4
38
>gi|194750627|ref|XM_001957596.1| Drosophila ananassae GF23929 (Dana\GF23929), mRNA
41
Score = 42.8 bits (46), Expect = 2.1
42
Identities = 37/45 (83%), Gaps = 1/45 (2%)
45
Query 236 ACGCCGACGACAAGCGGCGAGTCCCTTGTGGACGATCTCGGCCAC 280
46
|||||||||||||| ||||||||| || |||| |||| ||||
47
Sbjct 137 ACGCCGACGACAAG-GGCGAGTCCATTTGGGACTTTCTCACCCAC 180
50
>gi|118568029|gb|CP000325.1| Mycobacterium ulcerans Agy99, complete genome
53
Score = 42.8 bits (46), Expect = 2.1
54
Identities = 36/43 (84%), Gaps = 1/43 (2%)
57
Query 69 CCATGACTGCCGGCAGAACATCGACCGGCCAG-TATGTCGGCC 110
58
||||| ||||||||||||| |||| || || | |||| |||||
59
Sbjct 890715 CCATGTCTGCCGGCAGAACTTCGAGCGACCCGATATGGCGGCC 890673
62
>gi|24395301|emb|AL928579.6| Mouse DNA sequence from clone RP23-302M15 on chromosome 4, complete
66
Score = 42.8 bits (46), Expect = 2.1
67
Identities = 31/35 (89%), Gaps = 1/35 (2%)
70
Query 437 AGCTGCAACATGAATGTGCTCGCAGATCTTCAGGA 471
71
|||||||||||| ||||||| ||||| |||| |||
72
Sbjct 146437 AGCTGCAACATGGATGTGCTAGCAGA-CTTCTGGA 146470
75
>gi|326411376|gb|CP002568.1| Polymorphum gilvum SL003B-26A1, complete genome
78
Score = 41.0 bits (44), Expect = 7.4
79
Identities = 33/40 (83%), Gaps = 0/40 (0%)
82
Query 238 GCCGACGACAAGCGGCGAGTCCCTTGTGGACGATCTCGGC 277
83
||||||||| ||||||||||| | ||| |||| |||||
84
Sbjct 2023755 GCCGACGACGCGCGGCGAGTCCGTGGTGCTCGATGTCGGC 2023794
87
>gi|317165637|gb|CP002447.1| Mesorhizobium ciceri biovar biserrulae WSM1271, complete genome
90
Score = 41.0 bits (44), Expect = 7.4
91
Identities = 22/22 (100%), Gaps = 0/22 (0%)
94
Query 300 TTGATGCCGAGGCGGATGCCAT 321
95
||||||||||||||||||||||
96
Sbjct 4510008 TTGATGCCGAGGCGGATGCCAT 4510029
99
>gi|312283773|gb|AC243242.1| Panicum virgatum clone PV_ABa051-K05, complete sequence
102
Score = 41.0 bits (44), Expect = 7.4
103
Identities = 28/32 (88%), Gaps = 0/32 (0%)
106
Query 271 TCTCGGCCACCTGTTTGAGTGGTATTCTTTTG 302
107
||| |||||||||||||||| ||||| || ||
108
Sbjct 137850 TCTTGGCCACCTGTTTGAGTTGTATTTTTCTG 137881
111
>gi|303315262|ref|XM_003067593.1| Coccidioides posadasii C735 delta SOWgp Sugar transporter family
115
Score = 41.0 bits (44), Expect = 7.4
116
Identities = 31/37 (84%), Gaps = 0/37 (0%)
119
Query 289 GTGGTATTCTTTTGATGCCGAGGCGGATGCCATTGCA 325
120
||||| ||| | ||||||||||||| | ||| |||||
121
Sbjct 84 GTGGTCTTCATATGATGCCGAGGCGTAAGCCTTTGCA 48
124
>gi|297153409|gb|CP002047.1| Streptomyces bingchenggensis BCW-1, complete genome
127
Score = 41.0 bits (44), Expect = 7.4
128
Identities = 27/30 (90%), Gaps = 0/30 (0%)
131
Query 38 GTCGGCAGCGGATCTCCGAAAACCATCAAG 67
132
|||||||||||| ||||||| ||||||||
133
Sbjct 3073377 GTCGGCAGCGGAACTCCGAATTCCATCAAG 3073406
136
>gi|295434944|gb|CP002013.1| Burkholderia sp. CCGE1002 chromosome 1, complete sequence
139
Score = 41.0 bits (44), Expect = 7.4
140
Identities = 33/40 (83%), Gaps = 0/40 (0%)
143
Query 212 ATGGGCGAATCGATCACCTTCAACACGCCGACGACAAGCG 251
144
|| |||||||||||| | |||||||||| | ||| ||||
145
Sbjct 2243828 ATCGGCGAATCGATCGCGTTCAACACGCTGGCGATGAGCG 2243789
148
>gi|283945692|gb|CP001854.1| Conexibacter woesei DSM 14684, complete genome
151
Score = 41.0 bits (44), Expect = 7.4
152
Identities = 35/43 (82%), Gaps = 3/43 (6%)
155
Query 101 TATGTCGGCCATGGCTTCTACAGTCCGGACATGATCGACGGCG 143
156
||||||||||| | ||||||| ||| || || |||||||||
157
Sbjct 2039361 TATGTCGGCCACGACTTCTAC---CCGAACCTGGTCGACGGCG 2039322
160
>gi|194346582|gb|CP001111.1| Stenotrophomonas maltophilia R551-3, complete genome
163
Score = 41.0 bits (44), Expect = 7.4
164
Identities = 32/36 (89%), Gaps = 2/36 (5%)
167
Query 102 ATGTCGGCCATGGC-TTCTACAGTCCGGACATGATC 136
168
|||||||||||||| || |||| ||||||||||||
169
Sbjct 3897524 ATGTCGGCCATGGCGTTGCACAG-CCGGACATGATC 3897490
172
>gi|166857509|gb|CP000926.1| Pseudomonas putida GB-1, complete genome
175
Score = 41.0 bits (44), Expect = 7.4
176
Identities = 35/41 (86%), Gaps = 2/41 (4%)
179
Query 222 CGATCACCTTCAACACG-CCGACGACAAGCGGCGAGTCCCT 261
180
||||||||||| ||||| ||||||| ||||| | |||| ||
181
Sbjct 5087565 CGATCACCTTCGACACGTCCGACGA-AAGCGCCAAGTCGCT 5087526
184
>gi|148498119|gb|CP000699.1| Sphingomonas wittichii RW1, complete genome
187
Score = 41.0 bits (44), Expect = 7.4
188
Identities = 32/38 (85%), Gaps = 3/38 (7%)
191
Query 208 GACGATG---GGCGAATCGATCACCTTCAACACGCCGA 242
192
||||||| |||| |||||||||||||| |||||||
193
Sbjct 4265124 GACGATGTGGGGCGGCTCGATCACCTTCAAGACGCCGA 4265161
204
Effective search space used: 15727850050068
207
Query= GFAVMM201A1JOH
211
Sequences producing significant alignments: (Bits) Value
213
gi|22416007|emb|AL772315.4| Mouse DNA sequence from clone RP23-5... 41.0 0.42
214
gi|117168385|gb|AC190175.4| Pan troglodytes BAC clone CH251-542A... 39.2 1.5
215
gi|45238691|gb|AC145377.2| Pan troglodytes BAC clone RP43-11P19 ... 39.2 1.5
216
gi|4926907|gb|AC004936.2| Homo sapiens PAC clone RP5-959C21 from... 39.2 1.5
217
gi|31414500|emb|AL831797.4| Oryza sativa chromosome 12, . BAC OS... 39.2 1.5
218
gi|158967071|gb|CP000033.3| Lactobacillus acidophilus NCFM, comp... 37.4 5.1
221
>gi|22416007|emb|AL772315.4| Mouse DNA sequence from clone RP23-54A13 on chromosome 4, complete
225
Score = 41.0 bits (44), Expect = 0.42
226
Identities = 28/31 (91%), Gaps = 2/31 (6%)
229
Query 6 CGTTCCAGTCACGATACATACCAATAACGAC 36
230
||||||||| | ||||||||||||||||||
231
Sbjct 26821 CGTTCCAGTGA--ATACATACCAATAACGAC 26793
234
>gi|117168385|gb|AC190175.4| Pan troglodytes BAC clone CH251-542A7 from chromosome 7, complete
238
Score = 39.2 bits (42), Expect = 1.5
239
Identities = 26/29 (90%), Gaps = 0/29 (0%)
242
Query 19 ATACATACCAATAACGACTTCTATATGTA 47
243
|||||||||||||| | |||||||||||
244
Sbjct 166678 ATACATACCAATAAATATTTCTATATGTA 166650
247
>gi|45238691|gb|AC145377.2| Pan troglodytes BAC clone RP43-11P19 from chromosome 7, complete
251
Score = 39.2 bits (42), Expect = 1.5
252
Identities = 26/29 (90%), Gaps = 0/29 (0%)
255
Query 19 ATACATACCAATAACGACTTCTATATGTA 47
256
|||||||||||||| | |||||||||||
257
Sbjct 192749 ATACATACCAATAAATATTTCTATATGTA 192777
260
>gi|4926907|gb|AC004936.2| Homo sapiens PAC clone RP5-959C21 from 7, complete sequence
263
Score = 39.2 bits (42), Expect = 1.5
264
Identities = 26/29 (90%), Gaps = 0/29 (0%)
267
Query 19 ATACATACCAATAACGACTTCTATATGTA 47
268
|||||||||||||| | |||||||||||
269
Sbjct 12654 ATACATACCAATAAATATTTCTATATGTA 12626
272
>gi|31414500|emb|AL831797.4| Oryza sativa chromosome 12, . BAC OSJNBa0010M16 of library OSJNBa
273
from chromosome 12 of cultivar Nipponbare of ssp. japonica
274
of Oryza sativa (rice), complete sequence
277
Score = 39.2 bits (42), Expect = 1.5
278
Identities = 23/24 (96%), Gaps = 0/24 (0%)
281
Query 18 GATACATACCAATAACGACTTCTA 41
282
||||||||||||||| ||||||||
283
Sbjct 126346 GATACATACCAATAAAGACTTCTA 126369
286
>gi|158967071|gb|CP000033.3| Lactobacillus acidophilus NCFM, complete genome
289
Score = 37.4 bits (40), Expect = 5.1
290
Identities = 20/20 (100%), Gaps = 0/20 (0%)
293
Query 14 TCACGATACATACCAATAAC 33
295
Sbjct 756012 TCACGATACATACCAATAAC 755993
306
Effective search space used: 890637973200
309
Query= GFAVMM201D933Z
313
Sequences producing significant alignments: (Bits) Value
315
gi|332189094|gb|CP002684.1| Arabidopsis thaliana chromosome 1, c... 41.0 1.4
316
gi|313747610|gb|AC237431.3| Macaca mulatta Y Chr BAC RMAEX-106G5... 41.0 1.4
317
gi|168693784|gb|AC213321.3| MACACA MULATTA BAC clone CH250-541A1... 41.0 1.4
318
gi|8778954|gb|AC007932.3|F11A17 Arabidopsis thaliana chromosome ... 41.0 1.4
319
gi|256052438|ref|XM_002569731.1| Schistosoma mansoni GTP-binding... 39.2 5.0
320
gi|238867551|gb|CP001600.1| Edwardsiella ictaluri 93-146, comple... 39.2 5.0
321
gi|224465197|ref|NG_009929.1| Homo sapiens like-glycosyltransfer... 39.2 5.0
322
gi|147784643|emb|AM469007.2| Vitis vinifera contig VV78X098884.1... 39.2 5.0
323
gi|123663234|emb|AM455601.1| Vitis vinifera, whole genome shotgu... 39.2 5.0
324
gi|118344518|gb|AC192748.2| Gallus gallus BAC clone CH261-166H21... 39.2 5.0
325
gi|90309320|gb|AY607844.2| Edwardsiella ictaluri transposase-lik... 39.2 5.0
326
gi|55908997|gb|AC102414.8| Mus musculus chromosome 15, clone RP2... 39.2 5.0
327
gi|72096292|gb|AC162857.5| Mus musculus chromosome 15, clone RP2... 39.2 5.0
328
gi|22415845|emb|AL139016.8| Human DNA sequence from clone RP4-65... 39.2 5.0
329
gi|6572307|emb|Z82173.2| Human DNA sequence from clone SC22CB-1D... 39.2 5.0
330
gi|56410822|gb|AC140454.4| Mus musculus BAC clone RP24-163I14 fr... 39.2 5.0
331
gi|38707373|emb|BX255930.3| Zebrafish DNA sequence from clone DK... 39.2 5.0
334
>gi|332189094|gb|CP002684.1| Arabidopsis thaliana chromosome 1, complete sequence
337
Score = 41.0 bits (44), Expect = 1.4
338
Identities = 25/27 (93%), Gaps = 0/27 (0%)
341
Query 70 AACAGAAACCACTACGAAAGGAAAGAA 96
342
||||||||| ||||||||||||||||
343
Sbjct 17871124 AACAGAAACTTCTACGAAAGGAAAGAA 17871098
346
>gi|313747610|gb|AC237431.3| Macaca mulatta Y Chr BAC RMAEX-106G5 complete sequence
349
Score = 41.0 bits (44), Expect = 1.4
350
Identities = 28/31 (91%), Gaps = 2/31 (6%)
353
Query 75 AAACCACTACGAAAGGAAAGAAATGCCTCTA 105
354
|||||| ||||||||||| |||||||||||
355
Sbjct 60448 AAACCA--ACGAAAGGAAATAAATGCCTCTA 60420
358
>gi|168693784|gb|AC213321.3| MACACA MULATTA BAC clone CH250-541A11 from chromosome y, complete
362
Score = 41.0 bits (44), Expect = 1.4
363
Identities = 28/31 (91%), Gaps = 2/31 (6%)
366
Query 75 AAACCACTACGAAAGGAAAGAAATGCCTCTA 105
367
|||||| ||||||||||| |||||||||||
368
Sbjct 62327 AAACCA--ACGAAAGGAAATAAATGCCTCTA 62355
371
>gi|8778954|gb|AC007932.3|F11A17 Arabidopsis thaliana chromosome 1 BAC F11A17 sequence, complete
375
Score = 41.0 bits (44), Expect = 1.4
376
Identities = 25/27 (93%), Gaps = 0/27 (0%)
379
Query 70 AACAGAAACCACTACGAAAGGAAAGAA 96
380
||||||||| ||||||||||||||||
381
Sbjct 32940 AACAGAAACTTCTACGAAAGGAAAGAA 32966
384
>gi|256052438|ref|XM_002569731.1| Schistosoma mansoni GTP-binding protein era, putative (Smp_164220)
388
Score = 39.2 bits (42), Expect = 5.0
389
Identities = 21/21 (100%), Gaps = 0/21 (0%)
392
Query 40 TTCTCTTTTCGTATTTGGAAC 60
393
|||||||||||||||||||||
394
Sbjct 159 TTCTCTTTTCGTATTTGGAAC 139
397
>gi|238867551|gb|CP001600.1| Edwardsiella ictaluri 93-146, complete genome
400
Score = 39.2 bits (42), Expect = 5.0
401
Identities = 31/37 (84%), Gaps = 3/37 (8%)
404
Query 58 AACAGCTGTTATA---ACAGAAACCACTACGAAAGGA 91
405
|||||| |||||| ||| ||||||||||| |||||
406
Sbjct 1978968 AACAGCCGTTATATCCACATAAACCACTACGCAAGGA 1979004
409
>gi|224465197|ref|NG_009929.1| Homo sapiens like-glycosyltransferase (LARGE), RefSeqGene on
413
Score = 39.2 bits (42), Expect = 5.0
414
Identities = 30/36 (84%), Gaps = 0/36 (0%)
417
Query 66 TTATAACAGAAACCACTACGAAAGGAAAGAAATGCC 101
418
||| ||||||||||| || |||||||| ||| |||
419
Sbjct 603836 TTAAAACAGAAACCAAAACCAAAGGAAAAAAAAGCC 603871
422
>gi|147784643|emb|AM469007.2| Vitis vinifera contig VV78X098884.11, whole genome shotgun sequence
425
Score = 39.2 bits (42), Expect = 5.0
426
Identities = 24/26 (93%), Gaps = 0/26 (0%)
429
Query 71 ACAGAAACCACTACGAAAGGAAAGAA 96
430
||| |||||||||| |||||||||||
431
Sbjct 6827 ACAAAAACCACTACAAAAGGAAAGAA 6802
434
>gi|123663234|emb|AM455601.1| Vitis vinifera, whole genome shotgun sequence, contig VV78X123660.5,
438
Score = 39.2 bits (42), Expect = 5.0
439
Identities = 24/26 (93%), Gaps = 0/26 (0%)
442
Query 71 ACAGAAACCACTACGAAAGGAAAGAA 96
443
||| |||||||||| |||||||||||
444
Sbjct 34120 ACAAAAACCACTACAAAAGGAAAGAA 34145
447
>gi|118344518|gb|AC192748.2| Gallus gallus BAC clone CH261-166H21 from chromosome z, complete
451
Score = 39.2 bits (42), Expect = 5.0
452
Identities = 27/31 (88%), Gaps = 0/31 (0%)
455
Query 58 AACAGCTGTTATAACAGAAACCACTACGAAA 88
456
||||||||| |||||||||||| || ||||
457
Sbjct 178051 AACAGCTGTCATAACAGAAACCCATATGAAA 178081
460
>gi|90309320|gb|AY607844.2| Edwardsiella ictaluri transposase-like protein gene, complete
461
cds; urease operon, complete sequence; and ammonium transporter
462
(amtB) and TnpA (tnpA) genes, complete cds
465
Score = 39.2 bits (42), Expect = 5.0
466
Identities = 31/37 (84%), Gaps = 3/37 (8%)
469
Query 58 AACAGCTGTTATA---ACAGAAACCACTACGAAAGGA 91
470
|||||| |||||| ||| ||||||||||| |||||
471
Sbjct 8030 AACAGCCGTTATATCCACATAAACCACTACGCAAGGA 7994
474
>gi|55908997|gb|AC102414.8| Mus musculus chromosome 15, clone RP24-474D5, complete sequence
477
Score = 39.2 bits (42), Expect = 5.0
478
Identities = 24/26 (93%), Gaps = 0/26 (0%)
481
Query 70 AACAGAAACCACTACGAAAGGAAAGA 95
482
||||||||||||||| |||||| |||
483
Sbjct 71851 AACAGAAACCACTACAAAAGGAGAGA 71876
486
>gi|72096292|gb|AC162857.5| Mus musculus chromosome 15, clone RP23-333C11, complete sequence
489
Score = 39.2 bits (42), Expect = 5.0
490
Identities = 24/26 (93%), Gaps = 0/26 (0%)
493
Query 70 AACAGAAACCACTACGAAAGGAAAGA 95
494
||||||||||||||| |||||| |||
495
Sbjct 3370 AACAGAAACCACTACAAAAGGAGAGA 3345
498
>gi|22415845|emb|AL139016.8| Human DNA sequence from clone RP4-658C17 on chromosome 1p11.1-13.3,
502
Score = 39.2 bits (42), Expect = 5.0
503
Identities = 22/23 (96%), Gaps = 0/23 (0%)
506
Query 87 AAGGAAAGAAATGCCTCTANAAG 109
507
||||||||||||||||||| |||
508
Sbjct 22086 AAGGAAAGAAATGCCTCTAGAAG 22108
511
>gi|6572307|emb|Z82173.2| Human DNA sequence from clone SC22CB-1D7 on chromosome 22 Contains
512
a novel gene and part of the LARGE gene for like-glycosyltransferase,
516
Score = 39.2 bits (42), Expect = 5.0
517
Identities = 30/36 (84%), Gaps = 0/36 (0%)
520
Query 66 TTATAACAGAAACCACTACGAAAGGAAAGAAATGCC 101
521
||| ||||||||||| || |||||||| ||| |||
522
Sbjct 15530 TTAAAACAGAAACCAAAACCAAAGGAAAAAAAAGCC 15495
525
>gi|56410822|gb|AC140454.4| Mus musculus BAC clone RP24-163I14 from 1, complete sequence
528
Score = 39.2 bits (42), Expect = 5.0
529
Identities = 28/32 (88%), Gaps = 4/32 (12%)
532
Query 66 TTATAACAGAAACCACTACGAAAGGAAAGAAA 97
533
|||||||||||||||| ||||||||||||
534
Sbjct 140248 TTATAACAGAAACCAC----AAAGGAAAGAAA 140275
537
>gi|38707373|emb|BX255930.3| Zebrafish DNA sequence from clone DKEY-101K6 in linkage group
538
9 Contains the 3' end of the mcm3ap gene for MCM3 minichromosome
539
maintenance deficient 3 (S. cerevisiae) associated protein,
540
the gene for a novel protein similar to vertebrate lanosterol
541
synthase (2,3-oxidosqualene-lanosterol cyclase) (LSS),
542
the gene for a novel protein similar to human and mouse leishmanolysin-like
543
(metallopeptidase M8 family) (LMLN), the
544
gene for a novel protein similar to vertebrate oxysterol binding
545
protein-like 10 (OSBPL10), the gene for a novel protein
546
similar to vertebrate transforming growth factor, beta receptor
547
II (70/80kDa) (TGFBR2), the snx4 gene for sorting nexin
548
4, the gene for a novel protein similar to vertebrate zinc
549
finger protein 148 (ZNF148) and the 5' end of the gene for a
550
novel protein similar to vertebrate solute carrier family 12
551
(potassium/chloride transporters), member 8 (SLC12A8), complete
555
Score = 39.2 bits (42), Expect = 5.0
556
Identities = 21/21 (100%), Gaps = 0/21 (0%)
559
Query 62 GCTGTTATAACAGAAACCACT 82
560
|||||||||||||||||||||
561
Sbjct 124003 GCTGTTATAACAGAAACCACT 124023
572
Effective search space used: 3062586391620
575
Database: All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS,
576
GSS,environmental samples or phase 0, 1 or 2 HTGS sequences)
577
Posted date: May 21, 2011 4:39 PM
578
Number of letters in database: 36,075,095,184
579
Number of sequences in database: 14,049,258
583
Matrix: blastn matrix 2 -3
584
Gap Penalties: Existence: 5, Extension: 2