# /hgtech/tools/solaris8/bin/fasta34_t -T 8 -b50 -d10 -E0.01 -H -Oha06115.fasta.nr -Q ha06115.ptfa /cdna2/lib/nr/nr 2 FASTA searches a protein or DNA sequence data bank version 34.26.5 April 26, 2007 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 ha06115, 125 aa vs /cdna2/lib/nr/nr library 2779448989 residues in 8089198 sequences statistics sampled from 60000 to 8088693 sequences Expectation_n fit: rho(ln(x))= 5.0242+/-0.000181; mu= 5.5443+/- 0.010 mean_var=71.4004+/-13.830, 0's: 28 Z-trim: 29 B-trim: 0 in 0/66 Lambda= 0.151783 FASTA (3.5 Sept 2006) function [optimized, BL50 matrix (15:-5)] ktup: 2 join: 36, opt: 24, open/ext: -10/-2, width: 16 The best scores are: opt bits E(8089198) gi|71152351|sp|Q9H246.1|CA021_HUMAN RecName: Full= ( 121) 789 181.0 4e-44 gi|124829072|gb|AAI33419.1| Chromosome 1 open read ( 121) 788 180.8 4.6e-44 gi|71152352|sp|Q8K207.1|CA021_MOUSE RecName: Full= ( 121) 774 177.7 3.9e-43 gi|149058409|gb|EDM09566.1| similar to RIKEN cDNA ( 121) 773 177.5 4.5e-43 gi|50751176|ref|XP_422292.1| PREDICTED: similar to ( 121) 762 175.1 2.4e-42 gi|126306329|ref|XP_001366801.1| PREDICTED: simila ( 121) 759 174.4 3.8e-42 gi|224056992|ref|XP_002191100.1| PREDICTED: chromo ( 121) 757 174.0 5.1e-42 gi|149636353|ref|XP_001516128.1| PREDICTED: simila ( 121) 748 172.0 2e-41 gi|115313447|gb|AAI23936.1| Hypothetical protein M ( 121) 621 144.2 4.7e-33 gi|49115020|gb|AAH72863.1| MGC80268 protein [Xenop ( 126) 613 142.5 1.6e-32 gi|33416796|gb|AAH56102.1| MGC69115 protein [Xenop ( 128) 600 139.6 1.2e-31 gi|51858814|gb|AAH81606.1| Zgc:92140 [Danio rerio] ( 118) 473 111.8 2.6e-23 gi|47213907|emb|CAF95849.1| unnamed protein produc ( 112) 437 103.9 6e-21 gi|50925104|gb|AAH78652.1| Zgc:55943 protein [Dani ( 113) 406 97.1 6.7e-19 gi|118094202|ref|XP_001233452.1| PREDICTED: simila ( 133) 393 94.3 5.4e-18 gi|55957217|emb|CAI17843.1| chromosome 1 open read ( 87) 381 91.5 2.4e-17 gi|148707513|gb|EDL39460.1| RIKEN cDNA 1700025G04, ( 132) 374 90.2 9.7e-17 gi|27881980|gb|AAH44550.1| Zgc:55943 [Danio rerio] ( 102) 297 73.2 9.5e-12 gi|47227935|emb|CAF97564.1| unnamed protein produc ( 35) 205 52.7 4.9e-06 gi|47227936|emb|CAF97565.1| unnamed protein produc ( 78) 171 45.5 0.0016 gi|210126703|gb|EEA74389.1| hypothetical protein B ( 109) 163 43.9 0.0068 >>gi|71152351|sp|Q9H246.1|CA021_HUMAN RecName: Full=Unch (121 aa) initn: 789 init1: 789 opt: 789 Z-score: 948.7 bits: 181.0 E(): 4e-44 Smith-Waterman score: 789; 99.174% identity (100.000% similar) in 121 aa overlap (5-125:1-121) 10 20 30 40 50 60 ha0611 PNETMGCASAKHVATVQNEEEAQKGKNYQNGDLFGDEYRIKPVEEVKYMKNGAEEEQKIA ::::::::::::::::::::::::::::.::::::::::::::::::::::::::: gi|711 MGCASAKHVATVQNEEEAQKGKNYQNGDVFGDEYRIKPVEEVKYMKNGAEEEQKIA 10 20 30 40 50 70 80 90 100 110 120 ha0611 ARNQENLEKSASSNVRLKTNKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|711 ARNQENLEKSASSNVRLKTNKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE 60 70 80 90 100 110 ha0611 EEDIT ::::: gi|711 EEDIT 120 >>gi|124829072|gb|AAI33419.1| Chromosome 1 open reading (121 aa) initn: 788 init1: 788 opt: 788 Z-score: 947.5 bits: 180.8 E(): 4.6e-44 Smith-Waterman score: 788; 98.347% identity (100.000% similar) in 121 aa overlap (5-125:1-121) 10 20 30 40 50 60 ha0611 PNETMGCASAKHVATVQNEEEAQKGKNYQNGDLFGDEYRIKPVEEVKYMKNGAEEEQKIA ::::::::::::::::::::::::::::.::::::::::::::::::::::::::: gi|124 MGCASAKHVATVQNEEEAQKGKNYQNGDVFGDEYRIKPVEEVKYMKNGAEEEQKIA 10 20 30 40 50 70 80 90 100 110 120 ha0611 ARNQENLEKSASSNVRLKTNKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE ::::::::::::::::::::::.::::::::::::::::::::::::::::::::::::: gi|124 ARNQENLEKSASSNVRLKTNKEIPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE 60 70 80 90 100 110 ha0611 EEDIT ::::: gi|124 EEDIT 120 >>gi|71152352|sp|Q8K207.1|CA021_MOUSE RecName: Full=Unch (121 aa) initn: 774 init1: 774 opt: 774 Z-score: 930.9 bits: 177.7 E(): 3.9e-43 Smith-Waterman score: 774; 95.868% identity (100.000% similar) in 121 aa overlap (5-125:1-121) 10 20 30 40 50 60 ha0611 PNETMGCASAKHVATVQNEEEAQKGKNYQNGDLFGDEYRIKPVEEVKYMKNGAEEEQKIA :::::::::::::::::::.::.:::::.::::::::::::::::::::::::::: gi|711 MGCASAKHVATVQNEEEAQRGKSYQNGDVFGDEYRIKPVEEVKYMKNGAEEEQKIA 10 20 30 40 50 70 80 90 100 110 120 ha0611 ARNQENLEKSASSNVRLKTNKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE ::::::::::::::.:::::::.::::::::::::::::::::::::::::::::::::: gi|711 ARNQENLEKSASSNTRLKTNKEIPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE 60 70 80 90 100 110 ha0611 EEDIT ::::: gi|711 EEDIT 120 >>gi|149058409|gb|EDM09566.1| similar to RIKEN cDNA 1700 (121 aa) initn: 773 init1: 773 opt: 773 Z-score: 929.7 bits: 177.5 E(): 4.5e-43 Smith-Waterman score: 773; 95.868% identity (100.000% similar) in 121 aa overlap (5-125:1-121) 10 20 30 40 50 60 ha0611 PNETMGCASAKHVATVQNEEEAQKGKNYQNGDLFGDEYRIKPVEEVKYMKNGAEEEQKIA ::::::::::::::::::::::.:::::.::::::::::::::::::::::::::: gi|149 MGCASAKHVATVQNEEEAQKGKSYQNGDVFGDEYRIKPVEEVKYMKNGAEEEQKIA 10 20 30 40 50 70 80 90 100 110 120 ha0611 ARNQENLEKSASSNVRLKTNKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE ::::::::::::::.:::::::.::.:::::::::::::::::::::::::::::::::: gi|149 ARNQENLEKSASSNTRLKTNKEIPGFVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE 60 70 80 90 100 110 ha0611 EEDIT ::::: gi|149 EEDIT 120 >>gi|50751176|ref|XP_422292.1| PREDICTED: similar to C1o (121 aa) initn: 762 init1: 762 opt: 762 Z-score: 916.7 bits: 175.1 E(): 2.4e-42 Smith-Waterman score: 762; 92.562% identity (100.000% similar) in 121 aa overlap (5-125:1-121) 10 20 30 40 50 60 ha0611 PNETMGCASAKHVATVQNEEEAQKGKNYQNGDLFGDEYRIKPVEEVKYMKNGAEEEQKIA :::::::::.:::::::.::::::::::.:::::::::::::::::::.:..:::: gi|507 MGCASAKHVSTVQNEEETQKGKNYQNGDVFGDEYRIKPVEEVKYMKNGGEDDQKIA 10 20 30 40 50 70 80 90 100 110 120 ha0611 ARNQENLEKSASSNVRLKTNKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE ::::::::::::::::::.:::.::::::::::::::::::::::::::::::::::::: gi|507 ARNQENLEKSASSNVRLKSNKEIPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE 60 70 80 90 100 110 ha0611 EEDIT :::.: gi|507 EEDVT 120 >>gi|126306329|ref|XP_001366801.1| PREDICTED: similar to (121 aa) initn: 759 init1: 759 opt: 759 Z-score: 913.2 bits: 174.4 E(): 3.8e-42 Smith-Waterman score: 759; 95.041% identity (99.174% similar) in 121 aa overlap (5-125:1-121) 10 20 30 40 50 60 ha0611 PNETMGCASAKHVATVQNEEEAQKGKNYQNGDLFGDEYRIKPVEEVKYMKNGAEEEQKIA ::::::::::::::::::::::::::::.:.:::::::::::::::::.:.::::: gi|126 MGCASAKHVATVQNEEEAQKGKNYQNGDVFADEYRIKPVEEVKYMKNGGEDEQKIA 10 20 30 40 50 70 80 90 100 110 120 ha0611 ARNQENLEKSASSNVRLKTNKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE :::::::::::::::::: .:::::::::::::::::::::::::::::::::::::::: gi|126 ARNQENLEKSASSNVRLKPTKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE 60 70 80 90 100 110 ha0611 EEDIT ::::: gi|126 EEDIT 120 >>gi|224056992|ref|XP_002191100.1| PREDICTED: chromosome (121 aa) initn: 757 init1: 757 opt: 757 Z-score: 910.8 bits: 174.0 E(): 5.1e-42 Smith-Waterman score: 757; 91.736% identity (100.000% similar) in 121 aa overlap (5-125:1-121) 10 20 30 40 50 60 ha0611 PNETMGCASAKHVATVQNEEEAQKGKNYQNGDLFGDEYRIKPVEEVKYMKNGAEEEQKIA :::::::::.:::::::.::::::::::.:::::::::::::::::::.:..:::: gi|224 MGCASAKHVSTVQNEEETQKGKNYQNGDVFGDEYRIKPVEEVKYMKNGGEDDQKIA 10 20 30 40 50 70 80 90 100 110 120 ha0611 ARNQENLEKSASSNVRLKTNKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE ::::::::::::::.:::.:::.::::::::::::::::::::::::::::::::::::: gi|224 ARNQENLEKSASSNTRLKSNKEIPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE 60 70 80 90 100 110 ha0611 EEDIT :::.: gi|224 EEDVT 120 >>gi|149636353|ref|XP_001516128.1| PREDICTED: similar to (121 aa) initn: 781 init1: 748 opt: 748 Z-score: 900.1 bits: 172.0 E(): 2e-41 Smith-Waterman score: 748; 90.909% identity (99.174% similar) in 121 aa overlap (5-125:1-121) 10 20 30 40 50 60 ha0611 PNETMGCASAKHVATVQNEEEAQKGKNYQNGDLFGDEYRIKPVEEVKYMKNGAEEEQKIA :::::.:::.::::::::::::::::::.:::::::::::::::::::::.:::.: gi|149 MGCASGKHVSTVQNEEEAQKGKNYQNGDVFGDEYRIKPVEEVKYMKNGAEDEQKVA 10 20 30 40 50 70 80 90 100 110 120 ha0611 ARNQENLEKSASSNVRLKTNKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE .:::::::::::.::::: .:::::::::::.:::::::::::::::::::::::::::: gi|149 VRNQENLEKSASTNVRLKPTKEVPGLVHQPRTNMHISESQQEFFRMLDEKIEKGRDYCSE 60 70 80 90 100 110 ha0611 EEDIT :::.: gi|149 EEDVT 120 >>gi|115313447|gb|AAI23936.1| Hypothetical protein MGC14 (121 aa) initn: 621 init1: 621 opt: 621 Z-score: 749.8 bits: 144.2 E(): 4.7e-33 Smith-Waterman score: 621; 76.860% identity (89.256% similar) in 121 aa overlap (5-125:1-121) 10 20 30 40 50 60 ha0611 PNETMGCASAKHVATVQNEEEAQKGKNYQNGDLFGDEYRIKPVEEVKYMKNGAEEEQKIA :::::::::.:::::..::.:::::::: : ::::::::::::::::: :::::.. gi|115 MGCASAKHVSTVQNEDDAQNGKNYQNGDAFCDEYRIKPVEEVKYMKNGEEEEQKVV 10 20 30 40 50 70 80 90 100 110 120 ha0611 ARNQENLEKSASSNVRLKTNKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGRDYCSE .:::::::::.. .: ..: :. : : : :.::::::::::::::::::::.::::: gi|115 SRNQENLEKSVTHAARSRSNVEAAGAGHPYRINIHISESQQEFFRMLDEKIEKGQDYCSE 60 70 80 90 100 110 ha0611 EEDIT ::::: gi|115 EEDIT 120 >>gi|49115020|gb|AAH72863.1| MGC80268 protein [Xenopus l (126 aa) initn: 619 init1: 356 opt: 613 Z-score: 740.1 bits: 142.5 E(): 1.6e-32 Smith-Waterman score: 613; 76.190% identity (85.714% similar) in 126 aa overlap (5-125:1-126) 10 20 30 40 50 60 ha0611 PNETMGCASAKHVATVQNEEEAQKGKNYQNGDLFGDEYRIKPVEEVKYMKNGAEEEQKIA :::::::::.:::::..::.:::::::: : ::::::::::::::::: :::::: gi|491 MGCASAKHVSTVQNEDDAQNGKNYQNGDAFCDEYRIKPVEEVKYMKNGEEEEQKIL 10 20 30 40 50 70 80 90 100 110 ha0611 ARNQENL-----EKSASSNVRLKTNKEVPGLVHQPRANMHISESQQEFFRMLDEKIEKGR ..::::: ::::. ..: :.: :. : : : :.::::::::::::::::::::: gi|491 SKNQENLVSCLEEKSATHTARSKSNTEAAGAGHPYRINIHISESQQEFFRMLDEKIEKGR 60 70 80 90 100 110 120 ha0611 DYCSEEEDIT :::::::::: gi|491 DYCSEEEDIT 120 125 residues in 1 query sequences 2779448989 residues in 8089198 library sequences Tcomplib [34.26] (8 proc) start: Thu Apr 16 16:03:10 2009 done: Thu Apr 16 16:06:47 2009 Total Scan time: 616.600 Total Display time: 0.010 Function used was FASTA [version 34.26.5 April 26, 2007]