# /hgtech/tools/solaris8/bin/fasta34_t -T 8 -b50 -d10 -E0.01 -H -Oah01915.fasta.nr -Q ah01915.ptfa /cdna2/lib/nr/nr 2 FASTA searches a protein or DNA sequence data bank version 34.26.5 April 26, 2007 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 ah01915, 207 aa vs /cdna2/lib/nr/nr library 3071326396 residues in 8985982 sequences statistics sampled from 60000 to 8971999 sequences Expectation_n fit: rho(ln(x))= 5.7531+/-0.000194; mu= 6.1151+/- 0.011 mean_var=103.5978+/-19.842, 0's: 46 Z-trim: 106 B-trim: 33 in 1/66 Lambda= 0.126008 FASTA (3.5 Sept 2006) function [optimized, BL50 matrix (15:-5)] ktup: 2 join: 36, opt: 24, open/ext: -10/-2, width: 16 The best scores are: opt bits E(8985982) gi|37537960|sp|Q8NHV9.1|RHXF1_HUMAN RecName: Full= ( 184) 1279 242.0 4.9e-62 gi|109132103|ref|XP_001084578.1| PREDICTED: simila ( 185) 1062 202.5 3.7e-50 gi|3900848|gb|AAC78617.1| match to EST AA361117 (N ( 148) 1021 195.0 5.5e-48 gi|81674226|gb|AAI09513.1| ESX homeobox 1 [Bos tau ( 279) 395 81.5 1.6e-13 gi|74008892|ref|XP_538156.2| PREDICTED: similar to ( 516) 375 78.1 3e-12 gi|114690017|ref|XP_001138831.1| PREDICTED: simila ( 51) 336 70.1 7.7e-11 gi|74008894|ref|XP_853890.1| PREDICTED: similar to ( 327) 322 68.2 1.7e-09 gi|116241356|sp|Q8N693.3|ESX1_HUMAN RecName: Full= ( 406) 317 67.4 3.8e-09 gi|31566395|gb|AAH53599.1| ESX homeobox 1 [Homo sa ( 406) 317 67.4 3.8e-09 gi|109131769|ref|XP_001092145.1| PREDICTED: simila ( 361) 307 65.6 1.2e-08 gi|119924820|ref|XP_001253626.1| PREDICTED: simila ( 197) 303 64.6 1.3e-08 gi|194228162|ref|XP_001493708.2| PREDICTED: simila ( 449) 299 64.2 4e-08 gi|109512386|ref|XP_001055261.1| PREDICTED: simila ( 324) 288 62.1 1.2e-07 gi|190360188|sp|P0C7M4.1|RHF2B_HUMAN RecName: Full ( 288) 280 60.6 3.1e-07 gi|18996777|gb|AAL83210.1|AF465939_1 paired-like h ( 286) 279 60.4 3.5e-07 gi|37537987|sp|Q9BQY4.1|RHXF2_HUMAN RecName: Full= ( 288) 279 60.4 3.6e-07 gi|229288886|gb|EEN59578.1| hypothetical protein B ( 275) 278 60.2 3.9e-07 gi|11526760|gb|AAG36768.1|AF201698_1 homeobox prot ( 227) 272 59.0 7.3e-07 gi|219465754|ref|XP_002230526.1| hypothetical prot ( 286) 272 59.1 8.6e-07 gi|38565041|gb|AAR23915.1| peml [Marmota monax] ( 182) 269 58.4 9e-07 gi|109132107|ref|XP_001084818.1| PREDICTED: simila ( 287) 266 58.0 1.8e-06 gi|167874594|gb|EDS37977.1| conserved hypothetical ( 172) 262 57.1 2.1e-06 gi|109510351|ref|XP_001077350.1| PREDICTED: simila ( 234) 263 57.4 2.3e-06 gi|110559940|gb|ABG76208.1| Shox2 [Xenopus tropica ( 311) 262 57.3 3.2e-06 gi|109132105|ref|XP_001084696.1| PREDICTED: simila ( 283) 261 57.1 3.4e-06 gi|2739454|gb|AAB94670.1| paired-like homeodomain ( 314) 261 57.1 3.7e-06 gi|123222504|emb|CAM27212.1| extraembryonic sperma ( 314) 261 57.1 3.7e-06 gi|148691948|gb|EDL23895.1| extraembryonic, sperma ( 318) 261 57.1 3.7e-06 gi|148691949|gb|EDL23896.1| extraembryonic, sperma ( 338) 261 57.2 3.9e-06 gi|3599505|gb|AAC35366.1| homeobox protein SPX1 [M ( 382) 261 57.2 4.2e-06 gi|4102918|gb|AAD01621.1| homeodomain protein EPX ( 387) 261 57.2 4.3e-06 gi|145559528|sp|O60902.4|SHOX2_HUMAN RecName: Full ( 331) 260 57.0 4.3e-06 gi|126342299|ref|XP_001371776.1| PREDICTED: simila ( 400) 261 57.2 4.4e-06 gi|167234447|ref|NP_001107838.1| aristaless [Tribo ( 309) 259 56.8 4.7e-06 gi|229295137|gb|EEN65785.1| retinal homeobox prote ( 320) 259 56.8 4.8e-06 gi|119884923|ref|XP_878452.2| PREDICTED: similar t ( 319) 258 56.6 5.4e-06 gi|219487155|ref|XP_002240541.1| retinal homeobox ( 320) 258 56.6 5.4e-06 gi|119884921|ref|XP_878355.2| PREDICTED: similar t ( 331) 258 56.6 5.6e-06 gi|66932865|gb|AAY58267.1| reproductive homeobox o ( 227) 255 55.9 6.2e-06 gi|56694798|gb|AAW23061.1| Prop-a [Oikopleura dioi ( 315) 256 56.2 6.9e-06 gi|224091815|ref|XP_002188119.1| PREDICTED: retina ( 316) 255 56.1 7.8e-06 gi|148683583|gb|EDL15530.1| short stature homeobox ( 309) 254 55.9 8.8e-06 gi|149048355|gb|EDM00931.1| short stature homeobox ( 319) 254 55.9 9e-06 gi|26345180|dbj|BAC36240.1| unnamed protein produc ( 227) 252 55.4 9e-06 gi|18202340|sp|P70390.1|SHOX2_MOUSE RecName: Full= ( 331) 254 55.9 9.2e-06 gi|149048354|gb|EDM00930.1| short stature homeobox ( 331) 254 55.9 9.2e-06 gi|194208258|ref|XP_001491292.2| PREDICTED: simila ( 202) 251 55.2 9.4e-06 gi|156538657|ref|XP_001607712.1| PREDICTED: simila ( 413) 255 56.2 9.5e-06 gi|109465829|ref|XP_001059871.1| PREDICTED: simila ( 312) 253 55.7 1e-05 gi|229276193|gb|EEN47004.1| Q50 paired-like homeod ( 320) 253 55.7 1e-05 >>gi|37537960|sp|Q8NHV9.1|RHXF1_HUMAN RecName: Full=Rhox (184 aa) initn: 1279 init1: 1279 opt: 1279 Z-score: 1271.1 bits: 242.0 E(): 4.9e-62 Smith-Waterman score: 1279; 100.000% identity (100.000% similar) in 184 aa overlap (24-207:1-184) 10 20 30 40 50 60 ah0191 LHPLTPTPAADATEFVQGRSAPAMARSLVHDTVFYCLSVYQVKISPTPQLGAASSAEGHV ::::::::::::::::::::::::::::::::::::: gi|375 MARSLVHDTVFYCLSVYQVKISPTPQLGAASSAEGHV 10 20 30 70 80 90 100 110 120 ah0191 GQGAPGLMGNMNPEGGVNHENGMNRDGGMIPEGGGGNQEPRQQPQPPPEEPAQAAMEGPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|375 GQGAPGLMGNMNPEGGVNHENGMNRDGGMIPEGGGGNQEPRQQPQPPPEEPAQAAMEGPQ 40 50 60 70 80 90 130 140 150 160 170 180 ah0191 PENMQPRTRRTKFTLLQVEELESVFRHTQYPDVPTRRELAENLGVTEDKVRVWFKNKRAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|375 PENMQPRTRRTKFTLLQVEELESVFRHTQYPDVPTRRELAENLGVTEDKVRVWFKNKRAR 100 110 120 130 140 150 190 200 ah0191 CRRHQRELMLANELRADPDDCVYIVVD ::::::::::::::::::::::::::: gi|375 CRRHQRELMLANELRADPDDCVYIVVD 160 170 180 >>gi|109132103|ref|XP_001084578.1| PREDICTED: similar to (185 aa) initn: 1131 init1: 631 opt: 1062 Z-score: 1057.9 bits: 202.5 E(): 3.7e-50 Smith-Waterman score: 1062; 85.870% identity (91.304% similar) in 184 aa overlap (24-207:1-183) 10 20 30 40 50 60 ah0191 LHPLTPTPAADATEFVQGRSAPAMARSLVHDTVFYCLSVYQVKISPTPQLGAASSAEGHV ::::::::: ::::.::::::: ::.::::: .:::: gi|109 MARSLVHDTKFYCLNVYQVKISSTPELGAASRVEGHV 10 20 30 70 80 90 100 110 120 ah0191 GQGAPGLMGNMNPEGGVNHENGMNRDGGMIPEGGGGNQEPRQQPQPPPEEPAQAAMEGPQ ::::::::::::::::::::: ::: :::: ::::::: ::: :: ::::::::: :: gi|109 GQGAPGLMGNMNPEGGVNHENYMNRYGGMIHEGGGGNQGPRQLQQPL-EEPAQAAMEDPQ 40 50 60 70 80 90 130 140 150 160 170 180 ah0191 PENMQPRTRRTKFTLLQVEELESVFRHTQYPDVPTRRELAENLGVTEDKVRVWFKNKRAR ::::::: :: ::: :::.::::::..::::::::::::::::::::::::::::::::: gi|109 PENMQPRIRRKKFTPLQVQELESVFQRTQYPDVPTRRELAENLGVTEDKVRVWFKNKRAR 100 110 120 130 140 150 190 200 ah0191 CRRHQRELMLANELRADPDDCVYIVVD :::.:::::::::: ::::.::::..: gi|109 CRRYQRELMLANELLADPDNCVYIILDEP 160 170 180 >>gi|3900848|gb|AAC78617.1| match to EST AA361117 (NID:g (148 aa) initn: 1021 init1: 1021 opt: 1021 Z-score: 1018.9 bits: 195.0 E(): 5.5e-48 Smith-Waterman score: 1021; 100.000% identity (100.000% similar) in 148 aa overlap (24-171:1-148) 10 20 30 40 50 60 ah0191 LHPLTPTPAADATEFVQGRSAPAMARSLVHDTVFYCLSVYQVKISPTPQLGAASSAEGHV ::::::::::::::::::::::::::::::::::::: gi|390 MARSLVHDTVFYCLSVYQVKISPTPQLGAASSAEGHV 10 20 30 70 80 90 100 110 120 ah0191 GQGAPGLMGNMNPEGGVNHENGMNRDGGMIPEGGGGNQEPRQQPQPPPEEPAQAAMEGPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: gi|390 GQGAPGLMGNMNPEGGVNHENGMNRDGGMIPEGGGGNQEPRQQPQPPPEEPAQAAMEGPQ 40 50 60 70 80 90 130 140 150 160 170 180 ah0191 PENMQPRTRRTKFTLLQVEELESVFRHTQYPDVPTRRELAENLGVTEDKVRVWFKNKRAR ::::::::::::::::::::::::::::::::::::::::::::::::::: gi|390 PENMQPRTRRTKFTLLQVEELESVFRHTQYPDVPTRRELAENLGVTEDKVR 100 110 120 130 140 190 200 ah0191 CRRHQRELMLANELRADPDDCVYIVVD >>gi|81674226|gb|AAI09513.1| ESX homeobox 1 [Bos taurus] (279 aa) initn: 425 init1: 295 opt: 395 Z-score: 400.2 bits: 81.5 E(): 1.6e-13 Smith-Waterman score: 395; 43.038% identity (67.722% similar) in 158 aa overlap (41-192:38-191) 20 30 40 50 60 ah0191 DATEFVQGRSAPAMARSLVHDTVFYCLSVYQVKISPTPQLGAASSAEGHVGQGAPGLM-- ... :: : .::..: .. :.. .:. gi|816 IGFHNLGIGEDERHDVEPTLISEVLNGVEDETRSSPEPGEAAAAAAANYFGEADSNLLDD 10 20 30 40 50 60 70 80 90 100 110 120 ah0191 ----GNMNPEGGVNHENGMNRDGGMIPEGGGGNQEPRQQPQPPPEEPAQAAMEGPQPENM :: : : :. . :: : ::.::. :: : : :::. :..::: . gi|816 ENREGNENRAGDENRAGDENRAGDENREGNGGDIEP---P-PQQEEPGPQAVQGPQNAAV 70 80 90 100 110 120 130 140 150 160 170 180 ah0191 QPRTRRTKFTLLQVEELESVFRHTQYPDVPTRRELAENLGVTEDKVRVWFKNKRARCRRH .:: :: :: ::. ::: :::..::::: .:...:. :...: .:.:::.:.::. ::. gi|816 KPRRCRTVFTQLQLLELERVFRRVQYPDVFAREDIARRLNLAERRVQVWFQNRRAKWRRY 130 140 150 160 170 180 190 200 ah0191 QRELMLANELRADPDDCVYIVVD :: ::. : gi|816 QRALMFRNVHPAALGHPMGVFFNGPYHVFQPGWRYVPAVPRPGLPPGVPPPPVLPAPLPP 190 200 210 220 230 240 >>gi|74008892|ref|XP_538156.2| PREDICTED: similar to ext (516 aa) initn: 366 init1: 267 opt: 375 Z-score: 377.1 bits: 78.1 E(): 3e-12 Smith-Waterman score: 375; 48.175% identity (71.533% similar) in 137 aa overlap (62-198:372-501) 40 50 60 70 80 90 ah0191 TVFYCLSVYQVKISPTPQLGAASSAEGHVGQGAPGLMGNMNPEGGVNHENGMNRDGGMIP .: .: :..: :: :.:. .:..: : gi|740 EGHFNLEGDINLEGDFNRQGHFNLEGDFNHEGHFNLEGDINCEGDFNREGIVNHEGEGIH 350 360 370 380 390 400 100 110 120 130 140 150 ah0191 EGGGGNQEPRQQPQPPPEEPAQAAMEGPQPENMQPRTRRTKFTLLQVEELESVFRHTQYP : ::: :: : : : : .: ..:::. . :..:.::: .::.::::.:.::::: gi|740 EVGGGVQELGQL--APEELPQAVAARAPQPRRRR-RAQRNKFTQVQVQELESAFQHTQYP 410 420 430 440 450 160 170 180 190 200 ah0191 DVPTRRELAENLGVTEDKVRVWFKNKRARCRRHQRELMLANELRADPDDCVYIVVD :: ::.:::. . ::: .:.:::::.::. .: .: :..:: : gi|740 DVLTRQELARRMDVTEIRVQVWFKNRRAKYKRDER----ASKLRNTPPTNLNHLFILMLD 460 470 480 490 500 510 gi|740 GP >>gi|114690017|ref|XP_001138831.1| PREDICTED: similar to (51 aa) initn: 336 init1: 336 opt: 336 Z-score: 351.9 bits: 70.1 E(): 7.7e-11 Smith-Waterman score: 336; 98.000% identity (100.000% similar) in 50 aa overlap (158-207:2-51) 130 140 150 160 170 180 ah0191 TRRTKFTLLQVEELESVFRHTQYPDVPTRRELAENLGVTEDKVRVWFKNKRARCRRHQRE :::::::::::::::::::::::::::::: gi|114 MELAENLGVTEDKVRVWFKNKRARCRRHQRE 10 20 30 190 200 ah0191 LMLANELRADPDDCVYIVVD ::::.::::::::::::::: gi|114 LMLASELRADPDDCVYIVVD 40 50 >>gi|74008894|ref|XP_853890.1| PREDICTED: similar to PEP (327 aa) initn: 235 init1: 235 opt: 322 Z-score: 327.6 bits: 68.2 E(): 1.7e-09 Smith-Waterman score: 322; 41.221% identity (69.466% similar) in 131 aa overlap (57-186:112-238) 30 40 50 60 70 80 ah0191 SLVHDTVFYCLSVYQVKISPTPQLGAASSAEGHVGQGAPGLMGNMNPEGGVNHENGMNRD .: ::: . . . :: . :..:. . gi|740 THEGAAAEFTPDHGGGADQGQRDGDGQGQRDGDDGQG----QRDRDGEGQRDGEEAMGDE 90 100 110 120 130 90 100 110 120 130 140 ah0191 GGMIP-EGGGGNQEPRQQPQPPPEEPAQAAMEGPQPENMQPRTRRTKFTLLQVEELESVF .. .: .: :. . . . : : .: ::.. : : : : .: :. .:..:::::: gi|740 AAGFPLPAGDGTPQGHGDQGPAPGRPPQATVACPLPGNGQQAGQRIVFSRVQLHELESVF 140 150 160 170 180 190 150 160 170 180 190 200 ah0191 RHTQYPDVPTRRELAENLGVTEDKVRVWFKNKRARCRRHQRELMLANELRADPDDCVYIV ..::::..:::.:::. . :.: .:.:::::.::. ::::: gi|740 QRTQYPSAPTRQELARFMDVSEARVQVWFKNRRAKWRRHQRAVRFRTMPPVALVPPIVIN 200 210 220 230 240 250 ah0191 VD gi|740 LGGPCRTILIQEPNRIWVLQEPLLLGPPQPLMPSFPVVFLPPLPWLPPPLPLCGYPPVAG 260 270 280 290 300 310 >>gi|116241356|sp|Q8N693.3|ESX1_HUMAN RecName: Full=Home (406 aa) initn: 263 init1: 263 opt: 317 Z-score: 321.5 bits: 67.4 E(): 3.8e-09 Smith-Waterman score: 356; 43.478% identity (63.975% similar) in 161 aa overlap (73-197:51-210) 50 60 70 80 90 ah0191 KISPTPQLGAASSAEGHVGQGAPGLMGNMNPEGGVNHENGMNRDGGMIP------EGGGG :: :.. ::... .:. .: ::::: gi|116 EDIEEVNDEKLTVTSLMARGGEDEENTRSKPEYGTEAENNVGTEGS-VPSDDQDREGGGG 30 40 50 60 70 100 110 120 ah0191 N-----QE------PRQQPQPPP--------EEPAQAAMEGPQP-------ENMQP---- . :: :.:: . :: ::: :...::::: :. :: gi|116 HEPEQQQEEPPLTKPEQQQEEPPLLELKQEQEEPPQTTVEGPQPAEGPQTAEGPQPPERK 80 90 100 110 120 130 130 140 150 160 170 180 ah0191 RTRRTKFTLLQVEELESVFRHTQYPDVPTRRELAENLGVTEDKVRVWFKNKRARCRRHQR : ::: :: .:..:::. : ..::::: .:..:: :..:::.:.:::.:.::. .:.:: gi|116 RRRRTAFTQFQLQELENFFDESQYPDVVARERLAARLNLTEDRVQVWFQNRRAKWKRNQR 140 150 160 170 180 190 190 200 ah0191 ELMLANELRADPDDCVYIVVD ::: : :: gi|116 VLMLRNTATADLAHPLDMFLGGAYYAAPALDPALCVHLVPQLPRPPVLPVPPMPPRPPMV 200 210 220 230 240 250 >>gi|31566395|gb|AAH53599.1| ESX homeobox 1 [Homo sapien (406 aa) initn: 263 init1: 263 opt: 317 Z-score: 321.5 bits: 67.4 E(): 3.8e-09 Smith-Waterman score: 356; 43.478% identity (63.975% similar) in 161 aa overlap (73-197:51-210) 50 60 70 80 90 ah0191 KISPTPQLGAASSAEGHVGQGAPGLMGNMNPEGGVNHENGMNRDGGMIP------EGGGG :: :.. ::... .:. .: ::::: gi|315 EDIEEVNDEKLTVTSLMARGGEDEENTRSKPEYGTEAENNVGTEGS-VPSDDQDREGGGG 30 40 50 60 70 100 110 120 ah0191 N-----QE------PRQQPQPPP--------EEPAQAAMEGPQP-------ENMQP---- . :: :.:: . :: ::: :...::::: :. :: gi|315 HEPEQQQEEPPLTKPEQQQEEPPLLELKQEQEEPPQTTVEGPQPAEGPQTAEGPQPPERK 80 90 100 110 120 130 130 140 150 160 170 180 ah0191 RTRRTKFTLLQVEELESVFRHTQYPDVPTRRELAENLGVTEDKVRVWFKNKRARCRRHQR : ::: :: .:..:::. : ..::::: .:..:: :..:::.:.:::.:.::. .:.:: gi|315 RRRRTAFTQFQLQELENFFDESQYPDVVARERLAARLNLTEDRVQVWFQNRRAKWKRNQR 140 150 160 170 180 190 190 200 ah0191 ELMLANELRADPDDCVYIVVD ::: : :: gi|315 VLMLRNTATADLAHPLDMFLGGAYYAAPALDPALCVHLVPQLPRPPVLPVPPMPPRPPMV 200 210 220 230 240 250 >>gi|109131769|ref|XP_001092145.1| PREDICTED: similar to (361 aa) initn: 377 init1: 250 opt: 307 Z-score: 312.3 bits: 65.6 E(): 1.2e-08 Smith-Waterman score: 360; 42.675% identity (66.879% similar) in 157 aa overlap (48-192:51-205) 20 30 40 50 60 70 ah0191 GRSAPAMARSLVHDTVFYCLSVYQVKISPTPQLGAASSAEGHVGQGAPGLMGNMNPEGGV :. :: ::..:: . ... ::: gi|109 EDIEEVNDEKLTVTSLMARGGEDEENARSEPEYGAE--AENNVGTVGFVPSDDQDREGGG 30 40 50 60 70 80 90 100 110 120 ah0191 NHENGMNRDGGMIPEGGGGNQEPRQ-QPQPPPEEPAQAAMEGPQP-------ENMQP--- .:: .... .:: ..:: .:. ::: ::..::::: :. :: gi|109 GHEPEQQQEEPPLPEPEQQQEEPPLLEPKQEQEEPPQATVEGPQPAEGPQTAEGPQPPER 80 90 100 110 120 130 130 140 150 160 170 180 ah0191 -RTRRTKFTLLQVEELESVFRHTQYPDVPTRRELAENLGVTEDKVRVWFKNKRARCRRHQ : ::: :: .:..:::. : ..::::: .:..:: :..:::.:.:::.:.::. .:.: gi|109 KRRRRTAFTQFQLQELENFFDEAQYPDVVARERLAARLNLTEDRVQVWFQNRRAKWKRNQ 140 150 160 170 180 190 190 200 ah0191 RELMLANELRADPDDCVYIVVD : ::: : gi|109 RVLMLRNIAAAALARPAEVFLGGPYNATPSLDPALCVHLVPQLPRPPVPPMPPRPPMVPM 200 210 220 230 240 250 207 residues in 1 query sequences 3071326396 residues in 8985982 library sequences Tcomplib [34.26] (8 proc) start: Wed Jun 17 19:23:53 2009 done: Wed Jun 17 19:27:38 2009 Total Scan time: 848.040 Total Display time: 0.030 Function used was FASTA [version 34.26.5 April 26, 2007]