BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= trembl|Q60952|Q60952 Intranuclear protein. (446 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value trembl|Q60952|Q60952 Intranuclear protein. 470 e-135 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 31 0.003 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 26 0.072 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 21 2.4 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 470 bits (1210), Expect = e-135 Identities = 446/446 (100%), Positives = 446/446 (100%) Query: 1 MRQKQDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAAL 60 MRQKQDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAAL Sbjct: 1 MRQKQDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAAL 60 Query: 61 QEKEQSLLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVT 120 QEKEQSLLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVT Sbjct: 61 QEKEQSLLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVT 120 Query: 121 QHLQQELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVA 180 QHLQQELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVA Sbjct: 121 QHLQQELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVA 180 Query: 181 QREEEILMLREASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDL 240 QREEEILMLREASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDL Sbjct: 181 QREEEILMLREASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDL 240 Query: 241 ALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHSTSRA 300 ALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHSTSRA Sbjct: 241 ALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHSTSRA 300 Query: 301 DQGPKVGQESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTA 360 DQGPKVGQESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTA Sbjct: 301 DQGPKVGQESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTA 360 Query: 361 LEQVERERRKLKRDSVRASRAGSLEARETMTSSPTQQDGRGSQRGSSDSVLVVELQREVA 420 LEQVERERRKLKRDSVRASRAGSLEARETMTSSPTQQDGRGSQRGSSDSVLVVELQREVA Sbjct: 361 LEQVERERRKLKRDSVRASRAGSLEARETMTSSPTQQDGRGSQRGSSDSVLVVELQREVA 420 Query: 421 LLRAQLALERKRGRTTLHGPCRPAVS 446 LLRAQLALERKRGRTTLHGPCRPAVS Sbjct: 421 LLRAQLALERKRGRTTLHGPCRPAVS 446 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 30.8 bits (68), Expect = 0.003 Identities = 45/184 (24%), Positives = 83/184 (44%), Gaps = 8/184 (4%) Query: 10 QLAAGQATLTEQEQELAA-LRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLL 68 +L +AT E++ + L E Q + A + +AL L+ A++ + Q L Sbjct: 152 KLKEKEATFVSMEKKFQSELLNERDIRNNQLKRAGEERQALVNQLNSAKSTVTNLGQELQ 211 Query: 69 EQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELC 128 ++ ++ L LQ L + ++L+E L+ + IQ E L E+ Sbjct: 212 KEKRIAEELIVQIEGLQNNLMQMKEDKKKLQEELKEKLDLIQVL----QEKITLLTTEIK 267 Query: 129 QKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVAQREEEILM 188 KE L+ K + S+ + + S+ QE Q++ L +KEL++ V +RE E+ + Sbjct: 268 DKEASLQSTTSKLAEKESE---VDKLSSMYQESQDQLMNLTSEIKELKVEVQKRERELEL 324 Query: 189 LREA 192 RE+ Sbjct: 325 KRES 328 Score = 20.4 bits (41), Expect = 4.4 Identities = 83/395 (21%), Positives = 159/395 (40%), Gaps = 32/395 (8%) Query: 5 QDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKE 64 +D L + + L E+E E+ L + Q S+ Q S+ + L+ + K + L+ K Sbjct: 267 KDKEASLQSTTSKLAEKESEVDKLSSMYQESQDQLMNLTSEIKELKVEVQKRERELELKR 326 Query: 65 QSLLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQ 124 +S + L+ L + + A ++ E I E ++ + A Sbjct: 327 ES---EDNLNVRLNSLLVERDESKKELDAIQKEYSEFKSISEKKVASDA----------- 372 Query: 125 QELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRL----LESLKELQLTVA 180 + L ++E+ L QL E+ G + + + +E RR+ LE++ +L+L V Sbjct: 373 KLLGEQEKRLHQLEEQLGTASDEVRKNNVLIADLTQEKENLRRMLDAELENISKLKLEVQ 432 Query: 181 QREEEILMLREASSPSHRALPAEKPALQPLPA-----QQELERLQTALRQTEAREIEWRE 235 +E + R +S + L + L A Q ELE +T+LR+ E + Sbjct: 433 VTQETLEKSRSDASDIAQQLQQSRHLCSKLEAEVSKLQMELEETRTSLRRNID---ETKR 489 Query: 236 KAQDLALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPH 295 A+ LA L ++ + E + + LQ ELV + E Sbjct: 490 GAELLAAELTTTRELLKKTNEEMHTMSHELAAVTENCDNLQTELVDVYKKAERAA----- 544 Query: 296 STSRADQGPKVGQESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNA 355 + ++ V E + +E + + E ++ L + LER +++ E+ + L Sbjct: 545 DELKQEKNIVVTLEKELTFLEAQITREKESRKNLEEELERATESLDEMNRNAFALAKELE 604 Query: 356 QLRTALEQVERERRKLKRDSVRASRAGSLEARETM 390 + + +E ER L++ SV + S E+RE + Sbjct: 605 LANSHISSLEDEREVLQK-SVSEQKQISQESRENL 638 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 26.2 bits (56), Expect = 0.072 Identities = 63/301 (20%), Positives = 141/301 (45%), Gaps = 19/301 (6%) Query: 5 QDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQA--------EALQEALSKA 56 Q G L + L+ Q +E+ A E+ + +E++A A E+L++ + A Sbjct: 77 QHNMGLLLLEKNELSSQYEEIKASVDESDLTHMREKSAYVSALAEAKKREESLKKDVGIA 136 Query: 57 QAALQEKEQSLLE-QAELSHTLEASTAALQATLDTCQARARQLEEA-LRIREGE-IQAQA 113 + + E++L E +AE + T ++ + + + ++L +A ++R E +QA+A Sbjct: 137 KECISSLEKTLHEMRAECAETKVSAGSTMSEAHVMIEDALKKLADAEAKMRAAEALQAEA 196 Query: 114 LQHHEVTQHLQQELCQKEEEL-RQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESL 172 ++H + + +E+ +E++L R+L ++EN + ++ ER++ ++ E L Sbjct: 197 NRYHRIAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERL 256 Query: 173 KELQLTVAQREEEILMLREASSPSHRALPAEKPALQPLPAQQELER--LQTALRQTEARE 230 + Q+++ QRE+ I + + + L K + E ++ L+ AL RE Sbjct: 257 LDAQVSLNQREDHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKRE 316 Query: 231 IEWREKAQDLALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQ 290 L + ++S L + + +V ERES + ++EL+++ + + ++ Sbjct: 317 EVCFYSHNSLLFLVLHYRSSKKFLGD-----KIAVSERESSLLKKEQELLVAEEKIASKE 371 Query: 291 S 291 S Sbjct: 372 S 372 Score = 22.3 bits (46), Expect = 1.1 Identities = 32/194 (16%), Positives = 90/194 (45%), Gaps = 1/194 (0%) Query: 22 EQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAEL-SHTLEAS 80 E+E + L+ E + +E+ A ++E +Q L+ + L++++ + + E S ++E Sbjct: 348 ERESSLLKKEQELLVAEEKIASKESELIQNVLANQEVILRKRKSDVEAELECKSKSVEVE 407 Query: 81 TAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQKEEELRQLLEK 140 + + + + +Q E+ + +E +++ Q+ E + + ++ +E+ + L+ Sbjct: 408 IESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVAT 467 Query: 141 AGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVAQREEEILMLREASSPSHRAL 200 + +++++ ++ E ++ L SL++ + V +++ L+ +S Sbjct: 468 EEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLE 527 Query: 201 PAEKPALQPLPAQQ 214 K L L AQ+ Sbjct: 528 MKLKEELDDLRAQK 541 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 21.2 bits (43), Expect = 2.4 Identities = 24/86 (27%), Positives = 34/86 (38%), Gaps = 13/86 (15%) Query: 238 QDLALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHST 297 +D A SLA+ S S + S E E + ++L+L LE P Sbjct: 62 EDEADSLAKEYGSPSVI---------SSYEDEPPNRGEIDQLILMEPVLENN----PERR 108 Query: 298 SRADQGPKVGQESQPGEVETEPSPGV 323 GP G+ES+P E +P V Sbjct: 109 FVVVTGPGAGKESEPAPSIVETTPPV 134 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|Q60952|Q60952 Intranuclear protein. 441 e-127 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 180 2e-48 Sequences not found previously or not previously below threshold: swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 34 3e-04 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 29 0.010 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 24 0.24 swiss|Q07856|VE6_HPV65 E6 protein. 22 1.8 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 19 9.8 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 441 bits (1135), Expect = e-127 Identities = 446/446 (100%), Positives = 446/446 (100%) Query: 1 MRQKQDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAAL 60 MRQKQDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAAL Sbjct: 1 MRQKQDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAAL 60 Query: 61 QEKEQSLLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVT 120 QEKEQSLLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVT Sbjct: 61 QEKEQSLLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVT 120 Query: 121 QHLQQELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVA 180 QHLQQELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVA Sbjct: 121 QHLQQELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVA 180 Query: 181 QREEEILMLREASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDL 240 QREEEILMLREASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDL Sbjct: 181 QREEEILMLREASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDL 240 Query: 241 ALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHSTSRA 300 ALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHSTSRA Sbjct: 241 ALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHSTSRA 300 Query: 301 DQGPKVGQESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTA 360 DQGPKVGQESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTA Sbjct: 301 DQGPKVGQESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTA 360 Query: 361 LEQVERERRKLKRDSVRASRAGSLEARETMTSSPTQQDGRGSQRGSSDSVLVVELQREVA 420 LEQVERERRKLKRDSVRASRAGSLEARETMTSSPTQQDGRGSQRGSSDSVLVVELQREVA Sbjct: 361 LEQVERERRKLKRDSVRASRAGSLEARETMTSSPTQQDGRGSQRGSSDSVLVVELQREVA 420 Query: 421 LLRAQLALERKRGRTTLHGPCRPAVS 446 LLRAQLALERKRGRTTLHGPCRPAVS Sbjct: 421 LLRAQLALERKRGRTTLHGPCRPAVS 446 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 180 bits (457), Expect = 2e-48 Identities = 45/184 (24%), Positives = 83/184 (44%), Gaps = 8/184 (4%) Query: 10 QLAAGQATLTEQEQELAA-LRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLL 68 +L +AT E++ + L E Q + A + +AL L+ A++ + Q L Sbjct: 152 KLKEKEATFVSMEKKFQSELLNERDIRNNQLKRAGEERQALVNQLNSAKSTVTNLGQELQ 211 Query: 69 EQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELC 128 ++ ++ L LQ L + ++L+E L+ + IQ E L E+ Sbjct: 212 KEKRIAEELIVQIEGLQNNLMQMKEDKKKLQEELKEKLDLIQVL----QEKITLLTTEIK 267 Query: 129 QKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVAQREEEILM 188 KE L+ K + S+ + + S+ QE Q++ L +KEL++ V +RE E+ + Sbjct: 268 DKEASLQSTTSKLAEKESE---VDKLSSMYQESQDQLMNLTSEIKELKVEVQKRERELEL 324 Query: 189 LREA 192 RE+ Sbjct: 325 KRES 328 Score = 35.5 bits (80), Expect = 1e-04 Identities = 74/390 (18%), Positives = 149/390 (37%), Gaps = 25/390 (6%) Query: 10 QLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLE 69 ++ +A+L +LA +E + + Q L + + + +Q++E+ L Sbjct: 265 EIKDKEASLQSTTSKLAEKESEVDKLSSMYQESQDQLMNLTSEIKELKVEVQKRERELEL 324 Query: 70 QAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQ 129 + E L +L D + +++ + + L + Sbjct: 325 KRESEDNLNVRLNSLLVERDESKKELDAIQKEYSE-------FKSISEKKVASDAKLLGE 377 Query: 130 KEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRL----LESLKELQLTVAQREEE 185 +E+ L QL E+ G + + + +E RR+ LE++ +L+L V +E Sbjct: 378 QEKRLHQLEEQLGTASDEVRKNNVLIADLTQEKENLRRMLDAELENISKLKLEVQVTQET 437 Query: 186 ILMLREASSPSHRALPAEKPALQPLPA-----QQELERLQTALRQTEAREIEWREKAQDL 240 + R +S + L + L A Q ELE +T+LR+ E + A+ L Sbjct: 438 LEKSRSDASDIAQQLQQSRHLCSKLEAEVSKLQMELEETRTSLRRNID---ETKRGAELL 494 Query: 241 ALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHSTSRA 300 A L ++ + E + + LQ ELV + E + Sbjct: 495 AAELTTTRELLKKTNEEMHTMSHELAAVTENCDNLQTELVDVYKKAERAA-----DELKQ 549 Query: 301 DQGPKVGQESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTA 360 ++ V E + +E + + E ++ L + LER +++ E+ + L + Sbjct: 550 EKNIVVTLEKELTFLEAQITREKESRKNLEEELERATESLDEMNRNAFALAKELELANSH 609 Query: 361 LEQVERERRKLKRDSVRASRAGSLEARETM 390 + +E ER L++ SV + S E+RE + Sbjct: 610 ISSLEDEREVLQK-SVSEQKQISQESRENL 638 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 34.4 bits (77), Expect = 3e-04 Identities = 36/173 (20%), Positives = 64/173 (36%), Gaps = 9/173 (5%) Query: 33 QYSRRQEEAAV-SQAEALQEALSKAQAALQEKEQSLLEQAELSHTLEASTAALQATLDTC 91 Q+ R + + A SQA + A ++ Q L E + + S +L + A+Q L Sbjct: 377 QWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFLQLP 436 Query: 92 QARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQKEEELRQLLEKAGARRSQENGI 151 +A ++ + R R A L + + L + + N + Sbjct: 437 EAERDRIYQDERER----SLNAASAMGPAPLLSTPPSRPPQVKTATLATERNGKPENNTM 492 Query: 152 QEKQSLEQERQEETRRLLESLKELQLTVAQREE----EILMLREASSPSHRAL 200 S+ E Q+E +R S A + + E+L +E SP +R L Sbjct: 493 NINASIYDEIQQEMKRAKVSQAPFAKVAATKSQGWLCELLRWKEDPSPENRTL 545 Score = 20.9 bits (42), Expect = 2.8 Identities = 29/161 (18%), Positives = 59/161 (36%), Gaps = 24/161 (14%) Query: 18 LTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAELSHTL 77 L Q+ +A L + R + L +++ + EQ + E+S + Sbjct: 316 LVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKPLEQQVSTNTEVSSEI 375 Query: 78 EASTAALQATLDTCQARARQLEEALRIREGEIQAQ-ALQHHEVTQHLQQELCQKEEELRQ 136 + + + L+ R G QA A TQ L E+ +KEE+ + Sbjct: 376 Y-----------------QWVRDELK-RAGISQAVFARVAFNRTQGLLSEILRKEEDPKT 417 Query: 137 LLEKAGAR-RSQENGIQ----EKQSLEQERQEETRRLLESL 172 + R+ +N +Q E+ + Q+ +E + ++ Sbjct: 418 ASQSLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAM 458 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 29.0 bits (63), Expect = 0.010 Identities = 59/301 (19%), Positives = 132/301 (43%), Gaps = 19/301 (6%) Query: 5 QDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQA--------EALQEALSKA 56 Q G L + L+ Q +E+ A E+ + +E++A A E+L++ + A Sbjct: 77 QHNMGLLLLEKNELSSQYEEIKASVDESDLTHMREKSAYVSALAEAKKREESLKKDVGIA 136 Query: 57 QAALQEKEQSLLEQAELSHTLEASTAALQATLDTCQAR--ARQLEEALRIREGE-IQAQA 113 + + E++L E + S + + + + ++R E +QA+A Sbjct: 137 KECISSLEKTLHEMRAECAETKVSAGSTMSEAHVMIEDALKKLADAEAKMRAAEALQAEA 196 Query: 114 LQHHEVTQHLQQELCQKEEEL-RQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESL 172 ++H + + +E+ +E++L R+L ++EN + ++ ER++ ++ E L Sbjct: 197 NRYHRIAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERL 256 Query: 173 KELQLTVAQREEEILMLREASSPSHRALPAEKPALQPLPAQQELER--LQTALRQTEARE 230 + Q+++ QRE+ I + + + L K + E ++ L+ AL RE Sbjct: 257 LDAQVSLNQREDHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKRE 316 Query: 231 IEWREKAQDLALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQ 290 L + ++S L + + +V ERES + ++EL+++ + + ++ Sbjct: 317 EVCFYSHNSLLFLVLHYRSSKKFLGD-----KIAVSERESSLLKKEQELLVAEEKIASKE 371 Query: 291 S 291 S Sbjct: 372 S 372 Score = 27.4 bits (59), Expect = 0.028 Identities = 45/240 (18%), Positives = 108/240 (44%), Gaps = 26/240 (10%) Query: 11 LAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQ 70 ++ +++L ++EQEL L AE E+ A ++E +Q L+ + L++++ + + Sbjct: 346 VSERESSLLKKEQEL--LVAE-------EKIASKESELIQNVLANQEVILRKRKSDVEAE 396 Query: 71 AEL-SHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQ 129 E S ++E + + + + +Q E+ + +E +++ Q+ E + + ++ Sbjct: 397 LECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFN 456 Query: 130 KEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVAQREEEILML 189 +E+ + L+ + E I K ++ ++ +E R+ L L + Q + Sbjct: 457 LDEKEKNLV-------ATEEDINRKTTMLEDEKERLRK-------LDLELQQSLTSLEDK 502 Query: 190 REASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDLALSLAQSKA 249 R+ + + L A K L + +L+ L A+++E +A L + A+ +A Sbjct: 503 RKRVDSATQKLEALKSETSELSTLE--MKLKEELDDLRAQKLEMLAEADRLKVEKAKFEA 560 Score = 23.6 bits (49), Expect = 0.45 Identities = 31/192 (16%), Positives = 72/192 (37%), Gaps = 30/192 (15%) Query: 12 AAGQATLTEQEQELAALRAENQYSRRQEEAAVS----QAEALQEALSKAQAALQEKEQS- 66 +A + E ++ E R Q + V + E + + + K Q Sbjct: 581 RQREAFSMYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKIQRE 640 Query: 67 --------LLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHE 118 +++ EL + +E L+ + + R + E+ ++ E IQ+ + Sbjct: 641 RADFLLGIEMQKRELEYCIENKREELENS---SRDREKAFEQEKKLEEERIQSLKEMAEK 697 Query: 119 VTQHLQQELCQKEEE--------------LRQLLEKAGARRSQENGIQEKQSLEQERQEE 164 +H+Q EL + + E +L + + Q ++ ++ + + ++E Sbjct: 698 ELEHVQVELKRLDAERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDE 757 Query: 165 TRRLLESLKELQ 176 R +E LK+L+ Sbjct: 758 IRHEIEELKKLE 769 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 24.4 bits (51), Expect = 0.24 Identities = 24/86 (27%), Positives = 34/86 (38%), Gaps = 13/86 (15%) Query: 238 QDLALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHST 297 +D A SLA+ S S + S E E + ++L+L LE P Sbjct: 62 EDEADSLAKEYGSPSVI---------SSYEDEPPNRGEIDQLILMEPVLENN----PERR 108 Query: 298 SRADQGPKVGQESQPGEVETEPSPGV 323 GP G+ES+P E +P V Sbjct: 109 FVVVTGPGAGKESEPAPSIVETTPPV 134 >swiss|Q07856|VE6_HPV65 E6 protein. Length = 140 Score = 21.7 bits (44), Expect = 1.8 Identities = 11/36 (30%), Positives = 14/36 (38%) Query: 74 SHTLEASTAALQATLDTCQARARQLEEALRIREGEI 109 S L S Q C +A LEE + + EI Sbjct: 61 SECLRLSARFEQENYFQCSIKAVNLEEVAQRKIKEI 96 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 19.0 bits (37), Expect = 9.8 Identities = 8/37 (21%), Positives = 18/37 (48%) Query: 40 EAAVSQAEALQEALSKAQAALQEKEQSLLEQAELSHT 76 + + Q + QE L+ + ++K++ E+ E S Sbjct: 128 KEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSRE 164 Searching.done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|Q60952|Q60952 Intranuclear protein. 392 e-112 swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 212 7e-58 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 150 3e-39 Sequences not found previously or not previously below threshold: trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 28 0.017 swiss|P14733|LAM1_MOUSE Lamin B1. 22 1.8 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 20 3.5 CONVERGED! >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 392 bits (1006), Expect = e-112 Identities = 446/446 (100%), Positives = 446/446 (100%) Query: 1 MRQKQDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAAL 60 MRQKQDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAAL Sbjct: 1 MRQKQDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAAL 60 Query: 61 QEKEQSLLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVT 120 QEKEQSLLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVT Sbjct: 61 QEKEQSLLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVT 120 Query: 121 QHLQQELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVA 180 QHLQQELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVA Sbjct: 121 QHLQQELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVA 180 Query: 181 QREEEILMLREASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDL 240 QREEEILMLREASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDL Sbjct: 181 QREEEILMLREASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDL 240 Query: 241 ALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHSTSRA 300 ALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHSTSRA Sbjct: 241 ALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHSTSRA 300 Query: 301 DQGPKVGQESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTA 360 DQGPKVGQESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTA Sbjct: 301 DQGPKVGQESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTA 360 Query: 361 LEQVERERRKLKRDSVRASRAGSLEARETMTSSPTQQDGRGSQRGSSDSVLVVELQREVA 420 LEQVERERRKLKRDSVRASRAGSLEARETMTSSPTQQDGRGSQRGSSDSVLVVELQREVA Sbjct: 361 LEQVERERRKLKRDSVRASRAGSLEARETMTSSPTQQDGRGSQRGSSDSVLVVELQREVA 420 Query: 421 LLRAQLALERKRGRTTLHGPCRPAVS 446 LLRAQLALERKRGRTTLHGPCRPAVS Sbjct: 421 LLRAQLALERKRGRTTLHGPCRPAVS 446 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 212 bits (539), Expect = 7e-58 Identities = 36/173 (20%), Positives = 64/173 (36%), Gaps = 9/173 (5%) Query: 33 QYSRRQEEAAV-SQAEALQEALSKAQAALQEKEQSLLEQAELSHTLEASTAALQATLDTC 91 Q+ R + + A SQA + A ++ Q L E + + S +L + A+Q L Sbjct: 377 QWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQSLLVNLRAMQNFLQLP 436 Query: 92 QARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQKEEELRQLLEKAGARRSQENGI 151 +A ++ + R R A L + + L + + N + Sbjct: 437 EAERDRIYQDERER----SLNAASAMGPAPLLSTPPSRPPQVKTATLATERNGKPENNTM 492 Query: 152 QEKQSLEQERQEETRRLLESLKELQLTVAQREE----EILMLREASSPSHRAL 200 S+ E Q+E +R S A + + E+L +E SP +R L Sbjct: 493 NINASIYDEIQQEMKRAKVSQAPFAKVAATKSQGWLCELLRWKEDPSPENRTL 545 Score = 42.8 bits (99), Expect = 8e-07 Identities = 18/90 (20%), Positives = 36/90 (40%), Gaps = 9/90 (10%) Query: 33 QYSRRQEEAA-VSQAEALQEALSKAQAALQEKEQSLLEQAELSHTLEASTAALQATLDTC 91 +++ + A VSQA + A +K+Q L E + + + + TL + + ++ L Sbjct: 500 DEIQQEMKRAKVSQAPFAKVAATKSQGWLCELLRWKEDPSPENRTLWENLSMIRRFLSLP 559 Query: 92 QARARQLEEALRIREGEIQAQALQHHEVTQ 121 Q + E + A+ HH Sbjct: 560 QPERDAIYEQE--------SNAVHHHGDRP 581 Score = 37.0 bits (84), Expect = 4e-05 Identities = 20/101 (19%), Positives = 34/101 (32%), Gaps = 6/101 (5%) Query: 124 QQELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVAQRE 183 QQ L R + + + S + + Q ++E +R S R Sbjct: 344 QQYLNHPPPVSRSMNKPLEQQVSTNTEVS--SEIYQWVRDELKRAGISQAVFARVAFNRT 401 Query: 184 E----EILMLREASSPSHRALPAEKPALQPLPAQQELERLQ 220 + EIL E + ++L A+Q E ER + Sbjct: 402 QGLLSEILRKEEDPKTASQSLLVNLRAMQNFLQLPEAERDR 442 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 150 bits (379), Expect = 3e-39 Identities = 85/430 (19%), Positives = 175/430 (39%), Gaps = 26/430 (6%) Query: 10 QLAAGQATLTEQEQELAA-LRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLL 68 +L +AT E++ + L E Q + A + +AL L+ A++ + Q L Sbjct: 152 KLKEKEATFVSMEKKFQSELLNERDIRNNQLKRAGEERQALVNQLNSAKSTVTNLGQELQ 211 Query: 69 EQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELC 128 ++ ++ L LQ L + ++L+E L+ + IQ E L E+ Sbjct: 212 KEKRIAEELIVQIEGLQNNLMQMKEDKKKLQEELKEKLDLIQVL----QEKITLLTTEIK 267 Query: 129 QKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVAQREEEILM 188 KE L+ K + S+ + + S+ QE Q++ L +KEL++ V +RE E+ + Sbjct: 268 DKEASLQSTTSKLAEKESE---VDKLSSMYQESQDQLMNLTSEIKELKVEVQKRERELEL 324 Query: 189 LREASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDLAL------ 242 RE+ + L + ++ +++EL+ +Q +E + I ++ A D L Sbjct: 325 KRESEDNLNVRLNSL--LVERDESKKELDAIQKEY--SEFKSISEKKVASDAKLLGEQEK 380 Query: 243 SLAQSKASISSLQEITMFLQASVLERESEQQRLQEEL-----VLSSQALEEQQSGGPHST 297 L Q + + + + + + E++ L+ L +S LE Q + Sbjct: 381 RLHQLEEQLGTASDEVRKNNVLIADLTQEKENLRRMLDAELENISKLKLEVQVTQETLEK 440 Query: 298 SRADQGPKVGQESQPGEVETEPSPGVEE-KERLTQRLERLQQAVAELEVDRSKLQCHNAQ 356 SR+D Q Q + ++ V + + L + L++ + E + L Sbjct: 441 SRSDASDIAQQLQQSRHLCSKLEAEVSKLQMELEETRTSLRRNIDETKRGAELLAAELTT 500 Query: 357 LRTALEQVERERRKLKRDSVRASRAGSLEARETMTSSPTQQDGRGSQRGSSDSVLVVELQ 416 R L++ E + + + + + +T ++ R + + +VV L+ Sbjct: 501 TRELLKKTNEEMHTMSHELAAVTE--NCDNLQTELVDVYKKAERAADELKQEKNIVVTLE 558 Query: 417 REVALLRAQL 426 +E+ L AQ+ Sbjct: 559 KELTFLEAQI 568 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 28.2 bits (61), Expect = 0.017 Identities = 64/361 (17%), Positives = 151/361 (41%), Gaps = 14/361 (3%) Query: 22 EQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAEL-SHTLEAS 80 E+E + L+ E + +E+ A ++E +Q L+ + L++++ + + E S ++E Sbjct: 348 ERESSLLKKEQELLVAEEKIASKESELIQNVLANQEVILRKRKSDVEAELECKSKSVEVE 407 Query: 81 TAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQKEEELRQLLEK 140 + + + + +Q E+ + +E +++ Q+ E + + ++ +E+ + L+ Sbjct: 408 IESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVAT 467 Query: 141 AGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVAQREEEILMLREASSPSHRAL 200 + +++++ ++ E ++ L SL++ + V +++ L+ +S Sbjct: 468 EEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELSTLE 527 Query: 201 PAEKPALQPLPAQQ-----ELERLQTALRQTEAREIEWREKAQDL---ALSLAQSKASIS 252 K L L AQ+ E +RL+ + EA K ++L A + + + + S Sbjct: 528 MKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITRQREAFS 587 Query: 253 S-LQEITMFLQASVLERESEQQR----LQEELVLSSQALEEQQSGGPHSTSRADQGPKVG 307 L++ ++ ++ + L E + E+ S R +G Sbjct: 588 MYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKIQRERADFLLG 647 Query: 308 QESQPGEVETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTALEQVERE 367 E Q E+E EE E ++ E+ + +LE +R + A+ QVE + Sbjct: 648 IEMQKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEKELEHVQVELK 707 Query: 368 R 368 R Sbjct: 708 R 708 Score = 22.8 bits (47), Expect = 0.73 Identities = 87/464 (18%), Positives = 179/464 (37%), Gaps = 43/464 (9%) Query: 5 QDPAGQLAAGQATLTEQEQELAALRAENQYSRRQEEAAVSQA--------EALQEALSKA 56 Q G L + L+ Q +E+ A E+ + +E++A A E+L++ + A Sbjct: 77 QHNMGLLLLEKNELSSQYEEIKASVDESDLTHMREKSAYVSALAEAKKREESLKKDVGIA 136 Query: 57 QAALQEKEQSLLEQAELSHTLEASTAAL--QATLDTCQARARQLEEALRIREGE-IQAQA 113 + + E++L E + S + +A + A + + ++R E +QA+A Sbjct: 137 KECISSLEKTLHEMRAECAETKVSAGSTMSEAHVMIEDALKKLADAEAKMRAAEALQAEA 196 Query: 114 LQHHEVTQHLQQELCQKEEELRQLLE--------KAGARRSQENGIQEKQSLEQERQEET 165 ++H + + +E+ +E++L + L K + + E++ Q+ E Sbjct: 197 NRYHRIAERKLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERL 256 Query: 166 RRLLESLKELQLTVAQREEEILMLREASSPSHRALPAEKPALQPLPAQQELERLQTALRQ 225 SL + + + R +E+ L + + E+ A + + E+ A R+ Sbjct: 257 LDAQVSLNQREDHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKRE 316 Query: 226 TEA---------REIEWREKAQDLALSLAQSKASISSLQEITMFLQASVLERESEQQRLQ 276 + +R + L +A S+ S L++ L A E + +Q Sbjct: 317 EVCFYSHNSLLFLVLHYRSSKKFLGDKIAVSERESSLLKKEQELLVAEEKIASKESELIQ 376 Query: 277 EELVLSSQALEEQQSGGPHSTSRADQGPKVGQESQPGEVETEPSPGVEEKERL---TQRL 333 L L +++S + +V ES+ E + ++ + L Sbjct: 377 NVLANQEVILRKRKSDVEAELECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDL 436 Query: 334 ERLQQAVAELEVDRSK-----------LQCHNAQLRTALEQVERERRKLKRDSVRASRAG 382 E +A+AE E D ++ L + +E E+ +L++ + ++ Sbjct: 437 EVQSRALAEKEKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSL 496 Query: 383 -SLEARETMTSSPTQQDGRGSQRGSSDSVLVVELQREVALLRAQ 425 SLE + S TQ+ S S L ++L+ E+ LRAQ Sbjct: 497 TSLEDKRKRVDSATQKLEALKSETSELSTLEMKLKEELDDLRAQ 540 Score = 22.0 bits (45), Expect = 1.3 Identities = 61/378 (16%), Positives = 137/378 (36%), Gaps = 7/378 (1%) Query: 10 QLAAGQATLTEQEQELAALRAENQYSRRQE----EAAVSQAEALQEALSKAQAALQEKEQ 65 +L ++ + + LA+ ++E + + + + ++LQ+ + A Q Sbjct: 206 KLKEVESREDDLTRRLASFKSECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQ 265 Query: 66 SLLEQAELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQ 125 S L A T T R+ E + A + EV + Sbjct: 266 REDHIFARSQEL-AELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEVCFYSHN 324 Query: 126 ELCQKEEELRQLLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVAQREEE 185 L R + G + + EQE ++ EL V +E Sbjct: 325 SLLFLVLHYRSSKKFLGDKIAVSERESSLLKKEQELLVAEEKIASKESELIQNVLANQEV 384 Query: 186 ILMLREASSPSHRALPAEKPALQPLPAQQELERLQTALRQTEAREIEWREKAQDLALSLA 245 IL R++ + ++ ++ ++ E + ++Q E E + + +LA Sbjct: 385 ILRKRKSDVEAELECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALA 444 Query: 246 QSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHS-TSRADQGP 304 + + I+ + +++ E + R L + L + S TS D+ Sbjct: 445 EKEKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRK 504 Query: 305 KVGQESQPGE-VETEPSPGVEEKERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTALEQ 363 +V +Q E +++E S + +L + L+ L+ E+ + +L+ A+ E Sbjct: 505 RVDSATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEH 564 Query: 364 VERERRKLKRDSVRASRA 381 ++ +R +L++++ +R Sbjct: 565 IDVKREELRKEAEYITRQ 582 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 21.6 bits (44), Expect = 1.8 Identities = 34/185 (18%), Positives = 66/185 (35%), Gaps = 5/185 (2%) Query: 211 PAQQELERLQ-----TALRQTEAREIEWREKAQDLALSLAQSKASISSLQEITMFLQASV 265 P QQ+ + T L T ++ +E+ ++L LA + SL+ LQ V Sbjct: 5 PVQQQRAGSRASAPATPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSALQLQV 64 Query: 266 LERESEQQRLQEELVLSSQALEEQQSGGPHSTSRADQGPKVGQESQPGEVETEPSPGVEE 325 ERE + R L + T+R ++ E + ++ Sbjct: 65 TEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLNYAKK 124 Query: 326 KERLTQRLERLQQAVAELEVDRSKLQCHNAQLRTALEQVERERRKLKRDSVRASRAGSLE 385 + L+ +L++ A L + L ++ +E + ++ + S A Sbjct: 125 ESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAAKKQL 184 Query: 386 ARETM 390 A ET+ Sbjct: 185 ADETL 189 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 20.5 bits (41), Expect = 3.5 Identities = 23/99 (23%), Positives = 31/99 (31%), Gaps = 10/99 (10%) Query: 245 AQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHSTSRADQGP 304 +Q+ SI I A R S++ Q E SS A + + P Q Sbjct: 89 SQTTQSIGRTVSI-----AEATRRASQESPRQTETAGSSSAPDALSTQQPPLEHSTQQNI 143 Query: 305 KVGQESQPGEVE-----TEPSPGVEEKERLTQRLERLQQ 338 S VE S E +ER + E Q Sbjct: 144 PTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEASTQ 182 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.294 0.129 0.309 Lambda K H 0.267 0.0395 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 45,508 Number of Sequences: 43 Number of extensions: 2471 Number of successful extensions: 239 Number of sequences better than 10.0: 38 Number of HSP's better than 10.0 without gapping: 38 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 88 length of query: 446 length of database: 17,163 effective HSP length: 58 effective length of query: 388 effective length of database: 14,669 effective search space: 5691572 effective search space used: 5691572 T: 11 A: 40 X1: 16 ( 6.8 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 38 (19.1 bits) S2: 38 (19.3 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. (563 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. 842 0.0 swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). 30 0.007 trembl|Q60952|Q60952 Intranuclear protein. 25 0.19 swiss|P25233|NECD_MOUSE Necdin. 22 2.0 swiss|P30735|VE6_MNPV E6 protein. 20 7.9 >trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. Length = 563 Score = 842 bits (2176), Expect = 0.0 Identities = 563/563 (100%), Positives = 563/563 (100%) Query: 1 MEESHFNSNPYFWPSIPTVSGQIENTMFINKMKDQLLPEKGCGLAPPHYPTLLTVPASVS 60 MEESHFNSNPYFWPSIPTVSGQIENTMFINKMKDQLLPEKGCGLAPPHYPTLLTVPASVS Sbjct: 1 MEESHFNSNPYFWPSIPTVSGQIENTMFINKMKDQLLPEKGCGLAPPHYPTLLTVPASVS 60 Query: 61 LSSGISMDTESKSEQLTPHSQASVTQNITVVPVPSTGLMTAGPGLVITSPSGSLVTTASS 120 LSSGISMDTESKSEQLTPHSQASVTQNITVVPVPSTGLMTAGPGLVITSPSGSLVTTASS Sbjct: 61 LSSGISMDTESKSEQLTPHSQASVTQNITVVPVPSTGLMTAGPGLVITSPSGSLVTTASS 120 Query: 121 AQTFPISTPMIVSALPPGSQALQVVPDLSKKVASTLTEEGGGGGGGGGTVAPPKPPRGRK 180 AQTFPISTPMIVSALPPGSQALQVVPDLSKKVASTLTEEGGGGGGGGGTVAPPKPPRGRK Sbjct: 121 AQTFPISTPMIVSALPPGSQALQVVPDLSKKVASTLTEEGGGGGGGGGTVAPPKPPRGRK 180 Query: 181 KKRMLESGLPEMNDPYVLAPGDDDDHQKDGKTYRCRMCSLTFYSKSEMQIHSKSHTETKP 240 KKRMLESGLPEMNDPYVLAPGDDDDHQKDGKTYRCRMCSLTFYSKSEMQIHSKSHTETKP Sbjct: 181 KKRMLESGLPEMNDPYVLAPGDDDDHQKDGKTYRCRMCSLTFYSKSEMQIHSKSHTETKP 240 Query: 241 HKCPHCSKTFANSSYLAQHIRIHSGAKPYSCNFCEKSFRQLSHLQQHTRIHSKMHTETIK 300 HKCPHCSKTFANSSYLAQHIRIHSGAKPYSCNFCEKSFRQLSHLQQHTRIHSKMHTETIK Sbjct: 241 HKCPHCSKTFANSSYLAQHIRIHSGAKPYSCNFCEKSFRQLSHLQQHTRIHSKMHTETIK 300 Query: 301 PHKCPHCSKTFANTSYLAQHLRIHSGAKPYNCSYCQKAFRQLSHLQQHTRIHTGDRPYKC 360 PHKCPHCSKTFANTSYLAQHLRIHSGAKPYNCSYCQKAFRQLSHLQQHTRIHTGDRPYKC Sbjct: 301 PHKCPHCSKTFANTSYLAQHLRIHSGAKPYNCSYCQKAFRQLSHLQQHTRIHTGDRPYKC 360 Query: 361 AHPGCEKAFTQLSNLQSHRRQHNKDKPFKCHNCHRAYTDAASLEAHLSTHTVKHAKVYTC 420 AHPGCEKAFTQLSNLQSHRRQHNKDKPFKCHNCHRAYTDAASLEAHLSTHTVKHAKVYTC Sbjct: 361 AHPGCEKAFTQLSNLQSHRRQHNKDKPFKCHNCHRAYTDAASLEAHLSTHTVKHAKVYTC 420 Query: 421 TICSRAYTSETYLMKHMRKHNPPDLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQ 480 TICSRAYTSETYLMKHMRKHNPPDLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQ Sbjct: 421 TICSRAYTSETYLMKHMRKHNPPDLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQ 480 Query: 481 AQAQAQASQASQQQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPA 540 AQAQAQASQASQQQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPA Sbjct: 481 AQAQAQASQASQQQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPA 540 Query: 541 EHHKDICLTVTTSTIQVEHLASS 563 EHHKDICLTVTTSTIQVEHLASS Sbjct: 541 EHHKDICLTVTTSTIQVEHLASS 563 >swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). Length = 212 Score = 30.0 bits (66), Expect = 0.007 Identities = 20/65 (30%), Positives = 33/65 (50%), Gaps = 6/65 (9%) Query: 493 QQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPAEHHKDICLTVTT 552 ++++ P PP PP+ S + G++ P P+ FDLT P + HK C +VT Sbjct: 72 EERENSPSPPIPPYQISEMPLHELLESGNAKLVPNPE--FDLT--DPDDFHK--CFSVTY 125 Query: 553 STIQV 557 S + + Sbjct: 126 SALSL 130 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 25.4 bits (54), Expect = 0.19 Identities = 17/53 (32%), Positives = 27/53 (50%) Query: 445 LQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQAQASQASQQQQQQ 497 L +Q Q AA A Q + + A +QA+A +A ++AQA + Q +Q Sbjct: 18 LTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQ 70 Score = 23.5 bits (49), Expect = 0.59 Identities = 16/52 (30%), Positives = 27/52 (51%) Query: 446 QQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQAQASQASQQQQQQ 497 Q+Q AA A + + A +QA+A +A ++AQA Q + S +Q + Sbjct: 21 QEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAE 72 Score = 22.7 bits (47), Expect = 0.99 Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 1/55 (1%) Query: 444 DLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQA-QASQASQQQQQQ 497 +L ++A+ AA QA+A+ +A + + QAQA Q + +Q QQ+ Sbjct: 72 ELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQE 126 Score = 22.3 bits (46), Expect = 1.5 Identities = 15/54 (27%), Positives = 25/54 (45%) Query: 445 LQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQAQASQASQQQQQQQ 498 L+ A A QA+A+ +A + + QAQA + +Q QQ+ Q+ Sbjct: 77 LEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQK 130 Score = 21.6 bits (44), Expect = 2.6 Identities = 13/47 (27%), Positives = 26/47 (54%) Query: 445 LQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQAQASQAS 491 L+ + Q + A +QA+A +A ++AQA Q + Q+ + ++ S Sbjct: 28 LRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAELS 74 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 21.9 bits (45), Expect = 2.0 Identities = 10/38 (26%), Positives = 14/38 (36%) Query: 499 PPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTP 536 PP + P AP+G PP + D+ P Sbjct: 34 PPASLAANLAGPPCAPEGPMAAQQASPPPEERIEDVDP 71 >swiss|P30735|VE6_MNPV E6 protein. Length = 207 Score = 20.0 bits (40), Expect = 7.9 Identities = 7/11 (63%), Positives = 7/11 (63%) Query: 420 CTICSRAYTSE 430 CT CSR T E Sbjct: 95 CTFCSRFLTQE 105 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. 928 0.0 Sequences not found previously or not previously below threshold: swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). 33 0.001 trembl|Q60952|Q60952 Intranuclear protein. 27 0.039 swiss|P25233|NECD_MOUSE Necdin. 23 0.72 >trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. Length = 563 Score = 928 bits (2397), Expect = 0.0 Identities = 563/563 (100%), Positives = 563/563 (100%) Query: 1 MEESHFNSNPYFWPSIPTVSGQIENTMFINKMKDQLLPEKGCGLAPPHYPTLLTVPASVS 60 MEESHFNSNPYFWPSIPTVSGQIENTMFINKMKDQLLPEKGCGLAPPHYPTLLTVPASVS Sbjct: 1 MEESHFNSNPYFWPSIPTVSGQIENTMFINKMKDQLLPEKGCGLAPPHYPTLLTVPASVS 60 Query: 61 LSSGISMDTESKSEQLTPHSQASVTQNITVVPVPSTGLMTAGPGLVITSPSGSLVTTASS 120 LSSGISMDTESKSEQLTPHSQASVTQNITVVPVPSTGLMTAGPGLVITSPSGSLVTTASS Sbjct: 61 LSSGISMDTESKSEQLTPHSQASVTQNITVVPVPSTGLMTAGPGLVITSPSGSLVTTASS 120 Query: 121 AQTFPISTPMIVSALPPGSQALQVVPDLSKKVASTLTEEGGGGGGGGGTVAPPKPPRGRK 180 AQTFPISTPMIVSALPPGSQALQVVPDLSKKVASTLTEEGGGGGGGGGTVAPPKPPRGRK Sbjct: 121 AQTFPISTPMIVSALPPGSQALQVVPDLSKKVASTLTEEGGGGGGGGGTVAPPKPPRGRK 180 Query: 181 KKRMLESGLPEMNDPYVLAPGDDDDHQKDGKTYRCRMCSLTFYSKSEMQIHSKSHTETKP 240 KKRMLESGLPEMNDPYVLAPGDDDDHQKDGKTYRCRMCSLTFYSKSEMQIHSKSHTETKP Sbjct: 181 KKRMLESGLPEMNDPYVLAPGDDDDHQKDGKTYRCRMCSLTFYSKSEMQIHSKSHTETKP 240 Query: 241 HKCPHCSKTFANSSYLAQHIRIHSGAKPYSCNFCEKSFRQLSHLQQHTRIHSKMHTETIK 300 HKCPHCSKTFANSSYLAQHIRIHSGAKPYSCNFCEKSFRQLSHLQQHTRIHSKMHTETIK Sbjct: 241 HKCPHCSKTFANSSYLAQHIRIHSGAKPYSCNFCEKSFRQLSHLQQHTRIHSKMHTETIK 300 Query: 301 PHKCPHCSKTFANTSYLAQHLRIHSGAKPYNCSYCQKAFRQLSHLQQHTRIHTGDRPYKC 360 PHKCPHCSKTFANTSYLAQHLRIHSGAKPYNCSYCQKAFRQLSHLQQHTRIHTGDRPYKC Sbjct: 301 PHKCPHCSKTFANTSYLAQHLRIHSGAKPYNCSYCQKAFRQLSHLQQHTRIHTGDRPYKC 360 Query: 361 AHPGCEKAFTQLSNLQSHRRQHNKDKPFKCHNCHRAYTDAASLEAHLSTHTVKHAKVYTC 420 AHPGCEKAFTQLSNLQSHRRQHNKDKPFKCHNCHRAYTDAASLEAHLSTHTVKHAKVYTC Sbjct: 361 AHPGCEKAFTQLSNLQSHRRQHNKDKPFKCHNCHRAYTDAASLEAHLSTHTVKHAKVYTC 420 Query: 421 TICSRAYTSETYLMKHMRKHNPPDLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQ 480 TICSRAYTSETYLMKHMRKHNPPDLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQ Sbjct: 421 TICSRAYTSETYLMKHMRKHNPPDLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQ 480 Query: 481 AQAQAQASQASQQQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPA 540 AQAQAQASQASQQQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPA Sbjct: 481 AQAQAQASQASQQQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPA 540 Query: 541 EHHKDICLTVTTSTIQVEHLASS 563 EHHKDICLTVTTSTIQVEHLASS Sbjct: 541 EHHKDICLTVTTSTIQVEHLASS 563 >swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). Length = 212 Score = 32.8 bits (73), Expect = 0.001 Identities = 20/65 (30%), Positives = 33/65 (50%), Gaps = 6/65 (9%) Query: 493 QQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPAEHHKDICLTVTT 552 ++++ P PP PP+ S + G++ P P+ FDLT P + HK C +VT Sbjct: 72 EERENSPSPPIPPYQISEMPLHELLESGNAKLVPNPE--FDLT--DPDDFHK--CFSVTY 125 Query: 553 STIQV 557 S + + Sbjct: 126 SALSL 130 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 27.4 bits (59), Expect = 0.039 Identities = 17/53 (32%), Positives = 27/53 (50%) Query: 445 LQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQAQASQASQQQQQQ 497 L +Q Q AA A Q + + A +QA+A +A ++AQA + Q +Q Sbjct: 18 LTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQ 70 Score = 25.5 bits (54), Expect = 0.14 Identities = 16/52 (30%), Positives = 27/52 (51%) Query: 446 QQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQAQASQASQQQQQQ 497 Q+Q AA A + + A +QA+A +A ++AQA Q + S +Q + Sbjct: 21 QEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAE 72 Score = 25.5 bits (54), Expect = 0.17 Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 1/55 (1%) Query: 444 DLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQA-QASQASQQQQQQ 497 +L ++A+ AA QA+A+ +A + + QAQA Q + +Q QQ+ Sbjct: 72 ELSHTLEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQE 126 Score = 24.3 bits (51), Expect = 0.40 Identities = 15/54 (27%), Positives = 25/54 (45%) Query: 445 LQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQAQASQASQQQQQQQ 498 L+ A A QA+A+ +A + + QAQA + +Q QQ+ Q+ Sbjct: 77 LEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQK 130 Score = 23.2 bits (48), Expect = 0.73 Identities = 13/47 (27%), Positives = 26/47 (54%) Query: 445 LQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQAQASQAS 491 L+ + Q + A +QA+A +A ++AQA Q + Q+ + ++ S Sbjct: 28 LRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAELS 74 Score = 19.7 bits (39), Expect = 7.6 Identities = 11/39 (28%), Positives = 21/39 (53%) Query: 460 QAQAQAQAQAQAQAQAQAQAQAQAQAQASQASQQQQQQQ 498 +AQ A + AQ++A + + QAS ++ +QQ+ Sbjct: 236 KAQDLALSLAQSKASISSLQEITMFLQASVLERESEQQR 274 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 23.2 bits (48), Expect = 0.72 Identities = 10/38 (26%), Positives = 14/38 (36%) Query: 499 PPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTP 536 PP + P AP+G PP + D+ P Sbjct: 34 PPASLAANLAGPPCAPEGPMAAQQASPPPEERIEDVDP 71 Searching.done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. 911 0.0 swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). 106 1e-25 Sequences not found previously or not previously below threshold: trembl|Q60952|Q60952 Intranuclear protein. 27 0.046 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 20 7.2 swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 20 9.6 CONVERGED! >trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. Length = 563 Score = 911 bits (2354), Expect = 0.0 Identities = 563/563 (100%), Positives = 563/563 (100%) Query: 1 MEESHFNSNPYFWPSIPTVSGQIENTMFINKMKDQLLPEKGCGLAPPHYPTLLTVPASVS 60 MEESHFNSNPYFWPSIPTVSGQIENTMFINKMKDQLLPEKGCGLAPPHYPTLLTVPASVS Sbjct: 1 MEESHFNSNPYFWPSIPTVSGQIENTMFINKMKDQLLPEKGCGLAPPHYPTLLTVPASVS 60 Query: 61 LSSGISMDTESKSEQLTPHSQASVTQNITVVPVPSTGLMTAGPGLVITSPSGSLVTTASS 120 LSSGISMDTESKSEQLTPHSQASVTQNITVVPVPSTGLMTAGPGLVITSPSGSLVTTASS Sbjct: 61 LSSGISMDTESKSEQLTPHSQASVTQNITVVPVPSTGLMTAGPGLVITSPSGSLVTTASS 120 Query: 121 AQTFPISTPMIVSALPPGSQALQVVPDLSKKVASTLTEEGGGGGGGGGTVAPPKPPRGRK 180 AQTFPISTPMIVSALPPGSQALQVVPDLSKKVASTLTEEGGGGGGGGGTVAPPKPPRGRK Sbjct: 121 AQTFPISTPMIVSALPPGSQALQVVPDLSKKVASTLTEEGGGGGGGGGTVAPPKPPRGRK 180 Query: 181 KKRMLESGLPEMNDPYVLAPGDDDDHQKDGKTYRCRMCSLTFYSKSEMQIHSKSHTETKP 240 KKRMLESGLPEMNDPYVLAPGDDDDHQKDGKTYRCRMCSLTFYSKSEMQIHSKSHTETKP Sbjct: 181 KKRMLESGLPEMNDPYVLAPGDDDDHQKDGKTYRCRMCSLTFYSKSEMQIHSKSHTETKP 240 Query: 241 HKCPHCSKTFANSSYLAQHIRIHSGAKPYSCNFCEKSFRQLSHLQQHTRIHSKMHTETIK 300 HKCPHCSKTFANSSYLAQHIRIHSGAKPYSCNFCEKSFRQLSHLQQHTRIHSKMHTETIK Sbjct: 241 HKCPHCSKTFANSSYLAQHIRIHSGAKPYSCNFCEKSFRQLSHLQQHTRIHSKMHTETIK 300 Query: 301 PHKCPHCSKTFANTSYLAQHLRIHSGAKPYNCSYCQKAFRQLSHLQQHTRIHTGDRPYKC 360 PHKCPHCSKTFANTSYLAQHLRIHSGAKPYNCSYCQKAFRQLSHLQQHTRIHTGDRPYKC Sbjct: 301 PHKCPHCSKTFANTSYLAQHLRIHSGAKPYNCSYCQKAFRQLSHLQQHTRIHTGDRPYKC 360 Query: 361 AHPGCEKAFTQLSNLQSHRRQHNKDKPFKCHNCHRAYTDAASLEAHLSTHTVKHAKVYTC 420 AHPGCEKAFTQLSNLQSHRRQHNKDKPFKCHNCHRAYTDAASLEAHLSTHTVKHAKVYTC Sbjct: 361 AHPGCEKAFTQLSNLQSHRRQHNKDKPFKCHNCHRAYTDAASLEAHLSTHTVKHAKVYTC 420 Query: 421 TICSRAYTSETYLMKHMRKHNPPDLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQ 480 TICSRAYTSETYLMKHMRKHNPPDLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQ Sbjct: 421 TICSRAYTSETYLMKHMRKHNPPDLQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQ 480 Query: 481 AQAQAQASQASQQQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPA 540 AQAQAQASQASQQQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPA Sbjct: 481 AQAQAQASQASQQQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPA 540 Query: 541 EHHKDICLTVTTSTIQVEHLASS 563 EHHKDICLTVTTSTIQVEHLASS Sbjct: 541 EHHKDICLTVTTSTIQVEHLASS 563 >swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). Length = 212 Score = 106 bits (263), Expect = 1e-25 Identities = 20/65 (30%), Positives = 33/65 (50%), Gaps = 6/65 (9%) Query: 493 QQQQQQPPPPQPPHFQSPGAAPQGGGGGDSNQNPPPQCSFDLTPYKPAEHHKDICLTVTT 552 ++++ P PP PP+ S + G++ P P+ FDLT P + HK C +VT Sbjct: 72 EERENSPSPPIPPYQISEMPLHELLESGNAKLVPNPE--FDLT--DPDDFHK--CFSVTY 125 Query: 553 STIQV 557 S + + Sbjct: 126 SALSL 130 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 27.4 bits (59), Expect = 0.046 Identities = 17/53 (32%), Positives = 27/53 (50%) Query: 445 LQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQAQASQASQQQQQQ 497 L +Q Q AA A Q + + A +QA+A +A ++AQA + Q +Q Sbjct: 18 LTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQ 70 Score = 24.0 bits (50), Expect = 0.49 Identities = 14/50 (28%), Positives = 23/50 (46%) Query: 445 LQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQAQASQASQQQ 494 L+ A A QA+A+ +A + + QAQA + +Q QQ+ Sbjct: 77 LEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQE 126 Score = 23.2 bits (48), Expect = 0.82 Identities = 13/47 (27%), Positives = 26/47 (54%) Query: 445 LQQQVQAAAAAAAVAQAQAQAQAQAQAQAQAQAQAQAQAQAQASQAS 491 L+ + Q + A +QA+A +A ++AQA Q + Q+ + ++ S Sbjct: 28 LRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAELS 74 Score = 21.6 bits (44), Expect = 2.2 Identities = 14/49 (28%), Positives = 24/49 (48%), Gaps = 2/49 (4%) Query: 448 QVQAAAAAAAVAQAQAQAQAQAQAQ--AQAQAQAQAQAQAQASQASQQQ 494 + AA A + QA+A+ +A + + QAQA + +Q QQ+ Sbjct: 78 EASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQE 126 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 20.1 bits (40), Expect = 7.2 Identities = 4/40 (10%), Positives = 18/40 (45%) Query: 482 QAQAQASQASQQQQQQQPPPPQPPHFQSPGAAPQGGGGGD 521 + ++++ + +++++ P P+P + + D Sbjct: 204 RTKSRSPSPAPEKKEKSPELPEPSVRMKDSSVQEATSTSD 243 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 19.7 bits (39), Expect = 9.6 Identities = 9/30 (30%), Positives = 15/30 (50%) Query: 172 PPKPPRGRKKKRMLESGLPEMNDPYVLAPG 201 P K P G+K++ E ++P +AP Sbjct: 143 PAKKPAGKKQESEEEDDEESEDEPMEVAPA 172 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.311 0.126 0.380 Lambda K H 0.267 0.0387 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 67,026 Number of Sequences: 43 Number of extensions: 3579 Number of successful extensions: 511 Number of sequences better than 10.0: 28 Number of HSP's better than 10.0 without gapping: 28 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 110 length of query: 563 length of database: 17,163 effective HSP length: 60 effective length of query: 503 effective length of database: 14,583 effective search space: 7335249 effective search space used: 7335249 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (20.5 bits) S2: 39 (19.7 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. (504 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 1014 0.0 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 30 0.006 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 25 0.25 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 20 8.9 swiss|P14733|LAM1_MOUSE Lamin B1. 19 9.8 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 1014 bits (2623), Expect = 0.0 Identities = 504/504 (100%), Positives = 504/504 (100%) Query: 1 MSLICSISNEVPEHPCVSPVSNHVYERRLIEKYIAENGTDPINNQPLSEEQLIDIKVAHP 60 MSLICSISNEVPEHPCVSPVSNHVYERRLIEKYIAENGTDPINNQPLSEEQLIDIKVAHP Sbjct: 1 MSLICSISNEVPEHPCVSPVSNHVYERRLIEKYIAENGTDPINNQPLSEEQLIDIKVAHP 60 Query: 61 IRPKPPSATSIPAILKALQDEWDAVMLHSFTLRQQLQTTRQELSHALYQHDAACRVIARL 120 IRPKPPSATSIPAILKALQDEWDAVMLHSFTLRQQLQTTRQELSHALYQHDAACRVIARL Sbjct: 61 IRPKPPSATSIPAILKALQDEWDAVMLHSFTLRQQLQTTRQELSHALYQHDAACRVIARL 120 Query: 121 TKEVTAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKA 180 TKEVTAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKA Sbjct: 121 TKEVTAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKA 180 Query: 181 TVLTTERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLCPSDTNKILTG 240 TVLTTERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLCPSDTNKILTG Sbjct: 181 TVLTTERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLCPSDTNKILTG 240 Query: 241 GADKNVVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIWSVPNASCVQ 300 GADKNVVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIWSVPNASCVQ Sbjct: 241 GADKNVVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIWSVPNASCVQ 300 Query: 301 VVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETSGCSLTCAQFHP 360 VVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETSGCSLTCAQFHP Sbjct: 301 VVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETSGCSLTCAQFHP 360 Query: 361 DGLIFGTGTMDSQIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLATAADDSSVKLWD 420 DGLIFGTGTMDSQIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLATAADDSSVKLWD Sbjct: 361 DGLIFGTGTMDSQIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLATAADDSSVKLWD 420 Query: 421 LRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDVQIYICKQWTEILHFTEHSGLTTG 480 LRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDVQIYICKQWTEILHFTEHSGLTTG Sbjct: 421 LRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDVQIYICKQWTEILHFTEHSGLTTG 480 Query: 481 VAFGHHAKFIASTGMDRSLKFYSL 504 VAFGHHAKFIASTGMDRSLKFYSL Sbjct: 481 VAFGHHAKFIASTGMDRSLKFYSL 504 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 30.0 bits (66), Expect = 0.006 Identities = 26/106 (24%), Positives = 45/106 (41%), Gaps = 4/106 (3%) Query: 125 TAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKATVLT 184 ++A +AL+T +P Q +P+ PS A E + + E ++ + T Sbjct: 122 SSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAST 181 Query: 185 TERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLC 230 E +T E L K E + YR + S + ++P I L +C Sbjct: 182 QE----AETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMC 223 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 24.6 bits (52), Expect = 0.25 Identities = 11/20 (55%), Positives = 16/20 (80%) Query: 171 EIIQKLQDKATVLTTERKKR 190 ++IQ LQ+K T+LTTE K + Sbjct: 250 DLIQVLQEKITLLTTEIKDK 269 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 19.6 bits (39), Expect = 8.9 Identities = 9/24 (37%), Positives = 12/24 (49%) Query: 183 LTTERKKRGKTVPEELVKPEELSK 206 L RK+RG T +E +P K Sbjct: 901 LEAGRKRRGNTSGDETSEPSNNKK 924 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 19.2 bits (38), Expect = 9.8 Identities = 8/19 (42%), Positives = 12/19 (63%) Query: 40 DPINNQPLSEEQLIDIKVA 58 D + Q EQL+D+K+A Sbjct: 351 DQMQQQLSDYEQLLDVKLA 369 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 1069 0.0 Sequences not found previously or not previously below threshold: swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 32 0.001 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 26 0.12 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 20 5.6 swiss|P14733|LAM1_MOUSE Lamin B1. 20 6.2 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 1069 bits (2764), Expect = 0.0 Identities = 504/504 (100%), Positives = 504/504 (100%) Query: 1 MSLICSISNEVPEHPCVSPVSNHVYERRLIEKYIAENGTDPINNQPLSEEQLIDIKVAHP 60 MSLICSISNEVPEHPCVSPVSNHVYERRLIEKYIAENGTDPINNQPLSEEQLIDIKVAHP Sbjct: 1 MSLICSISNEVPEHPCVSPVSNHVYERRLIEKYIAENGTDPINNQPLSEEQLIDIKVAHP 60 Query: 61 IRPKPPSATSIPAILKALQDEWDAVMLHSFTLRQQLQTTRQELSHALYQHDAACRVIARL 120 IRPKPPSATSIPAILKALQDEWDAVMLHSFTLRQQLQTTRQELSHALYQHDAACRVIARL Sbjct: 61 IRPKPPSATSIPAILKALQDEWDAVMLHSFTLRQQLQTTRQELSHALYQHDAACRVIARL 120 Query: 121 TKEVTAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKA 180 TKEVTAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKA Sbjct: 121 TKEVTAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKA 180 Query: 181 TVLTTERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLCPSDTNKILTG 240 TVLTTERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLCPSDTNKILTG Sbjct: 181 TVLTTERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLCPSDTNKILTG 240 Query: 241 GADKNVVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIWSVPNASCVQ 300 GADKNVVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIWSVPNASCVQ Sbjct: 241 GADKNVVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIWSVPNASCVQ 300 Query: 301 VVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETSGCSLTCAQFHP 360 VVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETSGCSLTCAQFHP Sbjct: 301 VVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETSGCSLTCAQFHP 360 Query: 361 DGLIFGTGTMDSQIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLATAADDSSVKLWD 420 DGLIFGTGTMDSQIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLATAADDSSVKLWD Sbjct: 361 DGLIFGTGTMDSQIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLATAADDSSVKLWD 420 Query: 421 LRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDVQIYICKQWTEILHFTEHSGLTTG 480 LRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDVQIYICKQWTEILHFTEHSGLTTG Sbjct: 421 LRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDVQIYICKQWTEILHFTEHSGLTTG 480 Query: 481 VAFGHHAKFIASTGMDRSLKFYSL 504 VAFGHHAKFIASTGMDRSLKFYSL Sbjct: 481 VAFGHHAKFIASTGMDRSLKFYSL 504 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 32.4 bits (72), Expect = 0.001 Identities = 26/106 (24%), Positives = 45/106 (41%), Gaps = 4/106 (3%) Query: 125 TAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKATVLT 184 ++A +AL+T +P Q +P+ PS A E + + E ++ + T Sbjct: 122 SSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAST 181 Query: 185 TERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLC 230 E +T E L K E + YR + S + ++P I L +C Sbjct: 182 QE----AETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMC 223 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 25.8 bits (55), Expect = 0.12 Identities = 11/20 (55%), Positives = 16/20 (80%) Query: 171 EIIQKLQDKATVLTTERKKR 190 ++IQ LQ+K T+LTTE K + Sbjct: 250 DLIQVLQEKITLLTTEIKDK 269 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 20.0 bits (40), Expect = 5.6 Identities = 9/24 (37%), Positives = 12/24 (49%) Query: 183 LTTERKKRGKTVPEELVKPEELSK 206 L RK+RG T +E +P K Sbjct: 901 LEAGRKRRGNTSGDETSEPSNNKK 924 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 20.0 bits (40), Expect = 6.2 Identities = 8/19 (42%), Positives = 12/19 (63%) Query: 40 DPINNQPLSEEQLIDIKVA 58 D + Q EQL+D+K+A Sbjct: 351 DQMQQQLSDYEQLLDVKLA 369 Searching.done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 1035 0.0 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 174 1e-46 Sequences not found previously or not previously below threshold: trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.... 23 0.68 swiss|P13781|REP2_ZYGBI Trans-acting factor C (REP2). 23 0.77 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 22 1.4 swiss|P14733|LAM1_MOUSE Lamin B1. 20 6.2 CONVERGED! >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 1035 bits (2676), Expect = 0.0 Identities = 504/504 (100%), Positives = 504/504 (100%) Query: 1 MSLICSISNEVPEHPCVSPVSNHVYERRLIEKYIAENGTDPINNQPLSEEQLIDIKVAHP 60 MSLICSISNEVPEHPCVSPVSNHVYERRLIEKYIAENGTDPINNQPLSEEQLIDIKVAHP Sbjct: 1 MSLICSISNEVPEHPCVSPVSNHVYERRLIEKYIAENGTDPINNQPLSEEQLIDIKVAHP 60 Query: 61 IRPKPPSATSIPAILKALQDEWDAVMLHSFTLRQQLQTTRQELSHALYQHDAACRVIARL 120 IRPKPPSATSIPAILKALQDEWDAVMLHSFTLRQQLQTTRQELSHALYQHDAACRVIARL Sbjct: 61 IRPKPPSATSIPAILKALQDEWDAVMLHSFTLRQQLQTTRQELSHALYQHDAACRVIARL 120 Query: 121 TKEVTAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKA 180 TKEVTAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKA Sbjct: 121 TKEVTAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKA 180 Query: 181 TVLTTERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLCPSDTNKILTG 240 TVLTTERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLCPSDTNKILTG Sbjct: 181 TVLTTERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLCPSDTNKILTG 240 Query: 241 GADKNVVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIWSVPNASCVQ 300 GADKNVVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIWSVPNASCVQ Sbjct: 241 GADKNVVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIWSVPNASCVQ 300 Query: 301 VVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETSGCSLTCAQFHP 360 VVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETSGCSLTCAQFHP Sbjct: 301 VVRAHESAVTGLSLHATGDYLLSSSDDQYWAFSDIQTGRVLTKVTDETSGCSLTCAQFHP 360 Query: 361 DGLIFGTGTMDSQIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLATAADDSSVKLWD 420 DGLIFGTGTMDSQIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLATAADDSSVKLWD Sbjct: 361 DGLIFGTGTMDSQIKIWDLKERTNVANFPGHSGPITSIAFSENGYYLATAADDSSVKLWD 420 Query: 421 LRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDVQIYICKQWTEILHFTEHSGLTTG 480 LRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDVQIYICKQWTEILHFTEHSGLTTG Sbjct: 421 LRKLKNFKTLQLDNNFEVKSLIFDQSGTYLALGGTDVQIYICKQWTEILHFTEHSGLTTG 480 Query: 481 VAFGHHAKFIASTGMDRSLKFYSL 504 VAFGHHAKFIASTGMDRSLKFYSL Sbjct: 481 VAFGHHAKFIASTGMDRSLKFYSL 504 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 174 bits (442), Expect = 1e-46 Identities = 26/106 (24%), Positives = 45/106 (41%), Gaps = 4/106 (3%) Query: 125 TAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKATVLT 184 ++A +AL+T +P Q +P+ PS A E + + E ++ + T Sbjct: 122 SSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAST 181 Query: 185 TERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLC 230 E +T E L K E + YR + S + ++P I L +C Sbjct: 182 QE----AETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMC 223 >trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR Length = 752 Score = 23.1 bits (48), Expect = 0.68 Identities = 14/48 (29%), Positives = 20/48 (41%), Gaps = 2/48 (4%) Query: 173 IQKLQDKATVLTTERKKRGKTVPEELVKPEELSKYRQVASHVGLHSAS 220 K +D T E++ K +E + +E SK QV S H S Sbjct: 487 RSKERDHETTKEKEKQLDSKGKDQERSRSKENSK--QVESKSNEHDHS 532 >swiss|P13781|REP2_ZYGBI Trans-acting factor C (REP2). Length = 178 Score = 23.1 bits (48), Expect = 0.77 Identities = 8/30 (26%), Positives = 16/30 (52%) Query: 187 RKKRGKTVPEELVKPEELSKYRQVASHVGL 216 R K+ KT+P+ L+ + + + +H L Sbjct: 5 RTKKLKTIPDTLLIGSIFTIFENLEAHAYL 34 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 22.0 bits (45), Expect = 1.4 Identities = 12/30 (40%), Positives = 19/30 (63%) Query: 171 EIIQKLQDKATVLTTERKKRGKTVPEELVK 200 ++IQ LQ+K T+LTTE K + ++ K Sbjct: 250 DLIQVLQEKITLLTTEIKDKEASLQSTTSK 279 Score = 21.6 bits (44), Expect = 2.2 Identities = 19/95 (20%), Positives = 40/95 (42%), Gaps = 2/95 (2%) Query: 93 RQQLQTTRQELSHALYQHDAACRVIARLTKEVTAAREALATLKPQAGLIVPQAVPSSQPS 152 ++ + +EL A D R L KE+ A +++L+ + ++ ++V + Sbjct: 572 KESRKNLEEELERATESLDEMNRNAFALAKELELANSHISSLEDERE-VLQKSVSEQKQI 630 Query: 153 VVGAGEPM-DLGELVGMTPEIIQKLQDKATVLTTE 186 + E + D LV + + L+ +A L E Sbjct: 631 SQESRENLEDAHSLVMKLGKERESLEKRAKKLEDE 665 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 20.1 bits (40), Expect = 6.2 Identities = 8/19 (42%), Positives = 12/19 (63%) Query: 40 DPINNQPLSEEQLIDIKVA 58 D + Q EQL+D+K+A Sbjct: 351 DQMQQQLSDYEQLLDVKLA 369 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.311 0.131 0.375 Lambda K H 0.267 0.0400 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 51,812 Number of Sequences: 43 Number of extensions: 2095 Number of successful extensions: 48 Number of sequences better than 10.0: 17 Number of HSP's better than 10.0 without gapping: 17 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 21 length of query: 504 length of database: 17,163 effective HSP length: 59 effective length of query: 445 effective length of database: 14,626 effective search space: 6508570 effective search space used: 6508570 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (20.5 bits) S2: 39 (19.7 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= trembl|Q9PSQ1|Q9PSQ1 M57K=57 kDa NUCLEAR MATRIX PROTEIN (FRAGMENTS). (63 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value trembl|Q9PSQ1|Q9PSQ1 M57K=57 kDa NUCLEAR MATRIX PROTEIN (FRAGME... 114 2e-29 swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1... 20 0.34 swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. 17 2.6 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 17 3.4 swiss|P33418|LOS1_YEAST LOS1 protein. 16 4.7 swiss|O08579|EMD_MOUSE Emerin. 16 6.4 >trembl|Q9PSQ1|Q9PSQ1 M57K=57 kDa NUCLEAR MATRIX PROTEIN (FRAGMENTS). Length = 63 Score = 114 bits (285), Expect = 2e-29 Identities = 63/63 (100%), Positives = 63/63 (100%) Query: 1 SDVVELSDADFESGLAERPGLVLEEFFAMIAKTFLDAGHKLMDATANDVPSPYEVKGFPT 60 SDVVELSDADFESGLAERPGLVLEEFFAMIAKTFLDAGHKLMDATANDVPSPYEVKGFPT Sbjct: 1 SDVVELSDADFESGLAERPGLVLEEFFAMIAKTFLDAGHKLMDATANDVPSPYEVKGFPT 60 Query: 61 IYF 63 IYF Sbjct: 61 IYF 63 >swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu Length = 589 Score = 20.0 bits (40), Expect = 0.34 Identities = 13/46 (28%), Positives = 19/46 (41%), Gaps = 6/46 (13%) Query: 13 SGLAERPGLVLEEFFAMIAKTFLDAGHKLMDATAN------DVPSP 52 + L RP F + +TF+ L+D A D+PSP Sbjct: 284 TSLCARPRKTGHALFLLGGQTFMCDKLYLVDQKAKEIIPKADIPSP 329 >swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. Length = 487 Score = 17.3 bits (33), Expect = 2.6 Identities = 6/13 (46%), Positives = 9/13 (69%) Query: 7 SDADFESGLAERP 19 SD D+ G ++RP Sbjct: 470 SDEDYVEGPSKRP 482 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 16.5 bits (31), Expect = 3.4 Identities = 6/24 (25%), Positives = 14/24 (58%) Query: 40 KLMDATANDVPSPYEVKGFPTIYF 63 ++ + + N+ S E +G P ++F Sbjct: 48 RVSEGSENESGSGEEYEGAPHVHF 71 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 16.2 bits (30), Expect = 4.7 Identities = 6/10 (60%), Positives = 8/10 (80%) Query: 25 EFFAMIAKTF 34 EF ++ AKTF Sbjct: 380 EFDSITAKTF 389 >swiss|O08579|EMD_MOUSE Emerin. Length = 259 Score = 15.8 bits (29), Expect = 6.4 Identities = 6/15 (40%), Positives = 10/15 (66%) Query: 30 IAKTFLDAGHKLMDA 44 ++K+F G L+DA Sbjct: 114 MSKSFRQPGTSLVDA 128 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|Q9PSQ1|Q9PSQ1 M57K=57 kDa NUCLEAR MATRIX PROTEIN (FRAGME... 123 2e-32 Sequences not found previously or not previously below threshold: swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1... 20 0.35 swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. 18 1.3 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 17 2.2 swiss|O08579|EMD_MOUSE Emerin. 17 3.7 swiss|P33418|LOS1_YEAST LOS1 protein. 16 4.7 CONVERGED! >trembl|Q9PSQ1|Q9PSQ1 M57K=57 kDa NUCLEAR MATRIX PROTEIN (FRAGMENTS). Length = 63 Score = 123 bits (309), Expect = 2e-32 Identities = 63/63 (100%), Positives = 63/63 (100%) Query: 1 SDVVELSDADFESGLAERPGLVLEEFFAMIAKTFLDAGHKLMDATANDVPSPYEVKGFPT 60 SDVVELSDADFESGLAERPGLVLEEFFAMIAKTFLDAGHKLMDATANDVPSPYEVKGFPT Sbjct: 1 SDVVELSDADFESGLAERPGLVLEEFFAMIAKTFLDAGHKLMDATANDVPSPYEVKGFPT 60 Query: 61 IYF 63 IYF Sbjct: 61 IYF 63 >swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu Length = 589 Score = 20.0 bits (40), Expect = 0.35 Identities = 13/46 (28%), Positives = 19/46 (41%), Gaps = 6/46 (13%) Query: 13 SGLAERPGLVLEEFFAMIAKTFLDAGHKLMDATAN------DVPSP 52 + L RP F + +TF+ L+D A D+PSP Sbjct: 284 TSLCARPRKTGHALFLLGGQTFMCDKLYLVDQKAKEIIPKADIPSP 329 >swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. Length = 487 Score = 18.0 bits (35), Expect = 1.3 Identities = 6/13 (46%), Positives = 9/13 (69%) Query: 7 SDADFESGLAERP 19 SD D+ G ++RP Sbjct: 470 SDEDYVEGPSKRP 482 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 17.3 bits (33), Expect = 2.2 Identities = 6/24 (25%), Positives = 14/24 (58%) Query: 40 KLMDATANDVPSPYEVKGFPTIYF 63 ++ + + N+ S E +G P ++F Sbjct: 48 RVSEGSENESGSGEEYEGAPHVHF 71 >swiss|O08579|EMD_MOUSE Emerin. Length = 259 Score = 16.5 bits (31), Expect = 3.7 Identities = 6/15 (40%), Positives = 10/15 (66%) Query: 30 IAKTFLDAGHKLMDA 44 ++K+F G L+DA Sbjct: 114 MSKSFRQPGTSLVDA 128 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 16.1 bits (30), Expect = 4.7 Identities = 6/10 (60%), Positives = 8/10 (80%) Query: 25 EFFAMIAKTF 34 EF ++ AKTF Sbjct: 380 EFDSITAKTF 389 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.319 0.139 0.396 Lambda K H 0.267 0.0424 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 4071 Number of Sequences: 43 Number of extensions: 117 Number of successful extensions: 14 Number of sequences better than 10.0: 14 Number of HSP's better than 10.0 without gapping: 14 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 14 length of query: 63 length of database: 17,163 effective HSP length: 39 effective length of query: 24 effective length of database: 15,486 effective search space: 371664 effective search space used: 371664 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 28 (15.7 bits) S2: 28 (15.3 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P04489|E411_ADE05 Probable early E4 11 kDa protein. (116 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P04489|E411_ADE05 Probable early E4 11 kDa protein. 207 4e-57 swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). 18 3.4 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 18 4.2 >swiss|P04489|E411_ADE05 Probable early E4 11 kDa protein. Length = 116 Score = 207 bits (527), Expect = 4e-57 Identities = 116/116 (100%), Positives = 116/116 (100%) Query: 1 MIRCLRLKVEGALEQIFTMAGLNIRDLLRDILRRWRDENYLGMVEGAGMFIEEIHPEGFS 60 MIRCLRLKVEGALEQIFTMAGLNIRDLLRDILRRWRDENYLGMVEGAGMFIEEIHPEGFS Sbjct: 1 MIRCLRLKVEGALEQIFTMAGLNIRDLLRDILRRWRDENYLGMVEGAGMFIEEIHPEGFS 60 Query: 61 LYVHLDVRAVCLLEAIVQHLTNAIICSLAVEFDHATGGERVHLIDLHFEVLDNLLE 116 LYVHLDVRAVCLLEAIVQHLTNAIICSLAVEFDHATGGERVHLIDLHFEVLDNLLE Sbjct: 61 LYVHLDVRAVCLLEAIVQHLTNAIICSLAVEFDHATGGERVHLIDLHFEVLDNLLE 116 >swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). Length = 233 Score = 18.1 bits (35), Expect = 3.4 Identities = 5/13 (38%), Positives = 10/13 (76%) Query: 24 IRDLLRDILRRWR 36 + +LL ++LR W+ Sbjct: 23 VANLLENLLRTWK 35 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 18.1 bits (35), Expect = 4.2 Identities = 9/27 (33%), Positives = 16/27 (58%) Query: 12 ALEQIFTMAGLNIRDLLRDILRRWRDE 38 ++E+ F LN RD+ + L+R +E Sbjct: 162 SMEKKFQSELLNERDIRNNQLKRAGEE 188 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P04489|E411_ADE05 Probable early E4 11 kDa protein. 218 2e-60 Sequences not found previously or not previously below threshold: trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. 19 2.2 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 18 3.4 swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). 18 3.8 CONVERGED! >swiss|P04489|E411_ADE05 Probable early E4 11 kDa protein. Length = 116 Score = 218 bits (555), Expect = 2e-60 Identities = 116/116 (100%), Positives = 116/116 (100%) Query: 1 MIRCLRLKVEGALEQIFTMAGLNIRDLLRDILRRWRDENYLGMVEGAGMFIEEIHPEGFS 60 MIRCLRLKVEGALEQIFTMAGLNIRDLLRDILRRWRDENYLGMVEGAGMFIEEIHPEGFS Sbjct: 1 MIRCLRLKVEGALEQIFTMAGLNIRDLLRDILRRWRDENYLGMVEGAGMFIEEIHPEGFS 60 Query: 61 LYVHLDVRAVCLLEAIVQHLTNAIICSLAVEFDHATGGERVHLIDLHFEVLDNLLE 116 LYVHLDVRAVCLLEAIVQHLTNAIICSLAVEFDHATGGERVHLIDLHFEVLDNLLE Sbjct: 61 LYVHLDVRAVCLLEAIVQHLTNAIICSLAVEFDHATGGERVHLIDLHFEVLDNLLE 116 >trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. Length = 456 Score = 18.7 bits (37), Expect = 2.2 Identities = 8/42 (19%), Positives = 22/42 (52%), Gaps = 1/42 (2%) Query: 24 IRDLLRDILRRWRDENYLGMVEGAGMFIEEIHPEGFSLYVHL 65 +R + ++ ++ D+ +V G +F++E+H + +L Sbjct: 275 LRGEINKVVNKYIDQGIAELVPGV-LFVDEVHMLDIECFTYL 315 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 18.3 bits (36), Expect = 3.4 Identities = 9/27 (33%), Positives = 16/27 (58%) Query: 12 ALEQIFTMAGLNIRDLLRDILRRWRDE 38 ++E+ F LN RD+ + L+R +E Sbjct: 162 SMEKKFQSELLNERDIRNNQLKRAGEE 188 >swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). Length = 233 Score = 18.0 bits (35), Expect = 3.8 Identities = 5/13 (38%), Positives = 10/13 (76%) Query: 24 IRDLLRDILRRWR 36 + +LL ++LR W+ Sbjct: 23 VANLLENLLRTWK 35 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.329 0.146 0.438 Lambda K H 0.267 0.0448 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 8382 Number of Sequences: 43 Number of extensions: 313 Number of successful extensions: 7 Number of sequences better than 10.0: 7 Number of HSP's better than 10.0 without gapping: 7 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 7 length of query: 116 length of database: 17,163 effective HSP length: 46 effective length of query: 70 effective length of database: 15,185 effective search space: 1062950 effective search space used: 1062950 T: 11 A: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 32 (18.0 bits) S2: 32 (16.8 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu (589 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1... 1231 0.0 swiss|P13781|REP2_ZYGBI Trans-acting factor C (REP2). 23 0.60 swiss|P33418|LOS1_YEAST LOS1 protein. 21 3.2 swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 21 4.7 >swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu Length = 589 Score = 1231 bits (3185), Expect = 0.0 Identities = 589/589 (100%), Positives = 589/589 (100%) Query: 1 MSVSVHENRKSRASSGSINIYLFHKSSYADSVLTHLNLLRQQRLFTDVLLHAGNRTFPCH 60 MSVSVHENRKSRASSGSINIYLFHKSSYADSVLTHLNLLRQQRLFTDVLLHAGNRTFPCH Sbjct: 1 MSVSVHENRKSRASSGSINIYLFHKSSYADSVLTHLNLLRQQRLFTDVLLHAGNRTFPCH 60 Query: 61 RAVLAACSRYFEAMFSGGLKESQDSEVNFDNSIHPEVLELLLDYAYSSRVIINEENAESL 120 RAVLAACSRYFEAMFSGGLKESQDSEVNFDNSIHPEVLELLLDYAYSSRVIINEENAESL Sbjct: 61 RAVLAACSRYFEAMFSGGLKESQDSEVNFDNSIHPEVLELLLDYAYSSRVIINEENAESL 120 Query: 121 LEAGDMLEFQDIRDACAEFLEKNLHPTNCLGMLLLSDAHQCTKLYELSWRMCLSNFQTIR 180 LEAGDMLEFQDIRDACAEFLEKNLHPTNCLGMLLLSDAHQCTKLYELSWRMCLSNFQTIR Sbjct: 121 LEAGDMLEFQDIRDACAEFLEKNLHPTNCLGMLLLSDAHQCTKLYELSWRMCLSNFQTIR 180 Query: 181 KNEDFLQLPQDMVVQLLSSEELETEDERLVYESAINWISYDLKKRYCYLPELLQTVRLAL 240 KNEDFLQLPQDMVVQLLSSEELETEDERLVYESAINWISYDLKKRYCYLPELLQTVRLAL Sbjct: 181 KNEDFLQLPQDMVVQLLSSEELETEDERLVYESAINWISYDLKKRYCYLPELLQTVRLAL 240 Query: 241 LPAIYLMENVAMEELITKQRKSKEIVEEAIRCKLKILQNDGVVTSLCARPRKTGHALFLL 300 LPAIYLMENVAMEELITKQRKSKEIVEEAIRCKLKILQNDGVVTSLCARPRKTGHALFLL Sbjct: 241 LPAIYLMENVAMEELITKQRKSKEIVEEAIRCKLKILQNDGVVTSLCARPRKTGHALFLL 300 Query: 301 GGQTFMCDKLYLVDQKAKEIIPKADIPSPRKEFSACAIGCKVYITGGRGSENGVSKDVWV 360 GGQTFMCDKLYLVDQKAKEIIPKADIPSPRKEFSACAIGCKVYITGGRGSENGVSKDVWV Sbjct: 301 GGQTFMCDKLYLVDQKAKEIIPKADIPSPRKEFSACAIGCKVYITGGRGSENGVSKDVWV 360 Query: 361 YDTLHEEWSKAAPMLVARFGHGSAELKHCLYVVGGHTAATGCLPASPSVSLKQVEHYDPT 420 YDTLHEEWSKAAPMLVARFGHGSAELKHCLYVVGGHTAATGCLPASPSVSLKQVEHYDPT Sbjct: 361 YDTLHEEWSKAAPMLVARFGHGSAELKHCLYVVGGHTAATGCLPASPSVSLKQVEHYDPT 420 Query: 421 INKWTMVAPLREGVSNAAVVSAKLKLFAFGGTSVSHDKLPKVQCYDQCENRWTVPATCPQ 480 INKWTMVAPLREGVSNAAVVSAKLKLFAFGGTSVSHDKLPKVQCYDQCENRWTVPATCPQ Sbjct: 421 INKWTMVAPLREGVSNAAVVSAKLKLFAFGGTSVSHDKLPKVQCYDQCENRWTVPATCPQ 480 Query: 481 PWRYTAAAVLGNQIFIMGGDTEFSACSAYKFNSETYQWTKVGDVTAKRMSCHAVASGNKL 540 PWRYTAAAVLGNQIFIMGGDTEFSACSAYKFNSETYQWTKVGDVTAKRMSCHAVASGNKL Sbjct: 481 PWRYTAAAVLGNQIFIMGGDTEFSACSAYKFNSETYQWTKVGDVTAKRMSCHAVASGNKL 540 Query: 541 YVVGGYFGIQRCKTLDCYDPTLDVWNSITTVPYSLIPTAFVSTWKHLPS 589 YVVGGYFGIQRCKTLDCYDPTLDVWNSITTVPYSLIPTAFVSTWKHLPS Sbjct: 541 YVVGGYFGIQRCKTLDCYDPTLDVWNSITTVPYSLIPTAFVSTWKHLPS 589 >swiss|P13781|REP2_ZYGBI Trans-acting factor C (REP2). Length = 178 Score = 23.5 bits (49), Expect = 0.60 Identities = 12/40 (30%), Positives = 21/40 (52%) Query: 417 YDPTINKWTMVAPLREGVSNAAVVSAKLKLFAFGGTSVSH 456 Y PT + T+ L EG + +V+ ++++L F V H Sbjct: 55 YLPTRDGETLQEMLSEGAPSTSVLHSEMELSIFDQVQVPH 94 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 21.2 bits (43), Expect = 3.2 Identities = 6/13 (46%), Positives = 12/13 (92%) Query: 186 LQLPQDMVVQLLS 198 +QLPQ++ +QL++ Sbjct: 1062 IQLPQELTIQLVN 1074 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 20.8 bits (42), Expect = 4.7 Identities = 7/15 (46%), Positives = 10/15 (66%) Query: 128 EFQDIRDACAEFLEK 142 EF +++DA EF K Sbjct: 244 EFDELKDALREFFSK 258 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1... 1264 0.0 Sequences not found previously or not previously below threshold: swiss|P13781|REP2_ZYGBI Trans-acting factor C (REP2). 24 0.58 swiss|P33418|LOS1_YEAST LOS1 protein. 21 3.2 swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 21 4.7 CONVERGED! >swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu Length = 589 Score = 1264 bits (3270), Expect = 0.0 Identities = 589/589 (100%), Positives = 589/589 (100%) Query: 1 MSVSVHENRKSRASSGSINIYLFHKSSYADSVLTHLNLLRQQRLFTDVLLHAGNRTFPCH 60 MSVSVHENRKSRASSGSINIYLFHKSSYADSVLTHLNLLRQQRLFTDVLLHAGNRTFPCH Sbjct: 1 MSVSVHENRKSRASSGSINIYLFHKSSYADSVLTHLNLLRQQRLFTDVLLHAGNRTFPCH 60 Query: 61 RAVLAACSRYFEAMFSGGLKESQDSEVNFDNSIHPEVLELLLDYAYSSRVIINEENAESL 120 RAVLAACSRYFEAMFSGGLKESQDSEVNFDNSIHPEVLELLLDYAYSSRVIINEENAESL Sbjct: 61 RAVLAACSRYFEAMFSGGLKESQDSEVNFDNSIHPEVLELLLDYAYSSRVIINEENAESL 120 Query: 121 LEAGDMLEFQDIRDACAEFLEKNLHPTNCLGMLLLSDAHQCTKLYELSWRMCLSNFQTIR 180 LEAGDMLEFQDIRDACAEFLEKNLHPTNCLGMLLLSDAHQCTKLYELSWRMCLSNFQTIR Sbjct: 121 LEAGDMLEFQDIRDACAEFLEKNLHPTNCLGMLLLSDAHQCTKLYELSWRMCLSNFQTIR 180 Query: 181 KNEDFLQLPQDMVVQLLSSEELETEDERLVYESAINWISYDLKKRYCYLPELLQTVRLAL 240 KNEDFLQLPQDMVVQLLSSEELETEDERLVYESAINWISYDLKKRYCYLPELLQTVRLAL Sbjct: 181 KNEDFLQLPQDMVVQLLSSEELETEDERLVYESAINWISYDLKKRYCYLPELLQTVRLAL 240 Query: 241 LPAIYLMENVAMEELITKQRKSKEIVEEAIRCKLKILQNDGVVTSLCARPRKTGHALFLL 300 LPAIYLMENVAMEELITKQRKSKEIVEEAIRCKLKILQNDGVVTSLCARPRKTGHALFLL Sbjct: 241 LPAIYLMENVAMEELITKQRKSKEIVEEAIRCKLKILQNDGVVTSLCARPRKTGHALFLL 300 Query: 301 GGQTFMCDKLYLVDQKAKEIIPKADIPSPRKEFSACAIGCKVYITGGRGSENGVSKDVWV 360 GGQTFMCDKLYLVDQKAKEIIPKADIPSPRKEFSACAIGCKVYITGGRGSENGVSKDVWV Sbjct: 301 GGQTFMCDKLYLVDQKAKEIIPKADIPSPRKEFSACAIGCKVYITGGRGSENGVSKDVWV 360 Query: 361 YDTLHEEWSKAAPMLVARFGHGSAELKHCLYVVGGHTAATGCLPASPSVSLKQVEHYDPT 420 YDTLHEEWSKAAPMLVARFGHGSAELKHCLYVVGGHTAATGCLPASPSVSLKQVEHYDPT Sbjct: 361 YDTLHEEWSKAAPMLVARFGHGSAELKHCLYVVGGHTAATGCLPASPSVSLKQVEHYDPT 420 Query: 421 INKWTMVAPLREGVSNAAVVSAKLKLFAFGGTSVSHDKLPKVQCYDQCENRWTVPATCPQ 480 INKWTMVAPLREGVSNAAVVSAKLKLFAFGGTSVSHDKLPKVQCYDQCENRWTVPATCPQ Sbjct: 421 INKWTMVAPLREGVSNAAVVSAKLKLFAFGGTSVSHDKLPKVQCYDQCENRWTVPATCPQ 480 Query: 481 PWRYTAAAVLGNQIFIMGGDTEFSACSAYKFNSETYQWTKVGDVTAKRMSCHAVASGNKL 540 PWRYTAAAVLGNQIFIMGGDTEFSACSAYKFNSETYQWTKVGDVTAKRMSCHAVASGNKL Sbjct: 481 PWRYTAAAVLGNQIFIMGGDTEFSACSAYKFNSETYQWTKVGDVTAKRMSCHAVASGNKL 540 Query: 541 YVVGGYFGIQRCKTLDCYDPTLDVWNSITTVPYSLIPTAFVSTWKHLPS 589 YVVGGYFGIQRCKTLDCYDPTLDVWNSITTVPYSLIPTAFVSTWKHLPS Sbjct: 541 YVVGGYFGIQRCKTLDCYDPTLDVWNSITTVPYSLIPTAFVSTWKHLPS 589 >swiss|P13781|REP2_ZYGBI Trans-acting factor C (REP2). Length = 178 Score = 23.9 bits (50), Expect = 0.58 Identities = 12/40 (30%), Positives = 21/40 (52%) Query: 417 YDPTINKWTMVAPLREGVSNAAVVSAKLKLFAFGGTSVSH 456 Y PT + T+ L EG + +V+ ++++L F V H Sbjct: 55 YLPTRDGETLQEMLSEGAPSTSVLHSEMELSIFDQVQVPH 94 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 21.2 bits (43), Expect = 3.2 Identities = 6/13 (46%), Positives = 12/13 (92%) Query: 186 LQLPQDMVVQLLS 198 +QLPQ++ +QL++ Sbjct: 1062 IQLPQELTIQLVN 1074 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 20.8 bits (42), Expect = 4.7 Identities = 7/15 (46%), Positives = 10/15 (66%) Query: 128 EFQDIRDACAEFLEK 142 EF +++DA EF K Sbjct: 244 EFDELKDALREFFSK 258 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.320 0.134 0.410 Lambda K H 0.267 0.0411 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 42,429 Number of Sequences: 43 Number of extensions: 1877 Number of successful extensions: 33 Number of sequences better than 10.0: 11 Number of HSP's better than 10.0 without gapping: 11 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 11 length of query: 589 length of database: 17,163 effective HSP length: 60 effective length of query: 529 effective length of database: 14,583 effective search space: 7714407 effective search space used: 7714407 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (20.9 bits) S2: 39 (19.6 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P24836|VE6_HPV56 E6 protein. (155 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P24836|VE6_HPV56 E6 protein. 286 6e-81 swiss|Q07856|VE6_HPV65 E6 protein. 59 2e-12 swiss|P30735|VE6_MNPV E6 protein. 46 3e-08 swiss|P06931|VE6_BPV1 E6 protein. 43 2e-07 swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein c... 20 1.5 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 20 2.0 >swiss|P24836|VE6_HPV56 E6 protein. Length = 155 Score = 286 bits (733), Expect = 6e-81 Identities = 155/155 (100%), Positives = 155/155 (100%) Query: 1 MEPQFNNPQERPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDD 60 MEPQFNNPQERPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDD Sbjct: 1 MEPQFNNPQERPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDD 60 Query: 61 FPYAVCRVCLLFYSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQL 120 FPYAVCRVCLLFYSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQL Sbjct: 61 FPYAVCRVCLLFYSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQL 120 Query: 121 HCDRKRRFHLIAHGWTGSCLGCWRQTSREPRESTV 155 HCDRKRRFHLIAHGWTGSCLGCWRQTSREPRESTV Sbjct: 121 HCDRKRRFHLIAHGWTGSCLGCWRQTSREPRESTV 155 >swiss|Q07856|VE6_HPV65 E6 protein. Length = 140 Score = 59.3 bits (142), Expect = 2e-12 Identities = 37/135 (27%), Positives = 63/135 (46%) Query: 11 RPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCL 70 RP +L +I DL L+C++C + ++ +F +L LV+R YA C CL Sbjct: 5 RPAALDDFCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECL 64 Query: 71 LFYSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRFHL 130 ++ + Y+ S+ LE + ++++ ++ IRC C L EK + +L Sbjct: 65 RLSARFEQENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQACYL 124 Query: 131 IAHGWTGSCLGCWRQ 145 I W G C C R+ Sbjct: 125 IRGLWRGYCRNCIRK 139 >swiss|P30735|VE6_MNPV E6 protein. Length = 207 Score = 45.8 bits (107), Expect = 3e-08 Identities = 35/130 (26%), Positives = 54/130 (40%) Query: 13 RSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCLLF 72 R++H E L IP DL L C +C + LT+ E+ F + LV+R + +C C Sbjct: 75 RTVHSFVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARV 134 Query: 73 YSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRFHLIA 132 + + + ++ S A + L L RC C L+ EK +R F + Sbjct: 135 CASLDLFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVR 194 Query: 133 HGWTGSCLGC 142 W C C Sbjct: 195 GQWRARCRIC 204 >swiss|P06931|VE6_BPV1 E6 protein. Length = 137 Score = 42.7 bits (99), Expect = 2e-07 Identities = 34/119 (28%), Positives = 48/119 (39%), Gaps = 1/119 (0%) Query: 31 LSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCLLFYSKVRKYRYYDYSVYGAT 90 L C++C++ LT + + + +V R+ Y C +CL + + V G Sbjct: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 Query: 91 LESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRF-HLIAHGWTGSCLGCWRQTSR 148 E + K L L IRC C LT EK H F A+ G C C R SR Sbjct: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDCCRHGSR 133 >swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein charlie). Length = 296 Score = 20.0 bits (40), Expect = 1.5 Identities = 11/39 (28%), Positives = 17/39 (43%) Query: 77 RKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTP 115 RK Y ++ L KK+ ++L+R C L P Sbjct: 129 RKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLVP 167 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 19.6 bits (39), Expect = 2.0 Identities = 23/82 (28%), Positives = 34/82 (41%), Gaps = 7/82 (8%) Query: 3 PQFNNPQERPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDDFP 62 P + PQE R + + + + + L CV C K L A + N A T + P Sbjct: 759 PPCSGPQETARKWYGVWDSYKGDFVPLFEVCVGCVKML-EAVLPNLANTLYPIS-----P 812 Query: 63 YAVCRVCLLFYSKVRKYRYYDY 84 C L Y+ RK R+ +Y Sbjct: 813 EPFIAYCQLHYAPGRK-RFIEY 833 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P24836|VE6_HPV56 E6 protein. 212 2e-58 swiss|Q07856|VE6_HPV65 E6 protein. 170 8e-46 swiss|P30735|VE6_MNPV E6 protein. 164 4e-44 swiss|P06931|VE6_BPV1 E6 protein. 148 2e-39 Sequences not found previously or not previously below threshold: trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 19 2.8 swiss|P33418|LOS1_YEAST LOS1 protein. 18 5.7 swiss|P25233|NECD_MOUSE Necdin. 18 6.0 CONVERGED! >swiss|P24836|VE6_HPV56 E6 protein. Length = 155 Score = 212 bits (541), Expect = 2e-58 Identities = 155/155 (100%), Positives = 155/155 (100%) Query: 1 MEPQFNNPQERPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDD 60 MEPQFNNPQERPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDD Sbjct: 1 MEPQFNNPQERPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDD 60 Query: 61 FPYAVCRVCLLFYSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQL 120 FPYAVCRVCLLFYSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQL Sbjct: 61 FPYAVCRVCLLFYSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQL 120 Query: 121 HCDRKRRFHLIAHGWTGSCLGCWRQTSREPRESTV 155 HCDRKRRFHLIAHGWTGSCLGCWRQTSREPRESTV Sbjct: 121 HCDRKRRFHLIAHGWTGSCLGCWRQTSREPRESTV 155 >swiss|Q07856|VE6_HPV65 E6 protein. Length = 140 Score = 170 bits (431), Expect = 8e-46 Identities = 37/135 (27%), Positives = 63/135 (46%) Query: 11 RPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCL 70 RP +L +I DL L+C++C + ++ +F +L LV+R YA C CL Sbjct: 5 RPAALDDFCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECL 64 Query: 71 LFYSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRFHL 130 ++ + Y+ S+ LE + ++++ ++ IRC C L EK + +L Sbjct: 65 RLSARFEQENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQACYL 124 Query: 131 IAHGWTGSCLGCWRQ 145 I W G C C R+ Sbjct: 125 IRGLWRGYCRNCIRK 139 >swiss|P30735|VE6_MNPV E6 protein. Length = 207 Score = 164 bits (417), Expect = 4e-44 Identities = 35/130 (26%), Positives = 54/130 (40%) Query: 13 RSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCLLF 72 R++H E L IP DL L C +C + LT+ E+ F + LV+R + +C C Sbjct: 75 RTVHSFVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARV 134 Query: 73 YSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRFHLIA 132 + + + ++ S A + L L RC C L+ EK +R F + Sbjct: 135 CASLDLFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVR 194 Query: 133 HGWTGSCLGC 142 W C C Sbjct: 195 GQWRARCRIC 204 >swiss|P06931|VE6_BPV1 E6 protein. Length = 137 Score = 148 bits (375), Expect = 2e-39 Identities = 34/119 (28%), Positives = 48/119 (39%), Gaps = 1/119 (0%) Query: 31 LSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCLLFYSKVRKYRYYDYSVYGAT 90 L C++C++ LT + + + +V R+ Y C +CL + + V G Sbjct: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 Query: 91 LESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRF-HLIAHGWTGSCLGCWRQTSR 148 E + K L L IRC C LT EK H F A+ G C C R SR Sbjct: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDCCRHGSR 133 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 19.0 bits (38), Expect = 2.8 Identities = 8/27 (29%), Positives = 12/27 (43%) Query: 5 FNNPQERPRSLHHLSEVLEIPLIDLRL 31 F E+ +HH E +P L+L Sbjct: 854 FKTSPEKSTLMHHYEEEGGVPSEKLKL 880 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 18.2 bits (36), Expect = 5.7 Identities = 7/20 (35%), Positives = 8/20 (40%), Gaps = 2/20 (10%) Query: 99 LCDLLIRCYRCQSPLTPEEK 118 L + I CY C L K Sbjct: 263 LKETKISCYNC--ILAIISK 280 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 18.2 bits (36), Expect = 6.0 Identities = 6/35 (17%), Positives = 13/35 (37%) Query: 113 LTPEEKQLHCDRKRRFHLIAHGWTGSCLGCWRQTS 147 L ++K++ + + C R+TS Sbjct: 114 LVKDQKRMVLWFPDMVKEVMGSYKKWCRSILRRTS 148 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.322 0.162 0.503 Lambda K H 0.267 0.0495 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 14,421 Number of Sequences: 43 Number of extensions: 763 Number of successful extensions: 30 Number of sequences better than 10.0: 16 Number of HSP's better than 10.0 without gapping: 16 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 21 length of query: 155 length of database: 17,163 effective HSP length: 49 effective length of query: 106 effective length of database: 15,056 effective search space: 1595936 effective search space used: 1595936 T: 11 A: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 33 (18.0 bits) S2: 34 (17.4 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|Q07856|VE6_HPV65 E6 protein. (140 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|Q07856|VE6_HPV65 E6 protein. 229 8e-64 swiss|P30735|VE6_MNPV E6 protein. 59 4e-12 swiss|P24836|VE6_HPV56 E6 protein. 58 5e-12 swiss|P06931|VE6_BPV1 E6 protein. 38 6e-06 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 17 8.6 >swiss|Q07856|VE6_HPV65 E6 protein. Length = 140 Score = 229 bits (585), Expect = 8e-64 Identities = 140/140 (100%), Positives = 140/140 (100%) Query: 1 MADGRPAALDDFCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACC 60 MADGRPAALDDFCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACC Sbjct: 1 MADGRPAALDDFCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACC 60 Query: 61 SECLRLSARFEQENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQ 120 SECLRLSARFEQENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQ Sbjct: 61 SECLRLSARFEQENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQ 120 Query: 121 ACYLIRGLWRGYCRNCIRKQ 140 ACYLIRGLWRGYCRNCIRKQ Sbjct: 121 ACYLIRGLWRGYCRNCIRKQ 140 >swiss|P30735|VE6_MNPV E6 protein. Length = 207 Score = 58.5 bits (140), Expect = 4e-12 Identities = 37/125 (29%), Positives = 60/125 (47%) Query: 12 FCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECLRLSARFE 71 F R I DL L C FCS + ++L +F +LV+RG C + C+ C R+ A + Sbjct: 80 FVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARVCASLD 139 Query: 72 QENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQACYLIRGLWRG 131 + Q S ++ + + RC C+++L + EKL+ ++ +RG WR Sbjct: 140 LFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVRGQWRA 199 Query: 132 YCRNC 136 CR C Sbjct: 200 RCRIC 204 >swiss|P24836|VE6_HPV56 E6 protein. Length = 155 Score = 57.8 bits (138), Expect = 5e-12 Identities = 37/135 (27%), Positives = 63/135 (46%) Query: 5 RPAALDDFCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECL 64 RP +L +I DL L+C++C + ++ +F +L LV+R YA C CL Sbjct: 11 RPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCL 70 Query: 65 RLSARFEQENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQACYL 124 ++ + Y+ S+ LE + ++++ ++ IRC C L EK + +L Sbjct: 71 LFYSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRFHL 130 Query: 125 IRGLWRGYCRNCIRK 139 I W G C C R+ Sbjct: 131 IAHGWTGSCLGCWRQ 145 >swiss|P06931|VE6_BPV1 E6 protein. Length = 137 Score = 37.7 bits (86), Expect = 6e-06 Identities = 30/115 (26%), Positives = 49/115 (42%), Gaps = 1/115 (0%) Query: 25 LTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECLRLSARFEQENYFQCSIKAVN 84 L C++C + D +K +V R GC Y C+ CL E+ + + Sbjct: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 Query: 85 LEEVAQRKIKEICIRCICCLRLLDIVEK-LDLLYSDQACYLIRGLWRGYCRNCIR 138 E + + + +CIRC C L EK +L+++ C + RG C +C R Sbjct: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDCCR 129 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 17.3 bits (33), Expect = 8.6 Identities = 6/16 (37%), Positives = 11/16 (68%) Query: 81 KAVNLEEVAQRKIKEI 96 +A +A+RK+KE+ Sbjct: 195 EANRYHRIAERKLKEV 210 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P24836|VE6_HPV56 E6 protein. 168 3e-45 swiss|Q07856|VE6_HPV65 E6 protein. 163 9e-44 swiss|P30735|VE6_MNPV E6 protein. 149 1e-39 swiss|P06931|VE6_BPV1 E6 protein. 132 2e-34 Sequences not found previously or not previously below threshold: swiss|P33418|LOS1_YEAST LOS1 protein. 17 8.7 CONVERGED! >swiss|P24836|VE6_HPV56 E6 protein. Length = 155 Score = 168 bits (426), Expect = 3e-45 Identities = 37/135 (27%), Positives = 63/135 (46%) Query: 5 RPAALDDFCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECL 64 RP +L +I DL L+C++C + ++ +F +L LV+R YA C CL Sbjct: 11 RPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCL 70 Query: 65 RLSARFEQENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQACYL 124 ++ + Y+ S+ LE + ++++ ++ IRC C L EK + +L Sbjct: 71 LFYSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRFHL 130 Query: 125 IRGLWRGYCRNCIRK 139 I W G C C R+ Sbjct: 131 IAHGWTGSCLGCWRQ 145 Score = 18.2 bits (36), Expect = 4.9 Identities = 7/36 (19%), Positives = 16/36 (44%) Query: 76 FQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVE 111 + +L EV + + ++ + C+ C + L E Sbjct: 9 QERPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAE 44 >swiss|Q07856|VE6_HPV65 E6 protein. Length = 140 Score = 163 bits (413), Expect = 9e-44 Identities = 140/140 (100%), Positives = 140/140 (100%) Query: 1 MADGRPAALDDFCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACC 60 MADGRPAALDDFCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACC Sbjct: 1 MADGRPAALDDFCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACC 60 Query: 61 SECLRLSARFEQENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQ 120 SECLRLSARFEQENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQ Sbjct: 61 SECLRLSARFEQENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQ 120 Query: 121 ACYLIRGLWRGYCRNCIRKQ 140 ACYLIRGLWRGYCRNCIRKQ Sbjct: 121 ACYLIRGLWRGYCRNCIRKQ 140 >swiss|P30735|VE6_MNPV E6 protein. Length = 207 Score = 149 bits (378), Expect = 1e-39 Identities = 37/130 (28%), Positives = 61/130 (46%) Query: 7 AALDDFCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECLRL 66 + F R I DL L C FCS + ++L +F +LV+RG C + C+ C R+ Sbjct: 75 RTVHSFVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARV 134 Query: 67 SARFEQENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQACYLIR 126 A + + Q S ++ + + RC C+++L + EKL+ ++ +R Sbjct: 135 CASLDLFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVR 194 Query: 127 GLWRGYCRNC 136 G WR CR C Sbjct: 195 GQWRARCRIC 204 >swiss|P06931|VE6_BPV1 E6 protein. Length = 137 Score = 132 bits (332), Expect = 2e-34 Identities = 30/115 (26%), Positives = 49/115 (42%), Gaps = 1/115 (0%) Query: 25 LTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECLRLSARFEQENYFQCSIKAVN 84 L C++C + D +K +V R GC Y C+ CL E+ + + Sbjct: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 Query: 85 LEEVAQRKIKEICIRCICCLRLLDIVEK-LDLLYSDQACYLIRGLWRGYCRNCIR 138 E + + + +CIRC C L EK +L+++ C + RG C +C R Sbjct: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDCCR 129 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 17.4 bits (34), Expect = 8.7 Identities = 8/20 (40%), Positives = 11/20 (55%), Gaps = 2/20 (10%) Query: 93 IKEICIRCICCLRLLDIVEK 112 +KE I C C +L I+ K Sbjct: 263 LKETKISCYNC--ILAIISK 280 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.324 0.166 0.517 Lambda K H 0.267 0.0507 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 12,851 Number of Sequences: 43 Number of extensions: 652 Number of successful extensions: 30 Number of sequences better than 10.0: 14 Number of HSP's better than 10.0 without gapping: 14 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 17 length of query: 140 length of database: 17,163 effective HSP length: 48 effective length of query: 92 effective length of database: 15,099 effective search space: 1389108 effective search space used: 1389108 T: 11 A: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 33 (18.0 bits) S2: 34 (17.4 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P30735|VE6_MNPV E6 protein. (207 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P30735|VE6_MNPV E6 protein. 376 e-108 swiss|Q07856|VE6_HPV65 E6 protein. 59 5e-12 swiss|P24836|VE6_HPV56 E6 protein. 45 6e-08 swiss|P06931|VE6_BPV1 E6 protein. 29 0.005 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 19 5.0 trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. 18 5.6 >swiss|P30735|VE6_MNPV E6 protein. Length = 207 Score = 376 bits (966), Expect = e-108 Identities = 207/207 (100%), Positives = 207/207 (100%) Query: 1 HPAWHCLRRTCCRDLYRVTVLGSTNTIRFGQDRYGCECWQQSPLHTFFHCTGIVQTYIRR 60 HPAWHCLRRTCCRDLYRVTVLGSTNTIRFGQDRYGCECWQQSPLHTFFHCTGIVQTYIRR Sbjct: 1 HPAWHCLRRTCCRDLYRVTVLGSTNTIRFGQDRYGCECWQQSPLHTFFHCTGIVQTYIRR 60 Query: 61 PHWLWICYCGSRMDRTVHSFVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWR 120 PHWLWICYCGSRMDRTVHSFVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWR Sbjct: 61 PHWLWICYCGSRMDRTVHSFVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWR 120 Query: 121 GRCAHGICTACARVCASLDLFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEK 180 GRCAHGICTACARVCASLDLFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEK Sbjct: 121 GRCAHGICTACARVCASLDLFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEK 180 Query: 181 LECAERGESFAKVRGQWRARCRICKPV 207 LECAERGESFAKVRGQWRARCRICKPV Sbjct: 181 LECAERGESFAKVRGQWRARCRICKPV 207 >swiss|Q07856|VE6_HPV65 E6 protein. Length = 140 Score = 58.9 bits (141), Expect = 5e-12 Identities = 37/125 (29%), Positives = 60/125 (47%) Query: 80 FVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARVCASLD 139 F R I DL L C FCS + ++L +F +LV+RG C + C+ C R+ A + Sbjct: 12 FCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECLRLSARFE 71 Query: 140 LFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVRGQWRA 199 + Q S ++ + + RC C+++L + EKL+ ++ +RG WR Sbjct: 72 QENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQACYLIRGLWRG 131 Query: 200 RCRIC 204 CR C Sbjct: 132 YCRNC 136 >swiss|P24836|VE6_HPV56 E6 protein. Length = 155 Score = 45.1 bits (105), Expect = 6e-08 Identities = 35/130 (26%), Positives = 54/130 (40%) Query: 75 RTVHSFVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARV 134 R++H E L IP DL L C +C + LT+ E+ F + LV+R + +C C Sbjct: 13 RSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCLLF 72 Query: 135 CASLDLFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVR 194 + + + ++ S A + L L RC C L+ EK +R F + Sbjct: 73 YSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRFHLIA 132 Query: 195 GQWRARCRIC 204 W C C Sbjct: 133 HGWTGSCLGC 142 >swiss|P06931|VE6_BPV1 E6 protein. Length = 137 Score = 28.9 bits (63), Expect = 0.005 Identities = 32/113 (28%), Positives = 42/113 (36%), Gaps = 1/113 (0%) Query: 93 LPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARVCASLDLFLHHQNSRPLAD 152 L C +C LT+ + F++V R C +G CT C C + + L + Sbjct: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 Query: 153 VLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVRGQ-WRARCRIC 204 TL L RC C L+ EK E F K R R RC C Sbjct: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDC 127 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 18.9 bits (37), Expect = 5.0 Identities = 8/14 (57%), Positives = 9/14 (64%) Query: 78 HSFVERLGIPREDL 91 HS V +LG RE L Sbjct: 642 HSLVMKLGKERESL 655 >trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. Length = 563 Score = 18.5 bits (36), Expect = 5.6 Identities = 7/11 (63%), Positives = 7/11 (63%) Query: 95 CTFCSRFLTQE 105 CT CSR T E Sbjct: 420 CTICSRAYTSE 430 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P30735|VE6_MNPV E6 protein. 301 4e-85 swiss|P24836|VE6_HPV56 E6 protein. 170 1e-45 swiss|Q07856|VE6_HPV65 E6 protein. 156 2e-41 swiss|P06931|VE6_BPV1 E6 protein. 139 3e-36 Sequences not found previously or not previously below threshold: trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 19 5.2 CONVERGED! >swiss|P30735|VE6_MNPV E6 protein. Length = 207 Score = 301 bits (771), Expect = 4e-85 Identities = 207/207 (100%), Positives = 207/207 (100%) Query: 1 HPAWHCLRRTCCRDLYRVTVLGSTNTIRFGQDRYGCECWQQSPLHTFFHCTGIVQTYIRR 60 HPAWHCLRRTCCRDLYRVTVLGSTNTIRFGQDRYGCECWQQSPLHTFFHCTGIVQTYIRR Sbjct: 1 HPAWHCLRRTCCRDLYRVTVLGSTNTIRFGQDRYGCECWQQSPLHTFFHCTGIVQTYIRR 60 Query: 61 PHWLWICYCGSRMDRTVHSFVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWR 120 PHWLWICYCGSRMDRTVHSFVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWR Sbjct: 61 PHWLWICYCGSRMDRTVHSFVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWR 120 Query: 121 GRCAHGICTACARVCASLDLFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEK 180 GRCAHGICTACARVCASLDLFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEK Sbjct: 121 GRCAHGICTACARVCASLDLFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEK 180 Query: 181 LECAERGESFAKVRGQWRARCRICKPV 207 LECAERGESFAKVRGQWRARCRICKPV Sbjct: 181 LECAERGESFAKVRGQWRARCRICKPV 207 >swiss|P24836|VE6_HPV56 E6 protein. Length = 155 Score = 170 bits (431), Expect = 1e-45 Identities = 35/130 (26%), Positives = 54/130 (40%) Query: 75 RTVHSFVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARV 134 R++H E L IP DL L C +C + LT+ E+ F + LV+R + +C C Sbjct: 13 RSLHHLSEVLEIPLIDLRLSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCLLF 72 Query: 135 CASLDLFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVR 194 + + + ++ S A + L L RC C L+ EK +R F + Sbjct: 73 YSKVRKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRFHLIA 132 Query: 195 GQWRARCRIC 204 W C C Sbjct: 133 HGWTGSCLGC 142 Score = 19.0 bits (38), Expect = 4.3 Identities = 9/36 (25%), Positives = 14/36 (38%) Query: 144 HQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTE 179 + R L + + L L+ C C K L+ E Sbjct: 9 QERPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAE 44 >swiss|Q07856|VE6_HPV65 E6 protein. Length = 140 Score = 156 bits (396), Expect = 2e-41 Identities = 37/125 (29%), Positives = 60/125 (47%) Query: 80 FVERLGIPREDLLLPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARVCASLD 139 F R I DL L C FCS + ++L +F +LV+RG C + C+ C R+ A + Sbjct: 12 FCRRFDISFFDLHLTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECLRLSARFE 71 Query: 140 LFLHHQNSRPLADVLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVRGQWRA 199 + Q S ++ + + RC C+++L + EKL+ ++ +RG WR Sbjct: 72 QENYFQCSIKAVNLEEVAQRKIKEICIRCICCLRLLDIVEKLDLLYSDQACYLIRGLWRG 131 Query: 200 RCRIC 204 CR C Sbjct: 132 YCRNC 136 >swiss|P06931|VE6_BPV1 E6 protein. Length = 137 Score = 139 bits (350), Expect = 3e-36 Identities = 32/113 (28%), Positives = 42/113 (36%), Gaps = 1/113 (0%) Query: 93 LPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARVCASLDLFLHHQNSRPLAD 152 L C +C LT+ + F++V R C +G CT C C + + L + Sbjct: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 Query: 153 VLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVRGQ-WRARCRIC 204 TL L RC C L+ EK E F K R R RC C Sbjct: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDC 127 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 18.6 bits (37), Expect = 5.2 Identities = 11/36 (30%), Positives = 17/36 (46%), Gaps = 7/36 (19%) Query: 65 WICYCGSRMDRT-------VHSFVERLGIPREDLLL 93 WI C + + +T +H + E G+P E L L Sbjct: 845 WIKRCTNLIFKTSPEKSTLMHHYEEEGGVPSEKLKL 880 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.329 0.158 0.546 Lambda K H 0.267 0.0485 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 18,255 Number of Sequences: 43 Number of extensions: 966 Number of successful extensions: 33 Number of sequences better than 10.0: 16 Number of HSP's better than 10.0 without gapping: 16 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 21 length of query: 207 length of database: 17,163 effective HSP length: 52 effective length of query: 155 effective length of database: 14,927 effective search space: 2313685 effective search space used: 2313685 T: 11 A: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 35 (19.3 bits) S2: 35 (17.8 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P06931|VE6_BPV1 E6 protein. (137 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P06931|VE6_BPV1 E6 protein. 249 9e-70 swiss|P24836|VE6_HPV56 E6 protein. 42 2e-07 swiss|Q07856|VE6_HPV65 E6 protein. 38 4e-06 swiss|P30735|VE6_MNPV E6 protein. 29 0.002 trembl|Q15219|Q15219 Protein p84. 19 2.8 >swiss|P06931|VE6_BPV1 E6 protein. Length = 137 Score = 249 bits (636), Expect = 9e-70 Identities = 137/137 (100%), Positives = 137/137 (100%) Query: 1 MDLKPFARTNPFSGLDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLAT 60 MDLKPFARTNPFSGLDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLAT Sbjct: 1 MDLKPFARTNPFSGLDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLAT 60 Query: 61 ERRLWQGVPVTGEEAELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANII 120 ERRLWQGVPVTGEEAELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANII Sbjct: 61 ERRLWQGVPVTGEEAELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANII 120 Query: 121 RGRCYDCCRHGSRSKYP 137 RGRCYDCCRHGSRSKYP Sbjct: 121 RGRCYDCCRHGSRSKYP 137 >swiss|P24836|VE6_HPV56 E6 protein. Length = 155 Score = 42.4 bits (98), Expect = 2e-07 Identities = 34/119 (28%), Positives = 48/119 (39%), Gaps = 1/119 (0%) Query: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 L C++C++ LT + + + +V R+ Y C +CL + + V G Sbjct: 31 LSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCLLFYSKVRKYRYYDYSVYGAT 90 Query: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDCCRHGSR 133 E + K L L IRC C LT EK H F A+ G C C R SR Sbjct: 91 LESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRF-HLIAHGWTGSCLGCWRQTSR 148 >swiss|Q07856|VE6_HPV65 E6 protein. Length = 140 Score = 38.1 bits (87), Expect = 4e-06 Identities = 30/115 (26%), Positives = 49/115 (42%), Gaps = 1/115 (0%) Query: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 L C++C + D +K +V R GC Y C+ CL E+ + + Sbjct: 25 LTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECLRLSARFEQENYFQCSIKAVN 84 Query: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDCCR 129 E + + + +CIRC C L EK +L+++ C + RG C +C R Sbjct: 85 LEEVAQRKIKEICIRCICCLRLLDIVEK-LDLLYSDQACYLIRGLWRGYCRNCIR 138 >swiss|P30735|VE6_MNPV E6 protein. Length = 207 Score = 29.3 bits (64), Expect = 0.002 Identities = 32/113 (28%), Positives = 42/113 (36%), Gaps = 1/113 (0%) Query: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 L C +C LT+ + F++V R C +G CT C C + + L + Sbjct: 93 LPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARVCASLDLFLHHQNSRPLAD 152 Query: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDC 127 TL L RC C L+ EK E F K R R RC C Sbjct: 153 VLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVRGQ-WRARCRIC 204 >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 18.9 bits (37), Expect = 2.8 Identities = 7/13 (53%), Positives = 9/13 (68%) Query: 69 PVTGEEAELLHGK 81 PVTGE+ E+ K Sbjct: 568 PVTGEQIEVFANK 580 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P06931|VE6_BPV1 E6 protein. 180 8e-49 swiss|P24836|VE6_HPV56 E6 protein. 149 2e-39 swiss|P30735|VE6_MNPV E6 protein. 132 1e-34 swiss|Q07856|VE6_HPV65 E6 protein. 132 1e-34 Sequences not found previously or not previously below threshold: swiss|P33418|LOS1_YEAST LOS1 protein. 18 5.4 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 17 7.5 CONVERGED! >swiss|P06931|VE6_BPV1 E6 protein. Length = 137 Score = 180 bits (457), Expect = 8e-49 Identities = 137/137 (100%), Positives = 137/137 (100%) Query: 1 MDLKPFARTNPFSGLDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLAT 60 MDLKPFARTNPFSGLDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLAT Sbjct: 1 MDLKPFARTNPFSGLDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLAT 60 Query: 61 ERRLWQGVPVTGEEAELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANII 120 ERRLWQGVPVTGEEAELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANII Sbjct: 61 ERRLWQGVPVTGEEAELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANII 120 Query: 121 RGRCYDCCRHGSRSKYP 137 RGRCYDCCRHGSRSKYP Sbjct: 121 RGRCYDCCRHGSRSKYP 137 >swiss|P24836|VE6_HPV56 E6 protein. Length = 155 Score = 149 bits (376), Expect = 2e-39 Identities = 34/119 (28%), Positives = 48/119 (39%), Gaps = 1/119 (0%) Query: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 L C++C++ LT + + + +V R+ Y C +CL + + V G Sbjct: 31 LSCVYCKKELTRAEVYNFACTELKLVYRDDFPYAVCRVCLLFYSKVRKYRYYDYSVYGAT 90 Query: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDCCRHGSR 133 E + K L L IRC C LT EK H F A+ G C C R SR Sbjct: 91 LESITKKQLCDLLIRCYRCQSPLTPEEKQLHCDRKRRF-HLIAHGWTGSCLGCWRQTSR 148 Score = 18.6 bits (37), Expect = 3.3 Identities = 10/36 (27%), Positives = 15/36 (40%) Query: 66 QGVPVTGEEAELLHGKTLDRLCIRCCYCGGKLTKNE 101 Q P + + L L + C YC +LT+ E Sbjct: 9 QERPRSLHHLSEVLEIPLIDLRLSCVYCKKELTRAE 44 >swiss|P30735|VE6_MNPV E6 protein. Length = 207 Score = 132 bits (334), Expect = 1e-34 Identities = 32/113 (28%), Positives = 42/113 (36%), Gaps = 1/113 (0%) Query: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 L C +C LT+ + F++V R C +G CT C C + + L + Sbjct: 93 LPCTFCSRFLTQEELTAFDFSAFNLVWRGRCAHGICTACARVCASLDLFLHHQNSRPLAD 152 Query: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDC 127 TL L RC C L+ EK E F K R R RC C Sbjct: 153 VLRDENLTLHGLKARCRVCMKILSVTEKLECAERGESFAKVRGQ-WRARCRIC 204 >swiss|Q07856|VE6_HPV65 E6 protein. Length = 140 Score = 132 bits (333), Expect = 1e-34 Identities = 30/115 (26%), Positives = 49/115 (42%), Gaps = 1/115 (0%) Query: 15 LDCLWCREPLTEVDAFRCMVKDFHVVIREGCRYGACTICLENCLATERRLWQGVPVTGEE 74 L C++C + D +K +V R GC Y C+ CL E+ + + Sbjct: 25 LTCIFCSHTVDLQDLASFYLKKLSLVFRGGCYYACCSECLRLSARFEQENYFQCSIKAVN 84 Query: 75 AELLHGKTLDRLCIRCCYCGGKLTKNEKHRHVLFNEPFCKTRANIIRGRCYDCCR 129 E + + + +CIRC C L EK +L+++ C + RG C +C R Sbjct: 85 LEEVAQRKIKEICIRCICCLRLLDIVEK-LDLLYSDQACYLIRGLWRGYCRNCIR 138 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 17.8 bits (35), Expect = 5.4 Identities = 6/20 (30%), Positives = 6/20 (30%), Gaps = 2/20 (10%) Query: 83 LDRLCIRCCYCGGKLTKNEK 102 L I C C L K Sbjct: 263 LKETKISCYNC--ILAIISK 280 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 17.4 bits (34), Expect = 7.5 Identities = 4/11 (36%), Positives = 4/11 (36%) Query: 110 EPFCKTRANII 120 E F K I Sbjct: 9 ERFGKAIRKIW 19 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.325 0.162 0.512 Lambda K H 0.267 0.0495 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 12,415 Number of Sequences: 43 Number of extensions: 618 Number of successful extensions: 32 Number of sequences better than 10.0: 15 Number of HSP's better than 10.0 without gapping: 15 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 22 length of query: 137 length of database: 17,163 effective HSP length: 48 effective length of query: 89 effective length of database: 15,099 effective search space: 1343811 effective search space used: 1343811 T: 11 A: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 33 (18.1 bits) S2: 33 (17.0 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|O08579|EMD_MOUSE Emerin. (259 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|O08579|EMD_MOUSE Emerin. 429 e-123 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 21 1.3 swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. 20 4.0 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 19 6.6 swiss|P14733|LAM1_MOUSE Lamin B1. 19 6.6 >swiss|O08579|EMD_MOUSE Emerin. Length = 259 Score = 429 bits (1103), Expect = e-123 Identities = 259/259 (100%), Positives = 259/259 (100%) Query: 1 MDDYAVLSDTELAAVLRQYNIPHGPIVGSTRKLYEKKIFEYETQRRRLLPPNSSSSSFSY 60 MDDYAVLSDTELAAVLRQYNIPHGPIVGSTRKLYEKKIFEYETQRRRLLPPNSSSSSFSY Sbjct: 1 MDDYAVLSDTELAAVLRQYNIPHGPIVGSTRKLYEKKIFEYETQRRRLLPPNSSSSSFSY 60 Query: 61 QFSDLDSAAVDSDMYDLPKKEDALLYQSKDYNDDYYEESYLTTKTYGEPESVGMSKSFRQ 120 QFSDLDSAAVDSDMYDLPKKEDALLYQSKDYNDDYYEESYLTTKTYGEPESVGMSKSFRQ Sbjct: 61 QFSDLDSAAVDSDMYDLPKKEDALLYQSKDYNDDYYEESYLTTKTYGEPESVGMSKSFRQ 120 Query: 121 PGTSLVDADTFHHQVRDDIFSSLEEEGKDRERLIYGQDSAYQSIAHYRPISNVSRSSLGL 180 PGTSLVDADTFHHQVRDDIFSSLEEEGKDRERLIYGQDSAYQSIAHYRPISNVSRSSLGL Sbjct: 121 PGTSLVDADTFHHQVRDDIFSSLEEEGKDRERLIYGQDSAYQSIAHYRPISNVSRSSLGL 180 Query: 181 SYYPTSSTSSVSSSSSSPSSWLTRRAIRPEKQAPAAALGQDRQVPLWGQLLLFLVFAAFL 240 SYYPTSSTSSVSSSSSSPSSWLTRRAIRPEKQAPAAALGQDRQVPLWGQLLLFLVFAAFL Sbjct: 181 SYYPTSSTSSVSSSSSSPSSWLTRRAIRPEKQAPAAALGQDRQVPLWGQLLLFLVFAAFL 240 Query: 241 LFVYYSIQAEEGNPFWMDP 259 LFVYYSIQAEEGNPFWMDP Sbjct: 241 LFVYYSIQAEEGNPFWMDP 259 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 21.2 bits (43), Expect = 1.3 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 4/66 (6%) Query: 116 KSFRQPGTSLVDADTFHHQVRDDIFSSLEEEGKDRERLIYGQDSAYQSIAHYRPISNVSR 175 KS +Q L+DA +Q D IF+ +E + L G D+A + R + Sbjct: 247 KSLQQEHERLLDAQVSLNQREDHIFARSQELAE----LEKGLDTAKTTFEEERKAFEDKK 302 Query: 176 SSLGLS 181 S+L ++ Sbjct: 303 SNLEIA 308 >swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. Length = 487 Score = 19.6 bits (39), Expect = 4.0 Identities = 6/16 (37%), Positives = 11/16 (68%) Query: 19 YNIPHGPIVGSTRKLY 34 Y++ HGP+ + R L+ Sbjct: 120 YDVGHGPLASAMRMLW 135 Score = 18.5 bits (36), Expect = 7.9 Identities = 9/32 (28%), Positives = 14/32 (43%) Query: 49 LPPNSSSSSFSYQFSDLDSAAVDSDMYDLPKK 80 + P S+ Y F +S + D D + P K Sbjct: 449 IDPADLDESWDYIFETTESPSSDEDYVEGPSK 480 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 18.9 bits (37), Expect = 6.6 Identities = 7/20 (35%), Positives = 10/20 (50%) Query: 148 KDRERLIYGQDSAYQSIAHY 167 +DR G D YQ + H+ Sbjct: 442 RDRREYPIGDDPRYQQLDHH 461 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 18.9 bits (37), Expect = 6.6 Identities = 12/52 (23%), Positives = 25/52 (48%) Query: 138 DIFSSLEEEGKDRERLIYGQDSAYQSIAHYRPISNVSRSSLGLSYYPTSSTS 189 +I ++++ D E+L+ + + I+ YR + L LS P+S + Sbjct: 348 EIRDQMQQQLSDYEQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVT 399 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|O08579|EMD_MOUSE Emerin. 473 e-137 Sequences not found previously or not previously below threshold: trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 23 0.33 swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. 21 1.3 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 20 2.3 swiss|P14733|LAM1_MOUSE Lamin B1. 20 2.5 CONVERGED! >swiss|O08579|EMD_MOUSE Emerin. Length = 259 Score = 473 bits (1217), Expect = e-137 Identities = 259/259 (100%), Positives = 259/259 (100%) Query: 1 MDDYAVLSDTELAAVLRQYNIPHGPIVGSTRKLYEKKIFEYETQRRRLLPPNSSSSSFSY 60 MDDYAVLSDTELAAVLRQYNIPHGPIVGSTRKLYEKKIFEYETQRRRLLPPNSSSSSFSY Sbjct: 1 MDDYAVLSDTELAAVLRQYNIPHGPIVGSTRKLYEKKIFEYETQRRRLLPPNSSSSSFSY 60 Query: 61 QFSDLDSAAVDSDMYDLPKKEDALLYQSKDYNDDYYEESYLTTKTYGEPESVGMSKSFRQ 120 QFSDLDSAAVDSDMYDLPKKEDALLYQSKDYNDDYYEESYLTTKTYGEPESVGMSKSFRQ Sbjct: 61 QFSDLDSAAVDSDMYDLPKKEDALLYQSKDYNDDYYEESYLTTKTYGEPESVGMSKSFRQ 120 Query: 121 PGTSLVDADTFHHQVRDDIFSSLEEEGKDRERLIYGQDSAYQSIAHYRPISNVSRSSLGL 180 PGTSLVDADTFHHQVRDDIFSSLEEEGKDRERLIYGQDSAYQSIAHYRPISNVSRSSLGL Sbjct: 121 PGTSLVDADTFHHQVRDDIFSSLEEEGKDRERLIYGQDSAYQSIAHYRPISNVSRSSLGL 180 Query: 181 SYYPTSSTSSVSSSSSSPSSWLTRRAIRPEKQAPAAALGQDRQVPLWGQLLLFLVFAAFL 240 SYYPTSSTSSVSSSSSSPSSWLTRRAIRPEKQAPAAALGQDRQVPLWGQLLLFLVFAAFL Sbjct: 181 SYYPTSSTSSVSSSSSSPSSWLTRRAIRPEKQAPAAALGQDRQVPLWGQLLLFLVFAAFL 240 Query: 241 LFVYYSIQAEEGNPFWMDP 259 LFVYYSIQAEEGNPFWMDP Sbjct: 241 LFVYYSIQAEEGNPFWMDP 259 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 23.1 bits (48), Expect = 0.33 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 4/66 (6%) Query: 116 KSFRQPGTSLVDADTFHHQVRDDIFSSLEEEGKDRERLIYGQDSAYQSIAHYRPISNVSR 175 KS +Q L+DA +Q D IF+ +E + L G D+A + R + Sbjct: 247 KSLQQEHERLLDAQVSLNQREDHIFARSQELAE----LEKGLDTAKTTFEEERKAFEDKK 302 Query: 176 SSLGLS 181 S+L ++ Sbjct: 303 SNLEIA 308 >swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. Length = 487 Score = 21.2 bits (43), Expect = 1.3 Identities = 6/16 (37%), Positives = 11/16 (68%) Query: 19 YNIPHGPIVGSTRKLY 34 Y++ HGP+ + R L+ Sbjct: 120 YDVGHGPLASAMRMLW 135 Score = 20.1 bits (40), Expect = 2.9 Identities = 9/32 (28%), Positives = 14/32 (43%) Query: 49 LPPNSSSSSFSYQFSDLDSAAVDSDMYDLPKK 80 + P S+ Y F +S + D D + P K Sbjct: 449 IDPADLDESWDYIFETTESPSSDEDYVEGPSK 480 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 20.4 bits (41), Expect = 2.3 Identities = 7/20 (35%), Positives = 10/20 (50%) Query: 148 KDRERLIYGQDSAYQSIAHY 167 +DR G D YQ + H+ Sbjct: 442 RDRREYPIGDDPRYQQLDHH 461 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 20.1 bits (40), Expect = 2.5 Identities = 12/52 (23%), Positives = 25/52 (48%) Query: 138 DIFSSLEEEGKDRERLIYGQDSAYQSIAHYRPISNVSRSSLGLSYYPTSSTS 189 +I ++++ D E+L+ + + I+ YR + L LS P+S + Sbjct: 348 EIRDQMQQQLSDYEQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVT 399 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.314 0.131 0.373 Lambda K H 0.267 0.0400 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 18,575 Number of Sequences: 43 Number of extensions: 779 Number of successful extensions: 21 Number of sequences better than 10.0: 13 Number of HSP's better than 10.0 without gapping: 13 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 17 length of query: 259 length of database: 17,163 effective HSP length: 54 effective length of query: 205 effective length of database: 14,841 effective search space: 3042405 effective search space used: 3042405 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 36 (19.2 bits) S2: 36 (18.5 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P33418|LOS1_YEAST LOS1 protein. (1100 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P33418|LOS1_YEAST LOS1 protein. 1961 0.0 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 21 8.6 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 1961 bits (5079), Expect = 0.0 Identities = 1100/1100 (100%), Positives = 1100/1100 (100%) Query: 1 MLERIQQLVNAVNDPRSDVATKRQAIELLNGIKSSENALEIFISLVINENSNDLLKFYGL 60 MLERIQQLVNAVNDPRSDVATKRQAIELLNGIKSSENALEIFISLVINENSNDLLKFYGL Sbjct: 1 MLERIQQLVNAVNDPRSDVATKRQAIELLNGIKSSENALEIFISLVINENSNDLLKFYGL 60 Query: 61 STLIELMTEGVNANPNGLNLVKFEITKWLKFQVLGNKQTKLPDFLMNKISEVLTTLFMLM 120 STLIELMTEGVNANPNGLNLVKFEITKWLKFQVLGNKQTKLPDFLMNKISEVLTTLFMLM Sbjct: 61 STLIELMTEGVNANPNGLNLVKFEITKWLKFQVLGNKQTKLPDFLMNKISEVLTTLFMLM 120 Query: 121 YSDCNGNQWNSFFDDLMSLFQVDSAISNTSPSTDGNILLGLEFFNKLCLMINSEIADQSF 180 YSDCNGNQWNSFFDDLMSLFQVDSAISNTSPSTDGNILLGLEFFNKLCLMINSEIADQSF Sbjct: 121 YSDCNGNQWNSFFDDLMSLFQVDSAISNTSPSTDGNILLGLEFFNKLCLMINSEIADQSF 180 Query: 181 IRSKESQLKNNNIKDWMRDNDIMKLSNVWFQCLKLDEQIVSQCPGLINSTLDCIGSFISW 240 IRSKESQLKNNNIKDWMRDNDIMKLSNVWFQCLKLDEQIVSQCPGLINSTLDCIGSFISW Sbjct: 181 IRSKESQLKNNNIKDWMRDNDIMKLSNVWFQCLKLDEQIVSQCPGLINSTLDCIGSFISW 240 Query: 241 IDINLIIDANNYYLQLIYKFLNLKETKISCYNCILAIISKKMKPMDKLAFLNMINLTNEL 300 IDINLIIDANNYYLQLIYKFLNLKETKISCYNCILAIISKKMKPMDKLAFLNMINLTNEL Sbjct: 241 IDINLIIDANNYYLQLIYKFLNLKETKISCYNCILAIISKKMKPMDKLAFLNMINLTNEL 300 Query: 301 TYYHQAISMNPQIITFDNLEVWESLTKLITSFGIEFTIIIEQVNDDQKLDTLYKQSVISN 360 TYYHQAISMNPQIITFDNLEVWESLTKLITSFGIEFTIIIEQVNDDQKLDTLYKQSVISN Sbjct: 301 TYYHQAISMNPQIITFDNLEVWESLTKLITSFGIEFTIIIEQVNDDQKLDTLYKQSVISN 360 Query: 361 VDSILLEKIIPILLEFMNNEFDSITAKTFPFWSNYLAFLKKYKASSPNFVPLHKDFLDNF 420 VDSILLEKIIPILLEFMNNEFDSITAKTFPFWSNYLAFLKKYKASSPNFVPLHKDFLDNF Sbjct: 361 VDSILLEKIIPILLEFMNNEFDSITAKTFPFWSNYLAFLKKYKASSPNFVPLHKDFLDNF 420 Query: 421 QQICFKRMKFSDDEVTQDDFEEFNETVRFKLKNFQEIIVVIDPSLFLNNISQEISANLMN 480 QQICFKRMKFSDDEVTQDDFEEFNETVRFKLKNFQEIIVVIDPSLFLNNISQEISANLMN Sbjct: 421 QQICFKRMKFSDDEVTQDDFEEFNETVRFKLKNFQEIIVVIDPSLFLNNISQEISANLMN 480 Query: 481 CKNESWQIFELTIYQIFNLSECTKNNYFGLNKNEIMTSQPSLTLVRFLNELLMMKDFLLA 540 CKNESWQIFELTIYQIFNLSECTKNNYFGLNKNEIMTSQPSLTLVRFLNELLMMKDFLLA Sbjct: 481 CKNESWQIFELTIYQIFNLSECTKNNYFGLNKNEIMTSQPSLTLVRFLNELLMMKDFLLA 540 Query: 541 IDNEQIQILFMELIVKNYNFIFSTSANTANATDDDEKYLLILNIFMSSFAMFNKRENVRL 600 IDNEQIQILFMELIVKNYNFIFSTSANTANATDDDEKYLLILNIFMSSFAMFNKRENVRL Sbjct: 541 IDNEQIQILFMELIVKNYNFIFSTSANTANATDDDEKYLLILNIFMSSFAMFNKRENVRL 600 Query: 601 RSWYLFTRFLKLTRINLKKILFANKNLVNEITNKISPLLHIKVTSINAQGTDDNDTIFDN 660 RSWYLFTRFLKLTRINLKKILFANKNLVNEITNKISPLLHIKVTSINAQGTDDNDTIFDN Sbjct: 601 RSWYLFTRFLKLTRINLKKILFANKNLVNEITNKISPLLHIKVTSINAQGTDDNDTIFDN 660 Query: 661 QLYIFEGIGFIITLNNSSQELTAATANTPIDYDILDQILTPLFTQLEGCITQGASPVVIL 720 QLYIFEGIGFIITLNNSSQELTAATANTPIDYDILDQILTPLFTQLEGCITQGASPVVIL Sbjct: 661 QLYIFEGIGFIITLNNSSQELTAATANTPIDYDILDQILTPLFTQLEGCITQGASPVVIL 720 Query: 721 ECHHILMAIGTLARGLHIGLVPENQVNNMVVNKKLINDSLIHKFSNIAEVILVTFSFFNK 780 ECHHILMAIGTLARGLHIGLVPENQVNNMVVNKKLINDSLIHKFSNIAEVILVTFSFFNK Sbjct: 721 ECHHILMAIGTLARGLHIGLVPENQVNNMVVNKKLINDSLIHKFSNIAEVILVTFSFFNK 780 Query: 781 FENIRDASRFTFARLIPILSNKILPFINKLIELILSSTDLKSWEMIDFLGFLSQLIHMFH 840 FENIRDASRFTFARLIPILSNKILPFINKLIELILSSTDLKSWEMIDFLGFLSQLIHMFH Sbjct: 781 FENIRDASRFTFARLIPILSNKILPFINKLIELILSSTDLKSWEMIDFLGFLSQLIHMFH 840 Query: 841 TDTDCYQLFNQLLTPLINKVHSIIEEIDEQHDQQSSSNKPIDTAVTATSVNKNIVVTDSY 900 TDTDCYQLFNQLLTPLINKVHSIIEEIDEQHDQQSSSNKPIDTAVTATSVNKNIVVTDSY Sbjct: 841 TDTDCYQLFNQLLTPLINKVHSIIEEIDEQHDQQSSSNKPIDTAVTATSVNKNIVVTDSY 900 Query: 901 RDKILLKKAYCTFLQSFTNNSVTSILLSDINRAILPVILNDLVTYTPQEIQETSMMKVSL 960 RDKILLKKAYCTFLQSFTNNSVTSILLSDINRAILPVILNDLVTYTPQEIQETSMMKVSL Sbjct: 901 RDKILLKKAYCTFLQSFTNNSVTSILLSDINRAILPVILNDLVTYTPQEIQETSMMKVSL 960 Query: 961 NVLCNFIKCFGNGTCLDNDDINKDPNLKIDGLNEYFIMKCVPIIFEIPFNPIYKFNIKEG 1020 NVLCNFIKCFGNGTCLDNDDINKDPNLKIDGLNEYFIMKCVPIIFEIPFNPIYKFNIKEG Sbjct: 961 NVLCNFIKCFGNGTCLDNDDINKDPNLKIDGLNEYFIMKCVPIIFEIPFNPIYKFNIKEG 1020 Query: 1021 NFKTMAYDLARLLRELFIVSSNPTTNENECVKYLTQIYLPQIQLPQELTIQLVNMLTTMG 1080 NFKTMAYDLARLLRELFIVSSNPTTNENECVKYLTQIYLPQIQLPQELTIQLVNMLTTMG Sbjct: 1021 NFKTMAYDLARLLRELFIVSSNPTTNENECVKYLTQIYLPQIQLPQELTIQLVNMLTTMG 1080 Query: 1081 QKQFEKWFVDNFISVLKQGQ 1100 QKQFEKWFVDNFISVLKQGQ Sbjct: 1081 QKQFEKWFVDNFISVLKQGQ 1100 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 20.8 bits (42), Expect = 8.6 Identities = 9/22 (40%), Positives = 14/22 (62%) Query: 455 QEIIVVIDPSLFLNNISQEISA 476 QE+ DPS LNN+ E+++ Sbjct: 218 QELPPEEDPSRELNNVQHEVNS 239 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P33418|LOS1_YEAST LOS1 protein. 2108 0.0 Sequences not found previously or not previously below threshold: swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 22 3.4 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 21 6.8 CONVERGED! >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 2108 bits (5462), Expect = 0.0 Identities = 1100/1100 (100%), Positives = 1100/1100 (100%) Query: 1 MLERIQQLVNAVNDPRSDVATKRQAIELLNGIKSSENALEIFISLVINENSNDLLKFYGL 60 MLERIQQLVNAVNDPRSDVATKRQAIELLNGIKSSENALEIFISLVINENSNDLLKFYGL Sbjct: 1 MLERIQQLVNAVNDPRSDVATKRQAIELLNGIKSSENALEIFISLVINENSNDLLKFYGL 60 Query: 61 STLIELMTEGVNANPNGLNLVKFEITKWLKFQVLGNKQTKLPDFLMNKISEVLTTLFMLM 120 STLIELMTEGVNANPNGLNLVKFEITKWLKFQVLGNKQTKLPDFLMNKISEVLTTLFMLM Sbjct: 61 STLIELMTEGVNANPNGLNLVKFEITKWLKFQVLGNKQTKLPDFLMNKISEVLTTLFMLM 120 Query: 121 YSDCNGNQWNSFFDDLMSLFQVDSAISNTSPSTDGNILLGLEFFNKLCLMINSEIADQSF 180 YSDCNGNQWNSFFDDLMSLFQVDSAISNTSPSTDGNILLGLEFFNKLCLMINSEIADQSF Sbjct: 121 YSDCNGNQWNSFFDDLMSLFQVDSAISNTSPSTDGNILLGLEFFNKLCLMINSEIADQSF 180 Query: 181 IRSKESQLKNNNIKDWMRDNDIMKLSNVWFQCLKLDEQIVSQCPGLINSTLDCIGSFISW 240 IRSKESQLKNNNIKDWMRDNDIMKLSNVWFQCLKLDEQIVSQCPGLINSTLDCIGSFISW Sbjct: 181 IRSKESQLKNNNIKDWMRDNDIMKLSNVWFQCLKLDEQIVSQCPGLINSTLDCIGSFISW 240 Query: 241 IDINLIIDANNYYLQLIYKFLNLKETKISCYNCILAIISKKMKPMDKLAFLNMINLTNEL 300 IDINLIIDANNYYLQLIYKFLNLKETKISCYNCILAIISKKMKPMDKLAFLNMINLTNEL Sbjct: 241 IDINLIIDANNYYLQLIYKFLNLKETKISCYNCILAIISKKMKPMDKLAFLNMINLTNEL 300 Query: 301 TYYHQAISMNPQIITFDNLEVWESLTKLITSFGIEFTIIIEQVNDDQKLDTLYKQSVISN 360 TYYHQAISMNPQIITFDNLEVWESLTKLITSFGIEFTIIIEQVNDDQKLDTLYKQSVISN Sbjct: 301 TYYHQAISMNPQIITFDNLEVWESLTKLITSFGIEFTIIIEQVNDDQKLDTLYKQSVISN 360 Query: 361 VDSILLEKIIPILLEFMNNEFDSITAKTFPFWSNYLAFLKKYKASSPNFVPLHKDFLDNF 420 VDSILLEKIIPILLEFMNNEFDSITAKTFPFWSNYLAFLKKYKASSPNFVPLHKDFLDNF Sbjct: 361 VDSILLEKIIPILLEFMNNEFDSITAKTFPFWSNYLAFLKKYKASSPNFVPLHKDFLDNF 420 Query: 421 QQICFKRMKFSDDEVTQDDFEEFNETVRFKLKNFQEIIVVIDPSLFLNNISQEISANLMN 480 QQICFKRMKFSDDEVTQDDFEEFNETVRFKLKNFQEIIVVIDPSLFLNNISQEISANLMN Sbjct: 421 QQICFKRMKFSDDEVTQDDFEEFNETVRFKLKNFQEIIVVIDPSLFLNNISQEISANLMN 480 Query: 481 CKNESWQIFELTIYQIFNLSECTKNNYFGLNKNEIMTSQPSLTLVRFLNELLMMKDFLLA 540 CKNESWQIFELTIYQIFNLSECTKNNYFGLNKNEIMTSQPSLTLVRFLNELLMMKDFLLA Sbjct: 481 CKNESWQIFELTIYQIFNLSECTKNNYFGLNKNEIMTSQPSLTLVRFLNELLMMKDFLLA 540 Query: 541 IDNEQIQILFMELIVKNYNFIFSTSANTANATDDDEKYLLILNIFMSSFAMFNKRENVRL 600 IDNEQIQILFMELIVKNYNFIFSTSANTANATDDDEKYLLILNIFMSSFAMFNKRENVRL Sbjct: 541 IDNEQIQILFMELIVKNYNFIFSTSANTANATDDDEKYLLILNIFMSSFAMFNKRENVRL 600 Query: 601 RSWYLFTRFLKLTRINLKKILFANKNLVNEITNKISPLLHIKVTSINAQGTDDNDTIFDN 660 RSWYLFTRFLKLTRINLKKILFANKNLVNEITNKISPLLHIKVTSINAQGTDDNDTIFDN Sbjct: 601 RSWYLFTRFLKLTRINLKKILFANKNLVNEITNKISPLLHIKVTSINAQGTDDNDTIFDN 660 Query: 661 QLYIFEGIGFIITLNNSSQELTAATANTPIDYDILDQILTPLFTQLEGCITQGASPVVIL 720 QLYIFEGIGFIITLNNSSQELTAATANTPIDYDILDQILTPLFTQLEGCITQGASPVVIL Sbjct: 661 QLYIFEGIGFIITLNNSSQELTAATANTPIDYDILDQILTPLFTQLEGCITQGASPVVIL 720 Query: 721 ECHHILMAIGTLARGLHIGLVPENQVNNMVVNKKLINDSLIHKFSNIAEVILVTFSFFNK 780 ECHHILMAIGTLARGLHIGLVPENQVNNMVVNKKLINDSLIHKFSNIAEVILVTFSFFNK Sbjct: 721 ECHHILMAIGTLARGLHIGLVPENQVNNMVVNKKLINDSLIHKFSNIAEVILVTFSFFNK 780 Query: 781 FENIRDASRFTFARLIPILSNKILPFINKLIELILSSTDLKSWEMIDFLGFLSQLIHMFH 840 FENIRDASRFTFARLIPILSNKILPFINKLIELILSSTDLKSWEMIDFLGFLSQLIHMFH Sbjct: 781 FENIRDASRFTFARLIPILSNKILPFINKLIELILSSTDLKSWEMIDFLGFLSQLIHMFH 840 Query: 841 TDTDCYQLFNQLLTPLINKVHSIIEEIDEQHDQQSSSNKPIDTAVTATSVNKNIVVTDSY 900 TDTDCYQLFNQLLTPLINKVHSIIEEIDEQHDQQSSSNKPIDTAVTATSVNKNIVVTDSY Sbjct: 841 TDTDCYQLFNQLLTPLINKVHSIIEEIDEQHDQQSSSNKPIDTAVTATSVNKNIVVTDSY 900 Query: 901 RDKILLKKAYCTFLQSFTNNSVTSILLSDINRAILPVILNDLVTYTPQEIQETSMMKVSL 960 RDKILLKKAYCTFLQSFTNNSVTSILLSDINRAILPVILNDLVTYTPQEIQETSMMKVSL Sbjct: 901 RDKILLKKAYCTFLQSFTNNSVTSILLSDINRAILPVILNDLVTYTPQEIQETSMMKVSL 960 Query: 961 NVLCNFIKCFGNGTCLDNDDINKDPNLKIDGLNEYFIMKCVPIIFEIPFNPIYKFNIKEG 1020 NVLCNFIKCFGNGTCLDNDDINKDPNLKIDGLNEYFIMKCVPIIFEIPFNPIYKFNIKEG Sbjct: 961 NVLCNFIKCFGNGTCLDNDDINKDPNLKIDGLNEYFIMKCVPIIFEIPFNPIYKFNIKEG 1020 Query: 1021 NFKTMAYDLARLLRELFIVSSNPTTNENECVKYLTQIYLPQIQLPQELTIQLVNMLTTMG 1080 NFKTMAYDLARLLRELFIVSSNPTTNENECVKYLTQIYLPQIQLPQELTIQLVNMLTTMG Sbjct: 1021 NFKTMAYDLARLLRELFIVSSNPTTNENECVKYLTQIYLPQIQLPQELTIQLVNMLTTMG 1080 Query: 1081 QKQFEKWFVDNFISVLKQGQ 1100 QKQFEKWFVDNFISVLKQGQ Sbjct: 1081 QKQFEKWFVDNFISVLKQGQ 1100 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 21.9 bits (45), Expect = 3.4 Identities = 20/89 (22%), Positives = 35/89 (38%), Gaps = 19/89 (21%) Query: 61 STLIELMTEGVNANPNGLNLVKFEITKWLKFQVLGNKQTKLPD----------FLM--NK 108 S ++ + G++A +N ++ E + W + L N+ P FL+ K Sbjct: 249 SEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAAVFLLGHKK 308 Query: 109 ISEVLT-------TLFMLMYSDCNGNQWN 130 I + T LF + +C G WN Sbjct: 309 ILQTFTKQKDTARALFYINLKECLGTSWN 337 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 21.1 bits (43), Expect = 6.8 Identities = 9/22 (40%), Positives = 14/22 (62%) Query: 455 QEIIVVIDPSLFLNNISQEISA 476 QE+ DPS LNN+ E+++ Sbjct: 218 QELPPEEDPSRELNNVQHEVNS 239 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.322 0.138 0.397 Lambda K H 0.267 0.0423 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 72,719 Number of Sequences: 43 Number of extensions: 3101 Number of successful extensions: 19 Number of sequences better than 10.0: 7 Number of HSP's better than 10.0 without gapping: 5 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 6 Number of HSP's gapped (non-prelim): 7 length of query: 1100 length of database: 17,163 effective HSP length: 65 effective length of query: 1035 effective length of database: 14,368 effective search space: 14870880 effective search space used: 14870880 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 42 (20.7 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P14733|LAM1_MOUSE Lamin B1. (587 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P14733|LAM1_MOUSE Lamin B1. 865 0.0 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 28 0.018 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 28 0.037 trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 20 9.6 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 865 bits (2235), Expect = 0.0 Identities = 587/587 (100%), Positives = 587/587 (100%) Query: 1 ATATPVQQQRAGSRASAPATPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSAL 60 ATATPVQQQRAGSRASAPATPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSAL Sbjct: 1 ATATPVQQQRAGSRASAPATPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSAL 60 Query: 61 QLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLN 120 QLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLN Sbjct: 61 QLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLN 120 Query: 121 YAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAA 180 YAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAA Sbjct: 121 YAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAA 180 Query: 181 KKQLADETLLKVDLENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEY 240 KKQLADETLLKVDLENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEY Sbjct: 181 KKQLADETLLKVDLENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEY 240 Query: 241 KLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRI 300 KLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRI Sbjct: 241 KLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRI 300 Query: 301 ESLSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLSDY 360 ESLSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLSDY Sbjct: 301 ESLSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLSDY 360 Query: 361 EQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVD 420 EQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVD Sbjct: 361 EQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVD 420 Query: 421 VEESEASSSVSISHSASATGNVCIEEIDVDGKFIRLKNTSEQDQPMGGWEMIRKIGDTSV 480 VEESEASSSVSISHSASATGNVCIEEIDVDGKFIRLKNTSEQDQPMGGWEMIRKIGDTSV Sbjct: 421 VEESEASSSVSISHSASATGNVCIEEIDVDGKFIRLKNTSEQDQPMGGWEMIRKIGDTSV 480 Query: 481 SYKYTSRYVLKAGQTVTVWAANAGVTASPPTDLIWKNQNSWGTGEDVKVILKNSQGEEVA 540 SYKYTSRYVLKAGQTVTVWAANAGVTASPPTDLIWKNQNSWGTGEDVKVILKNSQGEEVA Sbjct: 481 SYKYTSRYVLKAGQTVTVWAANAGVTASPPTDLIWKNQNSWGTGEDVKVILKNSQGEEVA 540 Query: 541 QRSTVFKTTIPEEEEEEEEEPIGVAVEEERFHQQGAPRASNKSCAIM 587 QRSTVFKTTIPEEEEEEEEEPIGVAVEEERFHQQGAPRASNKSCAIM Sbjct: 541 QRSTVFKTTIPEEEEEEEEEPIGVAVEEERFHQQGAPRASNKSCAIM 587 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 28.5 bits (62), Expect = 0.018 Identities = 59/321 (18%), Positives = 130/321 (40%), Gaps = 30/321 (9%) Query: 33 KEELRELNDRLAVYIDKVRSLETE---NSALQLQVTEREEVRGRELTGLKALYE------ 83 +EEL+E D + V +K+ L TE A T + + E+ L ++Y+ Sbjct: 242 QEELKEKLDLIQVLQEKITLLTTEIKDKEASLQSTTSKLAEKESEVDKLSSMYQESQDQL 301 Query: 84 ----TELADARRALDDTARERAKLQIELGKFKAEHDQLLLNYAKKESDLSGAQIKLREYE 139 +E+ + + + RE + + LL+ + + +L Q + E++ Sbjct: 302 MNLTSEIKELKVEVQKRERELELKRESEDNLNVRLNSLLVERDESKKELDAIQKEYSEFK 361 Query: 140 AALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDLENRCQ 199 + K A+ A LG+++ ++ QLE L A ++ +L DL + Sbjct: 362 SISEKKVASDAKLLGEQEK----------RLHQLEEQLGTASDEVRKNNVLIADLTQEKE 411 Query: 200 SLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQVRL 259 +L L+ + E + ET I + + ++ L E +++++ Sbjct: 412 NLRRMLDAELENISKLKLEVQVTQETLEKSRSDASDIAQQLQQSRHLCSKLEAEVSKLQM 471 Query: 260 YKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRIESLSSQLS-------NLQK 312 EE + ++ + +E+ + + + RE L ++ + ++S +L+ NLQ Sbjct: 472 ELEETRTSLRRNIDETKRGAELLAAELTTTRELLKKTNEEMHTMSHELAAVTENCDNLQT 531 Query: 313 ESRACLERIQELEDMLAKERD 333 E ++ + D L +E++ Sbjct: 532 ELVDVYKKAERAADELKQEKN 552 Score = 24.6 bits (52), Expect = 0.29 Identities = 59/314 (18%), Positives = 125/314 (39%), Gaps = 47/314 (14%) Query: 28 SRLQEKE-ELRELNDRLAVYIDKVRSLETENSALQLQVTEREEVRGRELTGLKALYETEL 86 S+L EKE E+ +L+ D++ +L +E L+++V +RE Sbjct: 278 SKLAEKESEVDKLSSMYQESQDQLMNLTSEIKELKVEVQKRE------------------ 319 Query: 87 ADARRALDDTARERAKLQIELGKFKAEHDQLLLNYAKKESDLSGAQIKLREYEAALNSKD 146 R L+ L + L E D+ + +L Q + E+++ K Sbjct: 320 ----RELELKRESEDNLNVRLNSLLVERDE-------SKKELDAIQKEYSEFKSISEKKV 368 Query: 147 AALATALGDKKSLEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDLENRCQSLTEDLE 206 A+ A LG+++ ++ QLE L A ++ +L DL ++L L+ Sbjct: 369 ASDAKLLGEQEK----------RLHQLEEQLGTASDEVRKNNVLIADLTQEKENLRRMLD 418 Query: 207 FRKNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQVRLYKEELEQ 266 + E + ET I + + ++ L E +++++ EE Sbjct: 419 AELENISKLKLEVQVTQETLEKSRSDASDIAQQLQQSRHLCSKLEAEVSKLQMELEETRT 478 Query: 267 TYHAKLENARLSSEMNTSTVNSAREELMESRMRIESLSSQLS-------NLQKESRACLE 319 + ++ + +E+ + + + RE L ++ + ++S +L+ NLQ E + Sbjct: 479 SLRRNIDETKRGAELLAAELTTTRELLKKTNEEMHTMSHELAAVTENCDNLQTELVDVYK 538 Query: 320 RIQELEDMLAKERD 333 + + D L +E++ Sbjct: 539 KAERAADELKQEKN 552 Score = 23.9 bits (50), Expect = 0.57 Identities = 52/247 (21%), Positives = 110/247 (44%), Gaps = 37/247 (14%) Query: 150 ATALGDKKSLEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDLENRC--------QSL 201 A A +K + +E +K+++ + EA+ + +K+ E L + D+ N Q+L Sbjct: 133 ALARNEKAVSDATIESMKNKLKEKEATFVSMEKKFQSELLNERDIRNNQLKRAGEERQAL 192 Query: 202 TEDLEFRKNMYE---EEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQVR 258 L K+ +E+ + +R E +V+++ L L +M+E Sbjct: 193 VNQLNSAKSTVTNLGQELQKEKRIAEELIVQIEG---------LQNNLMQMKEDKKKLQE 243 Query: 259 LYKEELEQTYHAKLENARLSSEM--NTSTVNSAREELMESRMRIESLSS-------QLSN 309 KE+L+ + + L++E+ +++ S +L E ++ LSS QL N Sbjct: 244 ELKEKLDLIQVLQEKITLLTTEIKDKEASLQSTTSKLAEKESEVDKLSSMYQESQDQLMN 303 Query: 310 LQKESRACLERIQELEDMLAKERDNS-------RRMLSDREREMAEIRDQMQQQLSDYEQ 362 L E + +Q+ E L +R++ +L +R+ E+ D +Q++ S+++ Sbjct: 304 LTSEIKELKVEVQKRERELELKRESEDNLNVRLNSLLVERDESKKEL-DAIQKEYSEFKS 362 Query: 363 LLDVKLA 369 + + K+A Sbjct: 363 ISEKKVA 369 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 27.7 bits (60), Expect = 0.037 Identities = 75/341 (21%), Positives = 147/341 (42%), Gaps = 37/341 (10%) Query: 32 EKEELR----ELNDRLAVYIDKVRSLETENSALQLQVTEREEVRGRELTGLKALYETELA 87 EKE LR EL L DK + +++ L+ +E EL+ L+ + EL Sbjct: 481 EKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETS-----ELSTLEMKLKEELD 535 Query: 88 DARRALDDTARERAKLQIELGKFKAEHDQLLLNYAKKESDLSGAQIKLREYEA------- 140 D R + E +L++E KF+AE + + + K+E A+ R+ EA Sbjct: 536 DLRAQKLEMLAEADRLKVEKAKFEAEWEHIDV---KREELRKEAEYITRQREAFSMYLKD 592 Query: 141 ----ALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDLEN 196 +DA D +SL + E+ +++ + + + ++ + LL ++++ Sbjct: 593 ERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKIQRERADFLLGIEMQK 652 Query: 197 RCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQ 256 R +LE+ EE+ + R E + + + E + Q+L EM E+ Sbjct: 653 R------ELEYCIENKREELENSSRDREKAFEQ-----EKKLEEERIQSLKEMAEKELEH 701 Query: 257 VRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRIESLSSQLSNLQKESRA 316 V++ + L+ +++ R E + + + EEL R ++E+ L + E R Sbjct: 702 VQVELKRLDAE-RLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAERDEIRH 760 Query: 317 CLERIQELEDMLAKERDNS--RRMLSDREREMAEIRDQMQQ 355 +E +++LE++ D S + LS+ ER ++ Q+ Sbjct: 761 EIEELKKLENLKVALDDMSMAKMQLSNLERSWEKVSALKQK 801 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 19.6 bits (39), Expect = 9.6 Identities = 8/19 (42%), Positives = 12/19 (63%) Query: 351 DQMQQQLSDYEQLLDVKLA 369 D + Q EQL+D+K+A Sbjct: 40 DPINNQPLSEEQLIDIKVA 58 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P14733|LAM1_MOUSE Lamin B1. 973 0.0 Sequences not found previously or not previously below threshold: swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 37 6e-05 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 37 6e-05 swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein c... 21 2.9 trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 21 5.2 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 973 bits (2514), Expect = 0.0 Identities = 587/587 (100%), Positives = 587/587 (100%) Query: 1 ATATPVQQQRAGSRASAPATPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSAL 60 ATATPVQQQRAGSRASAPATPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSAL Sbjct: 1 ATATPVQQQRAGSRASAPATPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSAL 60 Query: 61 QLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLN 120 QLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLN Sbjct: 61 QLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLN 120 Query: 121 YAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAA 180 YAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAA Sbjct: 121 YAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAA 180 Query: 181 KKQLADETLLKVDLENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEY 240 KKQLADETLLKVDLENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEY Sbjct: 181 KKQLADETLLKVDLENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEY 240 Query: 241 KLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRI 300 KLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRI Sbjct: 241 KLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRI 300 Query: 301 ESLSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLSDY 360 ESLSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLSDY Sbjct: 301 ESLSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLSDY 360 Query: 361 EQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVD 420 EQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVD Sbjct: 361 EQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVD 420 Query: 421 VEESEASSSVSISHSASATGNVCIEEIDVDGKFIRLKNTSEQDQPMGGWEMIRKIGDTSV 480 VEESEASSSVSISHSASATGNVCIEEIDVDGKFIRLKNTSEQDQPMGGWEMIRKIGDTSV Sbjct: 421 VEESEASSSVSISHSASATGNVCIEEIDVDGKFIRLKNTSEQDQPMGGWEMIRKIGDTSV 480 Query: 481 SYKYTSRYVLKAGQTVTVWAANAGVTASPPTDLIWKNQNSWGTGEDVKVILKNSQGEEVA 540 SYKYTSRYVLKAGQTVTVWAANAGVTASPPTDLIWKNQNSWGTGEDVKVILKNSQGEEVA Sbjct: 481 SYKYTSRYVLKAGQTVTVWAANAGVTASPPTDLIWKNQNSWGTGEDVKVILKNSQGEEVA 540 Query: 541 QRSTVFKTTIPEEEEEEEEEPIGVAVEEERFHQQGAPRASNKSCAIM 587 QRSTVFKTTIPEEEEEEEEEPIGVAVEEERFHQQGAPRASNKSCAIM Sbjct: 541 QRSTVFKTTIPEEEEEEEEEPIGVAVEEERFHQQGAPRASNKSCAIM 587 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 36.7 bits (83), Expect = 6e-05 Identities = 79/405 (19%), Positives = 172/405 (41%), Gaps = 53/405 (13%) Query: 33 KEELRELNDRLAVYIDKVRSLETE----NSALQLQVTEREEVRGRELTGLKALYETELAD 88 +EEL+E D + V +K+ L TE ++LQ ++ E + E+ L ++Y+ + Sbjct: 242 QEELKEKLDLIQVLQEKITLLTTEIKDKEASLQSTTSKLAE-KESEVDKLSSMYQ----E 296 Query: 89 ARRALDDTARERAKLQIELGKFKAEHD--------------QLLLNYAKKESDLSGAQIK 134 ++ L + E +L++E+ K + E + LL+ + + +L Q + Sbjct: 297 SQDQLMNLTSEIKELKVEVQKRERELELKRESEDNLNVRLNSLLVERDESKKELDAIQKE 356 Query: 135 LREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDL 194 E+++ K A+ A LG+++ ++ QLE L A ++ +L DL Sbjct: 357 YSEFKSISEKKVASDAKLLGEQEK----------RLHQLEEQLGTASDEVRKNNVLIADL 406 Query: 195 ENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHD 254 ++L L+ + E + ET I + + ++ L E Sbjct: 407 TQEKENLRRMLDAELENISKLKLEVQVTQETLEKSRSDASDIAQQLQQSRHLCSKLEAEV 466 Query: 255 AQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRIESLSSQLS------ 308 +++++ EE + ++ + +E+ + + + RE L ++ + ++S +L+ Sbjct: 467 SKLQMELEETRTSLRRNIDETKRGAELLAAELTTTRELLKKTNEEMHTMSHELAAVTENC 526 Query: 309 -NLQKESRACLERIQELEDMLAKER------DNSRRMLSDREREMAEIRDQMQQQLSDYE 361 NLQ E ++ + D L +E+ + L + E R ++++L Sbjct: 527 DNLQTELVDVYKKAERAADELKQEKNIVVTLEKELTFLEAQITREKESRKNLEEELERAT 586 Query: 362 QLLD------VKLALDMEIS-AYRKLLEGEEERLKLSPSPSSRVT 399 + LD LA ++E++ ++ LE E E L+ S S +++ Sbjct: 587 ESLDEMNRNAFALAKELELANSHISSLEDEREVLQKSVSEQKQIS 631 Score = 34.4 bits (77), Expect = 3e-04 Identities = 95/393 (24%), Positives = 176/393 (44%), Gaps = 38/393 (9%) Query: 28 SRLQEKE-ELRELNDRLAVYIDKVRSLETENSALQLQVTEREEVRGRELTGLKALYETEL 86 S+L EKE E+ +L+ D++ +L +E L+++V +RE REL LK E L Sbjct: 278 SKLAEKESEVDKLSSMYQESQDQLMNLTSEIKELKVEVQKRE----RELE-LKRESEDNL 332 Query: 87 ADARRAL----DDTARERAKLQIELGKFKAEHDQLLLNYAK----KESDLSGAQIKLREY 138 +L D++ +E +Q E +FK+ ++ + + AK +E L + +L Sbjct: 333 NVRLNSLLVERDESKKELDAIQKEYSEFKSISEKKVASDAKLLGEQEKRLHQLEEQLGTA 392 Query: 139 EAALNSKDAALATALGDKKS----LEGDLED-----LKDQIAQ--LEASLSAAK---KQL 184 + + +A +K++ L+ +LE+ L+ Q+ Q LE S S A +QL Sbjct: 393 SDEVRKNNVLIADLTQEKENLRRMLDAELENISKLKLEVQVTQETLEKSRSDASDIAQQL 452 Query: 185 ADETLLKVDLENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQI--EYEYKL 242 L LE L +LE + I+ET+R E E+ + R++ + ++ Sbjct: 453 QQSRHLCSKLEAEVSKLQMELEETRTSLRRNIDETKRGAELLAAELTTTRELLKKTNEEM 512 Query: 243 AQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRIES 302 HE+ + L E ++ A+ L E N +E+++ E Sbjct: 513 HTMSHELAAVTENCDNLQTELVDVYKKAERAADELKQEKNIVVTLEKELTFLEAQITREK 572 Query: 303 LSSQLSNLQKESRACLERIQELED---MLAKERDNSRRMLS--DREREMAEIRDQMQQQL 357 S + NL++E E + E+ LAKE + + +S + ERE+ + Q+Q+ Sbjct: 573 ESRK--NLEEELERATESLDEMNRNAFALAKELELANSHISSLEDEREVLQKSVSEQKQI 630 Query: 358 S-DYEQLLDVKLALDMEISAYRKLLEGEEERLK 389 S + + L+ +L M++ R+ LE ++L+ Sbjct: 631 SQESRENLEDAHSLVMKLGKERESLEKRAKKLE 663 Score = 32.1 bits (71), Expect = 0.002 Identities = 52/247 (21%), Positives = 110/247 (44%), Gaps = 37/247 (14%) Query: 150 ATALGDKKSLEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDLENRC--------QSL 201 A A +K + +E +K+++ + EA+ + +K+ E L + D+ N Q+L Sbjct: 133 ALARNEKAVSDATIESMKNKLKEKEATFVSMEKKFQSELLNERDIRNNQLKRAGEERQAL 192 Query: 202 TEDLEFRKNMYE---EEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQVR 258 L K+ +E+ + +R E +V+++ L L +M+E Sbjct: 193 VNQLNSAKSTVTNLGQELQKEKRIAEELIVQIEG---------LQNNLMQMKEDKKKLQE 243 Query: 259 LYKEELEQTYHAKLENARLSSEM--NTSTVNSAREELMESRMRIESLSS-------QLSN 309 KE+L+ + + L++E+ +++ S +L E ++ LSS QL N Sbjct: 244 ELKEKLDLIQVLQEKITLLTTEIKDKEASLQSTTSKLAEKESEVDKLSSMYQESQDQLMN 303 Query: 310 LQKESRACLERIQELEDMLAKERDNS-------RRMLSDREREMAEIRDQMQQQLSDYEQ 362 L E + +Q+ E L +R++ +L +R+ E+ D +Q++ S+++ Sbjct: 304 LTSEIKELKVEVQKRERELELKRESEDNLNVRLNSLLVERDESKKEL-DAIQKEYSEFKS 362 Query: 363 LLDVKLA 369 + + K+A Sbjct: 363 ISEKKVA 369 Score = 24.4 bits (51), Expect = 0.34 Identities = 39/192 (20%), Positives = 83/192 (42%), Gaps = 17/192 (8%) Query: 264 LEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRIESLSSQLSN----LQKESRACLE 319 +E+ + ++L N R ++ + + A EE ++ S S ++N LQKE R E Sbjct: 163 MEKKFQSELLNER---DIRNNQLKRAGEERQALVNQLNSAKSTVTNLGQELQKEKRIAEE 219 Query: 320 RIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLSDYEQLLDVKLA-LDMEISAYR 378 I ++E + ++N +M D+++ E+++++ D Q+L K+ L EI Sbjct: 220 LIVQIEGL----QNNLMQMKEDKKKLQEELKEKL-----DLIQVLQEKITLLTTEIKDKE 270 Query: 379 KLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVDVEESEASSSVSISHSASA 438 L+ +L S +++ S + + K + VE + + + + Sbjct: 271 ASLQSTTSKLAEKESEVDKLSSMYQESQDQLMNLTSEIKELKVEVQKRERELELKRESED 330 Query: 439 TGNVCIEEIDVD 450 NV + + V+ Sbjct: 331 NLNVRLNSLLVE 342 Score = 21.3 bits (43), Expect = 3.5 Identities = 39/173 (22%), Positives = 70/173 (39%), Gaps = 6/173 (3%) Query: 19 ATPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSALQLQVTEREEVRGRELTGL 78 A L+ TR + EE+ ++ LA + +L+TE + + ER ++ + Sbjct: 495 AAELTTTRELLKKTNEEMHTMSHELAAVTENCDNLQTELVDV-YKKAERAADELKQEKNI 553 Query: 79 KALYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLN--YAKKESDLSGAQIKLR 136 E EL + R L+ EL + D++ N KE +L+ + I Sbjct: 554 VVTLEKELTFLEAQITREKESRKNLEEELERATESLDEMNRNAFALAKELELANSHISSL 613 Query: 137 EYEAALNSKDAALATALG--DKKSLEGDLEDLKDQIAQLEASLSAAKKQLADE 187 E E + K + + +++LE D L ++ + SL K+L DE Sbjct: 614 EDEREVLQKSVSEQKQISQESRENLE-DAHSLVMKLGKERESLEKRAKKLEDE 665 Score = 20.1 bits (40), Expect = 6.9 Identities = 33/116 (28%), Positives = 47/116 (40%), Gaps = 18/116 (15%) Query: 254 DAQVRL--YKEELEQTYHAKLENARLS---------SEMNTSTVNSAREELMESRMRIES 302 D+QV+ KEE EQT EN S S + S AR E S IES Sbjct: 89 DSQVKAQPQKEETEQTIQGNAENPFFSLLNGLGVFGSGVLGSLYALARNEKAVSDATIES 148 Query: 303 LSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLS 358 + ++L KE A +E E N R + +++ + E R + QL+ Sbjct: 149 MKNKL----KEKEATF---VSMEKKFQSELLNERDIRNNQLKRAGEERQALVNQLN 197 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 36.7 bits (83), Expect = 6e-05 Identities = 81/343 (23%), Positives = 151/343 (43%), Gaps = 41/343 (11%) Query: 32 EKEELR----ELNDRLAVYIDKVRSLETENSALQLQVTEREEVRGRELTGLKALYETELA 87 EKE LR EL L DK + +++ L+ +E EL+ L+ + EL Sbjct: 481 EKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETS-----ELSTLEMKLKEELD 535 Query: 88 DARRALDDTARERAKLQIELGKFKAEHDQLLLNYAKKESDLSGAQIKLREYEA------- 140 D R + E +L++E KF+AE + + + K+E A+ R+ EA Sbjct: 536 DLRAQKLEMLAEADRLKVEKAKFEAEWEHIDV---KREELRKEAEYITRQREAFSMYLKD 592 Query: 141 ----ALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEAS-LSAAKKQLADETLLKVDLE 195 +DA D +SL + E+ +++ + + LS +++ AD LL ++++ Sbjct: 593 ERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKIQRERAD-FLLGIEMQ 651 Query: 196 NRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDA 255 R +LE+ EE+ + R E + + + E + Q+L EM E+ Sbjct: 652 KR------ELEYCIENKREELENSSRDREKAFEQ-----EKKLEEERIQSLKEMAEKELE 700 Query: 256 QVRLYKEELE-QTYHAKLENARLSSEMNTSTVNSAREELMESRMRIESLSSQLSNLQKES 314 V++ + L+ + KL+ R E + + + EEL R ++E+ L + E Sbjct: 701 HVQVELKRLDAERLEIKLDRERREREW--AELKDSVEELKVQREKLETQRHMLRAERDEI 758 Query: 315 RACLERIQELEDMLAKERDNS--RRMLSDREREMAEIRDQMQQ 355 R +E +++LE++ D S + LS+ ER ++ Q+ Sbjct: 759 RHEIEELKKLENLKVALDDMSMAKMQLSNLERSWEKVSALKQK 801 Score = 26.3 bits (56), Expect = 0.091 Identities = 81/377 (21%), Positives = 160/377 (41%), Gaps = 56/377 (14%) Query: 35 ELRELN--DRLAVYIDKVRSLETENSALQLQVTEREEVRGRELTGLKALYETELADARRA 92 ELRE++ R + +K LE ++ AL + + E K L TE R+ Sbjct: 416 ELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVATEEDINRKT 475 Query: 93 --LDDTARERAKLQIELGKFKAEHDQLLLNYAKKESDLSGAQIKLREYEAALNSKDAALA 150 L+D KL +EL Q L + K + A KL AL S+ + L+ Sbjct: 476 TMLEDEKERLRKLDLEL-------QQSLTSLEDKRKRVDSATQKLE----ALKSETSELS 524 Query: 151 TALGDKKSLEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDLENRCQSLTEDLEF--- 207 T + L+ +L+DL+ Q ++ A K + A ++ + + L ++ E+ Sbjct: 525 TL---EMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEAEYITR 581 Query: 208 -----------RKNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHE-----MRE 251 ++ +EE + R +H+ + ++ R+ E+ K+ + E RE Sbjct: 582 QREAFSMYLKDERDNIKEERDALRNQHKNDVESLNRERE-EFMNKMVEEHSEWLSKIQRE 640 Query: 252 QHD--AQVRLYKEELE---QTYHAKLENA------------RLSSEMNTSTVNSAREELM 294 + D + + K ELE + +LEN+ +L E S A +EL Sbjct: 641 RADFLLGIEMQKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEKELE 700 Query: 295 ESRMRIESLSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQ 354 ++ ++ L ++ ++ + EL+D + + + ++ + R AE RD+++ Sbjct: 701 HVQVELKRLDAERLEIKLDRERREREWAELKDSVEELKVQREKLETQRHMLRAE-RDEIR 759 Query: 355 QQLSDYEQLLDVKLALD 371 ++ + ++L ++K+ALD Sbjct: 760 HEIEELKKLENLKVALD 776 Score = 25.1 bits (53), Expect = 0.23 Identities = 52/248 (20%), Positives = 101/248 (39%), Gaps = 23/248 (9%) Query: 101 AKLQIELGKFKAEHDQLLLNYAKKESDLSGAQIKLREYEAALNSKDAALATALGDKK--- 157 AKL+ E+ ++ LLL + S + + E + + +A +AL + K Sbjct: 67 AKLESEVYDYQHNMGLLLLEKNELSSQYEEIKASVDESDLTHMREKSAYVSALAEAKKRE 126 Query: 158 -SLEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDLENRCQSLTEDLEFR-------- 208 SL+ D+ K+ I+ LE +L + + A+ + + + ED + Sbjct: 127 ESLKKDVGIAKECISSLEKTLHEMRAECAETKVSAGSTMSEAHVMIEDALKKLADAEAKM 186 Query: 209 --KNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDA-----QVRLYK 261 + E N R E +L EV+S R+ + +LA E + + Q + Sbjct: 187 RAAEALQAEANRYHRIAERKLKEVES-REDDLTRRLASFKSECETKENEMVIERQTLNER 245 Query: 262 EELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRIESLSSQLSNLQKESRACLERI 321 + Q H +L +A++S + + +EL E + L + + ++E +A ++ Sbjct: 246 RKSLQQEHERLLDAQVSLNQREDHIFARSQELAELE---KGLDTAKTTFEEERKAFEDKK 302 Query: 322 QELEDMLA 329 LE LA Sbjct: 303 SNLEIALA 310 Score = 24.7 bits (52), Expect = 0.26 Identities = 92/408 (22%), Positives = 166/408 (40%), Gaps = 54/408 (13%) Query: 28 SRLQEKEELRE-LNDRLAVYIDKVRSLETENSALQLQV---TEREEV---RGRELTGLKA 80 +R QE EL + L+ + ++ ++ E + S L++ + +REEV L L Sbjct: 272 ARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEVCFYSHNSLLFLVL 331 Query: 81 LYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLNYAKKESD-----LSGAQIKL 135 Y + + + RE + L+ E AE A KES+ L+ ++ L Sbjct: 332 HYRSSKKFLGDKIAVSERESSLLKKEQELLVAEE-----KIASKESELIQNVLANQEVIL 386 Query: 136 R----EYEAALNSKDAALATALGDKKSL----EGDLEDLKDQIAQLEASLSAAKKQLA-- 185 R + EA L K ++ + K+ E D++ +D + + E L + LA Sbjct: 387 RKRKSDVEAELECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEK 446 Query: 186 --DETLLKVDLENRCQSLT---EDLEFRKNMYEEEINETRR----------KHETRLVEV 230 D T +L+ + ++L ED+ + M E+E R+ E + V Sbjct: 447 EKDITEKSFNLDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRV 506 Query: 231 DSGRQIEYEYK-----LAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTST 285 DS Q K L+ +++E+ D E L + K+E A+ +E Sbjct: 507 DSATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEW--EH 564 Query: 286 VNSAREELMESRMRIESLSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLS-DRER 344 ++ REEL + I S K+ R + I+E D L + N L+ +RE Sbjct: 565 IDVKREELRKEAEYITRQREAFSMYLKDER---DNIKEERDALRNQHKNDVESLNREREE 621 Query: 345 EMAEIRDQMQQQLSDYE-QLLDVKLALDMEISAYRKLLEGEEERLKLS 391 M ++ ++ + LS + + D L ++M+ +E + E L+ S Sbjct: 622 FMNKMVEEHSEWLSKIQRERADFLLGIEMQKRELEYCIENKREELENS 669 >swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein charlie). Length = 296 Score = 21.3 bits (43), Expect = 2.9 Identities = 11/32 (34%), Positives = 13/32 (40%) Query: 195 ENRCQSLTEDLEFRKNMYEEEINETRRKHETR 226 ENR Q +LE R + NE TR Sbjct: 220 ENRLQQKKSELEGRTEVNHANTNEEVPSRRTR 251 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 20.5 bits (41), Expect = 5.2 Identities = 8/19 (42%), Positives = 12/19 (63%) Query: 351 DQMQQQLSDYEQLLDVKLA 369 D + Q EQL+D+K+A Sbjct: 40 DPINNQPLSEEQLIDIKVA 58 Searching.done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P14733|LAM1_MOUSE Lamin B1. 711 0.0 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 283 4e-79 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 250 2e-69 Sequences not found previously or not previously below threshold: trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 27 0.075 swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. 23 1.0 CONVERGED! >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 711 bits (1835), Expect = 0.0 Identities = 587/587 (100%), Positives = 587/587 (100%) Query: 1 ATATPVQQQRAGSRASAPATPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSAL 60 ATATPVQQQRAGSRASAPATPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSAL Sbjct: 1 ATATPVQQQRAGSRASAPATPLSPTRLSRLQEKEELRELNDRLAVYIDKVRSLETENSAL 60 Query: 61 QLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLN 120 QLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLN Sbjct: 61 QLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLN 120 Query: 121 YAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAA 180 YAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAA Sbjct: 121 YAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAA 180 Query: 181 KKQLADETLLKVDLENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEY 240 KKQLADETLLKVDLENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEY Sbjct: 181 KKQLADETLLKVDLENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEY 240 Query: 241 KLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRI 300 KLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRI Sbjct: 241 KLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRI 300 Query: 301 ESLSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLSDY 360 ESLSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLSDY Sbjct: 301 ESLSSQLSNLQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLSDY 360 Query: 361 EQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVD 420 EQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVD Sbjct: 361 EQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVD 420 Query: 421 VEESEASSSVSISHSASATGNVCIEEIDVDGKFIRLKNTSEQDQPMGGWEMIRKIGDTSV 480 VEESEASSSVSISHSASATGNVCIEEIDVDGKFIRLKNTSEQDQPMGGWEMIRKIGDTSV Sbjct: 421 VEESEASSSVSISHSASATGNVCIEEIDVDGKFIRLKNTSEQDQPMGGWEMIRKIGDTSV 480 Query: 481 SYKYTSRYVLKAGQTVTVWAANAGVTASPPTDLIWKNQNSWGTGEDVKVILKNSQGEEVA 540 SYKYTSRYVLKAGQTVTVWAANAGVTASPPTDLIWKNQNSWGTGEDVKVILKNSQGEEVA Sbjct: 481 SYKYTSRYVLKAGQTVTVWAANAGVTASPPTDLIWKNQNSWGTGEDVKVILKNSQGEEVA 540 Query: 541 QRSTVFKTTIPEEEEEEEEEPIGVAVEEERFHQQGAPRASNKSCAIM 587 QRSTVFKTTIPEEEEEEEEEPIGVAVEEERFHQQGAPRASNKSCAIM Sbjct: 541 QRSTVFKTTIPEEEEEEEEEPIGVAVEEERFHQQGAPRASNKSCAIM 587 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 283 bits (724), Expect = 4e-79 Identities = 79/405 (19%), Positives = 172/405 (41%), Gaps = 53/405 (13%) Query: 33 KEELRELNDRLAVYIDKVRSLETE----NSALQLQVTEREEVRGRELTGLKALYETELAD 88 +EEL+E D + V +K+ L TE ++LQ ++ E + E+ L ++Y+ + Sbjct: 242 QEELKEKLDLIQVLQEKITLLTTEIKDKEASLQSTTSKLAE-KESEVDKLSSMYQ----E 296 Query: 89 ARRALDDTARERAKLQIELGKFKAEHD--------------QLLLNYAKKESDLSGAQIK 134 ++ L + E +L++E+ K + E + LL+ + + +L Q + Sbjct: 297 SQDQLMNLTSEIKELKVEVQKRERELELKRESEDNLNVRLNSLLVERDESKKELDAIQKE 356 Query: 135 LREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDL 194 E+++ K A+ A LG+++ ++ QLE L A ++ +L DL Sbjct: 357 YSEFKSISEKKVASDAKLLGEQEK----------RLHQLEEQLGTASDEVRKNNVLIADL 406 Query: 195 ENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHD 254 ++L L+ + E + ET I + + ++ L E Sbjct: 407 TQEKENLRRMLDAELENISKLKLEVQVTQETLEKSRSDASDIAQQLQQSRHLCSKLEAEV 466 Query: 255 AQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRIESLSSQLS------ 308 +++++ EE + ++ + +E+ + + + RE L ++ + ++S +L+ Sbjct: 467 SKLQMELEETRTSLRRNIDETKRGAELLAAELTTTRELLKKTNEEMHTMSHELAAVTENC 526 Query: 309 -NLQKESRACLERIQELEDMLAKER------DNSRRMLSDREREMAEIRDQMQQQLSDYE 361 NLQ E ++ + D L +E+ + L + E R ++++L Sbjct: 527 DNLQTELVDVYKKAERAADELKQEKNIVVTLEKELTFLEAQITREKESRKNLEEELERAT 586 Query: 362 QLLD------VKLALDMEIS-AYRKLLEGEEERLKLSPSPSSRVT 399 + LD LA ++E++ ++ LE E E L+ S S +++ Sbjct: 587 ESLDEMNRNAFALAKELELANSHISSLEDEREVLQKSVSEQKQIS 631 Score = 35.4 bits (80), Expect = 1e-04 Identities = 51/350 (14%), Positives = 123/350 (34%), Gaps = 14/350 (4%) Query: 58 SALQLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKAEHDQL 117 + ++ + +E ++ ++ +++EL + R ++ + + E + + Sbjct: 144 ATIESMKNKLKEKEATFVS-MEKKFQSELLNERDIRNNQLKRAGE---ERQALVNQLNSA 199 Query: 118 LLNYAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASL 177 +L + E + L DKK L+ +L++ D I L+ + Sbjct: 200 KSTVTNLGQELQKEKRIAEELIVQIEGLQNNLMQMKEDKKKLQEELKEKLDLIQVLQEKI 259 Query: 178 SAAKKQLADETLLKVDLENRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIE 237 + ++ D+ ++ +++ +MY+E ++ ++ E Sbjct: 260 TLLTTEIKDKEASLQSTTSKLAEKESEVDKLSSMYQESQDQLMNLTSEIKELKVEVQKRE 319 Query: 238 YEYKLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMNTSTVNSAREELMESR 297 E +L + + + + ++E ++ A + ++ V S + L E Sbjct: 320 RELELKRESEDNLNVRLNSLLVERDESKKELDAIQKEYSEFKSISEKKVASDAKLLGEQE 379 Query: 298 MRIESLSSQLSNLQKESR-------ACLERIQELEDMLAKERDN-SRRMLSDREREM--A 347 R+ L QL E R + + L ML E +N S+ L + + Sbjct: 380 KRLHQLEEQLGTASDEVRKNNVLIADLTQEKENLRRMLDAELENISKLKLEVQVTQETLE 439 Query: 348 EIRDQMQQQLSDYEQLLDVKLALDMEISAYRKLLEGEEERLKLSPSPSSR 397 + R +Q + L+ E+S + LE L+ + + R Sbjct: 440 KSRSDASDIAQQLQQSRHLCSKLEAEVSKLQMELEETRTSLRRNIDETKR 489 Score = 25.4 bits (54), Expect = 0.16 Identities = 31/132 (23%), Positives = 50/132 (37%), Gaps = 15/132 (11%) Query: 53 LETENSALQLQVTEREEVRGRELTGLKALYETELADARRALDDTARERAKLQIELGKFKA 112 LE E + L+ Q+T +E R E EL A +LD+ R L EL + Sbjct: 557 LEKELTFLEAQITREKESRKN--------LEEELERATESLDEMNRNAFALAKELELANS 608 Query: 113 EHDQLLLNYAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLEDLKDQIAQ 172 L E + Q + E + L A L + E L+ + + Sbjct: 609 HISSL-------EDEREVLQKSVSEQKQISQESRENLEDAHSLVMKLGKERESLEKRAKK 661 Query: 173 LEASLSAAKKQL 184 LE +++AK ++ Sbjct: 662 LEDEMASAKGEI 673 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 250 bits (639), Expect = 2e-69 Identities = 83/351 (23%), Positives = 156/351 (43%), Gaps = 41/351 (11%) Query: 32 EKEELR----ELNDRLAVYIDKVRSLETENSALQLQVTEREEVRGRELTGLKALYETELA 87 EKE LR EL L DK + +++ L+ +E EL+ L+ + EL Sbjct: 481 EKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETS-----ELSTLEMKLKEELD 535 Query: 88 DARRALDDTARERAKLQIELGKFKAEHDQLLLNYAKKESDLSGAQIKLREYEA------- 140 D R + E +L++E KF+AE + + + K+E A+ R+ EA Sbjct: 536 DLRAQKLEMLAEADRLKVEKAKFEAEWEHIDV---KREELRKEAEYITRQREAFSMYLKD 592 Query: 141 ----ALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEAS-LSAAKKQLADETLLKVDLE 195 +DA D +SL + E+ +++ + + LS +++ AD LL ++++ Sbjct: 593 ERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSKIQRERAD-FLLGIEMQ 651 Query: 196 NRCQSLTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDA 255 R +LE+ EE+ + R E + + + E + Q+L EM E+ Sbjct: 652 KR------ELEYCIENKREELENSSRDREKAFEQ-----EKKLEEERIQSLKEMAEKELE 700 Query: 256 QVRLYKEELE-QTYHAKLENARLSSEMNTSTVNSAREELMESRMRIESLSSQLSNLQKES 314 V++ + L+ + KL+ R E + + + EEL R ++E+ L + E Sbjct: 701 HVQVELKRLDAERLEIKLDRERREREW--AELKDSVEELKVQREKLETQRHMLRAERDEI 758 Query: 315 RACLERIQELEDMLAKERDNS--RRMLSDREREMAEIRDQMQQQLSDYEQL 363 R +E +++LE++ D S + LS+ ER ++ Q+ +S ++L Sbjct: 759 RHEIEELKKLENLKVALDDMSMAKMQLSNLERSWEKVSALKQKVVSRDDEL 809 Score = 42.8 bits (99), Expect = 1e-06 Identities = 59/382 (15%), Positives = 127/382 (32%), Gaps = 33/382 (8%) Query: 39 LNDRLAVYIDKVRSLETENSAL---QLQVTEREEVRGRELTGLKALYETELADARRALDD 95 L D++AV + L+ E L + ++ E+ L + + +D + Sbjct: 340 LGDKIAVSERESSLLKKEQELLVAEEKIASKESELIQNVLANQEVILRKRKSDVEA---E 396 Query: 96 TARERAKLQIELGKFKAEHDQLLLNYAKKESDLSGAQIKL---REYEAALNSKDAALATA 152 + +++E+ + + ++ ++E + + L A + Sbjct: 397 LECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFN 456 Query: 153 LGDKKSLEGDLEDLKDQ----IAQLEASLSAAKKQLADETLLKVDLENRCQSLTEDLEFR 208 L +K+ E+ ++ + + L +L D R S T+ LE Sbjct: 457 LDEKEKNLVATEEDINRKTTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEAL 516 Query: 209 KNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQVRLYKEELEQT- 267 K+ E + E ++ E + E + + +EEL + Sbjct: 517 KSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDVKREELRKEA 576 Query: 268 -------------YHAKLENARLSSEMNTSTVNSAREELMESRMR-----IESLSSQLSN 309 + +N + + + + E L R +E S LS Sbjct: 577 EYITRQREAFSMYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMVEEHSEWLSK 636 Query: 310 LQKESRACLERIQELEDMLAKERDNSRRMLSDREREMAEIRDQMQQQLSDYEQLLDVKLA 369 +Q+E L I+ + L +N R L + R+ + +Q ++ + Q L Sbjct: 637 IQRERADFLLGIEMQKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAE 696 Query: 370 LDME-ISAYRKLLEGEEERLKL 390 ++E + K L+ E +KL Sbjct: 697 KELEHVQVELKRLDAERLEIKL 718 Score = 33.1 bits (74), Expect = 9e-04 Identities = 73/411 (17%), Positives = 155/411 (36%), Gaps = 10/411 (2%) Query: 27 LSRLQE-KEELRELNDRLAVYIDKVRSLETENSALQLQVTEREEVRGRELTGLKALYETE 85 +S L E K+ L + + + + SLE ++ + E + G ++ + E Sbjct: 116 VSALAEAKKREESLKKDVGIAKECISSLEKTLHEMRAECAETKVSAGSTMSEAHVMIEDA 175 Query: 86 LADARRALDDTARERAKLQIELGKFKAEHDQLLLNYAKKESDLSGAQIKL-REYEAALNS 144 L A R LQ E ++ ++ L +E DL+ E E N Sbjct: 176 LKKLADAEAKM-RAAEALQAEANRYHRIAERKLKEVESREDDLTRRLASFKSECETKENE 234 Query: 145 KDAALATALGDKKSLEGDLEDLKD---QIAQLEASLSAAKKQLADETLLKVDLENRCQSL 201 T +KSL+ + E L D + Q E + A ++LA+ + + Sbjct: 235 MVIERQTLNERRKSLQQEHERLLDAQVSLNQREDHIFARSQELAELEKGLDTAKTTFEEE 294 Query: 202 TEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQVRLYK 261 + E +K+ E + ++ E +S + Y+ ++ + + Sbjct: 295 RKAFEDKKSNLEIALALCAKREEVCFYSHNSLLFLVLHYRSSKKFLGDKIAVSERESSLL 354 Query: 262 EELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMRIESLSSQLSNLQKESRACLERI 321 ++ ++ A+ + A SE+ + + + L + + +E+ S + R Sbjct: 355 KKEQELLVAEEKIASKESELIQNVLANQEVILRKRKSDVEAELECKSKSVEVEIESKRRA 414 Query: 322 QELEDMLAKERD----NSRRMLSDREREMAEIRDQMQQQLSDYEQLLDVKLALDMEISAY 377 EL ++ K+R+ L + R +AE + ++ + ++ +A + +I+ Sbjct: 415 WELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEKSFNLDEKEKNLVATEEDINRK 474 Query: 378 RKLLEGEEERLKLSPSPSSRVTVSRASSSRSVRTTRGKRKRVDVEESEASS 428 +LE E+ERL+ + S + V + K + + E SE S+ Sbjct: 475 TTMLEDEKERLRKLDLELQQSLTSLEDKRKRVDSATQKLEALKSETSELST 525 Score = 22.4 bits (46), Expect = 1.3 Identities = 40/279 (14%), Positives = 92/279 (32%), Gaps = 11/279 (3%) Query: 27 LSRLQEKEE------LRELNDRLAVYIDKVRSLETENSALQLQVTEREEVRGRELTGLKA 80 LS++Q + + + +K LE + + + +++ + LK Sbjct: 634 LSKIQRERADFLLGIEMQKRELEYCIENKREELENSSRDREKAFEQEKKLEEERIQSLKE 693 Query: 81 LYETELADARRALDDTARERAKLQIELGKFKAEHDQLLLNYAKKESDLSGAQIKLREYEA 140 + E EL + L ER +++++ + + E +L E + + Sbjct: 694 MAEKELEHVQVELKRLDAERLEIKLDRERREREWAEL---KDSVEELKVQREKLETQRHM 750 Query: 141 ALNSKDAALATALGDKKSLEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDLENRCQS 200 +D ++ +L+ D ++ + LS ++ + LK + +R Sbjct: 751 LRAERDEIR--HEIEELKKLENLKVALDDMSMAKMQLSNLERSWEKVSALKQKVVSRDDE 808 Query: 201 LTEDLEFRKNMYEEEINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQVRLY 260 L E+ + + + L + + E+ Sbjct: 809 LDLQNGVSTVSNSEDGYNSSMERQNGLTPSSATPFSWIKRCTNLIFKTSPEKSTLMHHYE 868 Query: 261 KEELEQTYHAKLENARLSSEMNTSTVNSAREELMESRMR 299 +E + KLE++R + T ++ A E L R R Sbjct: 869 EEGGVPSEKLKLESSRREEKAYTEGLSIAVERLEAGRKR 907 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 26.6 bits (57), Expect = 0.075 Identities = 18/62 (29%), Positives = 24/62 (38%), Gaps = 5/62 (8%) Query: 19 ATPLSPTRLSRLQEKEELRELN----DRLAVYIDKVRSLETENSALQLQVTEREEVRGRE 74 A P S + E +L EL + + DK L TE V E E V+ E Sbjct: 145 AVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKATVLTTERKKRGKTVPE-ELVKPEE 203 Query: 75 LT 76 L+ Sbjct: 204 LS 205 >swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. Length = 360 Score = 22.7 bits (47), Expect = 1.0 Identities = 13/69 (18%), Positives = 25/69 (35%), Gaps = 2/69 (2%) Query: 35 ELRELNDRLAVYIDKVRSLETEN--SALQLQVTEREEVRGRELTGLKALYETELADARRA 92 E +L + V L E + +V + + R E + + + D +RA Sbjct: 173 EADQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDLERMLEANDGSGMLDEDEEDLQRA 232 Query: 93 LDDTARERA 101 L + +E Sbjct: 233 LALSRQEID 241 Score = 22.7 bits (47), Expect = 1.1 Identities = 30/139 (21%), Positives = 53/139 (37%), Gaps = 21/139 (15%) Query: 177 LSAAKKQLADETLLKVDL-----ENRCQSLTEDLEFRKNMYEEEI--NETRRKHET---- 225 L+ +++ ++K DL + Q + R + EE+ + +R H+T Sbjct: 152 LAQLQQEGYSIFVVKGDLPDCEADQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDLER 211 Query: 226 -RLVEVDSGRQIEYEYKLAQALHEMREQHDAQVRLYKEELEQTYHAKLENARLSSEMN-- 282 SG E E L +AL R+ ++ + EE + +L S ++ Sbjct: 212 MLEANDGSGMLDEDEEDLQRALALSRQ----EIDMEDEEADLRRAIQLSMQGSSRNISQD 267 Query: 283 ---TSTVNSAREELMESRM 298 TS N EEL + R Sbjct: 268 MTQTSGTNLTSEELRKRRE 286 Score = 21.6 bits (44), Expect = 2.6 Identities = 19/119 (15%), Positives = 38/119 (30%), Gaps = 10/119 (8%) Query: 118 LLNYAKKESDLSGAQIKLREYEAALNSKDAALATALGDKKSLEGDLE--DLKDQIAQLEA 175 L + + L +++ + + A L K LE LE D + + E Sbjct: 168 DLPDCEADQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDLERMLEANDGSGMLDEDEE 227 Query: 176 SLSAAKKQLADETLLKVDLENRCQSLTEDLEFRKN----MYEEEINETRRKHETRLVEV 230 L A E +D+E+ L ++ +++ +T + T Sbjct: 228 DLQRALALSRQE----IDMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELR 282 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.303 0.132 0.328 Lambda K H 0.267 0.0403 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 64,200 Number of Sequences: 43 Number of extensions: 3436 Number of successful extensions: 354 Number of sequences better than 10.0: 32 Number of HSP's better than 10.0 without gapping: 32 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 116 length of query: 587 length of database: 17,163 effective HSP length: 60 effective length of query: 527 effective length of database: 14,583 effective search space: 7685241 effective search space used: 7685241 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (20.0 bits) S2: 39 (19.7 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P49449|CENA_BOVIN Centromere protein A (CENP-A) (Centromere autoanti (93 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P49449|CENA_BOVIN Centromere protein A (CENP-A) (Centrome... 161 2e-43 swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 17 4.4 trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 17 5.1 swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 16 9.1 >swiss|P49449|CENA_BOVIN Centromere protein A (CENP-A) (Centromere autoanti Length = 93 Score = 161 bits (408), Expect = 2e-43 Identities = 93/93 (100%), Positives = 93/93 (100%) Query: 1 VLKEIRTLQKTTHLLLRKSPFCRLAREICVQFTRGVDFNWQAQALLALQEAAEAFLVHLF 60 VLKEIRTLQKTTHLLLRKSPFCRLAREICVQFTRGVDFNWQAQALLALQEAAEAFLVHLF Sbjct: 1 VLKEIRTLQKTTHLLLRKSPFCRLAREICVQFTRGVDFNWQAQALLALQEAAEAFLVHLF 60 Query: 61 EDAYLLSLHAGRVTLFPKDVQLARRIRGIQEGL 93 EDAYLLSLHAGRVTLFPKDVQLARRIRGIQEGL Sbjct: 61 EDAYLLSLHAGRVTLFPKDVQLARRIRGIQEGL 93 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 17.3 bits (33), Expect = 4.4 Identities = 6/25 (24%), Positives = 13/25 (52%) Query: 40 WQAQALLALQEAAEAFLVHLFEDAY 64 W L + +A +A + ++ +D Y Sbjct: 137 WNPVPLSYVTDAPDATVANMLQDVY 161 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 17.3 bits (33), Expect = 5.1 Identities = 7/16 (43%), Positives = 11/16 (68%) Query: 45 LLALQEAAEAFLVHLF 60 L ALQ+ +A ++H F Sbjct: 75 LKALQDEWDAVMLHSF 90 Score = 16.9 bits (32), Expect = 6.2 Identities = 7/27 (25%), Positives = 13/27 (47%) Query: 7 TLQKTTHLLLRKSPFCRLAREICVQFT 33 T Q+ +H L + CR+ + + T Sbjct: 99 TRQELSHALYQHDAACRVIARLTKEVT 125 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 16.2 bits (30), Expect = 9.1 Identities = 6/18 (33%), Positives = 11/18 (60%) Query: 74 TLFPKDVQLARRIRGIQE 91 TLF K++ + + +QE Sbjct: 325 TLFVKNIPYSTTVEELQE 342 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P49449|CENA_BOVIN Centromere protein A (CENP-A) (Centrome... 169 8e-46 Sequences not found previously or not previously below threshold: swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 18 4.1 trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 17 5.4 swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 16 9.7 CONVERGED! >swiss|P49449|CENA_BOVIN Centromere protein A (CENP-A) (Centromere autoanti Length = 93 Score = 169 bits (428), Expect = 8e-46 Identities = 93/93 (100%), Positives = 93/93 (100%) Query: 1 VLKEIRTLQKTTHLLLRKSPFCRLAREICVQFTRGVDFNWQAQALLALQEAAEAFLVHLF 60 VLKEIRTLQKTTHLLLRKSPFCRLAREICVQFTRGVDFNWQAQALLALQEAAEAFLVHLF Sbjct: 1 VLKEIRTLQKTTHLLLRKSPFCRLAREICVQFTRGVDFNWQAQALLALQEAAEAFLVHLF 60 Query: 61 EDAYLLSLHAGRVTLFPKDVQLARRIRGIQEGL 93 EDAYLLSLHAGRVTLFPKDVQLARRIRGIQEGL Sbjct: 61 EDAYLLSLHAGRVTLFPKDVQLARRIRGIQEGL 93 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 17.6 bits (34), Expect = 4.1 Identities = 6/25 (24%), Positives = 13/25 (52%) Query: 40 WQAQALLALQEAAEAFLVHLFEDAY 64 W L + +A +A + ++ +D Y Sbjct: 137 WNPVPLSYVTDAPDATVANMLQDVY 161 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 17.2 bits (33), Expect = 5.4 Identities = 7/16 (43%), Positives = 11/16 (68%) Query: 45 LLALQEAAEAFLVHLF 60 L ALQ+ +A ++H F Sbjct: 75 LKALQDEWDAVMLHSF 90 Score = 16.9 bits (32), Expect = 6.5 Identities = 7/27 (25%), Positives = 13/27 (47%) Query: 7 TLQKTTHLLLRKSPFCRLAREICVQFT 33 T Q+ +H L + CR+ + + T Sbjct: 99 TRQELSHALYQHDAACRVIARLTKEVT 125 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 16.1 bits (30), Expect = 9.7 Identities = 6/18 (33%), Positives = 11/18 (60%) Query: 74 TLFPKDVQLARRIRGIQE 91 TLF K++ + + +QE Sbjct: 325 TLFVKNIPYSTTVEELQE 342 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.330 0.142 0.416 Lambda K H 0.267 0.0433 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 5835 Number of Sequences: 43 Number of extensions: 188 Number of successful extensions: 10 Number of sequences better than 10.0: 8 Number of HSP's better than 10.0 without gapping: 8 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 10 length of query: 93 length of database: 17,163 effective HSP length: 44 effective length of query: 49 effective length of database: 15,271 effective search space: 748279 effective search space used: 748279 T: 11 A: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 31 (17.6 bits) S2: 31 (16.5 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P48988|CENB_CRIGR Major centromere autoantigen B (Centromere protein (606 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P48988|CENB_CRIGR Major centromere autoantigen B (Centrom... 929 0.0 trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.... 26 0.12 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 24 0.52 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 23 1.1 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 22 2.0 >swiss|P48988|CENB_CRIGR Major centromere autoantigen B (Centromere protein Length = 606 Score = 929 bits (2402), Expect = 0.0 Identities = 606/606 (100%), Positives = 606/606 (100%) Query: 1 MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLSTILKNKRAILASERKYG 60 MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLSTILKNKRAILASERKYG Sbjct: 1 MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLSTILKNKRAILASERKYG 60 Query: 61 VASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKEKALRIAEELGMDDFTASN 120 VASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKEKALRIAEELGMDDFTASN Sbjct: 61 VASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKEKALRIAEELGMDDFTASN 120 Query: 121 GWLDRFRRRHGVVACSGVTRSRARTSTPRAPAAPAGPAAVPSEGSGGSTPGWRTREEQPP 180 GWLDRFRRRHGVVACSGVTRSRARTSTPRAPAAPAGPAAVPSEGSGGSTPGWRTREEQPP Sbjct: 121 GWLDRFRRRHGVVACSGVTRSRARTSTPRAPAAPAGPAAVPSEGSGGSTPGWRTREEQPP 180 Query: 181 SVAEGYASQDVFSATETSLWYDFLSDQASGLWGGDGTARQATQRLSVLLCANRDGSEKLP 240 SVAEGYASQDVFSATETSLWYDFLSDQASGLWGGDGTARQATQRLSVLLCANRDGSEKLP Sbjct: 181 SVAEGYASQDVFSATETSLWYDFLSDQASGLWGGDGTARQATQRLSVLLCANRDGSEKLP 240 Query: 241 PLVAGKSAKPRASQGGLPCDYTANSKGGVTTQALAKYLKALDTRMAAESRRVLLLAGRLA 300 PLVAGKSAKPRASQGGLPCDYTANSKGGVTTQALAKYLKALDTRMAAESRRVLLLAGRLA Sbjct: 241 PLVAGKSAKPRASQGGLPCDYTANSKGGVTTQALAKYLKALDTRMAAESRRVLLLAGRLA 300 Query: 301 AQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVKGHYRQAMLLKAMAALEGQDPSGLQLG 360 AQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVKGHYRQAMLLKAMAALEGQDPSGLQLG Sbjct: 301 AQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVKGHYRQAMLLKAMAALEGQDPSGLQLG 360 Query: 361 LVEALHFVAAAWQAVEPADIATCFREAGFGGGLNATITTSFKSEGEEEEEEEEEEEEEEE 420 LVEALHFVAAAWQAVEPADIATCFREAGFGGGLNATITTSFKSEGEEEEEEEEEEEEEEE Sbjct: 361 LVEALHFVAAAWQAVEPADIATCFREAGFGGGLNATITTSFKSEGEEEEEEEEEEEEEEE 420 Query: 421 EEGEGEEEEEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEEEEEEEEEEESSSEGLEAE 480 EEGEGEEEEEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEEEEEEEEEEESSSEGLEAE Sbjct: 421 EEGEGEEEEEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEEEEEEEEEEESSSEGLEAE 480 Query: 481 DWAQGVVEASGGFGGYSVQEEAQCPTLHFLEGGEDSDSDSDEEEEDEEEDEEDEEDDDDD 540 DWAQGVVEASGGFGGYSVQEEAQCPTLHFLEGGEDSDSDSDEEEEDEEEDEEDEEDDDDD Sbjct: 481 DWAQGVVEASGGFGGYSVQEEAQCPTLHFLEGGEDSDSDSDEEEEDEEEDEEDEEDDDDD 540 Query: 541 EDGDEVPVPSFGEAMAYFAMVKRYLTSFPIDDRVQSHILHLEHDLVHVTRKNHAWQAGVR 600 EDGDEVPVPSFGEAMAYFAMVKRYLTSFPIDDRVQSHILHLEHDLVHVTRKNHAWQAGVR Sbjct: 541 EDGDEVPVPSFGEAMAYFAMVKRYLTSFPIDDRVQSHILHLEHDLVHVTRKNHAWQAGVR 600 Query: 601 GLGHQS 606 GLGHQS Sbjct: 601 GLGHQS 606 >trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR Length = 752 Score = 25.8 bits (55), Expect = 0.12 Identities = 16/76 (21%), Positives = 37/76 (48%), Gaps = 3/76 (3%) Query: 402 KSEGEEEEEEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGEGEEVGEEE---EVEEEGDES 458 K + E++ +E + E +E+E++ + +G + E +E +VE + +E Sbjct: 470 KERDSKHSRHEDKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEH 529 Query: 459 DEEEEEEEEEEEESSS 474 D + +E++ +S S Sbjct: 530 DHSKSKEKDRRAQSRS 545 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 23.9 bits (50), Expect = 0.52 Identities = 12/41 (29%), Positives = 26/41 (63%) Query: 429 EEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEEEEEEEEE 469 E ++EE ++ E E++ ++ +E+ D+ D+EE+E E+ Sbjct: 125 ELKKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSREK 165 Score = 21.6 bits (44), Expect = 2.6 Identities = 12/37 (32%), Positives = 26/37 (69%), Gaps = 5/37 (13%) Query: 443 EEVGEEEEVEEEG----DESDEEEEEEEEEEEESSSE 475 EE+ ++ ++E+E + DE++++ ++EE+ESS E Sbjct: 129 EEI-KQRQIEQEKLASLKKQDEDKDKRDKEEKESSRE 164 Score = 20.0 bits (40), Expect = 7.4 Identities = 12/49 (24%), Positives = 29/49 (58%), Gaps = 5/49 (10%) Query: 395 ATITTSFKSEGEEEEEEEEEEEE-----EEEEEGEGEEEEEEEEEGEEE 438 A I ++F +EE ++ + E+E ++++E + + ++EE+E E+ Sbjct: 117 AGIPSAFLELKKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSREK 165 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 22.7 bits (47), Expect = 1.1 Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 12/50 (24%) Query: 456 DESDEEEEE------------EEEEEEESSSEGLEAEDWAQGVVEASGGF 493 D++D EEEE EE E +++S GL+A + +E F Sbjct: 226 DDNDLEEEERNASGEQTTTAREESEALDTTSNGLDALNTQINAIETEESF 275 Score = 21.9 bits (45), Expect = 2.1 Identities = 18/69 (26%), Positives = 26/69 (37%) Query: 410 EEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEEEEEEEEE 469 ++ + EEEE GE EE E + G + + E +ES E E Sbjct: 226 DDNDLEEEERNASGEQTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNE 285 Query: 470 EESSSEGLE 478 +S LE Sbjct: 286 LRTSPTQLE 294 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 21.9 bits (45), Expect = 2.0 Identities = 22/74 (29%), Positives = 35/74 (46%), Gaps = 3/74 (4%) Query: 417 EEEEEEGE-GEEE-EEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEEEEEEEEEEESSS 474 E E+++G G++E EE+ E EV E + E DE E + E+ +S + Sbjct: 207 EREQDKGHIGDQELPPEEDPSRELNNVQHEVNSLTEQDAEADEGLWGEIDSLCEKWQSEA 266 Query: 475 EG-LEAEDWAQGVV 487 E EAE A ++ Sbjct: 267 EDQTEAEIIADRII 280 Score = 19.6 bits (39), Expect = 8.8 Identities = 17/61 (27%), Positives = 31/61 (49%), Gaps = 7/61 (11%) Query: 405 GEEEEEEEEEEEEE----EEEEGEGEEEEEEEEEGEEEGGEGEEVGEEEEVEEEGDESDE 460 G++E EE+ E + E E++ E +EG GE + + E+ + E E D+++ Sbjct: 216 GDQELPPEEDPSRELNNVQHEVNSLTEQDAEADEG--LWGEIDSLCEKWQSEAE-DQTEA 272 Query: 461 E 461 E Sbjct: 273 E 273 Score = 19.6 bits (39), Expect = 9.7 Identities = 12/47 (25%), Positives = 20/47 (42%) Query: 449 EEVEEEGDESDEEEEEEEEEEEESSSEGLEAEDWAQGVVEASGGFGG 495 E +++G D+E EE+ E ++ E + EA G G Sbjct: 207 EREQDKGHIGDQELPPEEDPSRELNNVQHEVNSLTEQDAEADEGLWG 253 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P48988|CENB_CRIGR Major centromere autoantigen B (Centrom... 838 0.0 Sequences not found previously or not previously below threshold: trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.... 29 0.014 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 26 0.098 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 25 0.30 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 24 0.69 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 23 0.77 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 23 1.2 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 21 3.3 swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. 20 6.2 CONVERGED! >swiss|P48988|CENB_CRIGR Major centromere autoantigen B (Centromere protein Length = 606 Score = 838 bits (2166), Expect = 0.0 Identities = 402/402 (100%), Positives = 402/402 (100%) Query: 1 MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLSTILKNKRAILASERKYG 60 MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLSTILKNKRAILASERKYG Sbjct: 1 MGPKRRQLTFREKSRIIQEVEENPDLRKGEIARRFNIPPSTLSTILKNKRAILASERKYG 60 Query: 61 VASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKEKALRIAEELGMDDFTASN 120 VASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKEKALRIAEELGMDDFTASN Sbjct: 61 VASTCRKTNKLSPYDKLEGLLIAWFQQIRAAGLPVKGIILKEKALRIAEELGMDDFTASN 120 Query: 121 GWLDRFRRRHGVVACSGVTRSRARTSTPRAPAAPAGPAAVPSEGSGGSTPGWRTREEQPP 180 GWLDRFRRRHGVVACSGVTRSRARTSTPRAPAAPAGPAAVPSEGSGGSTPGWRTREEQPP Sbjct: 121 GWLDRFRRRHGVVACSGVTRSRARTSTPRAPAAPAGPAAVPSEGSGGSTPGWRTREEQPP 180 Query: 181 SVAEGYASQDVFSATETSLWYDFLSDQASGLWGGDGTARQATQRLSVLLCANRDGSEKLP 240 SVAEGYASQDVFSATETSLWYDFLSDQASGLWGGDGTARQATQRLSVLLCANRDGSEKLP Sbjct: 181 SVAEGYASQDVFSATETSLWYDFLSDQASGLWGGDGTARQATQRLSVLLCANRDGSEKLP 240 Query: 241 PLVAGKSAKPRASQGGLPCDYTANSKGGVTTQALAKYLKALDTRMAAESRRVLLLAGRLA 300 PLVAGKSAKPRASQGGLPCDYTANSKGGVTTQALAKYLKALDTRMAAESRRVLLLAGRLA Sbjct: 241 PLVAGKSAKPRASQGGLPCDYTANSKGGVTTQALAKYLKALDTRMAAESRRVLLLAGRLA 300 Query: 301 AQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVKGHYRQAMLLKAMAALEGQDPSGLQLG 360 AQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVKGHYRQAMLLKAMAALEGQDPSGLQLG Sbjct: 301 AQSLDTSGLRHVQLAFFPPGTVHPLERGVVQQVKGHYRQAMLLKAMAALEGQDPSGLQLG 360 Query: 361 LVEALHFVAAAWQAVEPADIATCFREAGFGGGLNATITTSFK 402 LVEALHFVAAAWQAVEPADIATCFREAGFGGGLNATITTSFK Sbjct: 361 LVEALHFVAAAWQAVEPADIATCFREAGFGGGLNATITTSFK 402 >trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR Length = 752 Score = 29.3 bits (64), Expect = 0.014 Identities = 16/76 (21%), Positives = 37/76 (48%), Gaps = 3/76 (3%) Query: 402 KSEGEEEEEEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGEGEEVGEEE---EVEEEGDES 458 K + E++ +E + E +E+E++ + +G + E +E +VE + +E Sbjct: 470 KERDSKHSRHEDKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEH 529 Query: 459 DEEEEEEEEEEEESSS 474 D + +E++ +S S Sbjct: 530 DHSKSKEKDRRAQSRS 545 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 26.2 bits (56), Expect = 0.098 Identities = 12/41 (29%), Positives = 26/41 (63%) Query: 429 EEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEEEEEEEEE 469 E ++EE ++ E E++ ++ +E+ D+ D+EE+E E+ Sbjct: 125 ELKKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSREK 165 Score = 25.1 bits (53), Expect = 0.25 Identities = 12/37 (32%), Positives = 26/37 (69%), Gaps = 5/37 (13%) Query: 443 EEVGEEEEVEEEG----DESDEEEEEEEEEEEESSSE 475 EE+ ++ ++E+E + DE++++ ++EE+ESS E Sbjct: 129 EEI-KQRQIEQEKLASLKKQDEDKDKRDKEEKESSRE 164 Score = 22.8 bits (47), Expect = 1.2 Identities = 12/49 (24%), Positives = 29/49 (58%), Gaps = 5/49 (10%) Query: 395 ATITTSFKSEGEEEEEEEEEEEE-----EEEEEGEGEEEEEEEEEGEEE 438 A I ++F +EE ++ + E+E ++++E + + ++EE+E E+ Sbjct: 117 AGIPSAFLELKKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSREK 165 Score = 20.8 bits (42), Expect = 3.8 Identities = 8/37 (21%), Positives = 23/37 (61%) Query: 402 KSEGEEEEEEEEEEEEEEEEEGEGEEEEEEEEEGEEE 438 K E ++ + E+E+ ++++ + ++ ++EE+E E Sbjct: 128 KEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSRE 164 Score = 20.4 bits (41), Expect = 5.0 Identities = 7/29 (24%), Positives = 20/29 (68%) Query: 406 EEEEEEEEEEEEEEEEEGEGEEEEEEEEE 434 E+E+ ++++E++++ + EE+E E+ Sbjct: 137 EQEKLASLKKQDEDKDKRDKEEKESSREK 165 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 24.7 bits (52), Expect = 0.30 Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 12/50 (24%) Query: 456 DESDEEEEE------------EEEEEEESSSEGLEAEDWAQGVVEASGGF 493 D++D EEEE EE E +++S GL+A + +E F Sbjct: 226 DDNDLEEEERNASGEQTTTAREESEALDTTSNGLDALNTQINAIETEESF 275 Score = 23.1 bits (48), Expect = 0.91 Identities = 18/69 (26%), Positives = 26/69 (37%) Query: 410 EEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEEEEEEEEE 469 ++ + EEEE GE EE E + G + + E +ES E E Sbjct: 226 DDNDLEEEERNASGEQTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNE 285 Query: 470 EESSSEGLE 478 +S LE Sbjct: 286 LRTSPTQLE 294 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 23.5 bits (49), Expect = 0.69 Identities = 20/83 (24%), Positives = 32/83 (38%) Query: 398 TTSFKSEGEEEEEEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGEGEEVGEEEEVEEEGDE 457 T+ KS EEE E+ + EE+ EG E E G + GDE Sbjct: 856 TSPEKSTLMHHYEEEGGVPSEKLKLESSRREEKAYTEGLSIAVERLEAGRKRRGNTSGDE 915 Query: 458 SDEEEEEEEEEEEESSSEGLEAE 480 + E ++ + + + EA+ Sbjct: 916 TSEPSNNKKRKHDVTQKYSDEAD 938 Score = 21.2 bits (43), Expect = 3.3 Identities = 20/75 (26%), Positives = 36/75 (47%), Gaps = 3/75 (4%) Query: 404 EGEEEEEEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGEGE---EVGEEEEVEEEGDESDE 460 E E E + + E E E + E E + ++ E+ GE E EV E+E D +++ Sbjct: 394 EAELECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDITEK 453 Query: 461 EEEEEEEEEEESSSE 475 +E+E+ ++E Sbjct: 454 SFNLDEKEKNLVATE 468 Score = 20.8 bits (42), Expect = 4.3 Identities = 17/58 (29%), Positives = 26/58 (44%), Gaps = 2/58 (3%) Query: 407 EEEEEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEEEE 464 E + EE E + E+ +E++ EEE + E+ E E V+ E D E E Sbjct: 660 ENKREELENSSRDREKAFEQEKKLEEERIQSLKEMAEK--ELEHVQVELKRLDAERLE 715 Score = 20.1 bits (40), Expect = 7.0 Identities = 21/78 (26%), Positives = 37/78 (46%) Query: 403 SEGEEEEEEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEE 462 S E+ E+E++ EEE + E E+E E + E + E +++ E E + E Sbjct: 670 SRDREKAFEQEKKLEEERIQSLKEMAEKELEHVQVELKRLDAERLEIKLDRERREREWAE 729 Query: 463 EEEEEEEEESSSEGLEAE 480 ++ EE + E LE + Sbjct: 730 LKDSVEELKVQREKLETQ 747 Score = 19.7 bits (39), Expect = 9.2 Identities = 11/37 (29%), Positives = 19/37 (50%) Query: 402 KSEGEEEEEEEEEEEEEEEEEGEGEEEEEEEEEGEEE 438 K E E + E+ E+E++ E E + +E E+E Sbjct: 662 KREELENSSRDREKAFEQEKKLEEERIQSLKEMAEKE 698 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 23.1 bits (48), Expect = 0.77 Identities = 22/74 (29%), Positives = 35/74 (46%), Gaps = 3/74 (4%) Query: 417 EEEEEEGE-GEEE-EEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEEEEEEEEEEESSS 474 E E+++G G++E EE+ E EV E + E DE E + E+ +S + Sbjct: 207 EREQDKGHIGDQELPPEEDPSRELNNVQHEVNSLTEQDAEADEGLWGEIDSLCEKWQSEA 266 Query: 475 EG-LEAEDWAQGVV 487 E EAE A ++ Sbjct: 267 EDQTEAEIIADRII 280 Score = 21.2 bits (43), Expect = 3.4 Identities = 17/61 (27%), Positives = 31/61 (49%), Gaps = 7/61 (11%) Query: 405 GEEEEEEEEEEEEE----EEEEGEGEEEEEEEEEGEEEGGEGEEVGEEEEVEEEGDESDE 460 G++E EE+ E + E E++ E +EG GE + + E+ + E E D+++ Sbjct: 216 GDQELPPEEDPSRELNNVQHEVNSLTEQDAEADEG--LWGEIDSLCEKWQSEAE-DQTEA 272 Query: 461 E 461 E Sbjct: 273 E 273 Score = 20.4 bits (41), Expect = 6.0 Identities = 12/47 (25%), Positives = 20/47 (42%) Query: 449 EEVEEEGDESDEEEEEEEEEEEESSSEGLEAEDWAQGVVEASGGFGG 495 E +++G D+E EE+ E ++ E + EA G G Sbjct: 207 EREQDKGHIGDQELPPEEDPSRELNNVQHEVNSLTEQDAEADEGLWG 253 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 22.8 bits (47), Expect = 1.2 Identities = 12/34 (35%), Positives = 15/34 (43%) Query: 398 TTSFKSEGEEEEEEEEEEEEEEEEEGEGEEEEEE 431 TTS +E E E E +E E E +EE Sbjct: 162 TTSRAAETRERSPEHTEASTQEAETAEEALNKEE 195 Score = 21.2 bits (43), Expect = 3.2 Identities = 13/67 (19%), Positives = 25/67 (36%) Query: 411 EEEEEEEEEEEEGEGEEEEEEEEEGEEEGGEGEEVGEEEEVEEEGDESDEEEEEEEEEEE 470 E +++ ++ E E E E E G GEE V + + EE + Sbjct: 29 EMDDDSDDAEAVFPTSEPSRVSEGSENESGSGEEYEGAPHVHFQESTATEEPRTPSKTPA 88 Query: 471 ESSSEGL 477 +++ + Sbjct: 89 SQTTQSI 95 Score = 20.1 bits (40), Expect = 7.9 Identities = 12/35 (34%), Positives = 13/35 (36%) Query: 410 EEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGEGEE 444 E E E E G GEE E +E EE Sbjct: 45 EPSRVSEGSENESGSGEEYEGAPHVHFQESTATEE 79 Score = 19.7 bits (39), Expect = 8.5 Identities = 11/28 (39%), Positives = 13/28 (46%) Query: 453 EEGDESDEEEEEEEEEEEESSSEGLEAE 480 E D+SD+ E E SEG E E Sbjct: 29 EMDDDSDDAEAVFPTSEPSRVSEGSENE 56 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 21.2 bits (43), Expect = 3.3 Identities = 9/40 (22%), Positives = 16/40 (39%) Query: 498 VQEEAQCPTLHFLEGGEDSDSDSDEEEEDEEEDEEDEEDD 537 ++E C + E D D EE+++ E + D Sbjct: 142 IKETNDCKKFVLVPSDEIKDGDKQEEKKEPELAKRKSHQD 181 >swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. Length = 360 Score = 20.4 bits (41), Expect = 6.2 Identities = 11/45 (24%), Positives = 21/45 (46%) Query: 406 EEEEEEEEEEEEEEEEEGEGEEEEEEEEEGEEEGGEGEEVGEEEE 450 E+ + + E E +G G +E+EE+ +E+ E+E Sbjct: 201 EQRVHKTDLERMLEANDGSGMLDEDEEDLQRALALSRQEIDMEDE 245 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.309 0.130 0.367 Lambda K H 0.267 0.0398 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 59,982 Number of Sequences: 43 Number of extensions: 4695 Number of successful extensions: 1390 Number of sequences better than 10.0: 28 Number of HSP's better than 10.0 without gapping: 28 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 222 length of query: 606 length of database: 17,163 effective HSP length: 60 effective length of query: 546 effective length of database: 14,583 effective search space: 7962318 effective search space used: 7962318 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (20.3 bits) S2: 39 (19.7 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti (402 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 759 0.0 swiss|P25233|NECD_MOUSE Necdin. 22 1.1 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 20 6.7 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 20 6.8 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 759 bits (1959), Expect = 0.0 Identities = 402/402 (100%), Positives = 402/402 (100%) Query: 1 KSEQSSFSSSSSVRNELSVYHNSRQKPPAEKTNQSSKNIGKKAAPFKKQKRANKGSSGAQ 60 KSEQSSFSSSSSVRNELSVYHNSRQKPPAEKTNQSSKNIGKKAAPFKKQKRANKGSSGAQ Sbjct: 1 KSEQSSFSSSSSVRNELSVYHNSRQKPPAEKTNQSSKNIGKKAAPFKKQKRANKGSSGAQ 60 Query: 61 VLYAKDSGGAQNESLRSNEADLAKKKNPNPSGDTGSSKNQDSMAAQNVHQKSQMSVETCT 120 VLYAKDSGGAQNESLRSNEADLAKKKNPNPSGDTGSSKNQDSMAAQNVHQKSQMSVETCT Sbjct: 61 VLYAKDSGGAQNESLRSNEADLAKKKNPNPSGDTGSSKNQDSMAAQNVHQKSQMSVETCT 120 Query: 121 TPSKSNLILESLGLQFFGREVTLFRLKNYLTSERHSDVDDNVVQENLNDSRGEIPDSTPE 180 TPSKSNLILESLGLQFFGREVTLFRLKNYLTSERHSDVDDNVVQENLNDSRGEIPDSTPE Sbjct: 121 TPSKSNLILESLGLQFFGREVTLFRLKNYLTSERHSDVDDNVVQENLNDSRGEIPDSTPE 180 Query: 181 SKMHHKLVLPSYTPNVRRTMRTRSKPLEYWRGERIDYQARPSGGFVIGGILSPDTVSSKR 240 SKMHHKLVLPSYTPNVRRTMRTRSKPLEYWRGERIDYQARPSGGFVIGGILSPDTVSSKR Sbjct: 181 SKMHHKLVLPSYTPNVRRTMRTRSKPLEYWRGERIDYQARPSGGFVIGGILSPDTVSSKR 240 Query: 241 KAKGNLGRIITTANRKRICLENAPIKNKFMVNLNIPLGDPLQPTRVKDPETQETVLMDLI 300 KAKGNLGRIITTANRKRICLENAPIKNKFMVNLNIPLGDPLQPTRVKDPETQETVLMDLI Sbjct: 241 KAKGNLGRIITTANRKRICLENAPIKNKFMVNLNIPLGDPLQPTRVKDPETQETVLMDLI 300 Query: 301 RPRDTYQFCVEHDELKVYKTLDTPFFSSGKLIIGPLQEKGKQHVGLDTLVFYVNLGYLLC 360 RPRDTYQFCVEHDELKVYKTLDTPFFSSGKLIIGPLQEKGKQHVGLDTLVFYVNLGYLLC Sbjct: 301 RPRDTYQFCVEHDELKVYKTLDTPFFSSGKLIIGPLQEKGKQHVGLDTLVFYVNLGYLLC 360 Query: 361 TLHETPYIVTTGDSFYVPSGNYYNIKNLLNEERVLLFTQIKS 402 TLHETPYIVTTGDSFYVPSGNYYNIKNLLNEERVLLFTQIKS Sbjct: 361 TLHETPYIVTTGDSFYVPSGNYYNIKNLLNEERVLLFTQIKS 402 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 22.3 bits (46), Expect = 1.1 Identities = 9/25 (36%), Positives = 13/25 (52%) Query: 153 ERHSDVDDNVVQENLNDSRGEIPDS 177 ER DVD ++Q+ + R P S Sbjct: 64 ERIEDVDPKILQQAAEEGRAHQPQS 88 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 19.6 bits (39), Expect = 6.7 Identities = 8/24 (33%), Positives = 15/24 (62%) Query: 1 KSEQSSFSSSSSVRNELSVYHNSR 24 +SE+ + S++ N L++ NSR Sbjct: 6 RSERFPITPSTAATNRLTITPNSR 29 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 19.6 bits (39), Expect = 6.8 Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 3/48 (6%) Query: 96 SSKNQDSMAAQNVHQKSQMSVETCTTPSKSNLILESLGLQFFGREVTL 143 S ++ A +VH + + E TPSK+ S Q GR V++ Sbjct: 57 SGSGEEYEGAPHVHFQESTATEEPRTPSKTP---ASQTTQSIGRTVSI 101 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 843 0.0 Sequences not found previously or not previously below threshold: trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 25 0.10 swiss|P25233|NECD_MOUSE Necdin. 23 0.62 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 22 1.4 swiss|O85062|REP2_BUCAP Probable replication-associated protein... 20 4.7 CONVERGED! >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 843 bits (2177), Expect = 0.0 Identities = 402/402 (100%), Positives = 402/402 (100%) Query: 1 KSEQSSFSSSSSVRNELSVYHNSRQKPPAEKTNQSSKNIGKKAAPFKKQKRANKGSSGAQ 60 KSEQSSFSSSSSVRNELSVYHNSRQKPPAEKTNQSSKNIGKKAAPFKKQKRANKGSSGAQ Sbjct: 1 KSEQSSFSSSSSVRNELSVYHNSRQKPPAEKTNQSSKNIGKKAAPFKKQKRANKGSSGAQ 60 Query: 61 VLYAKDSGGAQNESLRSNEADLAKKKNPNPSGDTGSSKNQDSMAAQNVHQKSQMSVETCT 120 VLYAKDSGGAQNESLRSNEADLAKKKNPNPSGDTGSSKNQDSMAAQNVHQKSQMSVETCT Sbjct: 61 VLYAKDSGGAQNESLRSNEADLAKKKNPNPSGDTGSSKNQDSMAAQNVHQKSQMSVETCT 120 Query: 121 TPSKSNLILESLGLQFFGREVTLFRLKNYLTSERHSDVDDNVVQENLNDSRGEIPDSTPE 180 TPSKSNLILESLGLQFFGREVTLFRLKNYLTSERHSDVDDNVVQENLNDSRGEIPDSTPE Sbjct: 121 TPSKSNLILESLGLQFFGREVTLFRLKNYLTSERHSDVDDNVVQENLNDSRGEIPDSTPE 180 Query: 181 SKMHHKLVLPSYTPNVRRTMRTRSKPLEYWRGERIDYQARPSGGFVIGGILSPDTVSSKR 240 SKMHHKLVLPSYTPNVRRTMRTRSKPLEYWRGERIDYQARPSGGFVIGGILSPDTVSSKR Sbjct: 181 SKMHHKLVLPSYTPNVRRTMRTRSKPLEYWRGERIDYQARPSGGFVIGGILSPDTVSSKR 240 Query: 241 KAKGNLGRIITTANRKRICLENAPIKNKFMVNLNIPLGDPLQPTRVKDPETQETVLMDLI 300 KAKGNLGRIITTANRKRICLENAPIKNKFMVNLNIPLGDPLQPTRVKDPETQETVLMDLI Sbjct: 241 KAKGNLGRIITTANRKRICLENAPIKNKFMVNLNIPLGDPLQPTRVKDPETQETVLMDLI 300 Query: 301 RPRDTYQFCVEHDELKVYKTLDTPFFSSGKLIIGPLQEKGKQHVGLDTLVFYVNLGYLLC 360 RPRDTYQFCVEHDELKVYKTLDTPFFSSGKLIIGPLQEKGKQHVGLDTLVFYVNLGYLLC Sbjct: 301 RPRDTYQFCVEHDELKVYKTLDTPFFSSGKLIIGPLQEKGKQHVGLDTLVFYVNLGYLLC 360 Query: 361 TLHETPYIVTTGDSFYVPSGNYYNIKNLLNEERVLLFTQIKS 402 TLHETPYIVTTGDSFYVPSGNYYNIKNLLNEERVLLFTQIKS Sbjct: 361 TLHETPYIVTTGDSFYVPSGNYYNIKNLLNEERVLLFTQIKS 402 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 25.4 bits (54), Expect = 0.10 Identities = 37/169 (21%), Positives = 73/169 (42%), Gaps = 40/169 (23%) Query: 1 KSEQSSFSSSSSVRNELSVYHNSR--QKPPAEKT-----------NQSSKNIGKKA---- 43 +SE+ + S++ N L++ NSR + P E+ QS KN K A Sbjct: 6 RSERFPITPSTAATNRLTITPNSRVLKSPLTEEIMWKRLKDAGFDEQSIKNRDKAALIAY 65 Query: 44 -APFKKQKRANKGSSGAQVLYAKDSGGAQNESLRS--NEADL---------------AKK 85 A + + + + G +L K+ +Q E +++ +E+DL AKK Sbjct: 66 IAKLESEVYDYQHNMGL-LLLEKNELSSQYEEIKASVDESDLTHMREKSAYVSALAEAKK 124 Query: 86 KNPNPSGDTGSSKNQDSMAAQNVHQK----SQMSVETCTTPSKSNLILE 130 + + D G +K S + +H+ ++ V +T S++++++E Sbjct: 125 REESLKKDVGIAKECISSLEKTLHEMRAECAETKVSAGSTMSEAHVMIE 173 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 23.1 bits (48), Expect = 0.62 Identities = 9/25 (36%), Positives = 13/25 (52%) Query: 153 ERHSDVDDNVVQENLNDSRGEIPDS 177 ER DVD ++Q+ + R P S Sbjct: 64 ERIEDVDPKILQQAAEEGRAHQPQS 88 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 22.0 bits (45), Expect = 1.4 Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 3/48 (6%) Query: 96 SSKNQDSMAAQNVHQKSQMSVETCTTPSKSNLILESLGLQFFGREVTL 143 S ++ A +VH + + E TPSK+ S Q GR V++ Sbjct: 57 SGSGEEYEGAPHVHFQESTATEEPRTPSKTP---ASQTTQSIGRTVSI 101 >swiss|O85062|REP2_BUCAP Probable replication-associated protein repA2. Length = 250 Score = 20.1 bits (40), Expect = 4.7 Identities = 9/16 (56%), Positives = 10/16 (62%) Query: 71 QNESLRSNEADLAKKK 86 +NE RSN D A KK Sbjct: 18 KNEKKRSNFIDYAMKK 33 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.312 0.130 0.367 Lambda K H 0.267 0.0399 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 29,435 Number of Sequences: 43 Number of extensions: 1320 Number of successful extensions: 27 Number of sequences better than 10.0: 18 Number of HSP's better than 10.0 without gapping: 18 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 24 length of query: 402 length of database: 17,163 effective HSP length: 57 effective length of query: 345 effective length of database: 14,712 effective search space: 5075640 effective search space used: 5075640 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 38 (20.0 bits) S2: 38 (19.3 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). (650 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 672 0.0 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 31 0.003 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 23 0.92 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 23 1.4 swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1... 22 2.8 swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 20 9.1 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 672 bits (1735), Expect = 0.0 Identities = 582/582 (100%), Positives = 582/582 (100%) Query: 1 VKLAKGAKTQAKPKKAAPPPPKDMEDSEEEEDMEEDDSSDEEVEVPVKKTPAKKTATPAK 60 VKLAKGAKTQAKPKKAAPPPPKDMEDSEEEEDMEEDDSSDEEVEVPVKKTPAKKTATPAK Sbjct: 1 VKLAKGAKTQAKPKKAAPPPPKDMEDSEEEEDMEEDDSSDEEVEVPVKKTPAKKTATPAK 60 Query: 61 ATPGKAATPGKKGATPAKNGKQAKKQESEEEEDDSDEEAEDQKPIKNKPVAKKAVAKKEE 120 ATPGKAATPGKKGATPAKNGKQAKKQESEEEEDDSDEEAEDQKPIKNKPVAKKAVAKKEE Sbjct: 61 ATPGKAATPGKKGATPAKNGKQAKKQESEEEEDDSDEEAEDQKPIKNKPVAKKAVAKKEE 120 Query: 121 SEEDDDDEDESEEEKAVAKKPTPAKKPAGKKQESEEEDDEESEDEPMEVAPALKGKKTAQ 180 SEEDDDDEDESEEEKAVAKKPTPAKKPAGKKQESEEEDDEESEDEPMEVAPALKGKKTAQ Sbjct: 121 SEEDDDDEDESEEEKAVAKKPTPAKKPAGKKQESEEEDDEESEDEPMEVAPALKGKKTAQ 180 Query: 181 AAEEDDEEEDDDDEEDDDDEEEQQGSAKRKKEMPKTIPEAKKTKTDTASEGLSIFIGNLN 240 AAEEDDEEEDDDDEEDDDDEEEQQGSAKRKKEMPKTIPEAKKTKTDTASEGLSIFIGNLN Sbjct: 181 AAEEDDEEEDDDDEEDDDDEEEQQGSAKRKKEMPKTIPEAKKTKTDTASEGLSIFIGNLN 240 Query: 241 STKEFDELKDALREFFSKKNLTIQDIRIGNSKKFGYVDFSSEEEVEKALKLTGKKILGTE 300 STKEFDELKDALREFFSKKNLTIQDIRIGNSKKFGYVDFSSEEEVEKALKLTGKKILGTE Sbjct: 241 STKEFDELKDALREFFSKKNLTIQDIRIGNSKKFGYVDFSSEEEVEKALKLTGKKILGTE 300 Query: 301 VKIEKAMAFDKNKTAENKKERDSRTLFVKNIPYSTTVEELQEIFENAKDIRIPTGKDGSN 360 VKIEKAMAFDKNKTAENKKERDSRTLFVKNIPYSTTVEELQEIFENAKDIRIPTGKDGSN Sbjct: 301 VKIEKAMAFDKNKTAENKKERDSRTLFVKNIPYSTTVEELQEIFENAKDIRIPTGKDGSN 360 Query: 361 KGIAYVEFSNEDEANKALEEKQGAEIEGRSIFVDFTGEKSQNSGNKKGPEGDSKVLVVNN 420 KGIAYVEFSNEDEANKALEEKQGAEIEGRSIFVDFTGEKSQNSGNKKGPEGDSKVLVVNN Sbjct: 361 KGIAYVEFSNEDEANKALEEKQGAEIEGRSIFVDFTGEKSQNSGNKKGPEGDSKVLVVNN 420 Query: 421 LSYSATEDSLREVFEKATSIRIPQNQGRAKGFAFIEFSSAEDAKDAMDSCNNTEIEGRSI 480 LSYSATEDSLREVFEKATSIRIPQNQGRAKGFAFIEFSSAEDAKDAMDSCNNTEIEGRSI Sbjct: 421 LSYSATEDSLREVFEKATSIRIPQNQGRAKGFAFIEFSSAEDAKDAMDSCNNTEIEGRSI 480 Query: 481 RLEFSQGGGPQGGGRGGSAQSKTLFVRGLSEDTTEETLKEAFDGSVNARIVTDRDTGASK 540 RLEFSQGGGPQGGGRGGSAQSKTLFVRGLSEDTTEETLKEAFDGSVNARIVTDRDTGASK Sbjct: 481 RLEFSQGGGPQGGGRGGSAQSKTLFVRGLSEDTTEETLKEAFDGSVNARIVTDRDTGASK 540 Query: 541 GFGFVDFSTAEDAKAAKEAMEDGEIDGNKVTLDFAKPKGDSQ 582 GFGFVDFSTAEDAKAAKEAMEDGEIDGNKVTLDFAKPKGDSQ Sbjct: 541 GFGFVDFSTAEDAKAAKEAMEDGEIDGNKVTLDFAKPKGDSQ 582 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 31.2 bits (69), Expect = 0.003 Identities = 32/136 (23%), Positives = 55/136 (39%), Gaps = 19/136 (13%) Query: 101 DQKPIKNKPVAKKAVAKKEESEEDDDDEDESEEEKAVAKKPTPAKKPAGKKQESEEEDDE 160 D IK KK V + E D D+ E ++E +AK+ + P +E D + Sbjct: 138 DTAQIKETNDCKKFVLVPSD-EIKDGDKQEEKKEPELAKRKSHQDLPR-LDTHVQEPDSK 195 Query: 161 ESEDEPMEVAPALKGKKTAQAAEEDD-------------EEEDDDDEEDDDDEEEQQGSA 207 ++ DEP P+ + +DD E DEED ++ S+ Sbjct: 196 DAPDEP----PSDRQSPEKSPVHQDDGKAARIKRSGSRRRPEACVDEEDTKKASLKRASS 251 Query: 208 KRKKEMPKTIPEAKKT 223 +R+ E+P + ++ T Sbjct: 252 RRRPEIPPDVQDSADT 267 Score = 22.3 bits (46), Expect = 1.5 Identities = 11/66 (16%), Positives = 27/66 (40%) Query: 82 QAKKQESEEEEDDSDEEAEDQKPIKNKPVAKKAVAKKEESEEDDDDEDESEEEKAVAKKP 141 Q + +E SD ++ ++ P+ + + + DE + +KA K+ Sbjct: 190 QEPDSKDAPDEPPSDRQSPEKSPVHQDDGKAARIKRSGSRRRPEACVDEEDTKKASLKRA 249 Query: 142 TPAKKP 147 + ++P Sbjct: 250 SSRRRP 255 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 23.1 bits (48), Expect = 0.92 Identities = 19/70 (27%), Positives = 30/70 (42%), Gaps = 10/70 (14%) Query: 91 EEDDSDEEAEDQKPIKNKPVAKKAVAKKEESEEDDDDEDES------EEEKAVAKKPTPA 144 E DD ++AE P V++ E+E +E E +E A + TP+ Sbjct: 29 EMDDDSDDAEAVFPTSE----PSRVSEGSENESGSGEEYEGAPHVHFQESTATEEPRTPS 84 Query: 145 KKPAGKKQES 154 K PA + +S Sbjct: 85 KTPASQTTQS 94 Score = 21.6 bits (44), Expect = 2.7 Identities = 15/44 (34%), Positives = 19/44 (43%), Gaps = 5/44 (11%) Query: 123 EDDDDEDESEEEKAVAKKPTPAKKPAGKKQESEEEDDEESEDEP 166 E DDD D++E A PT + E+E EE E P Sbjct: 29 EMDDDSDDAE-----AVFPTSEPSRVSEGSENESGSGEEYEGAP 67 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 22.7 bits (47), Expect = 1.4 Identities = 10/38 (26%), Positives = 23/38 (60%) Query: 102 QKPIKNKPVAKKAVAKKEESEEDDDDEDESEEEKAVAK 139 ++ IK + + ++ +A ++ +ED D D+ E+E + K Sbjct: 128 KEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSREK 165 >swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu Length = 589 Score = 21.6 bits (44), Expect = 2.8 Identities = 7/15 (46%), Positives = 10/15 (66%) Query: 244 EFDELKDALREFFSK 258 EF +++DA EF K Sbjct: 128 EFQDIRDACAEFLEK 142 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 20.0 bits (40), Expect = 9.1 Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 2/47 (4%) Query: 458 SSAEDAKDAMDSCNN-TEIEGRSIRL-EFSQGGGPQGGGRGGSAQSK 502 + A K+ + NN ++ +G ++ Q G P G GR GS K Sbjct: 5 NEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGGK 51 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 739 0.0 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 205 1e-55 Sequences not found previously or not previously below threshold: trembl|Q15219|Q15219 Protein p84. 26 0.13 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 23 1.2 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 22 2.1 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 22 2.3 swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1... 22 2.7 CONVERGED! >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 739 bits (1909), Expect = 0.0 Identities = 582/582 (100%), Positives = 582/582 (100%) Query: 1 VKLAKGAKTQAKPKKAAPPPPKDMEDSEEEEDMEEDDSSDEEVEVPVKKTPAKKTATPAK 60 VKLAKGAKTQAKPKKAAPPPPKDMEDSEEEEDMEEDDSSDEEVEVPVKKTPAKKTATPAK Sbjct: 1 VKLAKGAKTQAKPKKAAPPPPKDMEDSEEEEDMEEDDSSDEEVEVPVKKTPAKKTATPAK 60 Query: 61 ATPGKAATPGKKGATPAKNGKQAKKQESEEEEDDSDEEAEDQKPIKNKPVAKKAVAKKEE 120 ATPGKAATPGKKGATPAKNGKQAKKQESEEEEDDSDEEAEDQKPIKNKPVAKKAVAKKEE Sbjct: 61 ATPGKAATPGKKGATPAKNGKQAKKQESEEEEDDSDEEAEDQKPIKNKPVAKKAVAKKEE 120 Query: 121 SEEDDDDEDESEEEKAVAKKPTPAKKPAGKKQESEEEDDEESEDEPMEVAPALKGKKTAQ 180 SEEDDDDEDESEEEKAVAKKPTPAKKPAGKKQESEEEDDEESEDEPMEVAPALKGKKTAQ Sbjct: 121 SEEDDDDEDESEEEKAVAKKPTPAKKPAGKKQESEEEDDEESEDEPMEVAPALKGKKTAQ 180 Query: 181 AAEEDDEEEDDDDEEDDDDEEEQQGSAKRKKEMPKTIPEAKKTKTDTASEGLSIFIGNLN 240 AAEEDDEEEDDDDEEDDDDEEEQQGSAKRKKEMPKTIPEAKKTKTDTASEGLSIFIGNLN Sbjct: 181 AAEEDDEEEDDDDEEDDDDEEEQQGSAKRKKEMPKTIPEAKKTKTDTASEGLSIFIGNLN 240 Query: 241 STKEFDELKDALREFFSKKNLTIQDIRIGNSKKFGYVDFSSEEEVEKALKLTGKKILGTE 300 STKEFDELKDALREFFSKKNLTIQDIRIGNSKKFGYVDFSSEEEVEKALKLTGKKILGTE Sbjct: 241 STKEFDELKDALREFFSKKNLTIQDIRIGNSKKFGYVDFSSEEEVEKALKLTGKKILGTE 300 Query: 301 VKIEKAMAFDKNKTAENKKERDSRTLFVKNIPYSTTVEELQEIFENAKDIRIPTGKDGSN 360 VKIEKAMAFDKNKTAENKKERDSRTLFVKNIPYSTTVEELQEIFENAKDIRIPTGKDGSN Sbjct: 301 VKIEKAMAFDKNKTAENKKERDSRTLFVKNIPYSTTVEELQEIFENAKDIRIPTGKDGSN 360 Query: 361 KGIAYVEFSNEDEANKALEEKQGAEIEGRSIFVDFTGEKSQNSGNKKGPEGDSKVLVVNN 420 KGIAYVEFSNEDEANKALEEKQGAEIEGRSIFVDFTGEKSQNSGNKKGPEGDSKVLVVNN Sbjct: 361 KGIAYVEFSNEDEANKALEEKQGAEIEGRSIFVDFTGEKSQNSGNKKGPEGDSKVLVVNN 420 Query: 421 LSYSATEDSLREVFEKATSIRIPQNQGRAKGFAFIEFSSAEDAKDAMDSCNNTEIEGRSI 480 LSYSATEDSLREVFEKATSIRIPQNQGRAKGFAFIEFSSAEDAKDAMDSCNNTEIEGRSI Sbjct: 421 LSYSATEDSLREVFEKATSIRIPQNQGRAKGFAFIEFSSAEDAKDAMDSCNNTEIEGRSI 480 Query: 481 RLEFSQGGGPQGGGRGGSAQSKTLFVRGLSEDTTEETLKEAFDGSVNARIVTDRDTGASK 540 RLEFSQGGGPQGGGRGGSAQSKTLFVRGLSEDTTEETLKEAFDGSVNARIVTDRDTGASK Sbjct: 481 RLEFSQGGGPQGGGRGGSAQSKTLFVRGLSEDTTEETLKEAFDGSVNARIVTDRDTGASK 540 Query: 541 GFGFVDFSTAEDAKAAKEAMEDGEIDGNKVTLDFAKPKGDSQ 582 GFGFVDFSTAEDAKAAKEAMEDGEIDGNKVTLDFAKPKGDSQ Sbjct: 541 GFGFVDFSTAEDAKAAKEAMEDGEIDGNKVTLDFAKPKGDSQ 582 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 205 bits (521), Expect = 1e-55 Identities = 32/136 (23%), Positives = 55/136 (39%), Gaps = 19/136 (13%) Query: 101 DQKPIKNKPVAKKAVAKKEESEEDDDDEDESEEEKAVAKKPTPAKKPAGKKQESEEEDDE 160 D IK KK V + E D D+ E ++E +AK+ + P +E D + Sbjct: 138 DTAQIKETNDCKKFVLVPSD-EIKDGDKQEEKKEPELAKRKSHQDLPR-LDTHVQEPDSK 195 Query: 161 ESEDEPMEVAPALKGKKTAQAAEEDD-------------EEEDDDDEEDDDDEEEQQGSA 207 ++ DEP P+ + +DD E DEED ++ S+ Sbjct: 196 DAPDEP----PSDRQSPEKSPVHQDDGKAARIKRSGSRRRPEACVDEEDTKKASLKRASS 251 Query: 208 KRKKEMPKTIPEAKKT 223 +R+ E+P + ++ T Sbjct: 252 RRRPEIPPDVQDSADT 267 >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 25.8 bits (55), Expect = 0.13 Identities = 11/21 (52%), Positives = 13/21 (61%) Query: 22 KDMEDSEEEEDMEEDDSSDEE 42 K ED EE D+EE + DEE Sbjct: 201 KHTEDREEGMDVEEGEMGDEE 221 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 22.8 bits (47), Expect = 1.2 Identities = 28/120 (23%), Positives = 42/120 (34%), Gaps = 12/120 (10%) Query: 29 EEEDMEEDDSSDEEVEVPVKKTPAKKTATPAKATPGKAATPGKKGATPAKNGKQAKKQES 88 + E SS E+ V +K PA+ T + GKK A P K K+A K S Sbjct: 1 KSEQSSFSSSSSVRNELSVYHNSRQK--PPAEKTNQSSKNIGKKAA-PFKKQKRANKGSS 57 Query: 89 EEE---EDDSDEEAEDQKPIKNKPVAKKAVAKKEESEEDDDDEDESEEEKAVAKKPTPAK 145 + DS + +AKK D S+ + ++A + K Sbjct: 58 GAQVLYAKDSGGAQNESLRSNEADLAKKKNPNPS------GDTGSSKNQDSMAAQNVHQK 111 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 22.0 bits (45), Expect = 2.1 Identities = 21/120 (17%), Positives = 43/120 (35%), Gaps = 38/120 (31%) Query: 82 QAKKQESEEEEDDSDEEAEDQKPIKNKPVAKKAVAKKEESEEDDDDEDESEEEKAVA--- 138 + KK+E ++ + + ++ A +K E+ ++ D +E ES EK Sbjct: 125 ELKKEEIKQRQIEQEKLASLKKQ-------------DEDKDKRDKEEKESSREKRERSRS 171 Query: 139 ---------------------KKPTPAKKPAGK-KQESEEEDDEESEDEPMEVAPALKGK 176 +K + ++ P + K S E+ E P P+++ K Sbjct: 172 PRRRKSRSPSPRRRSYPVRRERKRSHSRSPRHRTKSRSPSPAPEKKEKSPELPEPSVRMK 231 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 22.0 bits (45), Expect = 2.3 Identities = 11/24 (45%), Positives = 15/24 (61%) Query: 34 EEDDSSDEEVEVPVKKTPAKKTAT 57 EE + E+ V VK+T +KTAT Sbjct: 690 EEKVEAGEKAAVTVKRTRRRKTAT 713 Score = 20.1 bits (40), Expect = 7.8 Identities = 14/78 (17%), Positives = 32/78 (40%), Gaps = 3/78 (3%) Query: 80 GKQAKKQESEEEEDDSDEEAEDQKPIKNKPVAKKAVAKKEESEEDDDDEDESEEEKAVAK 139 G Q + +E++ EE +++ + K + +E D + +A+ Sbjct: 226 GLQNNLMQMKEDKKKLQEELKEKLDLIQVLQEKITLLT---TEIKDKEASLQSTTSKLAE 282 Query: 140 KPTPAKKPAGKKQESEEE 157 K + K + QES+++ Sbjct: 283 KESEVDKLSSMYQESQDQ 300 >swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu Length = 589 Score = 21.6 bits (44), Expect = 2.7 Identities = 7/15 (46%), Positives = 10/15 (66%) Query: 244 EFDELKDALREFFSK 258 EF +++DA EF K Sbjct: 128 EFQDIRDACAEFLEK 142 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.301 0.130 0.351 Lambda K H 0.267 0.0398 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 58,452 Number of Sequences: 43 Number of extensions: 4247 Number of successful extensions: 990 Number of sequences better than 10.0: 25 Number of HSP's better than 10.0 without gapping: 25 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 135 length of query: 650 length of database: 17,163 effective HSP length: 61 effective length of query: 589 effective length of database: 14,540 effective search space: 8564060 effective search space used: 8564060 T: 11 A: 40 X1: 16 ( 6.9 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.3 bits) S2: 40 (20.1 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|Q9IB83|PS12_CARAU Proteasome subunit beta type 1-B (EC 3.4.25.1) (20 (237 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|Q9IB83|PS12_CARAU Proteasome subunit beta type 1-B (EC 3.... 481 e-139 swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. 22 0.70 >swiss|Q9IB83|PS12_CARAU Proteasome subunit beta type 1-B (EC 3.4.25.1) (20 Length = 237 Score = 481 bits (1239), Expect = e-139 Identities = 237/237 (100%), Positives = 237/237 (100%) Query: 1 MISAQACGANGKMKDYHYSGPVEHKFSPYAFNGGTVLAVAGEDFALVASDTRLSEGYSIH 60 MISAQACGANGKMKDYHYSGPVEHKFSPYAFNGGTVLAVAGEDFALVASDTRLSEGYSIH Sbjct: 1 MISAQACGANGKMKDYHYSGPVEHKFSPYAFNGGTVLAVAGEDFALVASDTRLSEGYSIH 60 Query: 61 SRDSPKCYKLTDTTVIGCSGFHGDCLTLTKIIEARLKMYKHSNNKSMTSGAIAAMLSTIL 120 SRDSPKCYKLTDTTVIGCSGFHGDCLTLTKIIEARLKMYKHSNNKSMTSGAIAAMLSTIL Sbjct: 61 SRDSPKCYKLTDTTVIGCSGFHGDCLTLTKIIEARLKMYKHSNNKSMTSGAIAAMLSTIL 120 Query: 121 YGRRFFPYYVYNIIGGLDEEGRGAVYSFDPVGSYQRDTYKAGGSASAMLQPLLDNQIGFK 180 YGRRFFPYYVYNIIGGLDEEGRGAVYSFDPVGSYQRDTYKAGGSASAMLQPLLDNQIGFK Sbjct: 121 YGRRFFPYYVYNIIGGLDEEGRGAVYSFDPVGSYQRDTYKAGGSASAMLQPLLDNQIGFK 180 Query: 181 NMENVEQVPLSQEKAVQLVKDVFISAAERDVYTGDALKICIITKEGIREEIVPLRKD 237 NMENVEQVPLSQEKAVQLVKDVFISAAERDVYTGDALKICIITKEGIREEIVPLRKD Sbjct: 181 NMENVEQVPLSQEKAVQLVKDVFISAAERDVYTGDALKICIITKEGIREEIVPLRKD 237 >swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. Length = 360 Score = 21.9 bits (45), Expect = 0.70 Identities = 11/27 (40%), Positives = 15/27 (54%) Query: 116 LSTILYGRRFFPYYVYNIIGGLDEEGR 142 L+ +L G F P + +I LDEE R Sbjct: 19 LNNLLQGEYFSPVELSSIAHQLDEEER 45 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|Q9IB83|PS12_CARAU Proteasome subunit beta type 1-B (EC 3.... 504 e-146 Sequences not found previously or not previously below threshold: swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. 22 0.71 CONVERGED! >swiss|Q9IB83|PS12_CARAU Proteasome subunit beta type 1-B (EC 3.4.25.1) (20 Length = 237 Score = 504 bits (1299), Expect = e-146 Identities = 237/237 (100%), Positives = 237/237 (100%) Query: 1 MISAQACGANGKMKDYHYSGPVEHKFSPYAFNGGTVLAVAGEDFALVASDTRLSEGYSIH 60 MISAQACGANGKMKDYHYSGPVEHKFSPYAFNGGTVLAVAGEDFALVASDTRLSEGYSIH Sbjct: 1 MISAQACGANGKMKDYHYSGPVEHKFSPYAFNGGTVLAVAGEDFALVASDTRLSEGYSIH 60 Query: 61 SRDSPKCYKLTDTTVIGCSGFHGDCLTLTKIIEARLKMYKHSNNKSMTSGAIAAMLSTIL 120 SRDSPKCYKLTDTTVIGCSGFHGDCLTLTKIIEARLKMYKHSNNKSMTSGAIAAMLSTIL Sbjct: 61 SRDSPKCYKLTDTTVIGCSGFHGDCLTLTKIIEARLKMYKHSNNKSMTSGAIAAMLSTIL 120 Query: 121 YGRRFFPYYVYNIIGGLDEEGRGAVYSFDPVGSYQRDTYKAGGSASAMLQPLLDNQIGFK 180 YGRRFFPYYVYNIIGGLDEEGRGAVYSFDPVGSYQRDTYKAGGSASAMLQPLLDNQIGFK Sbjct: 121 YGRRFFPYYVYNIIGGLDEEGRGAVYSFDPVGSYQRDTYKAGGSASAMLQPLLDNQIGFK 180 Query: 181 NMENVEQVPLSQEKAVQLVKDVFISAAERDVYTGDALKICIITKEGIREEIVPLRKD 237 NMENVEQVPLSQEKAVQLVKDVFISAAERDVYTGDALKICIITKEGIREEIVPLRKD Sbjct: 181 NMENVEQVPLSQEKAVQLVKDVFISAAERDVYTGDALKICIITKEGIREEIVPLRKD 237 >swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. Length = 360 Score = 21.9 bits (45), Expect = 0.71 Identities = 11/27 (40%), Positives = 15/27 (54%) Query: 116 LSTILYGRRFFPYYVYNIIGGLDEEGR 142 L+ +L G F P + +I LDEE R Sbjct: 19 LNNLLQGEYFSPVELSSIAHQLDEEER 45 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.318 0.136 0.394 Lambda K H 0.267 0.0416 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 16,071 Number of Sequences: 43 Number of extensions: 616 Number of successful extensions: 4 Number of sequences better than 10.0: 4 Number of HSP's better than 10.0 without gapping: 4 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 4 length of query: 237 length of database: 17,163 effective HSP length: 53 effective length of query: 184 effective length of database: 14,884 effective search space: 2738656 effective search space used: 2738656 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 35 (18.9 bits) S2: 35 (18.1 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. (360 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. 607 e-177 swiss|Q9IB83|PS12_CARAU Proteasome subunit beta type 1-B (EC 3.... 22 1.0 swiss|P14733|LAM1_MOUSE Lamin B1. 20 3.0 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 20 5.4 swiss|P33418|LOS1_YEAST LOS1 protein. 20 6.2 trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. 20 6.3 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 19 8.6 >swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. Length = 360 Score = 607 bits (1566), Expect = e-177 Identities = 360/360 (100%), Positives = 360/360 (100%) Query: 1 MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRT 60 MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRT Sbjct: 1 MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRT 60 Query: 61 FLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHW 120 FLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHW Sbjct: 61 FLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHW 120 Query: 121 FTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQM 180 FTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQM Sbjct: 121 FTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQM 180 Query: 181 IRVQQMHRPKLIGEELAQLKEQRVHKTDLERMLEANDGSGMLDEDEEDLQRALALSRQEI 240 IRVQQMHRPKLIGEELAQLKEQRVHKTDLERMLEANDGSGMLDEDEEDLQRALALSRQEI Sbjct: 181 IRVQQMHRPKLIGEELAQLKEQRVHKTDLERMLEANDGSGMLDEDEEDLQRALALSRQEI 240 Query: 241 DMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQ 300 DMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQ Sbjct: 241 DMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQ 300 Query: 301 QQQQQQQQQQQQQQQQQRDLSGQSSHPCERPATSSGALGSDLGKACSPFIMFATFTLYLT 360 QQQQQQQQQQQQQQQQQRDLSGQSSHPCERPATSSGALGSDLGKACSPFIMFATFTLYLT Sbjct: 301 QQQQQQQQQQQQQQQQQRDLSGQSSHPCERPATSSGALGSDLGKACSPFIMFATFTLYLT 360 >swiss|Q9IB83|PS12_CARAU Proteasome subunit beta type 1-B (EC 3.4.25.1) (20 Length = 237 Score = 21.9 bits (45), Expect = 1.0 Identities = 11/27 (40%), Positives = 15/27 (54%) Query: 19 LNNLLQGEYFSPVELSSIAHQLDEEER 45 L+ +L G F P + +I LDEE R Sbjct: 116 LSTILYGRRFFPYYVYNIIGGLDEEGR 142 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 20.4 bits (41), Expect = 3.0 Identities = 15/98 (15%), Positives = 45/98 (45%), Gaps = 2/98 (2%) Query: 222 LDEDEEDLQRALALSRQEIDMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEEL 281 L+ D EDL+ +A + ++ +++ ++ +++++D+ N+ EE+ Sbjct: 159 LEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDLENRCQSLTEDLEFRK--NMYEEEI 216 Query: 282 RKRREAYFEKQQQKQQQQQQQQQQQQQQQQQQQQQQRD 319 + R + + + +Q + + + Q + ++Q D Sbjct: 217 NETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHD 254 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 19.6 bits (39), Expect = 5.4 Identities = 7/40 (17%), Positives = 23/40 (57%) Query: 280 ELRKRREAYFEKQQQKQQQQQQQQQQQQQQQQQQQQQQRD 319 EL+K + +Q+K ++Q + + ++ +++++ R+ Sbjct: 125 ELKKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSRE 164 Score = 19.2 bits (38), Expect = 6.7 Identities = 8/44 (18%), Positives = 26/44 (58%), Gaps = 5/44 (11%) Query: 276 LTSEELRKRREAYFEKQQQKQQQQQQQQQQQQQQQQQQQQQQRD 319 L EE+++R + +Q+K ++Q + + ++ +++++ R+ Sbjct: 126 LKKEEIKQR-----QIEQEKLASLKKQDEDKDKRDKEEKESSRE 164 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 19.6 bits (39), Expect = 6.2 Identities = 6/18 (33%), Positives = 12/18 (66%) Query: 44 ERMRMAEGGVTSEDYRTF 61 +RM+ ++ VT +D+ F Sbjct: 426 KRMKFSDDEVTQDDFEEF 443 >trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. Length = 456 Score = 19.6 bits (39), Expect = 6.3 Identities = 10/19 (52%), Positives = 11/19 (57%) Query: 136 LLTGPELISDTYLALFLAQ 154 LL GP T LAL +AQ Sbjct: 67 LLAGPPGTGKTALALAIAQ 85 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 18.9 bits (37), Expect = 8.6 Identities = 10/33 (30%), Positives = 15/33 (45%) Query: 38 HQLDEEERMRMAEGGVTSEDYRTFLQQPSGNMD 70 + L+EEER E T+ + L S +D Sbjct: 228 NDLEEEERNASGEQTTTAREESEALDTTSNGLD 260 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. 666 0.0 Sequences not found previously or not previously below threshold: swiss|P14733|LAM1_MOUSE Lamin B1. 22 0.74 swiss|Q9IB83|PS12_CARAU Proteasome subunit beta type 1-B (EC 3.... 22 1.0 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 22 1.2 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 21 2.1 trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. 20 3.5 swiss|P33418|LOS1_YEAST LOS1 protein. 20 4.2 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 20 4.6 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 20 5.0 CONVERGED! >swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. Length = 360 Score = 666 bits (1718), Expect = 0.0 Identities = 360/360 (100%), Positives = 360/360 (100%) Query: 1 MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRT 60 MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRT Sbjct: 1 MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRT 60 Query: 61 FLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHW 120 FLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHW Sbjct: 61 FLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHW 120 Query: 121 FTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQM 180 FTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQM Sbjct: 121 FTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQM 180 Query: 181 IRVQQMHRPKLIGEELAQLKEQRVHKTDLERMLEANDGSGMLDEDEEDLQRALALSRQEI 240 IRVQQMHRPKLIGEELAQLKEQRVHKTDLERMLEANDGSGMLDEDEEDLQRALALSRQEI Sbjct: 181 IRVQQMHRPKLIGEELAQLKEQRVHKTDLERMLEANDGSGMLDEDEEDLQRALALSRQEI 240 Query: 241 DMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQ 300 DMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQ Sbjct: 241 DMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQ 300 Query: 301 QQQQQQQQQQQQQQQQQRDLSGQSSHPCERPATSSGALGSDLGKACSPFIMFATFTLYLT 360 QQQQQQQQQQQQQQQQQRDLSGQSSHPCERPATSSGALGSDLGKACSPFIMFATFTLYLT Sbjct: 301 QQQQQQQQQQQQQQQQQRDLSGQSSHPCERPATSSGALGSDLGKACSPFIMFATFTLYLT 360 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 22.4 bits (46), Expect = 0.74 Identities = 15/98 (15%), Positives = 45/98 (45%), Gaps = 2/98 (2%) Query: 222 LDEDEEDLQRALALSRQEIDMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEEL 281 L+ D EDL+ +A + ++ +++ ++ +++++D+ N+ EE+ Sbjct: 159 LEGDLEDLKDQIAQLEASLSAAKKQLADETLLKVDLENRCQSLTEDLEFRK--NMYEEEI 216 Query: 282 RKRREAYFEKQQQKQQQQQQQQQQQQQQQQQQQQQQRD 319 + R + + + +Q + + + Q + ++Q D Sbjct: 217 NETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHD 254 >swiss|Q9IB83|PS12_CARAU Proteasome subunit beta type 1-B (EC 3.4.25.1) (20 Length = 237 Score = 22.0 bits (45), Expect = 1.0 Identities = 11/27 (40%), Positives = 15/27 (54%) Query: 19 LNNLLQGEYFSPVELSSIAHQLDEEER 45 L+ +L G F P + +I LDEE R Sbjct: 116 LSTILYGRRFFPYYVYNIIGGLDEEGR 142 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 22.0 bits (45), Expect = 1.2 Identities = 20/104 (19%), Positives = 53/104 (50%), Gaps = 9/104 (8%) Query: 226 EEDLQRALALSRQEIDMEDEEADLRRAIQLSMQG--SSRNISQDMTQTSGTNLTSEE-LR 282 E+D+ ++ E+ D + +Q+++ G + +N + M + L+++E + Sbjct: 63 EDDVVIEFIFNQLEVKNPDS-----KMMQINLTGFLNGKNAREFMGELWPLLLSAQENIA 117 Query: 283 KRREAYFE-KQQQKQQQQQQQQQQQQQQQQQQQQQQRDLSGQSS 325 A+ E K+++ +Q+Q +Q++ ++Q + + +RD + S Sbjct: 118 GIPSAFLELKKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKES 161 Score = 21.2 bits (43), Expect = 1.6 Identities = 8/44 (18%), Positives = 26/44 (58%), Gaps = 5/44 (11%) Query: 276 LTSEELRKRREAYFEKQQQKQQQQQQQQQQQQQQQQQQQQQQRD 319 L EE+++R + +Q+K ++Q + + ++ +++++ R+ Sbjct: 126 LKKEEIKQR-----QIEQEKLASLKKQDEDKDKRDKEEKESSRE 164 Score = 21.2 bits (43), Expect = 2.0 Identities = 7/40 (17%), Positives = 23/40 (57%) Query: 280 ELRKRREAYFEKQQQKQQQQQQQQQQQQQQQQQQQQQQRD 319 EL+K + +Q+K ++Q + + ++ +++++ R+ Sbjct: 125 ELKKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSRE 164 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 21.2 bits (43), Expect = 2.1 Identities = 11/42 (26%), Positives = 20/42 (47%) Query: 278 SEELRKRREAYFEKQQQKQQQQQQQQQQQQQQQQQQQQQQRD 319 SE K E E+Q ++++ QQ+ ++ Q QR+ Sbjct: 226 SECETKENEMVIERQTLNERRKSLQQEHERLLDAQVSLNQRE 267 Score = 18.9 bits (37), Expect = 8.5 Identities = 30/128 (23%), Positives = 54/128 (41%), Gaps = 8/128 (6%) Query: 169 LPDCEADQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDLERMLEANDGSGMLDEDEED 228 L D EA + +R I E +LKE + DL R L + E+E Sbjct: 179 LADAEAKMRAAEALQAEANRYHRIAER--KLKEVESREDDLTRRLASFKSECETKENEMV 236 Query: 229 LQRALALSRQEIDMEDEEADLRRAIQLSMQGSSRNI---SQDMTQ-TSGTNLTSEELRKR 284 ++R L+ + ++ E L A Q+S+ +I SQ++ + G + + Sbjct: 237 IERQ-TLNERRKSLQQEHERLLDA-QVSLNQREDHIFARSQELAELEKGLDTAKTTFEEE 294 Query: 285 REAYFEKQ 292 R+A+ +K+ Sbjct: 295 RKAFEDKK 302 >trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. Length = 456 Score = 20.4 bits (41), Expect = 3.5 Identities = 10/19 (52%), Positives = 11/19 (57%) Query: 136 LLTGPELISDTYLALFLAQ 154 LL GP T LAL +AQ Sbjct: 67 LLAGPPGTGKTALALAIAQ 85 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 20.0 bits (40), Expect = 4.2 Identities = 6/18 (33%), Positives = 12/18 (66%) Query: 44 ERMRMAEGGVTSEDYRTF 61 +RM+ ++ VT +D+ F Sbjct: 426 KRMKFSDDEVTQDDFEEF 443 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 20.0 bits (40), Expect = 4.6 Identities = 10/33 (30%), Positives = 15/33 (45%) Query: 38 HQLDEEERMRMAEGGVTSEDYRTFLQQPSGNMD 70 + L+EEER E T+ + L S +D Sbjct: 228 NDLEEEERNASGEQTTTAREESEALDTTSNGLD 260 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 19.7 bits (39), Expect = 5.0 Identities = 22/91 (24%), Positives = 47/91 (51%), Gaps = 3/91 (3%) Query: 194 EELAQLK-EQRVHKTDLERML-EANDGSGMLDEDEEDLQRALA-LSRQEIDMEDEEADLR 250 E +++LK E +V + LE+ +A+D + L + + A +S+ ++++E+ LR Sbjct: 422 ENISKLKLEVQVTQETLEKSRSDASDIAQQLQQSRHLCSKLEAEVSKLQMELEETRTSLR 481 Query: 251 RAIQLSMQGSSRNISQDMTQTSGTNLTSEEL 281 R I + +G+ ++ T T+EE+ Sbjct: 482 RNIDETKRGAELLAAELTTTRELLKKTNEEM 512 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.316 0.131 0.368 Lambda K H 0.267 0.0401 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 27,787 Number of Sequences: 43 Number of extensions: 1288 Number of successful extensions: 226 Number of sequences better than 10.0: 32 Number of HSP's better than 10.0 without gapping: 32 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 56 length of query: 360 length of database: 17,163 effective HSP length: 56 effective length of query: 304 effective length of database: 14,755 effective search space: 4485520 effective search space used: 4485520 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 37 (19.8 bits) S2: 37 (18.9 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR (752 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.... 693 0.0 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 29 0.021 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 23 1.6 swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein c... 22 3.5 trembl|Q60952|Q60952 Intranuclear protein. 21 4.4 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 21 5.8 >trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR Length = 752 Score = 693 bits (1789), Expect = 0.0 Identities = 752/752 (100%), Positives = 752/752 (100%) Query: 1 MGIKVQRPRCFFDIAINNQPAGRVVFELFSDVCPKTCENFRCLCTGEKGTGKSTQKPLHY 60 MGIKVQRPRCFFDIAINNQPAGRVVFELFSDVCPKTCENFRCLCTGEKGTGKSTQKPLHY Sbjct: 1 MGIKVQRPRCFFDIAINNQPAGRVVFELFSDVCPKTCENFRCLCTGEKGTGKSTQKPLHY 60 Query: 61 KSCLFHRVVKDFMVQGGDFSEGNGRGGESIYGGFFEDESFAVKHNKEFLLSMANRGKDTN 120 KSCLFHRVVKDFMVQGGDFSEGNGRGGESIYGGFFEDESFAVKHNKEFLLSMANRGKDTN Sbjct: 61 KSCLFHRVVKDFMVQGGDFSEGNGRGGESIYGGFFEDESFAVKHNKEFLLSMANRGKDTN 120 Query: 121 GSQFFITTKPTPHLDGHHVVFGQVISGQEVVREIENQKTDAASKPFAEVRILSCGELVPK 180 GSQFFITTKPTPHLDGHHVVFGQVISGQEVVREIENQKTDAASKPFAEVRILSCGELVPK Sbjct: 121 GSQFFITTKPTPHLDGHHVVFGQVISGQEVVREIENQKTDAASKPFAEVRILSCGELVPK 180 Query: 181 SKVKKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDSETASEEKSRKRKKKHRKNSRKHK 240 SKVKKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDSETASEEKSRKRKKKHRKNSRKHK Sbjct: 181 SKVKKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDSETASEEKSRKRKKKHRKNSRKHK 240 Query: 241 KEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKERKNR 300 KEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKERKNR Sbjct: 241 KEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKERKNR 300 Query: 301 ERERERECNPPNSQPASYQRRFLVTRSGRKIKGRGPRRYRTPSRSRSRDRFRRSETPPHW 360 ERERERECNPPNSQPASYQRRFLVTRSGRKIKGRGPRRYRTPSRSRSRDRFRRSETPPHW Sbjct: 301 ERERERECNPPNSQPASYQRRFLVTRSGRKIKGRGPRRYRTPSRSRSRDRFRRSETPPHW 360 Query: 361 RQEMQRAQRMRVSSGERWIKGDKSELNEIKENQRSPVRVKEKKITDHRHMSESPNRGIEK 420 RQEMQRAQRMRVSSGERWIKGDKSELNEIKENQRSPVRVKEKKITDHRHMSESPNRGIEK Sbjct: 361 RQEMQRAQRMRVSSGERWIKGDKSELNEIKENQRSPVRVKEKKITDHRHMSESPNRGIEK 420 Query: 421 GGKVKDHKLESKERDIRRNSGKDDKYNKNKVKKRGKSKSRSKSKERSKSKERDSKHSRHE 480 GGKVKDHKLESKERDIRRNSGKDDKYNKNKVKKRGKSKSRSKSKERSKSKERDSKHSRHE Sbjct: 421 GGKVKDHKLESKERDIRRNSGKDDKYNKNKVKKRGKSKSRSKSKERSKSKERDSKHSRHE 480 Query: 481 DKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEKDRR 540 DKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEKDRR Sbjct: 481 DKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEKDRR 540 Query: 541 AQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRR 600 AQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRR Sbjct: 541 AQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRR 600 Query: 601 GRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEK 660 GRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEK Sbjct: 601 GRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEK 660 Query: 661 RMYSKSRDHSSSNNNREKKADIDQSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKEN 720 RMYSKSRDHSSSNNNREKKADIDQSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKEN Sbjct: 661 RMYSKSRDHSSSNNNREKKADIDQSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKEN 720 Query: 721 QKSKGQENDHVHDKNKKCDHESSPGTDEDKSG 752 QKSKGQENDHVHDKNKKCDHESSPGTDEDKSG Sbjct: 721 QKSKGQENDHVHDKNKKCDHESSPGTDEDKSG 752 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 28.9 bits (63), Expect = 0.021 Identities = 24/107 (22%), Positives = 47/107 (43%), Gaps = 6/107 (5%) Query: 177 LVPKSKV----KKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDS--ETASEEKSRKRKK 230 LVP ++ K+EEKK + + S D DS D+ E S+ +S ++ Sbjct: 153 LVPSDEIKDGDKQEEKKEPELAKRKSHQDLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSP 212 Query: 231 KHRKNSRKHKKEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPP 277 H+ + + + ++ ++ ++ E + + S+ R EIPP Sbjct: 213 VHQDDGKAARIKRSGSRRRPEACVDEEDTKKASLKRASSRRRPEIPP 259 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 22.7 bits (47), Expect = 1.6 Identities = 24/100 (24%), Positives = 43/100 (43%), Gaps = 8/100 (8%) Query: 624 RDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEKRMYSKSRDHSSSNNNREKKADID 683 R +E+ S+N K + K + S G + +Y+K + + + R +AD+ Sbjct: 24 RQKPPAEKTNQSSKNIGKKAAPFKKQKRANKGSSGAQVLYAKDSGGAQNESLRSNEADL- 82 Query: 684 QSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKENQKS 723 +K K + + SS KNQ D V +++Q S Sbjct: 83 ----AKKKNPNPSGDTGSS--KNQ-DSMAAQNVHQKSQMS 115 >swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein charlie). Length = 296 Score = 21.6 bits (44), Expect = 3.5 Identities = 15/69 (21%), Positives = 34/69 (48%), Gaps = 1/69 (1%) Query: 513 SRSKENSKQVESKSNEHDHSKSKEKDRRAQSRSRERDLTKSKHSYNSRTRERSRSRDRSR 572 +R ++ ++E ++ E +H+ + E+ ++RSR+ + + N+ T + R Sbjct: 221 NRLQQKKSELEGRT-EVNHANTNEEVPSRRTRSRDTNARGAYKLQNTITEGPKAVPTKKR 279 Query: 573 RVRSRSHDR 581 RV +R R Sbjct: 280 RVATRVRGR 288 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 21.2 bits (43), Expect = 4.4 Identities = 30/168 (17%), Positives = 72/168 (42%), Gaps = 8/168 (4%) Query: 495 TTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEK-DRRAQSRSRERDLTKS 553 T E+E++L + + + SR +E + ++++ + SK++ + QS + +L+ + Sbjct: 17 TLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAELSHT 76 Query: 554 KHSYNS-------RTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRRGRSRSR 606 + + + R+R + + R+R + E ++ +QE ++ + Sbjct: 77 LEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQKEEELRQ 136 Query: 607 DRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKE 654 G RS++ + + S ER+E R E + + R+E Sbjct: 137 LLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVAQREE 184 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 20.8 bits (42), Expect = 5.8 Identities = 25/139 (17%), Positives = 58/139 (40%), Gaps = 7/139 (5%) Query: 458 KSRSKSKERSKSKERDSKHSRHEDKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKE 517 +S + + R+ SK S+ ++ + V R ++E +Q ++ G S S Sbjct: 73 ESTATEEPRTPSKTPASQTTQSIGRTVSIAEATR--RASQESPRQTETAG-----SSSAP 125 Query: 518 NSKQVESKSNEHDHSKSKEKDRRAQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSR 577 ++ + EH ++ + ++ ER ++ S + TRERS + + Sbjct: 126 DALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEASTQEAE 185 Query: 578 SHDRDRSRSKEYHRYREQE 596 + + ++ + ++ YR E Sbjct: 186 TAEEALNKEEFWNLYRSLE 204 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.... 782 0.0 Sequences not found previously or not previously below threshold: trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 37 8e-05 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 27 0.071 trembl|Q60952|Q60952 Intranuclear protein. 25 0.32 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 24 0.41 swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein c... 24 0.45 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 23 1.2 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 22 1.8 swiss|P14733|LAM1_MOUSE Lamin B1. 22 3.1 >trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR Length = 752 Score = 782 bits (2020), Expect = 0.0 Identities = 752/752 (100%), Positives = 752/752 (100%) Query: 1 MGIKVQRPRCFFDIAINNQPAGRVVFELFSDVCPKTCENFRCLCTGEKGTGKSTQKPLHY 60 MGIKVQRPRCFFDIAINNQPAGRVVFELFSDVCPKTCENFRCLCTGEKGTGKSTQKPLHY Sbjct: 1 MGIKVQRPRCFFDIAINNQPAGRVVFELFSDVCPKTCENFRCLCTGEKGTGKSTQKPLHY 60 Query: 61 KSCLFHRVVKDFMVQGGDFSEGNGRGGESIYGGFFEDESFAVKHNKEFLLSMANRGKDTN 120 KSCLFHRVVKDFMVQGGDFSEGNGRGGESIYGGFFEDESFAVKHNKEFLLSMANRGKDTN Sbjct: 61 KSCLFHRVVKDFMVQGGDFSEGNGRGGESIYGGFFEDESFAVKHNKEFLLSMANRGKDTN 120 Query: 121 GSQFFITTKPTPHLDGHHVVFGQVISGQEVVREIENQKTDAASKPFAEVRILSCGELVPK 180 GSQFFITTKPTPHLDGHHVVFGQVISGQEVVREIENQKTDAASKPFAEVRILSCGELVPK Sbjct: 121 GSQFFITTKPTPHLDGHHVVFGQVISGQEVVREIENQKTDAASKPFAEVRILSCGELVPK 180 Query: 181 SKVKKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDSETASEEKSRKRKKKHRKNSRKHK 240 SKVKKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDSETASEEKSRKRKKKHRKNSRKHK Sbjct: 181 SKVKKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDSETASEEKSRKRKKKHRKNSRKHK 240 Query: 241 KEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKERKNR 300 KEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKERKNR Sbjct: 241 KEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKERKNR 300 Query: 301 ERERERECNPPNSQPASYQRRFLVTRSGRKIKGRGPRRYRTPSRSRSRDRFRRSETPPHW 360 ERERERECNPPNSQPASYQRRFLVTRSGRKIKGRGPRRYRTPSRSRSRDRFRRSETPPHW Sbjct: 301 ERERERECNPPNSQPASYQRRFLVTRSGRKIKGRGPRRYRTPSRSRSRDRFRRSETPPHW 360 Query: 361 RQEMQRAQRMRVSSGERWIKGDKSELNEIKENQRSPVRVKEKKITDHRHMSESPNRGIEK 420 RQEMQRAQRMRVSSGERWIKGDKSELNEIKENQRSPVRVKEKKITDHRHMSESPNRGIEK Sbjct: 361 RQEMQRAQRMRVSSGERWIKGDKSELNEIKENQRSPVRVKEKKITDHRHMSESPNRGIEK 420 Query: 421 GGKVKDHKLESKERDIRRNSGKDDKYNKNKVKKRGKSKSRSKSKERSKSKERDSKHSRHE 480 GGKVKDHKLESKERDIRRNSGKDDKYNKNKVKKRGKSKSRSKSKERSKSKERDSKHSRHE Sbjct: 421 GGKVKDHKLESKERDIRRNSGKDDKYNKNKVKKRGKSKSRSKSKERSKSKERDSKHSRHE 480 Query: 481 DKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEKDRR 540 DKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEKDRR Sbjct: 481 DKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEKDRR 540 Query: 541 AQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRR 600 AQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRR Sbjct: 541 AQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRR 600 Query: 601 GRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEK 660 GRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEK Sbjct: 601 GRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEK 660 Query: 661 RMYSKSRDHSSSNNNREKKADIDQSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKEN 720 RMYSKSRDHSSSNNNREKKADIDQSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKEN Sbjct: 661 RMYSKSRDHSSSNNNREKKADIDQSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKEN 720 Query: 721 QKSKGQENDHVHDKNKKCDHESSPGTDEDKSG 752 QKSKGQENDHVHDKNKKCDHESSPGTDEDKSG Sbjct: 721 QKSKGQENDHVHDKNKKCDHESSPGTDEDKSG 752 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 37.1 bits (84), Expect = 8e-05 Identities = 50/217 (23%), Positives = 82/217 (37%), Gaps = 39/217 (17%) Query: 177 LVPKSKV----KKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDS--ETASEEKSRKRKK 230 LVP ++ K+EEKK + + S D DS D+ E S+ +S ++ Sbjct: 153 LVPSDEIKDGDKQEEKKEPELAKRKSHQDLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSP 212 Query: 231 KHRKNSRKHKKEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENR-------- 282 H+ + + + ++ ++ ++ E + + S+ R EIPP ++ Sbjct: 213 VHQDDGKAARIKRSGSRRRPEACVDEEDTKKASLKRASSRRRPEIPPDVQDSADTSIKRS 272 Query: 283 --FLMRKSPPKADDKE----RKNRERERER---------------ECNPPNSQ-PASYQR 320 R+ P D KE R N + R R + NP S+ PAS Sbjct: 273 SSRRYRERPADQDAKESPIKRSNSRKPRHRPLHDQDFEITADSPDKVNPGASEFPASI-- 330 Query: 321 RFLVTRSGRKIKGRGPRRYRTPSRSRSRDRFRRSETP 357 + T GR+ + R R TP R R E P Sbjct: 331 -VIQTAGGRERRPRHERMVSTPDVPRIDSRSLELEGP 366 Score = 22.8 bits (47), Expect = 1.4 Identities = 59/307 (19%), Positives = 108/307 (34%), Gaps = 38/307 (12%) Query: 372 VSSGERWIKGDKSELNEIKENQRSPVRVKEKKITDHRHMSESPNRGIEKGGKVKDHKLES 431 + G++ + + EL + K +Q P + D + + P + K H+ + Sbjct: 159 IKDGDKQEEKKEPELAKRKSHQDLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSPVHQDDG 218 Query: 432 KERDIRRNSGK-------DDKYNKNKVKKRGKSKSRS--------------KSKERSKSK 470 K I+R+ + D++ K KR S+ R K + + Sbjct: 219 KAARIKRSGSRRRPEACVDEEDTKKASLKRASSRRRPEIPPDVQDSADTSIKRSSSRRYR 278 Query: 471 ERDSKHSRHEDKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQV-------E 523 ER + E RS S++ H +++ ++ + D+ + E + Sbjct: 279 ERPADQDAKESPIKRSNSRKPRHRPLHDQDFEITADSPDKVNPGASEFPASIVIQTAGGR 338 Query: 524 SKSNEHDHSKSKEKDRRAQSRSRERD----LTKSKHSYNSRTRERSRSRDRSRRVRSRSH 579 + H+ S R SRS E + S + YN S + V H Sbjct: 339 ERRPRHERMVSTPDVPRIDSRSLELEGPGIAPLSMYRYNDADDAMSF---MVQEVVIPDH 395 Query: 580 DRDRSRSKEYHRYREQEYRRRGRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNK 639 D R S R +E S + +R P +S + +RRRR + +R E + Sbjct: 396 DLTRKSSPP--RTSRKESPPPYPSVTERKRMPSQS-ATNRRRRNSMAAPRDRREYPIGDD 452 Query: 640 EKYRSQD 646 +Y+ D Sbjct: 453 PRYQQLD 459 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 27.1 bits (58), Expect = 0.071 Identities = 24/100 (24%), Positives = 43/100 (43%), Gaps = 8/100 (8%) Query: 624 RDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEKRMYSKSRDHSSSNNNREKKADID 683 R +E+ S+N K + K + S G + +Y+K + + + R +AD+ Sbjct: 24 RQKPPAEKTNQSSKNIGKKAAPFKKQKRANKGSSGAQVLYAKDSGGAQNESLRSNEADL- 82 Query: 684 QSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKENQKS 723 +K K + + SS KNQ D V +++Q S Sbjct: 83 ----AKKKNPNPSGDTGSS--KNQ-DSMAAQNVHQKSQMS 115 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 24.8 bits (52), Expect = 0.32 Identities = 30/168 (17%), Positives = 72/168 (42%), Gaps = 8/168 (4%) Query: 495 TTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEK-DRRAQSRSRERDLTKS 553 T E+E++L + + + SR +E + ++++ + SK++ + QS + +L+ + Sbjct: 17 TLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAELSHT 76 Query: 554 KHSYNS-------RTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRRGRSRSR 606 + + + R+R + + R+R + E ++ +QE ++ + Sbjct: 77 LEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQKEEELRQ 136 Query: 607 DRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKE 654 G RS++ + + S ER+E R E + + R+E Sbjct: 137 LLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVAQREE 184 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 24.4 bits (51), Expect = 0.41 Identities = 25/139 (17%), Positives = 58/139 (40%), Gaps = 7/139 (5%) Query: 458 KSRSKSKERSKSKERDSKHSRHEDKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKE 517 +S + + R+ SK S+ ++ + V R ++E +Q ++ G S S Sbjct: 73 ESTATEEPRTPSKTPASQTTQSIGRTVSIAEATR--RASQESPRQTETAG-----SSSAP 125 Query: 518 NSKQVESKSNEHDHSKSKEKDRRAQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSR 577 ++ + EH ++ + ++ ER ++ S + TRERS + + Sbjct: 126 DALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEASTQEAE 185 Query: 578 SHDRDRSRSKEYHRYREQE 596 + + ++ + ++ YR E Sbjct: 186 TAEEALNKEEFWNLYRSLE 204 >swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein charlie). Length = 296 Score = 24.4 bits (51), Expect = 0.45 Identities = 15/69 (21%), Positives = 34/69 (48%), Gaps = 1/69 (1%) Query: 513 SRSKENSKQVESKSNEHDHSKSKEKDRRAQSRSRERDLTKSKHSYNSRTRERSRSRDRSR 572 +R ++ ++E ++ E +H+ + E+ ++RSR+ + + N+ T + R Sbjct: 221 NRLQQKKSELEGRT-EVNHANTNEEVPSRRTRSRDTNARGAYKLQNTITEGPKAVPTKKR 279 Query: 573 RVRSRSHDR 581 RV +R R Sbjct: 280 RVATRVRGR 288 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 22.8 bits (47), Expect = 1.2 Identities = 32/119 (26%), Positives = 51/119 (41%), Gaps = 4/119 (3%) Query: 533 KSKEKDRRAQSRSRERDLTKSKHSYNSRTR-ERSRSRDRSRRVRSRSHDRDRSRSKEYHR 591 K +E +R + + L K + R + E+ SR++ R RS + RS S Sbjct: 127 KKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSREKRERSRSPRRRKSRSPSPRRRS 186 Query: 592 YREQEYRRRGRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSS 650 Y + R+R SRS RT RS S ++ +S E E R K+ + + +S Sbjct: 187 YPVRRERKRSHSRSPRHRTKSRSPSPAPEKKE---KSPELPEPSVRMKDSSVQEATSTS 242 Score = 21.3 bits (43), Expect = 4.1 Identities = 26/96 (27%), Positives = 44/96 (45%), Gaps = 1/96 (1%) Query: 562 RERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRRGRSRSRDRRTPGRSRSKDRRR 621 ++R +++ ++ + D+D+ +E RE+ R R R + R R RS RR Sbjct: 132 KQRQIEQEKLASLKKQDEDKDKRDKEEKESSREKRERSRSPRRRKSRSPSPRRRSYPVRR 191 Query: 622 RRRDSRS-SEREESQSRNKEKYRSQDSKSSHRKENS 656 R+ S S S R ++SR+ + KS E S Sbjct: 192 ERKRSHSRSPRHRTKSRSPSPAPEKKEKSPELPEPS 227 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 22.5 bits (46), Expect = 1.8 Identities = 75/402 (18%), Positives = 157/402 (38%), Gaps = 29/402 (7%) Query: 346 RSRDRFRRSETPPHWRQEMQRAQRMRVSSGERWIKGDKSEL-NEIKENQRSPVRVKEKKI 404 RS +F + R+ + + E I +SEL + NQ +R K+ Sbjct: 334 RSSKKFLGDKIAVSERESSLLKKEQELLVAEEKIASKESELIQNVLANQEVILR---KRK 390 Query: 405 TDHRHMSESPNRGIEKGGKVKDHKLESKERDIR-RNSGKDDKYNKNKVKKRGKSKSRSKS 463 +D E ++ +E + K E +E DI+ R +K + +V+ R ++ Sbjct: 391 SDVEAELECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDI 450 Query: 464 KERSKSKERDSKHSRHEDKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVE 523 E+S + + K+ ++ + ++ + E KE+ ++LD + + Q + ++ K+V+ Sbjct: 451 TEKSFNLDEKEKNLVATEEDINRKTTMLEDE--KERLRKLDLELQ-QSLTSLEDKRKRVD 507 Query: 524 SKSNEHDHSKSKEKDRRAQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSRSH---- 579 S + + + KS+ + + +L + E R + + + Sbjct: 508 SATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDV 567 Query: 580 DRDRSRSKEYHRYREQEYRRRGRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNK 639 R+ R + + R++E RD R +++ + +S + EREE ++ Sbjct: 568 KREELRKEAEYITRQREAFSMYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMV 627 Query: 640 EKYRSQDSKSSHRKENSEGEKRMYSKSRDHSSSNNNREKKADIDQSPVSKTKQSSQDNEV 699 E++ SK + + M + ++ N E + SS+D E Sbjct: 628 EEHSEWLSKIQRERADFLLGIEMQKRELEYCIENKREE------------LENSSRDREK 675 Query: 700 KSSTLKNQEDEKTRSPVEKENQKSKGQENDHVHDKNKKCDHE 741 K E+E+ +S E +E +HV + K+ D E Sbjct: 676 AFEQEKKLEEERIQSLKEMAE-----KELEHVQVELKRLDAE 712 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 21.7 bits (44), Expect = 3.1 Identities = 22/134 (16%), Positives = 61/134 (45%), Gaps = 4/134 (2%) Query: 423 KVKDHKLESKERDIRRNSGKDDKYNKNKVKKRGKSKSRSKSKERSKSKE-RDSKHSRHED 481 ++ + + + + R + +SG+ +Y + + + + ++ R +E + H++ E+ Sbjct: 215 EINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQVRLYKEELEQTYHAKLEN 274 Query: 482 KRVRSRSKERDHETTKEK--EKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEKDR 539 R+ S + +E+ E ++ + + S ++ S+ + E + +KE+D Sbjct: 275 ARLSSEMNTSTVNSAREELMESRMRIESLSSQLSNLQKESRACLERIQELEDMLAKERDN 334 Query: 540 -RAQSRSRERDLTK 552 R RER++ + Sbjct: 335 SRRMLSDREREMAE 348 Searching.done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.... 758 0.0 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 245 1e-67 Sequences not found previously or not previously below threshold: swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 28 0.042 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 27 0.11 trembl|Q60952|Q60952 Intranuclear protein. 24 0.60 swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein c... 24 0.71 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 23 1.4 swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). 22 2.9 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 21 4.9 swiss|P14733|LAM1_MOUSE Lamin B1. 21 5.0 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 20 7.9 CONVERGED! >trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR Length = 752 Score = 758 bits (1957), Expect = 0.0 Identities = 752/752 (100%), Positives = 752/752 (100%) Query: 1 MGIKVQRPRCFFDIAINNQPAGRVVFELFSDVCPKTCENFRCLCTGEKGTGKSTQKPLHY 60 MGIKVQRPRCFFDIAINNQPAGRVVFELFSDVCPKTCENFRCLCTGEKGTGKSTQKPLHY Sbjct: 1 MGIKVQRPRCFFDIAINNQPAGRVVFELFSDVCPKTCENFRCLCTGEKGTGKSTQKPLHY 60 Query: 61 KSCLFHRVVKDFMVQGGDFSEGNGRGGESIYGGFFEDESFAVKHNKEFLLSMANRGKDTN 120 KSCLFHRVVKDFMVQGGDFSEGNGRGGESIYGGFFEDESFAVKHNKEFLLSMANRGKDTN Sbjct: 61 KSCLFHRVVKDFMVQGGDFSEGNGRGGESIYGGFFEDESFAVKHNKEFLLSMANRGKDTN 120 Query: 121 GSQFFITTKPTPHLDGHHVVFGQVISGQEVVREIENQKTDAASKPFAEVRILSCGELVPK 180 GSQFFITTKPTPHLDGHHVVFGQVISGQEVVREIENQKTDAASKPFAEVRILSCGELVPK Sbjct: 121 GSQFFITTKPTPHLDGHHVVFGQVISGQEVVREIENQKTDAASKPFAEVRILSCGELVPK 180 Query: 181 SKVKKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDSETASEEKSRKRKKKHRKNSRKHK 240 SKVKKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDSETASEEKSRKRKKKHRKNSRKHK Sbjct: 181 SKVKKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDSETASEEKSRKRKKKHRKNSRKHK 240 Query: 241 KEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKERKNR 300 KEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKERKNR Sbjct: 241 KEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKERKNR 300 Query: 301 ERERERECNPPNSQPASYQRRFLVTRSGRKIKGRGPRRYRTPSRSRSRDRFRRSETPPHW 360 ERERERECNPPNSQPASYQRRFLVTRSGRKIKGRGPRRYRTPSRSRSRDRFRRSETPPHW Sbjct: 301 ERERERECNPPNSQPASYQRRFLVTRSGRKIKGRGPRRYRTPSRSRSRDRFRRSETPPHW 360 Query: 361 RQEMQRAQRMRVSSGERWIKGDKSELNEIKENQRSPVRVKEKKITDHRHMSESPNRGIEK 420 RQEMQRAQRMRVSSGERWIKGDKSELNEIKENQRSPVRVKEKKITDHRHMSESPNRGIEK Sbjct: 361 RQEMQRAQRMRVSSGERWIKGDKSELNEIKENQRSPVRVKEKKITDHRHMSESPNRGIEK 420 Query: 421 GGKVKDHKLESKERDIRRNSGKDDKYNKNKVKKRGKSKSRSKSKERSKSKERDSKHSRHE 480 GGKVKDHKLESKERDIRRNSGKDDKYNKNKVKKRGKSKSRSKSKERSKSKERDSKHSRHE Sbjct: 421 GGKVKDHKLESKERDIRRNSGKDDKYNKNKVKKRGKSKSRSKSKERSKSKERDSKHSRHE 480 Query: 481 DKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEKDRR 540 DKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEKDRR Sbjct: 481 DKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEKDRR 540 Query: 541 AQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRR 600 AQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRR Sbjct: 541 AQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRR 600 Query: 601 GRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEK 660 GRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEK Sbjct: 601 GRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEK 660 Query: 661 RMYSKSRDHSSSNNNREKKADIDQSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKEN 720 RMYSKSRDHSSSNNNREKKADIDQSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKEN Sbjct: 661 RMYSKSRDHSSSNNNREKKADIDQSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKEN 720 Query: 721 QKSKGQENDHVHDKNKKCDHESSPGTDEDKSG 752 QKSKGQENDHVHDKNKKCDHESSPGTDEDKSG Sbjct: 721 QKSKGQENDHVHDKNKKCDHESSPGTDEDKSG 752 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 245 bits (626), Expect = 1e-67 Identities = 50/217 (23%), Positives = 82/217 (37%), Gaps = 39/217 (17%) Query: 177 LVPKSKV----KKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDS--ETASEEKSRKRKK 230 LVP ++ K+EEKK + + S D DS D+ E S+ +S ++ Sbjct: 153 LVPSDEIKDGDKQEEKKEPELAKRKSHQDLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSP 212 Query: 231 KHRKNSRKHKKEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENR-------- 282 H+ + + + ++ ++ ++ E + + S+ R EIPP ++ Sbjct: 213 VHQDDGKAARIKRSGSRRRPEACVDEEDTKKASLKRASSRRRPEIPPDVQDSADTSIKRS 272 Query: 283 --FLMRKSPPKADDKE----RKNRERERER---------------ECNPPNSQ-PASYQR 320 R+ P D KE R N + R R + NP S+ PAS Sbjct: 273 SSRRYRERPADQDAKESPIKRSNSRKPRHRPLHDQDFEITADSPDKVNPGASEFPASI-- 330 Query: 321 RFLVTRSGRKIKGRGPRRYRTPSRSRSRDRFRRSETP 357 + T GR+ + R R TP R R E P Sbjct: 331 -VIQTAGGRERRPRHERMVSTPDVPRIDSRSLELEGP 366 Score = 61.7 bits (148), Expect = 2e-12 Identities = 28/162 (17%), Positives = 53/162 (32%), Gaps = 45/162 (27%) Query: 186 EEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDSETASEEKSRKRKKKHRKNSRKHKKEKKK 245 EE + S +SS DS+D+ ++ Sbjct: 238 EEDTKKASLKRASSRRRPEIPPDVQ--DSADTS----------------------IKRSS 273 Query: 246 RKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKERKNRERERE 305 ++ ++ P+ + +A + ++ +P P ++ + SP Sbjct: 274 SRRYRERPADQ-DAKESPIKRSNSRKPRHRPLHDQDFEITADSP---------------- 316 Query: 306 RECNPPNSQ-PASYQRRFLVTRSGRKIKGRGPRRYRTPSRSR 346 + NP S+ PAS + T GR+ + R R TP R Sbjct: 317 DKVNPGASEFPASI---VIQTAGGRERRPRHERMVSTPDVPR 355 Score = 29.4 bits (64), Expect = 0.014 Identities = 22/141 (15%), Positives = 46/141 (32%), Gaps = 14/141 (9%) Query: 188 KKRHKSSSSSSSSDSDSSSDSQSSSDS-----------SDSETASEEKSRKRKKKHRKNS 236 ++R + S+ D + +D++ A ++ + Sbjct: 339 ERRPRHERMVSTPDVPRIDSRSLELEGPGIAPLSMYRYNDADDAMSFMVQEVVIPDHDLT 398 Query: 237 RKHKKEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKE 296 RK + RK+S S +E + +Q + R P +R P D + Sbjct: 399 RKSSPPRTSRKESPPPYPSVTERKRMPSQSATNRRRRNSMAAPRDRREY---PIGDDPRY 455 Query: 297 RKNRERERERECNPPNSQPAS 317 ++ E+ + P + AS Sbjct: 456 QQLDHHEQPQSFRPAVEEDAS 476 Score = 22.4 bits (46), Expect = 2.0 Identities = 82/459 (17%), Positives = 156/459 (33%), Gaps = 68/459 (14%) Query: 240 KKEKKKRK----KSKKSPSSESEADNVDAQPQSTVRPEEI-PPIPENRFLMRKSPPKADD 294 + + K + ++ + SE D +P R +++ PP E+ + Sbjct: 17 RTSRSKSQNPYSRTPRRTDPLSELDIHIDKPLERRRIQQLPPPTVEDE-------ADSLA 69 Query: 295 KERKNRERERERECNPPNS---------QPA---SYQRRFLVTRS---GRKIKGRGPRRY 339 KE + E PPN +P + +RRF+V G++ + Sbjct: 70 KEYGSPSVISSYEDEPPNRGEIDQLILMEPVLENNPERRFVVVTGPGAGKESEPAPSIVE 129 Query: 340 RTPSRSRSRDRFRRSETPPHWRQEMQRAQRMRVSSGERWIKGDKSELNEIKENQRSPVRV 399 TP + + + ++ + G++ + + EL + K +Q P Sbjct: 130 TTPPVTSTDTAQIKETNDC---KKFVLVPSDEIKDGDKQEEKKEPELAKRKSHQDLPRLD 186 Query: 400 KEKKITDHRHMSESPNRGIEKGGKVKDHKLESKERDIRRNSGK-------DDKYNKNKVK 452 + D + + P + K H+ + K I+R+ + D++ K Sbjct: 187 THVQEPDSKDAPDEPPSDRQSPEKSPVHQDDGKAARIKRSGSRRRPEACVDEEDTKKASL 246 Query: 453 KRGKSKSRS--------------KSKERSKSKERDSKHSRHEDKRVRSRSKERDHETTKE 498 KR S+ R K + +ER + E RS S++ H + Sbjct: 247 KRASSRRRPEIPPDVQDSADTSIKRSSSRRYRERPADQDAKESPIKRSNSRKPRHRPLHD 306 Query: 499 KEKQLDSKGKDQERSRSKENSKQV-------ESKSNEHDHSKSKEKDRRAQSRSRERD-- 549 ++ ++ + D+ + E + + H+ S R SRS E + Sbjct: 307 QDFEITADSPDKVNPGASEFPASIVIQTAGGRERRPRHERMVSTPDVPRIDSRSLELEGP 366 Query: 550 --LTKSKHSYNSRTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRRGRSRSRD 607 S + YN S + V HD R S R +E S + Sbjct: 367 GIAPLSMYRYNDADDAMSF---MVQEVVIPDHDLTRKSSPP--RTSRKESPPPYPSVTER 421 Query: 608 RRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQD 646 +R P +S + +RRRR + +R E + +Y+ D Sbjct: 422 KRMPSQS-ATNRRRRNSMAAPRDRREYPIGDDPRYQQLD 459 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 27.8 bits (60), Expect = 0.042 Identities = 19/143 (13%), Positives = 44/143 (30%), Gaps = 10/143 (6%) Query: 180 KSKVKKEEKKRHKSSSSSSSSDSDSSSDSQSSSDSSDSETASEEKSRKRKKKHRKNSRKH 239 + E ++S S + +S+ +E + + R Sbjct: 44 SEPSRVSEGSENESGSGEEYEGAP----HVHFQESTATEEPRTPSKTPASQTTQSIGRTV 99 Query: 240 KKEKKKRKKSKKSPSSESEADNVDAQPQSTVRPEEIPPIPENRFLMRKSPPKADDKER-- 297 + R+ S++SP A + A + + + + + K P ER Sbjct: 100 SIAEATRRASQESPRQTETAGSSSAPDALSTQQPPLEHSTQQN-IPTKLPSTQQAVERVV 158 Query: 298 ---KNRERERERECNPPNSQPAS 317 RE +P +++ ++ Sbjct: 159 SSQTTSRAAETRERSPEHTEAST 181 Score = 23.6 bits (49), Expect = 0.77 Identities = 25/139 (17%), Positives = 58/139 (40%), Gaps = 7/139 (5%) Query: 458 KSRSKSKERSKSKERDSKHSRHEDKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKE 517 +S + + R+ SK S+ ++ + V R ++E +Q ++ G S S Sbjct: 73 ESTATEEPRTPSKTPASQTTQSIGRTVSIAEATR--RASQESPRQTETAG-----SSSAP 125 Query: 518 NSKQVESKSNEHDHSKSKEKDRRAQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSR 577 ++ + EH ++ + ++ ER ++ S + TRERS + + Sbjct: 126 DALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEASTQEAE 185 Query: 578 SHDRDRSRSKEYHRYREQE 596 + + ++ + ++ YR E Sbjct: 186 TAEEALNKEEFWNLYRSLE 204 Score = 22.4 bits (46), Expect = 2.0 Identities = 15/72 (20%), Positives = 25/72 (33%), Gaps = 1/72 (1%) Query: 287 KSPPKADDKERKNRERERERECNPPNSQPASYQRRFLVTRSGRKIKGRGPRRYRTPSRSR 346 + P +E E R P + S R + + R+ PR+ T S Sbjct: 64 EGAPHVHFQESTATEEPRTPSKTPASQTTQSIGRTVSIAEATRRASQESPRQTETAGSSS 123 Query: 347 SRDRFRRSETPP 358 + D ++ PP Sbjct: 124 APD-ALSTQQPP 134 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 26.7 bits (57), Expect = 0.11 Identities = 24/100 (24%), Positives = 43/100 (43%), Gaps = 8/100 (8%) Query: 624 RDSRSSEREESQSRNKEKYRSQDSKSSHRKENSEGEKRMYSKSRDHSSSNNNREKKADID 683 R +E+ S+N K + K + S G + +Y+K + + + R +AD+ Sbjct: 24 RQKPPAEKTNQSSKNIGKKAAPFKKQKRANKGSSGAQVLYAKDSGGAQNESLRSNEADL- 82 Query: 684 QSPVSKTKQSSQDNEVKSSTLKNQEDEKTRSPVEKENQKS 723 +K K + + SS KNQ D V +++Q S Sbjct: 83 ----AKKKNPNPSGDTGSS--KNQ-DSMAAQNVHQKSQMS 115 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 24.0 bits (50), Expect = 0.60 Identities = 30/168 (17%), Positives = 72/168 (42%), Gaps = 8/168 (4%) Query: 495 TTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEK-DRRAQSRSRERDLTKS 553 T E+E++L + + + SR +E + ++++ + SK++ + QS + +L+ + Sbjct: 17 TLTEQEQELAALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAELSHT 76 Query: 554 KHSYNS-------RTRERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRRGRSRSR 606 + + + R+R + + R+R + E ++ +QE ++ + Sbjct: 77 LEASTAALQATLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQKEEELRQ 136 Query: 607 DRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSSHRKE 654 G RS++ + + S ER+E R E + + R+E Sbjct: 137 LLEKAGARRSQENGIQEKQSLEQERQEETRRLLESLKELQLTVAQREE 184 >swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein charlie). Length = 296 Score = 23.6 bits (49), Expect = 0.71 Identities = 15/69 (21%), Positives = 34/69 (48%), Gaps = 1/69 (1%) Query: 513 SRSKENSKQVESKSNEHDHSKSKEKDRRAQSRSRERDLTKSKHSYNSRTRERSRSRDRSR 572 +R ++ ++E ++ E +H+ + E+ ++RSR+ + + N+ T + R Sbjct: 221 NRLQQKKSELEGRT-EVNHANTNEEVPSRRTRSRDTNARGAYKLQNTITEGPKAVPTKKR 279 Query: 573 RVRSRSHDR 581 RV +R R Sbjct: 280 RVATRVRGR 288 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 22.8 bits (47), Expect = 1.4 Identities = 26/120 (21%), Positives = 47/120 (38%), Gaps = 7/120 (5%) Query: 154 IENQKTDAASKPFAEVRILSCGELVPKSKVKKEEKKRHKSSSSSSSSDSDSSSDSQSSSD 213 +E +K + + + ++ S L + + K + K K SS S S +S S Sbjct: 124 LELKKEEIKQRQIEQEKLAS---LKKQDEDKDKRDKEEKESSREKRERSRSPRRRKSRSP 180 Query: 214 SSDSETASEEKSRK----RKKKHRKNSRKHKKEKKKRKKSKKSPSSESEADNVDAQPQST 269 S + + RK R +HR SR +K++KS + P + Q ++ Sbjct: 181 SPRRRSYPVRRERKRSHSRSPRHRTKSRSPSPAPEKKEKSPELPEPSVRMKDSSVQEATS 240 Score = 22.0 bits (45), Expect = 2.2 Identities = 32/119 (26%), Positives = 51/119 (41%), Gaps = 4/119 (3%) Query: 533 KSKEKDRRAQSRSRERDLTKSKHSYNSRTR-ERSRSRDRSRRVRSRSHDRDRSRSKEYHR 591 K +E +R + + L K + R + E+ SR++ R RS + RS S Sbjct: 127 KKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSREKRERSRSPRRRKSRSPSPRRRS 186 Query: 592 YREQEYRRRGRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNKEKYRSQDSKSS 650 Y + R+R SRS RT RS S ++ +S E E R K+ + + +S Sbjct: 187 YPVRRERKRSHSRSPRHRTKSRSPSPAPEKKE---KSPELPEPSVRMKDSSVQEATSTS 242 Score = 20.5 bits (41), Expect = 6.5 Identities = 26/96 (27%), Positives = 44/96 (45%), Gaps = 1/96 (1%) Query: 562 RERSRSRDRSRRVRSRSHDRDRSRSKEYHRYREQEYRRRGRSRSRDRRTPGRSRSKDRRR 621 ++R +++ ++ + D+D+ +E RE+ R R R + R R RS RR Sbjct: 132 KQRQIEQEKLASLKKQDEDKDKRDKEEKESSREKRERSRSPRRRKSRSPSPRRRSYPVRR 191 Query: 622 RRRDSRS-SEREESQSRNKEKYRSQDSKSSHRKENS 656 R+ S S S R ++SR+ + KS E S Sbjct: 192 ERKRSHSRSPRHRTKSRSPSPAPEKKEKSPELPEPS 227 >swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). Length = 233 Score = 21.6 bits (44), Expect = 2.9 Identities = 7/32 (21%), Positives = 11/32 (33%) Query: 228 RKKKHRKNSRKHKKEKKKRKKSKKSPSSESEA 259 R + H+ RK++ P ES Sbjct: 190 RPPVDIHSGNMHRTGISPRKRTLPEPFDESNT 221 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 20.9 bits (42), Expect = 4.9 Identities = 75/402 (18%), Positives = 157/402 (38%), Gaps = 29/402 (7%) Query: 346 RSRDRFRRSETPPHWRQEMQRAQRMRVSSGERWIKGDKSEL-NEIKENQRSPVRVKEKKI 404 RS +F + R+ + + E I +SEL + NQ +R K+ Sbjct: 334 RSSKKFLGDKIAVSERESSLLKKEQELLVAEEKIASKESELIQNVLANQEVILR---KRK 390 Query: 405 TDHRHMSESPNRGIEKGGKVKDHKLESKERDIR-RNSGKDDKYNKNKVKKRGKSKSRSKS 463 +D E ++ +E + K E +E DI+ R +K + +V+ R ++ Sbjct: 391 SDVEAELECKSKSVEVEIESKRRAWELREVDIKQREDLVGEKEHDLEVQSRALAEKEKDI 450 Query: 464 KERSKSKERDSKHSRHEDKRVRSRSKERDHETTKEKEKQLDSKGKDQERSRSKENSKQVE 523 E+S + + K+ ++ + ++ + E KE+ ++LD + + Q + ++ K+V+ Sbjct: 451 TEKSFNLDEKEKNLVATEEDINRKTTMLEDE--KERLRKLDLELQ-QSLTSLEDKRKRVD 507 Query: 524 SKSNEHDHSKSKEKDRRAQSRSRERDLTKSKHSYNSRTRERSRSRDRSRRVRSRSH---- 579 S + + + KS+ + + +L + E R + + + Sbjct: 508 SATQKLEALKSETSELSTLEMKLKEELDDLRAQKLEMLAEADRLKVEKAKFEAEWEHIDV 567 Query: 580 DRDRSRSKEYHRYREQEYRRRGRSRSRDRRTPGRSRSKDRRRRRRDSRSSEREESQSRNK 639 R+ R + + R++E RD R +++ + +S + EREE ++ Sbjct: 568 KREELRKEAEYITRQREAFSMYLKDERDNIKEERDALRNQHKNDVESLNREREEFMNKMV 627 Query: 640 EKYRSQDSKSSHRKENSEGEKRMYSKSRDHSSSNNNREKKADIDQSPVSKTKQSSQDNEV 699 E++ SK + + M + ++ N E + SS+D E Sbjct: 628 EEHSEWLSKIQRERADFLLGIEMQKRELEYCIENKREE------------LENSSRDREK 675 Query: 700 KSSTLKNQEDEKTRSPVEKENQKSKGQENDHVHDKNKKCDHE 741 K E+E+ +S E +E +HV + K+ D E Sbjct: 676 AFEQEKKLEEERIQSLKEMAE-----KELEHVQVELKRLDAE 712 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 20.9 bits (42), Expect = 5.0 Identities = 22/134 (16%), Positives = 61/134 (45%), Gaps = 4/134 (2%) Query: 423 KVKDHKLESKERDIRRNSGKDDKYNKNKVKKRGKSKSRSKSKERSKSKE-RDSKHSRHED 481 ++ + + + + R + +SG+ +Y + + + + ++ R +E + H++ E+ Sbjct: 215 EINETRRKHETRLVEVDSGRQIEYEYKLAQALHEMREQHDAQVRLYKEELEQTYHAKLEN 274 Query: 482 KRVRSRSKERDHETTKEK--EKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKSKEKDR 539 R+ S + +E+ E ++ + + S ++ S+ + E + +KE+D Sbjct: 275 ARLSSEMNTSTVNSAREELMESRMRIESLSSQLSNLQKESRACLERIQELEDMLAKERDN 334 Query: 540 -RAQSRSRERDLTK 552 R RER++ + Sbjct: 335 SRRMLSDREREMAE 348 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 20.1 bits (40), Expect = 7.9 Identities = 43/185 (23%), Positives = 75/185 (40%), Gaps = 21/185 (11%) Query: 433 ERDIRRNSGKDDKYNKNKVKKR---GKSKSRSKSKERSKSK-------------ERDSKH 476 ERDIR N K + + + KS + +E K K + + Sbjct: 174 ERDIRNNQLKRAGEERQALVNQLNSAKSTVTNLGQELQKEKRIAEELIVQIEGLQNNLMQ 233 Query: 477 SRHEDKRVRSRSKERDH--ETTKEKEKQLDSKGKDQERSRSKENSKQVESKSNEHDHSKS 534 + + K+++ KE+ + +EK L ++ KD+E S SK E K +E D S Sbjct: 234 MKEDKKKLQEELKEKLDLIQVLQEKITLLTTEIKDKEASLQSTTSKLAE-KESEVDKLSS 292 Query: 535 KEKDRRAQSRSRERDLTKSKHSYNSRTRERS--RSRDRSRRVRSRSHDRDRSRSKEYHRY 592 ++ + Q + ++ + K R RE R + + VR S +R SK+ Sbjct: 293 MYQESQDQLMNLTSEIKELKVEVQKRERELELKRESEDNLNVRLNSLLVERDESKKELDA 352 Query: 593 REQEY 597 ++EY Sbjct: 353 IQKEY 357 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.303 0.126 0.343 Lambda K H 0.267 0.0385 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 96,682 Number of Sequences: 43 Number of extensions: 6227 Number of successful extensions: 1328 Number of sequences better than 10.0: 49 Number of HSP's better than 10.0 without gapping: 49 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 236 length of query: 752 length of database: 17,163 effective HSP length: 62 effective length of query: 690 effective length of database: 14,497 effective search space: 10002930 effective search space used: 10002930 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (20.5 bits) S2: 40 (20.1 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|O85062|REP2_BUCAP Probable replication-associated protein repA2. (250 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|O85062|REP2_BUCAP Probable replication-associated protein... 438 e-126 swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). 20 3.1 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 18 8.0 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 18 8.0 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 18 8.1 >swiss|O85062|REP2_BUCAP Probable replication-associated protein repA2. Length = 250 Score = 438 bits (1127), Expect = e-126 Identities = 250/250 (100%), Positives = 250/250 (100%) Query: 1 MPRKNYICNLKPVFNPPKNEKKRSNFIDYAMKKASEIDVARSKLNYTLLAIDPKTGNILP 60 MPRKNYICNLKPVFNPPKNEKKRSNFIDYAMKKASEIDVARSKLNYTLLAIDPKTGNILP Sbjct: 1 MPRKNYICNLKPVFNPPKNEKKRSNFIDYAMKKASEIDVARSKLNYTLLAIDPKTGNILP 60 Query: 61 RFRRLNEHRACAMRAIVLAMLYYWNTNSNLVEASIEKLSDECGLSTFSNSGNKSITRASR 120 RFRRLNEHRACAMRAIVLAMLYYWNTNSNLVEASIEKLSDECGLSTFSNSGNKSITRASR Sbjct: 61 RFRRLNEHRACAMRAIVLAMLYYWNTNSNLVEASIEKLSDECGLSTFSNSGNKSITRASR 120 Query: 121 LITDFMEPMGFIKCKRKKTQSSSNYIPKKIFLTSNFFMLFHISQSTINQYLSEQKQSIKN 180 LITDFMEPMGFIKCKRKKTQSSSNYIPKKIFLTSNFFMLFHISQSTINQYLSEQKQSIKN Sbjct: 121 LITDFMEPMGFIKCKRKKTQSSSNYIPKKIFLTSNFFMLFHISQSTINQYLSEQKQSIKN 180 Query: 181 LKKEKKIFISFSDIRVISKLDEKAARNKILNALIKYYSASELTKIGPQGLKKKIDIEYSN 240 LKKEKKIFISFSDIRVISKLDEKAARNKILNALIKYYSASELTKIGPQGLKKKIDIEYSN Sbjct: 181 LKKEKKIFISFSDIRVISKLDEKAARNKILNALIKYYSASELTKIGPQGLKKKIDIEYSN 240 Query: 241 LCKLYKKNNK 250 LCKLYKKNNK Sbjct: 241 LCKLYKKNNK 250 >swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). Length = 224 Score = 20.0 bits (40), Expect = 3.1 Identities = 9/27 (33%), Positives = 19/27 (70%) Query: 87 NSNLVEASIEKLSDECGLSTFSNSGNK 113 N+N+ +++ + L LSTFS++G++ Sbjct: 187 NTNVQKSARQALKKYPLLSTFSSNGSR 213 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 18.5 bits (36), Expect = 8.0 Identities = 9/16 (56%), Positives = 10/16 (62%) Query: 18 KNEKKRSNFIDYAMKK 33 +NE RSN D A KK Sbjct: 71 QNESLRSNEADLAKKK 86 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 18.5 bits (36), Expect = 8.0 Identities = 9/28 (32%), Positives = 13/28 (46%) Query: 96 EKLSDECGLSTFSNSGNKSITRASRLIT 123 E+ + C L+ F K I R L+T Sbjct: 942 EEFREACRLNDFGYLDGKYIERLGSLLT 969 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 18.5 bits (36), Expect = 8.1 Identities = 6/22 (27%), Positives = 13/22 (58%) Query: 164 QSTINQYLSEQKQSIKNLKKEK 185 Q ++QY+ + + + LKK + Sbjct: 91 QHKLSQYVEKDPSTFEELKKRR 112 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|O85062|REP2_BUCAP Probable replication-associated protein... 476 e-138 Sequences not found previously or not previously below threshold: swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). 21 1.6 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 20 3.4 trembl|Q15219|Q15219 Protein p84. 19 4.8 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 19 5.5 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 19 6.2 trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.... 18 7.5 CONVERGED! >swiss|O85062|REP2_BUCAP Probable replication-associated protein repA2. Length = 250 Score = 476 bits (1226), Expect = e-138 Identities = 250/250 (100%), Positives = 250/250 (100%) Query: 1 MPRKNYICNLKPVFNPPKNEKKRSNFIDYAMKKASEIDVARSKLNYTLLAIDPKTGNILP 60 MPRKNYICNLKPVFNPPKNEKKRSNFIDYAMKKASEIDVARSKLNYTLLAIDPKTGNILP Sbjct: 1 MPRKNYICNLKPVFNPPKNEKKRSNFIDYAMKKASEIDVARSKLNYTLLAIDPKTGNILP 60 Query: 61 RFRRLNEHRACAMRAIVLAMLYYWNTNSNLVEASIEKLSDECGLSTFSNSGNKSITRASR 120 RFRRLNEHRACAMRAIVLAMLYYWNTNSNLVEASIEKLSDECGLSTFSNSGNKSITRASR Sbjct: 61 RFRRLNEHRACAMRAIVLAMLYYWNTNSNLVEASIEKLSDECGLSTFSNSGNKSITRASR 120 Query: 121 LITDFMEPMGFIKCKRKKTQSSSNYIPKKIFLTSNFFMLFHISQSTINQYLSEQKQSIKN 180 LITDFMEPMGFIKCKRKKTQSSSNYIPKKIFLTSNFFMLFHISQSTINQYLSEQKQSIKN Sbjct: 121 LITDFMEPMGFIKCKRKKTQSSSNYIPKKIFLTSNFFMLFHISQSTINQYLSEQKQSIKN 180 Query: 181 LKKEKKIFISFSDIRVISKLDEKAARNKILNALIKYYSASELTKIGPQGLKKKIDIEYSN 240 LKKEKKIFISFSDIRVISKLDEKAARNKILNALIKYYSASELTKIGPQGLKKKIDIEYSN Sbjct: 181 LKKEKKIFISFSDIRVISKLDEKAARNKILNALIKYYSASELTKIGPQGLKKKIDIEYSN 240 Query: 241 LCKLYKKNNK 250 LCKLYKKNNK Sbjct: 241 LCKLYKKNNK 250 >swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). Length = 224 Score = 20.8 bits (42), Expect = 1.6 Identities = 9/27 (33%), Positives = 19/27 (70%) Query: 87 NSNLVEASIEKLSDECGLSTFSNSGNK 113 N+N+ +++ + L LSTFS++G++ Sbjct: 187 NTNVQKSARQALKKYPLLSTFSSNGSR 213 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 19.6 bits (39), Expect = 3.4 Identities = 9/16 (56%), Positives = 10/16 (62%) Query: 18 KNEKKRSNFIDYAMKK 33 +NE RSN D A KK Sbjct: 71 QNESLRSNEADLAKKK 86 >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 19.3 bits (38), Expect = 4.8 Identities = 8/18 (44%), Positives = 11/18 (60%) Query: 110 SGNKSITRASRLITDFME 127 +GN+ +TR L D ME Sbjct: 433 TGNEELTRLWNLCPDNME 450 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 18.9 bits (37), Expect = 5.5 Identities = 6/22 (27%), Positives = 13/22 (58%) Query: 164 QSTINQYLSEQKQSIKNLKKEK 185 Q ++QY+ + + + LKK + Sbjct: 91 QHKLSQYVEKDPSTFEELKKRR 112 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 18.9 bits (37), Expect = 6.2 Identities = 9/28 (32%), Positives = 13/28 (46%) Query: 96 EKLSDECGLSTFSNSGNKSITRASRLIT 123 E+ + C L+ F K I R L+T Sbjct: 942 EEFREACRLNDFGYLDGKYIERLGSLLT 969 >trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR Length = 752 Score = 18.5 bits (36), Expect = 7.5 Identities = 10/27 (37%), Positives = 15/27 (55%) Query: 183 KEKKIFISFSDIRVISKLDEKAARNKI 209 K+ K+ DIR S D+K +NK+ Sbjct: 425 KDHKLESKERDIRRNSGKDDKYNKNKV 451 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.318 0.133 0.372 Lambda K H 0.267 0.0406 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 16,788 Number of Sequences: 43 Number of extensions: 642 Number of successful extensions: 14 Number of sequences better than 10.0: 14 Number of HSP's better than 10.0 without gapping: 14 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 14 length of query: 250 length of database: 17,163 effective HSP length: 53 effective length of query: 197 effective length of database: 14,884 effective search space: 2932148 effective search space used: 2932148 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 36 (19.4 bits) S2: 36 (18.5 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). (212 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). 410 e-118 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 22 0.45 swiss|P33418|LOS1_YEAST LOS1 protein. 20 2.0 swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. 20 2.0 swiss|O08579|EMD_MOUSE Emerin. 18 8.8 >swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). Length = 212 Score = 410 bits (1054), Expect = e-118 Identities = 212/212 (100%), Positives = 212/212 (100%) Query: 1 MPDNCHFVRSVDLLSVILLRLSTEVGLTLTTLPRHSKELILYHCTYEEHAVELSKPGLRK 60 MPDNCHFVRSVDLLSVILLRLSTEVGLTLTTLPRHSKELILYHCTYEEHAVELSKPGLRK Sbjct: 1 MPDNCHFVRSVDLLSVILLRLSTEVGLTLTTLPRHSKELILYHCTYEEHAVELSKPGLRK 60 Query: 61 AGGKCSLFVDPEERENSPSPPIPPYQISEMPLHELLESGNAKLVPNPEFDLTDPDDFHKC 120 AGGKCSLFVDPEERENSPSPPIPPYQISEMPLHELLESGNAKLVPNPEFDLTDPDDFHKC Sbjct: 61 AGGKCSLFVDPEERENSPSPPIPPYQISEMPLHELLESGNAKLVPNPEFDLTDPDDFHKC 120 Query: 121 FSVTYSALSLMVPYLPRAALKAARVFCKDHSILTTDMLDLNYLEELIEFSKETVNKIPAR 180 FSVTYSALSLMVPYLPRAALKAARVFCKDHSILTTDMLDLNYLEELIEFSKETVNKIPAR Sbjct: 121 FSVTYSALSLMVPYLPRAALKAARVFCKDHSILTTDMLDLNYLEELIEFSKETVNKIPAR 180 Query: 181 IPIEDMLLERGYVLPWVHGGTVKGGKLLTPND 212 IPIEDMLLERGYVLPWVHGGTVKGGKLLTPND Sbjct: 181 IPIEDMLLERGYVLPWVHGGTVKGGKLLTPND 212 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 22.3 bits (46), Expect = 0.45 Identities = 8/24 (33%), Positives = 14/24 (58%) Query: 71 PEERENSPSPPIPPYQISEMPLHE 94 PE++E SP P P ++ + + E Sbjct: 214 PEKKEKSPELPEPSVRMKDSSVQE 237 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 20.4 bits (41), Expect = 2.0 Identities = 10/25 (40%), Positives = 12/25 (48%) Query: 154 TTDMLDLNYLEELIEFSKETVNKIP 178 + D+L L LIE E VN P Sbjct: 51 SNDLLKFYGLSTLIELMTEGVNANP 75 >swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. Length = 487 Score = 20.4 bits (41), Expect = 2.0 Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 3/41 (7%) Query: 71 PEERENSPSPP---IPPYQISEMPLHELLESGNAKLVPNPE 108 P E + +PSPP +P + + +H L +PE Sbjct: 235 PTELQPTPSPPRMHLPVLHVPDQSMHPLTHQSTPNDPDSPE 275 >swiss|O08579|EMD_MOUSE Emerin. Length = 259 Score = 18.1 bits (35), Expect = 8.8 Identities = 8/20 (40%), Positives = 9/20 (45%) Query: 99 GNAKLVPNPEFDLTDPDDFH 118 G +K P L D D FH Sbjct: 113 GMSKSFRQPGTSLVDADTFH 132 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). 445 e-128 Sequences not found previously or not previously below threshold: trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 23 0.22 swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. 22 0.59 swiss|P33418|LOS1_YEAST LOS1 protein. 20 1.9 swiss|O08579|EMD_MOUSE Emerin. 19 5.9 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 18 7.9 CONVERGED! >swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). Length = 212 Score = 445 bits (1145), Expect = e-128 Identities = 212/212 (100%), Positives = 212/212 (100%) Query: 1 MPDNCHFVRSVDLLSVILLRLSTEVGLTLTTLPRHSKELILYHCTYEEHAVELSKPGLRK 60 MPDNCHFVRSVDLLSVILLRLSTEVGLTLTTLPRHSKELILYHCTYEEHAVELSKPGLRK Sbjct: 1 MPDNCHFVRSVDLLSVILLRLSTEVGLTLTTLPRHSKELILYHCTYEEHAVELSKPGLRK 60 Query: 61 AGGKCSLFVDPEERENSPSPPIPPYQISEMPLHELLESGNAKLVPNPEFDLTDPDDFHKC 120 AGGKCSLFVDPEERENSPSPPIPPYQISEMPLHELLESGNAKLVPNPEFDLTDPDDFHKC Sbjct: 61 AGGKCSLFVDPEERENSPSPPIPPYQISEMPLHELLESGNAKLVPNPEFDLTDPDDFHKC 120 Query: 121 FSVTYSALSLMVPYLPRAALKAARVFCKDHSILTTDMLDLNYLEELIEFSKETVNKIPAR 180 FSVTYSALSLMVPYLPRAALKAARVFCKDHSILTTDMLDLNYLEELIEFSKETVNKIPAR Sbjct: 121 FSVTYSALSLMVPYLPRAALKAARVFCKDHSILTTDMLDLNYLEELIEFSKETVNKIPAR 180 Query: 181 IPIEDMLLERGYVLPWVHGGTVKGGKLLTPND 212 IPIEDMLLERGYVLPWVHGGTVKGGKLLTPND Sbjct: 181 IPIEDMLLERGYVLPWVHGGTVKGGKLLTPND 212 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 23.4 bits (49), Expect = 0.22 Identities = 8/24 (33%), Positives = 14/24 (58%) Query: 71 PEERENSPSPPIPPYQISEMPLHE 94 PE++E SP P P ++ + + E Sbjct: 214 PEKKEKSPELPEPSVRMKDSSVQE 237 >swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. Length = 487 Score = 21.9 bits (45), Expect = 0.59 Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 3/41 (7%) Query: 71 PEERENSPSPP---IPPYQISEMPLHELLESGNAKLVPNPE 108 P E + +PSPP +P + + +H L +PE Sbjct: 235 PTELQPTPSPPRMHLPVLHVPDQSMHPLTHQSTPNDPDSPE 275 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 20.3 bits (41), Expect = 1.9 Identities = 10/25 (40%), Positives = 12/25 (48%) Query: 154 TTDMLDLNYLEELIEFSKETVNKIP 178 + D+L L LIE E VN P Sbjct: 51 SNDLLKFYGLSTLIELMTEGVNANP 75 >swiss|O08579|EMD_MOUSE Emerin. Length = 259 Score = 18.8 bits (37), Expect = 5.9 Identities = 8/20 (40%), Positives = 9/20 (45%) Query: 99 GNAKLVPNPEFDLTDPDDFH 118 G +K P L D D FH Sbjct: 113 GMSKSFRQPGTSLVDADTFH 132 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 18.0 bits (35), Expect = 7.9 Identities = 15/58 (25%), Positives = 29/58 (49%), Gaps = 2/58 (3%) Query: 118 HKCFSVTYSALSLMVPYLP--RAALKAARVFCKDHSILTTDMLDLNYLEELIEFSKET 173 H+ +VT + +L + + A +AA ++ +I+ T +L +LE I KE+ Sbjct: 517 HELAAVTENCDNLQTELVDVYKKAERAADELKQEKNIVVTLEKELTFLEAQITREKES 574 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.319 0.139 0.418 Lambda K H 0.267 0.0425 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 17,001 Number of Sequences: 43 Number of extensions: 819 Number of successful extensions: 22 Number of sequences better than 10.0: 16 Number of HSP's better than 10.0 without gapping: 16 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 22 length of query: 212 length of database: 17,163 effective HSP length: 52 effective length of query: 160 effective length of database: 14,927 effective search space: 2388320 effective search space used: 2388320 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 35 (18.9 bits) S2: 35 (18.0 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|Q09824|REP2_SCHPO Transcriptional activator protein rep2. (219 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|Q09824|REP2_SCHPO Transcriptional activator protein rep2. 376 e-107 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 22 0.64 swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. 19 6.0 trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. 18 8.6 >swiss|Q09824|REP2_SCHPO Transcriptional activator protein rep2. Length = 219 Score = 376 bits (965), Expect = e-107 Identities = 219/219 (100%), Positives = 219/219 (100%) Query: 1 MHFADIPLSKPCLNNPPTYPWSSPILSSIANPSLCDIVSSPSSVSSFASSDDFAFMNAYC 60 MHFADIPLSKPCLNNPPTYPWSSPILSSIANPSLCDIVSSPSSVSSFASSDDFAFMNAYC Sbjct: 1 MHFADIPLSKPCLNNPPTYPWSSPILSSIANPSLCDIVSSPSSVSSFASSDDFAFMNAYC 60 Query: 61 LPIQQNHQFGSPVAASPNQQPLVESQNRRNVTYASLVIGKLGIAQLIRQQTDPPQIIHRK 120 LPIQQNHQFGSPVAASPNQQPLVESQNRRNVTYASLVIGKLGIAQLIRQQTDPPQIIHRK Sbjct: 61 LPIQQNHQFGSPVAASPNQQPLVESQNRRNVTYASLVIGKLGIAQLIRQQTDPPQIIHRK 120 Query: 121 QDKGLMARVLSRSKKQEERENHSSDARDAIKSALRRRMRRREGRVQKALRPTPNLICSKC 180 QDKGLMARVLSRSKKQEERENHSSDARDAIKSALRRRMRRREGRVQKALRPTPNLICSKC Sbjct: 121 QDKGLMARVLSRSKKQEERENHSSDARDAIKSALRRRMRRREGRVQKALRPTPNLICSKC 180 Query: 181 NTTFNHSTALMMHEATCLNQVPFKLSDFFVEDVIDDWLF 219 NTTFNHSTALMMHEATCLNQVPFKLSDFFVEDVIDDWLF Sbjct: 181 NTTFNHSTALMMHEATCLNQVPFKLSDFFVEDVIDDWLF 219 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 21.9 bits (45), Expect = 0.64 Identities = 23/125 (18%), Positives = 54/125 (42%), Gaps = 4/125 (3%) Query: 45 SSFASSDDFAFMNAYCLPIQQNHQFGSPVAASPNQQPLVESQNRRNVTYASLVIGKLGIA 104 ++FA + + N++ ++ + + S ++Q + ++R N+ A ++ KLG Sbjct: 595 NAFALAKELELANSHISSLEDEREV---LQKSVSEQKQISQESRENLEDAHSLVMKLGKE 651 Query: 105 QLIRQQTDPPQIIHRKQDKGLMARVLSRSKKQEERENHSSDARDAIKSALR-RRMRRREG 163 + ++ KG + R+ ++ + N+ K+A+ +R RRR+ Sbjct: 652 RESLEKRAKKLEDEMASAKGEILRLRTQVNSVKAPVNNEEKVEAGEKAAVTVKRTRRRKT 711 Query: 164 RVQKA 168 Q A Sbjct: 712 ATQPA 716 >swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. Length = 487 Score = 18.9 bits (37), Expect = 6.0 Identities = 8/19 (42%), Positives = 11/19 (57%) Query: 16 PPTYPWSSPILSSIANPSL 34 PP P + ILS ++ P L Sbjct: 170 PPLRPTAPTILSPLSQPRL 188 >trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. Length = 563 Score = 18.1 bits (35), Expect = 8.6 Identities = 8/28 (28%), Positives = 10/28 (35%) Query: 177 CSKCNTTFNHSTALMMHEATCLNQVPFK 204 CS C F + L H P+K Sbjct: 332 CSYCQKAFRQLSHLQQHTRIHTGDRPYK 359 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|Q09824|REP2_SCHPO Transcriptional activator protein rep2. 407 e-117 Sequences not found previously or not previously below threshold: swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 24 0.18 swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. 20 2.6 trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. 19 3.6 CONVERGED! >swiss|Q09824|REP2_SCHPO Transcriptional activator protein rep2. Length = 219 Score = 407 bits (1045), Expect = e-117 Identities = 219/219 (100%), Positives = 219/219 (100%) Query: 1 MHFADIPLSKPCLNNPPTYPWSSPILSSIANPSLCDIVSSPSSVSSFASSDDFAFMNAYC 60 MHFADIPLSKPCLNNPPTYPWSSPILSSIANPSLCDIVSSPSSVSSFASSDDFAFMNAYC Sbjct: 1 MHFADIPLSKPCLNNPPTYPWSSPILSSIANPSLCDIVSSPSSVSSFASSDDFAFMNAYC 60 Query: 61 LPIQQNHQFGSPVAASPNQQPLVESQNRRNVTYASLVIGKLGIAQLIRQQTDPPQIIHRK 120 LPIQQNHQFGSPVAASPNQQPLVESQNRRNVTYASLVIGKLGIAQLIRQQTDPPQIIHRK Sbjct: 61 LPIQQNHQFGSPVAASPNQQPLVESQNRRNVTYASLVIGKLGIAQLIRQQTDPPQIIHRK 120 Query: 121 QDKGLMARVLSRSKKQEERENHSSDARDAIKSALRRRMRRREGRVQKALRPTPNLICSKC 180 QDKGLMARVLSRSKKQEERENHSSDARDAIKSALRRRMRRREGRVQKALRPTPNLICSKC Sbjct: 121 QDKGLMARVLSRSKKQEERENHSSDARDAIKSALRRRMRRREGRVQKALRPTPNLICSKC 180 Query: 181 NTTFNHSTALMMHEATCLNQVPFKLSDFFVEDVIDDWLF 219 NTTFNHSTALMMHEATCLNQVPFKLSDFFVEDVIDDWLF Sbjct: 181 NTTFNHSTALMMHEATCLNQVPFKLSDFFVEDVIDDWLF 219 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 23.9 bits (50), Expect = 0.18 Identities = 23/125 (18%), Positives = 54/125 (42%), Gaps = 4/125 (3%) Query: 45 SSFASSDDFAFMNAYCLPIQQNHQFGSPVAASPNQQPLVESQNRRNVTYASLVIGKLGIA 104 ++FA + + N++ ++ + + S ++Q + ++R N+ A ++ KLG Sbjct: 595 NAFALAKELELANSHISSLEDEREV---LQKSVSEQKQISQESRENLEDAHSLVMKLGKE 651 Query: 105 QLIRQQTDPPQIIHRKQDKGLMARVLSRSKKQEERENHSSDARDAIKSALR-RRMRRREG 163 + ++ KG + R+ ++ + N+ K+A+ +R RRR+ Sbjct: 652 RESLEKRAKKLEDEMASAKGEILRLRTQVNSVKAPVNNEEKVEAGEKAAVTVKRTRRRKT 711 Query: 164 RVQKA 168 Q A Sbjct: 712 ATQPA 716 >swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. Length = 487 Score = 20.0 bits (40), Expect = 2.6 Identities = 8/19 (42%), Positives = 11/19 (57%) Query: 16 PPTYPWSSPILSSIANPSL 34 PP P + ILS ++ P L Sbjct: 170 PPLRPTAPTILSPLSQPRL 188 Score = 18.1 bits (35), Expect = 7.8 Identities = 8/14 (57%), Positives = 10/14 (71%) Query: 28 SIANPSLCDIVSSP 41 S+ NPSL I S+P Sbjct: 22 SLGNPSLSVIPSNP 35 >trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. Length = 563 Score = 19.3 bits (38), Expect = 3.6 Identities = 8/28 (28%), Positives = 10/28 (35%) Query: 177 CSKCNTTFNHSTALMMHEATCLNQVPFK 204 CS C F + L H P+K Sbjct: 332 CSYCQKAFRQLSHLQQHTRIHTGDRPYK 359 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.319 0.131 0.393 Lambda K H 0.267 0.0402 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 17,240 Number of Sequences: 43 Number of extensions: 728 Number of successful extensions: 31 Number of sequences better than 10.0: 14 Number of HSP's better than 10.0 without gapping: 14 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 31 length of query: 219 length of database: 17,163 effective HSP length: 52 effective length of query: 167 effective length of database: 14,927 effective search space: 2492809 effective search space used: 2492809 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 35 (19.1 bits) S2: 35 (18.1 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein charlie). (296 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein c... 509 e-147 swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 21 1.8 swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 21 2.0 swiss|P24836|VE6_HPV56 E6 protein. 21 2.1 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 20 4.3 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 20 5.0 swiss|P14733|LAM1_MOUSE Lamin B1. 19 6.5 >swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein charlie). Length = 296 Score = 509 bits (1311), Expect = e-147 Identities = 296/296 (100%), Positives = 296/296 (100%) Query: 1 MDDIETAKNLTVKARTAYSVWDVCRLFIEMIAPDVDIDIESKRKSDELLFPGYVIRPMES 60 MDDIETAKNLTVKARTAYSVWDVCRLFIEMIAPDVDIDIESKRKSDELLFPGYVIRPMES Sbjct: 1 MDDIETAKNLTVKARTAYSVWDVCRLFIEMIAPDVDIDIESKRKSDELLFPGYVIRPMES 60 Query: 61 LTTGRPYGLDSSAEDSSVSSDSSAEVILPAAKMVKERFDSIGNGMLSSQEASQAAIDLML 120 LTTGRPYGLDSSAEDSSVSSDSSAEVILPAAKMVKERFDSIGNGMLSSQEASQAAIDLML Sbjct: 61 LTTGRPYGLDSSAEDSSVSSDSSAEVILPAAKMVKERFDSIGNGMLSSQEASQAAIDLML 120 Query: 121 QNNKLLDNRKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLVPPAPTEEDVMKLVS 180 QNNKLLDNRKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLVPPAPTEEDVMKLVS Sbjct: 121 QNNKLLDNRKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLVPPAPTEEDVMKLVS 180 Query: 181 VVTQLLTLVPPDRQAALIGDLFIPESLKDIFNSFNELAAENRLQQKKSELEGRTEVNHAN 240 VVTQLLTLVPPDRQAALIGDLFIPESLKDIFNSFNELAAENRLQQKKSELEGRTEVNHAN Sbjct: 181 VVTQLLTLVPPDRQAALIGDLFIPESLKDIFNSFNELAAENRLQQKKSELEGRTEVNHAN 240 Query: 241 TNEEVPSRRTRSRDTNARGAYKLQNTITEGPKAVPTKKRRVATRVRGRKSRNTSRV 296 TNEEVPSRRTRSRDTNARGAYKLQNTITEGPKAVPTKKRRVATRVRGRKSRNTSRV Sbjct: 241 TNEEVPSRRTRSRDTNARGAYKLQNTITEGPKAVPTKKRRVATRVRGRKSRNTSRV 296 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 20.8 bits (42), Expect = 1.8 Identities = 8/28 (28%), Positives = 17/28 (60%) Query: 168 PAPTEEDVMKLVSVVTQLLTLVPPDRQA 195 P+P + + +S++ + L+L P+R A Sbjct: 538 PSPENRTLWENLSMIRRFLSLPQPERDA 565 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 20.8 bits (42), Expect = 2.0 Identities = 30/129 (23%), Positives = 58/129 (44%), Gaps = 19/129 (14%) Query: 107 SSQEASQAAIDLMLQNNKLLDNRKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLV 166 SS+E + A L L K+L ++ K++A + E K+R + L K + Sbjct: 280 SSEEEVEKA--LKLTGKKILGTEVKIEKAMAFDKNKTAENKKERDSRTLFVKN------I 331 Query: 167 PPAPTEEDVMKLVSVVTQLLTLVPPDRQAALIGDLFIPESLKDIFNSFNELAAENRLQQK 226 P + T E++ ++ + +P + + G ++ S NE A L++K Sbjct: 332 PYSTTVEELQEIFENAKDI--RIPTGKDGSNKGIAYVEFS--------NEDEANKALEEK 381 Query: 227 K-SELEGRT 234 + +E+EGR+ Sbjct: 382 QGAEIEGRS 390 >swiss|P24836|VE6_HPV56 E6 protein. Length = 155 Score = 20.8 bits (42), Expect = 2.1 Identities = 11/39 (28%), Positives = 17/39 (43%) Query: 129 RKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLVP 167 RK Y ++ L KK+ ++L+R C L P Sbjct: 77 RKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTP 115 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 19.6 bits (39), Expect = 4.3 Identities = 9/21 (42%), Positives = 16/21 (75%) Query: 28 IEMIAPDVDIDIESKRKSDEL 48 +E + V+++IESKR++ EL Sbjct: 397 LECKSKSVEVEIESKRRAWEL 417 Score = 19.2 bits (38), Expect = 6.0 Identities = 8/27 (29%), Positives = 16/27 (58%) Query: 223 LQQKKSELEGRTEVNHANTNEEVPSRR 249 L+++KS++E E + E+ S+R Sbjct: 386 LRKRKSDVEAELECKSKSVEVEIESKR 412 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 19.6 bits (39), Expect = 5.0 Identities = 16/57 (28%), Positives = 28/57 (49%), Gaps = 12/57 (21%) Query: 109 QEASQAAIDLMLQNNKLLDNRKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLL 165 Q+ + A +L++Q L +N Q+ ++DKK+ E L K+D Q+L Sbjct: 211 QKEKRIAEELIVQIEGLQNNLMQM------------KEDKKKLQEELKEKLDLIQVL 255 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 19.2 bits (38), Expect = 6.5 Identities = 11/32 (34%), Positives = 13/32 (40%) Query: 220 ENRLQQKKSELEGRTEVNHANTNEEVPSRRTR 251 ENR Q +LE R + NE TR Sbjct: 195 ENRCQSLTEDLEFRKNMYEEEINETRRKHETR 226 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein c... 561 e-163 Sequences not found previously or not previously below threshold: swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 25 0.089 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 23 0.44 swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 22 0.62 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 21 1.4 swiss|P24836|VE6_HPV56 E6 protein. 21 1.9 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 21 2.1 swiss|P14733|LAM1_MOUSE Lamin B1. 20 2.3 swiss|P25233|NECD_MOUSE Necdin. 19 5.4 trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.... 19 6.6 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 19 7.1 CONVERGED! >swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein charlie). Length = 296 Score = 561 bits (1445), Expect = e-163 Identities = 296/296 (100%), Positives = 296/296 (100%) Query: 1 MDDIETAKNLTVKARTAYSVWDVCRLFIEMIAPDVDIDIESKRKSDELLFPGYVIRPMES 60 MDDIETAKNLTVKARTAYSVWDVCRLFIEMIAPDVDIDIESKRKSDELLFPGYVIRPMES Sbjct: 1 MDDIETAKNLTVKARTAYSVWDVCRLFIEMIAPDVDIDIESKRKSDELLFPGYVIRPMES 60 Query: 61 LTTGRPYGLDSSAEDSSVSSDSSAEVILPAAKMVKERFDSIGNGMLSSQEASQAAIDLML 120 LTTGRPYGLDSSAEDSSVSSDSSAEVILPAAKMVKERFDSIGNGMLSSQEASQAAIDLML Sbjct: 61 LTTGRPYGLDSSAEDSSVSSDSSAEVILPAAKMVKERFDSIGNGMLSSQEASQAAIDLML 120 Query: 121 QNNKLLDNRKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLVPPAPTEEDVMKLVS 180 QNNKLLDNRKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLVPPAPTEEDVMKLVS Sbjct: 121 QNNKLLDNRKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLVPPAPTEEDVMKLVS 180 Query: 181 VVTQLLTLVPPDRQAALIGDLFIPESLKDIFNSFNELAAENRLQQKKSELEGRTEVNHAN 240 VVTQLLTLVPPDRQAALIGDLFIPESLKDIFNSFNELAAENRLQQKKSELEGRTEVNHAN Sbjct: 181 VVTQLLTLVPPDRQAALIGDLFIPESLKDIFNSFNELAAENRLQQKKSELEGRTEVNHAN 240 Query: 241 TNEEVPSRRTRSRDTNARGAYKLQNTITEGPKAVPTKKRRVATRVRGRKSRNTSRV 296 TNEEVPSRRTRSRDTNARGAYKLQNTITEGPKAVPTKKRRVATRVRGRKSRNTSRV Sbjct: 241 TNEEVPSRRTRSRDTNARGAYKLQNTITEGPKAVPTKKRRVATRVRGRKSRNTSRV 296 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 25.5 bits (54), Expect = 0.089 Identities = 30/129 (23%), Positives = 58/129 (44%), Gaps = 19/129 (14%) Query: 107 SSQEASQAAIDLMLQNNKLLDNRKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLV 166 SS+E + A L L K+L ++ K++A + E K+R + L K + Sbjct: 280 SSEEEVEKA--LKLTGKKILGTEVKIEKAMAFDKNKTAENKKERDSRTLFVKN------I 331 Query: 167 PPAPTEEDVMKLVSVVTQLLTLVPPDRQAALIGDLFIPESLKDIFNSFNELAAENRLQQK 226 P + T E++ ++ + +P + + G ++ S NE A L++K Sbjct: 332 PYSTTVEELQEIFENAKDI--RIPTGKDGSNKGIAYVEFS--------NEDEANKALEEK 381 Query: 227 K-SELEGRT 234 + +E+EGR+ Sbjct: 382 QGAEIEGRS 390 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 22.8 bits (47), Expect = 0.44 Identities = 29/133 (21%), Positives = 59/133 (43%), Gaps = 16/133 (12%) Query: 109 QEASQAAIDLMLQNNKLLDNRKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLVPP 168 Q+ + A +L++Q L +N Q+ ++DKK+ E L K+D Q+L Sbjct: 211 QKEKRIAEELIVQIEGLQNNLMQM------------KEDKKKLQEELKEKLDLIQVLQEK 258 Query: 169 AP--TEEDVMKLVSVVTQLLTLVPPDRQAALIGDLF--IPESLKDIFNSFNELAAENRLQ 224 T E K S+ + L + + + ++ + L ++ + EL E + + Sbjct: 259 ITLLTTEIKDKEASLQSTTSKLAEKESEVDKLSSMYQESQDQLMNLTSEIKELKVEVQKR 318 Query: 225 QKKSELEGRTEVN 237 +++ EL+ +E N Sbjct: 319 ERELELKRESEDN 331 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 22.4 bits (46), Expect = 0.62 Identities = 8/28 (28%), Positives = 17/28 (60%) Query: 168 PAPTEEDVMKLVSVVTQLLTLVPPDRQA 195 P+P + + +S++ + L+L P+R A Sbjct: 538 PSPENRTLWENLSMIRRFLSLPQPERDA 565 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 21.2 bits (43), Expect = 1.4 Identities = 9/21 (42%), Positives = 16/21 (75%) Query: 28 IEMIAPDVDIDIESKRKSDEL 48 +E + V+++IESKR++ EL Sbjct: 397 LECKSKSVEVEIESKRRAWEL 417 Score = 20.8 bits (42), Expect = 2.1 Identities = 8/27 (29%), Positives = 16/27 (58%) Query: 223 LQQKKSELEGRTEVNHANTNEEVPSRR 249 L+++KS++E E + E+ S+R Sbjct: 386 LRKRKSDVEAELECKSKSVEVEIESKR 412 >swiss|P24836|VE6_HPV56 E6 protein. Length = 155 Score = 20.8 bits (42), Expect = 1.9 Identities = 11/39 (28%), Positives = 17/39 (43%) Query: 129 RKQLYKSIAIIIGRLPEKDKKRATEMLMRKMDCTQLLVP 167 RK Y ++ L KK+ ++L+R C L P Sbjct: 77 RKYRYYDYSVYGATLESITKKQLCDLLIRCYRCQSPLTP 115 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 20.8 bits (42), Expect = 2.1 Identities = 14/57 (24%), Positives = 23/57 (39%), Gaps = 13/57 (22%) Query: 228 SELEGRTEVNHANTNEEVPSRRTRSRDTNARGAYKLQNTITEGPKAVPTKKRRVATR 284 S + V H N+ ++ P+ +T N G KA P KK++ A + Sbjct: 11 SSVRNELSVYH-NSRQKPPAEKTNQSSKNI------------GKKAAPFKKQKRANK 54 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 20.4 bits (41), Expect = 2.3 Identities = 11/32 (34%), Positives = 13/32 (40%) Query: 220 ENRLQQKKSELEGRTEVNHANTNEEVPSRRTR 251 ENR Q +LE R + NE TR Sbjct: 195 ENRCQSLTEDLEFRKNMYEEEINETRRKHETR 226 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 19.3 bits (38), Expect = 5.4 Identities = 12/38 (31%), Positives = 15/38 (38%) Query: 44 KSDELLFPGYVIRPMESLTTGRPYGLDSSAEDSSVSSD 81 K D +P +E R L + A SSVS D Sbjct: 288 KKDPQAWPSRYREALEQARALREANLAAQAPRSSVSED 325 >trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR Length = 752 Score = 18.9 bits (37), Expect = 6.6 Identities = 8/37 (21%), Positives = 23/37 (61%), Gaps = 1/37 (2%) Query: 221 NRLQQKKSELEGRT-EVNHANTNEEVPSRRTRSRDTN 256 +R ++ ++E ++ E +H+ + E+ ++RSR+ + Sbjct: 513 SRSKENSKQVESKSNEHDHSKSKEKDRRAQSRSRERD 549 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 18.9 bits (37), Expect = 7.1 Identities = 8/17 (47%), Positives = 10/17 (58%) Query: 262 KLQNTITEGPKAVPTKK 278 +LQN I G K +KK Sbjct: 112 RLQNAIASGIKIPESKK 128 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.314 0.130 0.350 Lambda K H 0.267 0.0398 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 20,422 Number of Sequences: 43 Number of extensions: 878 Number of successful extensions: 28 Number of sequences better than 10.0: 26 Number of HSP's better than 10.0 without gapping: 26 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 28 length of query: 296 length of database: 17,163 effective HSP length: 55 effective length of query: 241 effective length of database: 14,798 effective search space: 3566318 effective search space used: 3566318 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 36 (19.2 bits) S2: 36 (18.5 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P13781|REP2_ZYGBI Trans-acting factor C (REP2). (178 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P13781|REP2_ZYGBI Trans-acting factor C (REP2). 360 e-103 swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1... 24 0.14 trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. 20 2.3 swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 18 4.8 >swiss|P13781|REP2_ZYGBI Trans-acting factor C (REP2). Length = 178 Score = 360 bits (925), Expect = e-103 Identities = 178/178 (100%), Positives = 178/178 (100%) Query: 1 MPPRRTKKLKTIPDTLLIGSIFTIFENLEAHAYLLPEDDKILWDQYFSEFDALKYLPTRD 60 MPPRRTKKLKTIPDTLLIGSIFTIFENLEAHAYLLPEDDKILWDQYFSEFDALKYLPTRD Sbjct: 1 MPPRRTKKLKTIPDTLLIGSIFTIFENLEAHAYLLPEDDKILWDQYFSEFDALKYLPTRD 60 Query: 61 GETLQEMLSEGAPSTSVLHSEMELSIFDQVQVPHPSQTTNKERYMMLMDGIKYVLGSVTR 120 GETLQEMLSEGAPSTSVLHSEMELSIFDQVQVPHPSQTTNKERYMMLMDGIKYVLGSVTR Sbjct: 61 GETLQEMLSEGAPSTSVLHSEMELSIFDQVQVPHPSQTTNKERYMMLMDGIKYVLGSVTR 120 Query: 121 VNAEKAGDIIKDYNVPVNLTEREMNRAKAFCNGWQSVXLNESGRKLVMHSAGGEFQLK 178 VNAEKAGDIIKDYNVPVNLTEREMNRAKAFCNGWQSV LNESGRKLVMHSAGGEFQLK Sbjct: 121 VNAEKAGDIIKDYNVPVNLTEREMNRAKAFCNGWQSVXLNESGRKLVMHSAGGEFQLK 178 >swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu Length = 589 Score = 23.9 bits (50), Expect = 0.14 Identities = 12/40 (30%), Positives = 21/40 (52%) Query: 55 YLPTRDGETLQEMLSEGAPSTSVLHSEMELSIFDQVQVPH 94 Y PT + T+ L EG + +V+ ++++L F V H Sbjct: 417 YDPTINKWTMVAPLREGVSNAAVVSAKLKLFAFGGTSVSH 456 >trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. Length = 563 Score = 19.6 bits (39), Expect = 2.3 Identities = 8/28 (28%), Positives = 13/28 (45%) Query: 78 LHSEMELSIFDQVQVPHPSQTTNKERYM 105 +HS+M + PH S+T Y+ Sbjct: 290 IHSKMHTETIKPHKCPHCSKTFANTSYL 317 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 18.5 bits (36), Expect = 4.8 Identities = 9/23 (39%), Positives = 13/23 (56%) Query: 5 RTKKLKTIPDTLLIGSIFTIFEN 27 RT +K IP + + + IFEN Sbjct: 324 RTLFVKNIPYSTTVEELQEIFEN 346 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13781|REP2_ZYGBI Trans-acting factor C (REP2). 382 e-109 Sequences not found previously or not previously below threshold: swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1... 24 0.12 trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. 20 1.6 swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 20 1.9 CONVERGED! >swiss|P13781|REP2_ZYGBI Trans-acting factor C (REP2). Length = 178 Score = 382 bits (981), Expect = e-109 Identities = 178/178 (100%), Positives = 178/178 (100%) Query: 1 MPPRRTKKLKTIPDTLLIGSIFTIFENLEAHAYLLPEDDKILWDQYFSEFDALKYLPTRD 60 MPPRRTKKLKTIPDTLLIGSIFTIFENLEAHAYLLPEDDKILWDQYFSEFDALKYLPTRD Sbjct: 1 MPPRRTKKLKTIPDTLLIGSIFTIFENLEAHAYLLPEDDKILWDQYFSEFDALKYLPTRD 60 Query: 61 GETLQEMLSEGAPSTSVLHSEMELSIFDQVQVPHPSQTTNKERYMMLMDGIKYVLGSVTR 120 GETLQEMLSEGAPSTSVLHSEMELSIFDQVQVPHPSQTTNKERYMMLMDGIKYVLGSVTR Sbjct: 61 GETLQEMLSEGAPSTSVLHSEMELSIFDQVQVPHPSQTTNKERYMMLMDGIKYVLGSVTR 120 Query: 121 VNAEKAGDIIKDYNVPVNLTEREMNRAKAFCNGWQSVXLNESGRKLVMHSAGGEFQLK 178 VNAEKAGDIIKDYNVPVNLTEREMNRAKAFCNGWQSV LNESGRKLVMHSAGGEFQLK Sbjct: 121 VNAEKAGDIIKDYNVPVNLTEREMNRAKAFCNGWQSVXLNESGRKLVMHSAGGEFQLK 178 >swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu Length = 589 Score = 23.9 bits (50), Expect = 0.12 Identities = 12/40 (30%), Positives = 21/40 (52%) Query: 55 YLPTRDGETLQEMLSEGAPSTSVLHSEMELSIFDQVQVPH 94 Y PT + T+ L EG + +V+ ++++L F V H Sbjct: 417 YDPTINKWTMVAPLREGVSNAAVVSAKLKLFAFGGTSVSH 456 >trembl|Q9EQJ3|Q9EQJ3 Nuclear matrix transcription factor. Length = 563 Score = 20.4 bits (41), Expect = 1.6 Identities = 8/28 (28%), Positives = 13/28 (45%) Query: 78 LHSEMELSIFDQVQVPHPSQTTNKERYM 105 +HS+M + PH S+T Y+ Sbjct: 290 IHSKMHTETIKPHKCPHCSKTFANTSYL 317 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 20.0 bits (40), Expect = 1.9 Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 7/66 (10%) Query: 5 RTKKLKTIPDTLLIGSIFTIFENLEAHAYLLP-----EDDKILWDQYFSEFDALKYLPTR 59 RT +K IP + + + IFEN A +P + I + ++ +E +A K L + Sbjct: 324 RTLFVKNIPYSTTVEELQEIFEN--AKDIRIPTGKDGSNKGIAYVEFSNEDEANKALEEK 381 Query: 60 DGETLQ 65 G ++ Sbjct: 382 QGAEIE 387 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.317 0.134 0.387 Lambda K H 0.267 0.0411 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 12,568 Number of Sequences: 43 Number of extensions: 497 Number of successful extensions: 11 Number of sequences better than 10.0: 11 Number of HSP's better than 10.0 without gapping: 11 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 11 length of query: 178 length of database: 17,163 effective HSP length: 50 effective length of query: 128 effective length of database: 15,013 effective search space: 1921664 effective search space used: 1921664 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 34 (18.4 bits) S2: 34 (17.7 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). (200 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 383 e-109 swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 45 9e-08 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 44 1e-07 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 36 3e-05 trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. 27 0.023 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 25 0.055 swiss|P33418|LOS1_YEAST LOS1 protein. 20 1.7 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 20 2.9 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 19 5.1 swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. 19 5.2 trembl|Q15219|Q15219 Protein p84. 19 5.4 swiss|O85062|REP2_BUCAP Probable replication-associated protein... 18 6.3 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 383 bits (983), Expect = e-109 Identities = 200/200 (100%), Positives = 200/200 (100%) Query: 1 MEANPGELERFGKAIRKIWLYETIMLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISS 60 MEANPGELERFGKAIRKIWLYETIMLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISS Sbjct: 1 MEANPGELERFGKAIRKIWLYETIMLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISS 60 Query: 61 SEHVSDSLKFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVY 120 SEHVSDSLKFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVY Sbjct: 61 SEHVSDSLKFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVY 120 Query: 121 GYVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDI 180 GYVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDI Sbjct: 121 GYVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDI 180 Query: 181 EFTFNCSLNLKSSKAISAGM 200 EFTFNCSLNLKSSKAISAGM Sbjct: 181 EFTFNCSLNLKSSKAISAGM 200 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 44.7 bits (104), Expect = 9e-08 Identities = 26/89 (29%), Positives = 46/89 (51%), Gaps = 3/89 (3%) Query: 94 LSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFY 153 L Y+ D +L + R + ++ VR ++ G ++GVRC NG ++VS+ Sbjct: 307 LGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSVRVSIRL 366 Query: 154 GPCKWVTSMWVTLVPITLRENGSLEDIEF 182 + +T+ W+ L PI ENG ++D+ F Sbjct: 367 KH-RQLTATWIELEPII--ENGEVKDVMF 392 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 43.9 bits (102), Expect = 1e-07 Identities = 34/157 (21%), Positives = 73/157 (45%), Gaps = 7/157 (4%) Query: 33 KGNLSEESTSTSES---EIPNTSSQQDVISSSEHVSDSLKFWEVLNFLEVSWVPHSGEDG 89 KG++ ++ E E+ N + + ++ + +D W ++ L W + + Sbjct: 212 KGHIGDQELPPEEDPSRELNNVQHEVNSLTEQDAEADE-GLWGEIDSLCEKWQSEAEDQT 270 Query: 90 VQHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVLKV 149 ++ + + LK RR K+ +Y +++++ S+G V++YRG + + K+ Sbjct: 271 EAEIIADRIIGNSQRMANLKIRRTKFKSVLYHILKELIQSQGTVKVYRGSSFSHDSI-KI 329 Query: 150 SLFYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFNC 186 SL Y + +T++WV L + E+ D+E F C Sbjct: 330 SLHYEE-QHITAVWVYLT-VKFEEHWKPVDVEVEFRC 364 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 35.8 bits (81), Expect = 3e-05 Identities = 34/158 (21%), Positives = 68/158 (42%), Gaps = 4/158 (2%) Query: 29 EKIAKGNLSEESTSTSESEIPNTSSQQDVISSSEHVSDSLKFWEVLNFLEV-SWVPHSGE 87 E++ + + T E +T + ++E + +FW + LE S++ Sbjct: 155 ERVVSSQTTSRAAETRERSPEHTEASTQEAETAEEALNKEEFWNLYRSLESKSYISRENV 214 Query: 88 DGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVL 147 ++ L ++ + E +RR+ KN +Y ++ ++ ++R YRG R L Sbjct: 215 PDIR-LLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRGCRSADGLTL 273 Query: 148 KVSLFYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFN 185 VS Y + + V L P L+E S+ +I +T + Sbjct: 274 SVSFKYKNRPQFSGVKVDLEPKFLQE--SVVNIYYTIS 309 >trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. Length = 456 Score = 26.6 bits (57), Expect = 0.023 Identities = 11/28 (39%), Positives = 19/28 (67%) Query: 93 KLSQYVEKDPSTFEELKKRRLAAKNDVY 120 K ++ ++ DPS FE L+K R+ A + +Y Sbjct: 165 KGTKQLKLDPSIFESLQKERVEAGDVIY 192 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 25.4 bits (54), Expect = 0.055 Identities = 17/85 (20%), Positives = 34/85 (40%), Gaps = 14/85 (16%) Query: 7 ELERFGKAIRKIWLYETI---MLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISSSEH 63 EL+ +RK+W+ E + + I +L EE + S + + + + ++ + Sbjct: 198 ELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGEQTTTAREESEALDTTSN 257 Query: 64 VSDSLK-----------FWEVLNFL 77 D+L FWE + L Sbjct: 258 GLDALNTQINAIETEESFWEAIRAL 282 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 20.4 bits (41), Expect = 1.7 Identities = 15/54 (27%), Positives = 27/54 (49%), Gaps = 7/54 (12%) Query: 65 SDSLKFWEVLNFL----EVSWVPHSGEDGVQ---HKLSQYVEKDPSTFEELKKR 111 S LK WE+++FL ++ + H+ D Q L+ + K S EE+ ++ Sbjct: 817 STDLKSWEMIDFLGFLSQLIHMFHTDTDCYQLFNQLLTPLINKVHSIIEEIDEQ 870 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 19.6 bits (39), Expect = 2.9 Identities = 10/47 (21%), Positives = 22/47 (46%) Query: 8 LERFGKAIRKIWLYETIMLGAEKIAKGNLSEESTSTSESEIPNTSSQ 54 L+ FG+ + L + N+ +E+ + S EIP+++ + Sbjct: 134 LQFFGREVTLFRLKNYLTSERHSDVDDNVVQENLNDSRGEIPDSTPE 180 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 18.9 bits (37), Expect = 5.1 Identities = 9/24 (37%), Positives = 14/24 (57%) Query: 88 DGVQHKLSQYVEKDPSTFEELKKR 111 +G+Q+ L Q E EELK++ Sbjct: 225 EGLQNNLMQMKEDKKKLQEELKEK 248 >swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. Length = 360 Score = 18.9 bits (37), Expect = 5.2 Identities = 8/11 (72%), Positives = 9/11 (81%) Query: 104 TFEELKKRRLA 114 T EEL+KRR A Sbjct: 277 TSEELRKRREA 287 >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 18.9 bits (37), Expect = 5.4 Identities = 7/18 (38%), Positives = 10/18 (54%) Query: 1 MEANPGELERFGKAIRKI 18 + NP + ERF K + I Sbjct: 364 LSENPPDGERFSKMVEHI 381 >swiss|O85062|REP2_BUCAP Probable replication-associated protein repA2. Length = 250 Score = 18.5 bits (36), Expect = 6.3 Identities = 6/22 (27%), Positives = 13/22 (58%) Query: 91 QHKLSQYVEKDPSTFEELKKRR 112 Q ++QY+ + + + LKK + Sbjct: 164 QSTINQYLSEQKQSIKNLKKEK 185 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 315 2e-89 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 212 3e-58 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 206 2e-56 swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 129 2e-33 Sequences not found previously or not previously below threshold: swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 39 4e-06 trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 21 0.91 trembl|Q60952|Q60952 Intranuclear protein. 21 1.1 swiss|O85062|REP2_BUCAP Probable replication-associated protein... 19 3.5 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 19 5.0 trembl|Q15219|Q15219 Protein p84. 19 5.4 swiss|P33418|LOS1_YEAST LOS1 protein. 18 8.0 swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). 18 9.2 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 315 bits (808), Expect = 2e-89 Identities = 200/200 (100%), Positives = 200/200 (100%) Query: 1 MEANPGELERFGKAIRKIWLYETIMLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISS 60 MEANPGELERFGKAIRKIWLYETIMLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISS Sbjct: 1 MEANPGELERFGKAIRKIWLYETIMLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISS 60 Query: 61 SEHVSDSLKFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVY 120 SEHVSDSLKFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVY Sbjct: 61 SEHVSDSLKFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVY 120 Query: 121 GYVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDI 180 GYVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDI Sbjct: 121 GYVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDI 180 Query: 181 EFTFNCSLNLKSSKAISAGM 200 EFTFNCSLNLKSSKAISAGM Sbjct: 181 EFTFNCSLNLKSSKAISAGM 200 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 212 bits (540), Expect = 3e-58 Identities = 34/157 (21%), Positives = 73/157 (45%), Gaps = 7/157 (4%) Query: 33 KGNLSEESTSTSES---EIPNTSSQQDVISSSEHVSDSLKFWEVLNFLEVSWVPHSGEDG 89 KG++ ++ E E+ N + + ++ + +D W ++ L W + + Sbjct: 212 KGHIGDQELPPEEDPSRELNNVQHEVNSLTEQDAEADE-GLWGEIDSLCEKWQSEAEDQT 270 Query: 90 VQHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVLKV 149 ++ + + LK RR K+ +Y +++++ S+G V++YRG + + K+ Sbjct: 271 EAEIIADRIIGNSQRMANLKIRRTKFKSVLYHILKELIQSQGTVKVYRGSSFSHDSI-KI 329 Query: 150 SLFYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFNC 186 SL Y + +T++WV L + E+ D+E F C Sbjct: 330 SLHYEE-QHITAVWVYLT-VKFEEHWKPVDVEVEFRC 364 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 206 bits (524), Expect = 2e-56 Identities = 34/158 (21%), Positives = 68/158 (42%), Gaps = 4/158 (2%) Query: 29 EKIAKGNLSEESTSTSESEIPNTSSQQDVISSSEHVSDSLKFWEVLNFLEV-SWVPHSGE 87 E++ + + T E +T + ++E + +FW + LE S++ Sbjct: 155 ERVVSSQTTSRAAETRERSPEHTEASTQEAETAEEALNKEEFWNLYRSLESKSYISRENV 214 Query: 88 DGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVL 147 ++ L ++ + E +RR+ KN +Y ++ ++ ++R YRG R L Sbjct: 215 PDIR-LLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRGCRSADGLTL 273 Query: 148 KVSLFYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFN 185 VS Y + + V L P L+E S+ +I +T + Sbjct: 274 SVSFKYKNRPQFSGVKVDLEPKFLQE--SVVNIYYTIS 309 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 129 bits (324), Expect = 2e-33 Identities = 26/94 (27%), Positives = 47/94 (49%), Gaps = 3/94 (3%) Query: 92 HKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVLKVSL 151 L Y+ D +L + R + ++ VR ++ G ++GVRC NG ++VS+ Sbjct: 305 EYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSVRVSI 364 Query: 152 FYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFN 185 + +T+ W+ L PI ENG ++D+ F + Sbjct: 365 RLKH-RQLTATWIELEPII--ENGEVKDVMFKLS 395 Score = 21.8 bits (45), Expect = 0.66 Identities = 11/45 (24%), Positives = 14/45 (30%), Gaps = 1/45 (2%) Query: 32 AKGNLSEESTS-TSESEIPNTSSQQDVISSSEHVSDSLKFWEVLN 75 A SE + S+ Q V + DS KF N Sbjct: 3 APSESSETAAPLLSDLSPEEQKGVQQVREIYRALLDSFKFKNEYN 47 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 39.1 bits (90), Expect = 4e-06 Identities = 22/142 (15%), Positives = 52/142 (36%), Gaps = 16/142 (11%) Query: 7 ELERFGKAIRKIWLYETI--------------MLGAEKIAKGNLSEESTSTSESEIPNTS 52 EL+ +RK+W+ E + + E+ A G + + SE + TS Sbjct: 198 ELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGEQTTTAREESE-ALDTTS 256 Query: 53 SQQDVISSS-EHVSDSLKFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKR 111 + D +++ + FWE + L + K + ++ + K+ Sbjct: 257 NGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAAVFLLGHKKILQTFTKQ 316 Query: 112 RLAAKNDVYGYVRQIMYSKGQV 133 + A+ Y +++ + + + Sbjct: 317 KDTARALFYINLKECLGTSWNL 338 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 21.4 bits (44), Expect = 0.91 Identities = 6/41 (14%), Positives = 15/41 (35%), Gaps = 3/41 (7%) Query: 99 EKDPSTFEELKKRRLAAKNDVYGYVRQIMYS---KGQVRIY 136 + LK + V+ + +++S +RI+ Sbjct: 251 KSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIW 291 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 21.0 bits (43), Expect = 1.1 Identities = 15/66 (22%), Positives = 24/66 (35%), Gaps = 12/66 (18%) Query: 28 AEKIAKGNLSEESTSTSESEIPNTSSQQDVISSSEHVSDSLKFWEVLNFLEVSWVPHSGE 87 A +I +G + ++ E T Q + E L LE + S E Sbjct: 101 ALRIREGEIQAQALQHHEV----TQHLQQELCQKEEELRQL--------LEKAGARRSQE 148 Query: 88 DGVQHK 93 +G+Q K Sbjct: 149 NGIQEK 154 >swiss|O85062|REP2_BUCAP Probable replication-associated protein repA2. Length = 250 Score = 19.5 bits (39), Expect = 3.5 Identities = 22/105 (20%), Positives = 40/105 (37%), Gaps = 19/105 (18%) Query: 91 QHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQI-MYSKGQVRIYRGVRCKPNGVLKV 149 Q ++QY+ + + + LKK + ++ I + SK + R N +L Sbjct: 164 QSTINQYLSEQKQSIKNLKKEKK-----IFISFSDIRVISKLDEKAAR------NKILNA 212 Query: 150 SLFYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFNCSLNLKSSK 194 + Y L I + DIE++ C L K++K Sbjct: 213 LIKYYSAS-------ELTKIGPQGLKKKIDIEYSNLCKLYKKNNK 250 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 18.7 bits (37), Expect = 5.0 Identities = 9/23 (39%), Positives = 12/23 (52%) Query: 28 AEKIAKGNLSEESTSTSESEIPN 50 A+ +AK S S+ E E PN Sbjct: 65 ADSLAKEYGSPSVISSYEDEPPN 87 >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 18.7 bits (37), Expect = 5.4 Identities = 7/18 (38%), Positives = 10/18 (54%) Query: 1 MEANPGELERFGKAIRKI 18 + NP + ERF K + I Sbjct: 364 LSENPPDGERFSKMVEHI 381 Score = 18.3 bits (36), Expect = 6.3 Identities = 7/29 (24%), Positives = 12/29 (41%), Gaps = 1/29 (3%) Query: 62 EHVSDSLKFWEVL-NFLEVSWVPHSGEDG 89 EH+ ++ + W N S+V D Sbjct: 379 EHILNTEENWNSWKNEGCPSFVKERTSDT 407 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 17.9 bits (35), Expect = 8.0 Identities = 7/13 (53%), Positives = 10/13 (76%) Query: 65 SDSLKFWEVLNFL 77 S LK WE+++FL Sbjct: 817 STDLKSWEMIDFL 829 >swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). Length = 233 Score = 17.9 bits (35), Expect = 9.2 Identities = 7/29 (24%), Positives = 12/29 (41%) Query: 31 IAKGNLSEESTSTSESEIPNTSSQQDVIS 59 I GN+ S + +P + + IS Sbjct: 195 IHSGNMHRTGISPRKRTLPEPFDESNTIS 223 Searching.done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 301 4e-85 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 199 2e-54 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 193 9e-53 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 165 4e-44 swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 129 3e-33 Sequences not found previously or not previously below threshold: trembl|Q60952|Q60952 Intranuclear protein. 23 0.20 trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. 23 0.37 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 20 1.8 swiss|P33418|LOS1_YEAST LOS1 protein. 19 3.1 swiss|O85062|REP2_BUCAP Probable replication-associated protein... 19 4.2 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 19 4.3 swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1... 19 4.3 CONVERGED! >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 301 bits (771), Expect = 4e-85 Identities = 200/200 (100%), Positives = 200/200 (100%) Query: 1 MEANPGELERFGKAIRKIWLYETIMLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISS 60 MEANPGELERFGKAIRKIWLYETIMLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISS Sbjct: 1 MEANPGELERFGKAIRKIWLYETIMLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISS 60 Query: 61 SEHVSDSLKFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVY 120 SEHVSDSLKFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVY Sbjct: 61 SEHVSDSLKFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVY 120 Query: 121 GYVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDI 180 GYVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDI Sbjct: 121 GYVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDI 180 Query: 181 EFTFNCSLNLKSSKAISAGM 200 EFTFNCSLNLKSSKAISAGM Sbjct: 181 EFTFNCSLNLKSSKAISAGM 200 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 199 bits (506), Expect = 2e-54 Identities = 39/176 (22%), Positives = 81/176 (45%), Gaps = 8/176 (4%) Query: 15 IRKIWLYETIMLGAEKIA-KGNLSEESTSTSES---EIPNTSSQQDVISSSEHVSDSLKF 70 IR ++E I E+ KG++ ++ E E+ N + + ++ + +D Sbjct: 193 IRCSIIHEYISKFVEREQDKGHIGDQELPPEEDPSRELNNVQHEVNSLTEQDAEADE-GL 251 Query: 71 WEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSK 130 W ++ L W + + ++ + + LK RR K+ +Y +++++ S+ Sbjct: 252 WGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVLYHILKELIQSQ 311 Query: 131 GQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFNC 186 G V++YRG + + K+SL Y + +T++WV L + E+ D+E F C Sbjct: 312 GTVKVYRGSSFSHDSI-KISLHYEE-QHITAVWVYLT-VKFEEHWKPVDVEVEFRC 364 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 193 bits (492), Expect = 9e-53 Identities = 34/158 (21%), Positives = 68/158 (42%), Gaps = 4/158 (2%) Query: 29 EKIAKGNLSEESTSTSESEIPNTSSQQDVISSSEHVSDSLKFWEVLNFLEV-SWVPHSGE 87 E++ + + T E +T + ++E + +FW + LE S++ Sbjct: 155 ERVVSSQTTSRAAETRERSPEHTEASTQEAETAEEALNKEEFWNLYRSLESKSYISRENV 214 Query: 88 DGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVL 147 ++ L ++ + E +RR+ KN +Y ++ ++ ++R YRG R L Sbjct: 215 PDIR-LLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRGCRSADGLTL 273 Query: 148 KVSLFYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFN 185 VS Y + + V L P L+E S+ +I +T + Sbjct: 274 SVSFKYKNRPQFSGVKVDLEPKFLQE--SVVNIYYTIS 309 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 165 bits (418), Expect = 4e-44 Identities = 22/142 (15%), Positives = 52/142 (36%), Gaps = 16/142 (11%) Query: 7 ELERFGKAIRKIWLYETI--------------MLGAEKIAKGNLSEESTSTSESEIPNTS 52 EL+ +RK+W+ E + + E+ A G + + SE + TS Sbjct: 198 ELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGEQTTTAREESE-ALDTTS 256 Query: 53 SQQDVISSS-EHVSDSLKFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKR 111 + D +++ + FWE + L + K + ++ + K+ Sbjct: 257 NGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAAVFLLGHKKILQTFTKQ 316 Query: 112 RLAAKNDVYGYVRQIMYSKGQV 133 + A+ Y +++ + + + Sbjct: 317 KDTARALFYINLKECLGTSWNL 338 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 129 bits (324), Expect = 3e-33 Identities = 26/94 (27%), Positives = 47/94 (49%), Gaps = 3/94 (3%) Query: 92 HKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVLKVSL 151 L Y+ D +L + R + ++ VR ++ G ++GVRC NG ++VS+ Sbjct: 305 EYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSVRVSI 364 Query: 152 FYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFN 185 + +T+ W+ L PI ENG ++D+ F + Sbjct: 365 RLKH-RQLTATWIELEPII--ENGEVKDVMFKLS 395 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 23.3 bits (49), Expect = 0.20 Identities = 9/83 (10%), Positives = 25/83 (29%), Gaps = 5/83 (6%) Query: 25 MLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISSSEHVSDS-LKFWEVLNFLEVSWVP 83 + A +I +G + ++ E T Q + E L+ E Sbjct: 98 LEEALRIREGEIQAQALQHHEV----TQHLQQELCQKEEELRQLLEKAGARRSQENGIQE 153 Query: 84 HSGEDGVQHKLSQYVEKDPSTFE 106 + + + ++ + + + Sbjct: 154 KQSLEQERQEETRRLLESLKELQ 176 >trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. Length = 456 Score = 22.5 bits (47), Expect = 0.37 Identities = 10/24 (41%), Positives = 16/24 (66%) Query: 98 VEKDPSTFEELKKRRLAAKNDVYG 121 ++ DPS FE L+K R+ A + +Y Sbjct: 170 LKLDPSIFESLQKERVEAGDVIYI 193 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 20.2 bits (41), Expect = 1.8 Identities = 11/77 (14%), Positives = 20/77 (25%), Gaps = 8/77 (10%) Query: 96 QYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQV--------RIYRGVRCKPNGVL 147 ++ EL+K AK + K + + N +L Sbjct: 268 DHIFARSQELAELEKGLDTAKTTFEEERKAFEDKKSNLEIALALCAKREEVCFYSHNSLL 327 Query: 148 KVSLFYGPCKWVTSMWV 164 + L Y K + Sbjct: 328 FLVLHYRSSKKFLGDKI 344 Score = 18.3 bits (36), Expect = 6.9 Identities = 15/91 (16%), Positives = 32/91 (34%), Gaps = 12/91 (13%) Query: 26 LGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISSS------EHVSDSLKFWEVLNFLEV 79 L E+I +E E+ + + + + + ++ E+ + +E Sbjct: 683 LEEERIQS------LKEMAEKELEHVQVELKRLDAERLEIKLDRERREREWAELKDSVEE 736 Query: 80 SWVPHSGEDGVQHKLSQYVEKDPSTFEELKK 110 V + +H L ++ EELKK Sbjct: 737 LKVQREKLETQRHMLRAERDEIRHEIEELKK 767 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 19.4 bits (39), Expect = 3.1 Identities = 7/13 (53%), Positives = 10/13 (76%) Query: 65 SDSLKFWEVLNFL 77 S LK WE+++FL Sbjct: 817 STDLKSWEMIDFL 829 >swiss|O85062|REP2_BUCAP Probable replication-associated protein repA2. Length = 250 Score = 19.0 bits (38), Expect = 4.2 Identities = 21/102 (20%), Positives = 39/102 (37%), Gaps = 19/102 (18%) Query: 94 LSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQI-MYSKGQVRIYRGVRCKPNGVLKVSLF 152 ++QY+ + + + LKK + ++ I + SK + R N +L + Sbjct: 167 INQYLSEQKQSIKNLKKEKK-----IFISFSDIRVISKLDEKAAR------NKILNALIK 215 Query: 153 YGPCKWVTSMWVTLVPITLRENGSLEDIEFTFNCSLNLKSSK 194 Y L I + DIE++ C L K++K Sbjct: 216 YYSAS-------ELTKIGPQGLKKKIDIEYSNLCKLYKKNNK 250 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 19.0 bits (38), Expect = 4.3 Identities = 10/28 (35%), Positives = 14/28 (49%) Query: 23 TIMLGAEKIAKGNLSEESTSTSESEIPN 50 T+ A+ +AK S S+ E E PN Sbjct: 60 TVEDEADSLAKEYGSPSVISSYEDEPPN 87 >swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu Length = 589 Score = 19.0 bits (38), Expect = 4.3 Identities = 5/17 (29%), Positives = 10/17 (58%) Query: 16 RKIWLYETIMLGAEKIA 32 + +W+Y+T+ K A Sbjct: 356 KDVWVYDTLHEEWSKAA 372 Score = 18.3 bits (36), Expect = 7.5 Identities = 3/9 (33%), Positives = 4/9 (44%) Query: 73 VLNFLEVSW 81 V + L W Sbjct: 360 VYDTLHEEW 368 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.311 0.154 0.405 Lambda K H 0.267 0.0471 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 21,472 Number of Sequences: 43 Number of extensions: 985 Number of successful extensions: 107 Number of sequences better than 10.0: 45 Number of HSP's better than 10.0 without gapping: 45 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 68 length of query: 200 length of database: 17,163 effective HSP length: 51 effective length of query: 149 effective length of database: 14,970 effective search space: 2230530 effective search space used: 2230530 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 35 (18.4 bits) S2: 35 (17.9 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). (233 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). 455 e-131 swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). 23 0.32 swiss|P14733|LAM1_MOUSE Lamin B1. 20 3.4 swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 19 4.9 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 19 5.3 swiss|P04489|E411_ADE05 Probable early E4 11 kDa protein. 19 6.1 trembl|Q15219|Q15219 Protein p84. 18 6.7 >swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). Length = 233 Score = 455 bits (1170), Expect = e-131 Identities = 233/233 (100%), Positives = 233/233 (100%) Query: 1 MQIQNSIRATLESNHGLLDVDYVANLLENLLRTWKHGKPTIKVREAIQLAHAKSIKVISL 60 MQIQNSIRATLESNHGLLDVDYVANLLENLLRTWKHGKPTIKVREAIQLAHAKSIKVISL Sbjct: 1 MQIQNSIRATLESNHGLLDVDYVANLLENLLRTWKHGKPTIKVREAIQLAHAKSIKVISL 60 Query: 61 WPQETCSFRNFDGNPEDDPNVPWLVRRENSSGPFTQPGSETSSLEQLLNGLGCIARILRE 120 WPQETCSFRNFDGNPEDDPNVPWLVRRENSSGPFTQPGSETSSLEQLLNGLGCIARILRE Sbjct: 61 WPQETCSFRNFDGNPEDDPNVPWLVRRENSSGPFTQPGSETSSLEQLLNGLGCIARILRE 120 Query: 121 NTNTVEARRAIDDHFCKIKKPAKLTMVGIQNIKKSMKRLLVNMSPIEGLEELFFQARSLG 180 NTNTVEARRAIDDHFCKIKKPAKLTMVGIQNIKKSMKRLLVNMSPIEGLEELFFQARSLG Sbjct: 121 NTNTVEARRAIDDHFCKIKKPAKLTMVGIQNIKKSMKRLLVNMSPIEGLEELFFQARSLG 180 Query: 181 VPSHLVESVRPPVDIHSGNMHRTGISPRKRTLPEPFDESNTISHRRTRRNTKQ 233 VPSHLVESVRPPVDIHSGNMHRTGISPRKRTLPEPFDESNTISHRRTRRNTKQ Sbjct: 181 VPSHLVESVRPPVDIHSGNMHRTGISPRKRTLPEPFDESNTISHRRTRRNTKQ 233 >swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). Length = 224 Score = 23.1 bits (48), Expect = 0.32 Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 1/66 (1%) Query: 14 NHGLLDVDYVANLLENLLRTWKHGKPTIKVREAIQLAHAKSIKVISLWPQETCSFRNFDG 73 + G+ +VD L E+ R + + + ++L H +S++ W ET S+ N Sbjct: 82 SEGITNVDTAVRLAESAERFVRQTNLELSANQLLRLLHYRSMQE-RQWELETRSYLNDLI 140 Query: 74 NPEDDP 79 N +DP Sbjct: 141 NLVEDP 146 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 19.6 bits (39), Expect = 3.4 Identities = 15/46 (32%), Positives = 24/46 (51%), Gaps = 4/46 (8%) Query: 122 TNTVEARRAIDDHFCKIKKPAKLTMVGIQNIKKSMKRLLVNMSPIE 167 T +ARRA+DD ++ AKL + + K +LL+N + E Sbjct: 84 TELADARRALDD---TARERAKL-QIELGKFKAEHDQLLLNYAKKE 125 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 19.2 bits (38), Expect = 4.9 Identities = 6/18 (33%), Positives = 13/18 (71%) Query: 166 IEGLEELFFQARSLGVPS 183 +E L+E+F A+ + +P+ Sbjct: 337 VEELQEIFENAKDIRIPT 354 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 18.9 bits (37), Expect = 5.3 Identities = 7/13 (53%), Positives = 9/13 (68%) Query: 213 PEPFDESNTISHR 225 PEP+ NTI+ R Sbjct: 82 PEPYVVVNTITKR 94 >swiss|P04489|E411_ADE05 Probable early E4 11 kDa protein. Length = 116 Score = 18.9 bits (37), Expect = 6.1 Identities = 5/13 (38%), Positives = 10/13 (76%) Query: 23 VANLLENLLRTWK 35 + +LL ++LR W+ Sbjct: 24 IRDLLRDILRRWR 36 >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 18.5 bits (36), Expect = 6.7 Identities = 7/9 (77%), Positives = 7/9 (77%) Query: 208 RKRTLPEPF 216 RKRT PE F Sbjct: 414 RKRTAPEDF 422 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). 498 e-144 Sequences not found previously or not previously below threshold: swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). 25 0.10 swiss|P14733|LAM1_MOUSE Lamin B1. 22 0.74 swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). 20 2.6 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 20 3.3 trembl|Q15219|Q15219 Protein p84. 20 3.6 CONVERGED! >swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). Length = 233 Score = 498 bits (1282), Expect = e-144 Identities = 233/233 (100%), Positives = 233/233 (100%) Query: 1 MQIQNSIRATLESNHGLLDVDYVANLLENLLRTWKHGKPTIKVREAIQLAHAKSIKVISL 60 MQIQNSIRATLESNHGLLDVDYVANLLENLLRTWKHGKPTIKVREAIQLAHAKSIKVISL Sbjct: 1 MQIQNSIRATLESNHGLLDVDYVANLLENLLRTWKHGKPTIKVREAIQLAHAKSIKVISL 60 Query: 61 WPQETCSFRNFDGNPEDDPNVPWLVRRENSSGPFTQPGSETSSLEQLLNGLGCIARILRE 120 WPQETCSFRNFDGNPEDDPNVPWLVRRENSSGPFTQPGSETSSLEQLLNGLGCIARILRE Sbjct: 61 WPQETCSFRNFDGNPEDDPNVPWLVRRENSSGPFTQPGSETSSLEQLLNGLGCIARILRE 120 Query: 121 NTNTVEARRAIDDHFCKIKKPAKLTMVGIQNIKKSMKRLLVNMSPIEGLEELFFQARSLG 180 NTNTVEARRAIDDHFCKIKKPAKLTMVGIQNIKKSMKRLLVNMSPIEGLEELFFQARSLG Sbjct: 121 NTNTVEARRAIDDHFCKIKKPAKLTMVGIQNIKKSMKRLLVNMSPIEGLEELFFQARSLG 180 Query: 181 VPSHLVESVRPPVDIHSGNMHRTGISPRKRTLPEPFDESNTISHRRTRRNTKQ 233 VPSHLVESVRPPVDIHSGNMHRTGISPRKRTLPEPFDESNTISHRRTRRNTKQ Sbjct: 181 VPSHLVESVRPPVDIHSGNMHRTGISPRKRTLPEPFDESNTISHRRTRRNTKQ 233 >swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). Length = 224 Score = 24.6 bits (52), Expect = 0.10 Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 1/66 (1%) Query: 14 NHGLLDVDYVANLLENLLRTWKHGKPTIKVREAIQLAHAKSIKVISLWPQETCSFRNFDG 73 + G+ +VD L E+ R + + + ++L H +S++ W ET S+ N Sbjct: 82 SEGITNVDTAVRLAESAERFVRQTNLELSANQLLRLLHYRSMQE-RQWELETRSYLNDLI 140 Query: 74 NPEDDP 79 N +DP Sbjct: 141 NLVEDP 146 >swiss|P14733|LAM1_MOUSE Lamin B1. Length = 587 Score = 21.9 bits (45), Expect = 0.74 Identities = 15/46 (32%), Positives = 24/46 (51%), Gaps = 4/46 (8%) Query: 122 TNTVEARRAIDDHFCKIKKPAKLTMVGIQNIKKSMKRLLVNMSPIE 167 T +ARRA+DD ++ AKL + + K +LL+N + E Sbjct: 84 TELADARRALDD---TARERAKL-QIELGKFKAEHDQLLLNYAKKE 125 >swiss|P20397|NUCL_XENLA Nucleolin (Protein C23). Length = 650 Score = 20.0 bits (40), Expect = 2.6 Identities = 6/18 (33%), Positives = 13/18 (71%) Query: 166 IEGLEELFFQARSLGVPS 183 +E L+E+F A+ + +P+ Sbjct: 337 VEELQEIFENAKDIRIPT 354 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 19.6 bits (39), Expect = 3.3 Identities = 7/13 (53%), Positives = 9/13 (68%) Query: 213 PEPFDESNTISHR 225 PEP+ NTI+ R Sbjct: 82 PEPYVVVNTITKR 94 >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 19.6 bits (39), Expect = 3.6 Identities = 7/9 (77%), Positives = 7/9 (77%) Query: 208 RKRTLPEPF 216 RKRT PE F Sbjct: 414 RKRTAPEDF 422 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.317 0.134 0.392 Lambda K H 0.267 0.0409 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 16,807 Number of Sequences: 43 Number of extensions: 704 Number of successful extensions: 13 Number of sequences better than 10.0: 13 Number of HSP's better than 10.0 without gapping: 13 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 13 length of query: 233 length of database: 17,163 effective HSP length: 53 effective length of query: 180 effective length of database: 14,884 effective search space: 2679120 effective search space used: 2679120 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 35 (18.9 bits) S2: 35 (18.1 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= trembl|Q15219|Q15219 Protein p84. (657 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value trembl|Q15219|Q15219 Protein p84. 1268 0.0 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 25 0.25 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 22 2.8 >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 1268 bits (3280), Expect = 0.0 Identities = 657/657 (100%), Positives = 657/657 (100%) Query: 1 MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSENEKKCTLDQAFRGILE 60 MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSENEKKCTLDQAFRGILE Sbjct: 1 MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSENEKKCTLDQAFRGILE 60 Query: 61 EEIINHSSCENVLAIISLAIGGVTEGICTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNV 120 EEIINHSSCENVLAIISLAIGGVTEGICTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNV Sbjct: 61 EEIINHSSCENVLAIISLAIGGVTEGICTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNV 120 Query: 121 ATWKSNTFYAAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQ 180 ATWKSNTFYAAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQ Sbjct: 121 ATWKSNTFYAAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQ 180 Query: 181 FNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDEEAPTTCSIPIDYNLYRKFWS 240 FNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDEEAPTTCSIPIDYNLYRKFWS Sbjct: 181 FNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDEEAPTTCSIPIDYNLYRKFWS 240 Query: 241 LQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAK 300 LQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAK Sbjct: 241 LQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAK 300 Query: 301 FLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSV 360 FLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSV Sbjct: 301 FLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSV 360 Query: 361 YQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERTSDTKPTRIIRKRTAPE 420 YQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERTSDTKPTRIIRKRTAPE Sbjct: 361 YQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERTSDTKPTRIIRKRTAPE 420 Query: 421 DFLGKGPTKKILTGNEELTRLWNLCPDNMEACKSETREHMPTLEEFFEEAIEQADPENMA 480 DFLGKGPTKKILTGNEELTRLWNLCPDNMEACKSETREHMPTLEEFFEEAIEQADPENMA Sbjct: 421 DFLGKGPTKKILTGNEELTRLWNLCPDNMEACKSETREHMPTLEEFFEEAIEQADPENMA 480 Query: 481 ENEYKAMNNSNYGWRALKLLARRSPHFFQPTNQQFKSLQEYLENMVIKLAKELPPPSEEI 540 ENEYKAMNNSNYGWRALKLLARRSPHFFQPTNQQFKSLQEYLENMVIKLAKELPPPSEEI Sbjct: 481 ENEYKAMNNSNYGWRALKLLARRSPHFFQPTNQQFKSLQEYLENMVIKLAKELPPPSEEI 540 Query: 541 KTGEDEDEEDNDALLKENESPDVRRDKPVTGEQIEVFANKLGEQWKILAPYLEMKDSEIR 600 KTGEDEDEEDNDALLKENESPDVRRDKPVTGEQIEVFANKLGEQWKILAPYLEMKDSEIR Sbjct: 541 KTGEDEDEEDNDALLKENESPDVRRDKPVTGEQIEVFANKLGEQWKILAPYLEMKDSEIR 600 Query: 601 QIECDSEDMKMRAKQLLVAWQDQEGVHATPENLINALNKSGLSDLAESLTNDNETNS 657 QIECDSEDMKMRAKQLLVAWQDQEGVHATPENLINALNKSGLSDLAESLTNDNETNS Sbjct: 601 QIECDSEDMKMRAKQLLVAWQDQEGVHATPENLINALNKSGLSDLAESLTNDNETNS 657 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 25.0 bits (53), Expect = 0.25 Identities = 11/30 (36%), Positives = 17/30 (56%) Query: 536 PSEEIKTGEDEDEEDNDALLKENESPDVRR 565 PS+EIK G+ ++E+ L K D+ R Sbjct: 155 PSDEIKDGDKQEEKKEPELAKRKSHQDLPR 184 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 21.6 bits (44), Expect = 2.8 Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 11/46 (23%) Query: 581 LGEQWKIL-----------APYLEMKDSEIRQIECDSEDMKMRAKQ 615 +GE W +L + +LE+K EI+Q + + E + KQ Sbjct: 102 MGELWPLLLSAQENIAGIPSAFLELKKEEIKQRQIEQEKLASLKKQ 147 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|Q15219|Q15219 Protein p84. 1398 0.0 Sequences not found previously or not previously below threshold: trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 27 0.093 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 24 0.44 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 22 2.0 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 21 5.3 CONVERGED! >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 1398 bits (3619), Expect = 0.0 Identities = 657/657 (100%), Positives = 657/657 (100%) Query: 1 MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSENEKKCTLDQAFRGILE 60 MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSENEKKCTLDQAFRGILE Sbjct: 1 MSPTPPLFSLPEARTRFTKSTREALNNKNIKPLLSTFSQVPGSENEKKCTLDQAFRGILE 60 Query: 61 EEIINHSSCENVLAIISLAIGGVTEGICTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNV 120 EEIINHSSCENVLAIISLAIGGVTEGICTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNV Sbjct: 61 EEIINHSSCENVLAIISLAIGGVTEGICTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNV 120 Query: 121 ATWKSNTFYAAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQ 180 ATWKSNTFYAAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQ Sbjct: 121 ATWKSNTFYAAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQ 180 Query: 181 FNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDEEAPTTCSIPIDYNLYRKFWS 240 FNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDEEAPTTCSIPIDYNLYRKFWS Sbjct: 181 FNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDEEAPTTCSIPIDYNLYRKFWS 240 Query: 241 LQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAK 300 LQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAK Sbjct: 241 LQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAK 300 Query: 301 FLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSV 360 FLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSV Sbjct: 301 FLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSV 360 Query: 361 YQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERTSDTKPTRIIRKRTAPE 420 YQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERTSDTKPTRIIRKRTAPE Sbjct: 361 YQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERTSDTKPTRIIRKRTAPE 420 Query: 421 DFLGKGPTKKILTGNEELTRLWNLCPDNMEACKSETREHMPTLEEFFEEAIEQADPENMA 480 DFLGKGPTKKILTGNEELTRLWNLCPDNMEACKSETREHMPTLEEFFEEAIEQADPENMA Sbjct: 421 DFLGKGPTKKILTGNEELTRLWNLCPDNMEACKSETREHMPTLEEFFEEAIEQADPENMA 480 Query: 481 ENEYKAMNNSNYGWRALKLLARRSPHFFQPTNQQFKSLQEYLENMVIKLAKELPPPSEEI 540 ENEYKAMNNSNYGWRALKLLARRSPHFFQPTNQQFKSLQEYLENMVIKLAKELPPPSEEI Sbjct: 481 ENEYKAMNNSNYGWRALKLLARRSPHFFQPTNQQFKSLQEYLENMVIKLAKELPPPSEEI 540 Query: 541 KTGEDEDEEDNDALLKENESPDVRRDKPVTGEQIEVFANKLGEQWKILAPYLEMKDSEIR 600 KTGEDEDEEDNDALLKENESPDVRRDKPVTGEQIEVFANKLGEQWKILAPYLEMKDSEIR Sbjct: 541 KTGEDEDEEDNDALLKENESPDVRRDKPVTGEQIEVFANKLGEQWKILAPYLEMKDSEIR 600 Query: 601 QIECDSEDMKMRAKQLLVAWQDQEGVHATPENLINALNKSGLSDLAESLTNDNETNS 657 QIECDSEDMKMRAKQLLVAWQDQEGVHATPENLINALNKSGLSDLAESLTNDNETNS Sbjct: 601 QIECDSEDMKMRAKQLLVAWQDQEGVHATPENLINALNKSGLSDLAESLTNDNETNS 657 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 26.6 bits (57), Expect = 0.093 Identities = 11/30 (36%), Positives = 17/30 (56%) Query: 536 PSEEIKTGEDEDEEDNDALLKENESPDVRR 565 PS+EIK G+ ++E+ L K D+ R Sbjct: 155 PSDEIKDGDKQEEKKEPELAKRKSHQDLPR 184 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 24.3 bits (51), Expect = 0.44 Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 11/46 (23%) Query: 581 LGEQWKIL-----------APYLEMKDSEIRQIECDSEDMKMRAKQ 615 +GE W +L + +LE+K EI+Q + + E + KQ Sbjct: 102 MGELWPLLLSAQENIAGIPSAFLELKKEEIKQRQIEQEKLASLKKQ 147 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 22.0 bits (45), Expect = 2.0 Identities = 46/198 (23%), Positives = 70/198 (35%), Gaps = 23/198 (11%) Query: 355 DTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERTSDTKPTRIIR 414 D TK +LL+ K ++T + + E C + T+ + + Sbjct: 485 DETKRGAELLAAELTTTRELLKKTNEEMHTMSHELAAVTENCDNL------QTELVDVYK 538 Query: 415 KRTAPEDFLGKGPTKKILTGNEELTRLWNLCPDNMEACKSETREHMPTLEEFFEEAIEQA 474 K D L K ++T +ELT L EA + +E LEE E A E Sbjct: 539 KAERAADEL-KQEKNIVVTLEKELTFL--------EAQITREKESRKNLEEELERATESL 589 Query: 475 DPENMAENEYKAMNNSNYGWRALKLLARRSPHFFQPTNQQFKSLQEYLEN------MVIK 528 D M N + + L + ++Q + QE EN +V+K Sbjct: 590 D--EMNRNAFALAKELELANSHISSLEDEREVLQKSVSEQKQISQESRENLEDAHSLVMK 647 Query: 529 LAKELPPPSEEIKTGEDE 546 L KE + K EDE Sbjct: 648 LGKERESLEKRAKKLEDE 665 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 20.8 bits (42), Expect = 5.3 Identities = 9/24 (37%), Positives = 14/24 (57%) Query: 186 VTVFNTNEQESTLGQKHTEDREEG 209 VT + +E S G+K E+ E+G Sbjct: 989 VTTLDCSESPSEAGRKMGEETEDG 1012 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.314 0.132 0.382 Lambda K H 0.267 0.0403 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 52,015 Number of Sequences: 43 Number of extensions: 2542 Number of successful extensions: 54 Number of sequences better than 10.0: 14 Number of HSP's better than 10.0 without gapping: 14 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 48 length of query: 657 length of database: 17,163 effective HSP length: 61 effective length of query: 596 effective length of database: 14,540 effective search space: 8665840 effective search space used: 8665840 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.0 bits) S2: 40 (20.0 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. (487 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. 406 e-116 swiss|P25233|NECD_MOUSE Necdin. 22 2.2 swiss|O08579|EMD_MOUSE Emerin. 20 7.0 >swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. Length = 487 Score = 406 bits (1043), Expect = e-116 Identities = 487/487 (100%), Positives = 487/487 (100%) Query: 1 MPTFYLALHGGQTYHLIVDTDSLGNPSLSVIPSNPYQEQLSDTPLIPLTIFVGENTGVPP 60 MPTFYLALHGGQTYHLIVDTDSLGNPSLSVIPSNPYQEQLSDTPLIPLTIFVGENTGVPP Sbjct: 1 MPTFYLALHGGQTYHLIVDTDSLGNPSLSVIPSNPYQEQLSDTPLIPLTIFVGENTGVPP 60 Query: 61 PLPPPPPPPPPPPPPPPPPPPPPPPPPPSPPPPPPPPPPPQRRDAWTQEPSPLDRDPLGY 120 PLPPPPPPPPPPPPPPPPPPPPPPPPPPSPPPPPPPPPPPQRRDAWTQEPSPLDRDPLGY Sbjct: 61 PLPPPPPPPPPPPPPPPPPPPPPPPPPPSPPPPPPPPPPPQRRDAWTQEPSPLDRDPLGY 120 Query: 121 DVGHGPLASAMRMLWMANYIVRQSRGDRGLILPQGPQTAPQARLVQPHVPPLRPTAPTIL 180 DVGHGPLASAMRMLWMANYIVRQSRGDRGLILPQGPQTAPQARLVQPHVPPLRPTAPTIL Sbjct: 121 DVGHGPLASAMRMLWMANYIVRQSRGDRGLILPQGPQTAPQARLVQPHVPPLRPTAPTIL 180 Query: 181 SPLSQPRLTPPQPLMMPPRPTPPTPLPPATLTVPPRPTRPTTLPPTPLLTVLQRPTELQP 240 SPLSQPRLTPPQPLMMPPRPTPPTPLPPATLTVPPRPTRPTTLPPTPLLTVLQRPTELQP Sbjct: 181 SPLSQPRLTPPQPLMMPPRPTPPTPLPPATLTVPPRPTRPTTLPPTPLLTVLQRPTELQP 240 Query: 241 TPSPPRMHLPVLHVPDQSMHPLTHQSTPNDPDSPEPRSPTVFYNIPPMPLPPSQLPPPAA 300 TPSPPRMHLPVLHVPDQSMHPLTHQSTPNDPDSPEPRSPTVFYNIPPMPLPPSQLPPPAA Sbjct: 241 TPSPPRMHLPVLHVPDQSMHPLTHQSTPNDPDSPEPRSPTVFYNIPPMPLPPSQLPPPAA 300 Query: 301 PAQPPPGVINDQQLHHLPSGPPWWPPICDPPQPSKTQGQSRGQSRGRGRGRGRGRGKGKS 360 PAQPPPGVINDQQLHHLPSGPPWWPPICDPPQPSKTQGQSRGQSRGRGRGRGRGRGKGKS Sbjct: 301 PAQPPPGVINDQQLHHLPSGPPWWPPICDPPQPSKTQGQSRGQSRGRGRGRGRGRGKGKS 360 Query: 361 RDKQRKPGGPWRPEPNTSSPSMPELSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATP 420 RDKQRKPGGPWRPEPNTSSPSMPELSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATP Sbjct: 361 RDKQRKPGGPWRPEPNTSSPSMPELSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATP 420 Query: 421 NVSPIHEPESHNSPEAPILFPDDWYPPSIDPADLDESWDYIFETTESPSSDEDYVEGPSK 480 NVSPIHEPESHNSPEAPILFPDDWYPPSIDPADLDESWDYIFETTESPSSDEDYVEGPSK Sbjct: 421 NVSPIHEPESHNSPEAPILFPDDWYPPSIDPADLDESWDYIFETTESPSSDEDYVEGPSK 480 Query: 481 RPRPSIQ 487 RPRPSIQ Sbjct: 481 RPRPSIQ 487 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 21.6 bits (44), Expect = 2.2 Identities = 13/50 (26%), Positives = 19/50 (38%), Gaps = 8/50 (16%) Query: 39 QLSDTPLIPLTIFVGENTGVPPPLPPPPPPPPPPPPPPPPPPPPPPPPPP 88 ++ D+ +P+ G+PPP PP P P PPP Sbjct: 21 EMQDSDAVPV--------GIPPPASLAANLAGPPCAPEGPMAAQQASPPP 62 >swiss|O08579|EMD_MOUSE Emerin. Length = 259 Score = 19.6 bits (39), Expect = 7.0 Identities = 6/16 (37%), Positives = 11/16 (68%) Query: 120 YDVGHGPLASAMRMLW 135 Y++ HGP+ + R L+ Sbjct: 19 YNIPHGPIVGSTRKLY 34 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. 450 e-129 Sequences not found previously or not previously below threshold: swiss|P25233|NECD_MOUSE Necdin. 24 0.37 swiss|O08579|EMD_MOUSE Emerin. 22 2.3 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 21 4.1 trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.... 20 6.6 swiss|Q09824|REP2_SCHPO Transcriptional activator protein rep2. 20 7.4 CONVERGED! >swiss|P12978|EBN2_EBV EBNA-2 nuclear protein. Length = 487 Score = 450 bits (1158), Expect = e-129 Identities = 487/487 (100%), Positives = 487/487 (100%) Query: 1 MPTFYLALHGGQTYHLIVDTDSLGNPSLSVIPSNPYQEQLSDTPLIPLTIFVGENTGVPP 60 MPTFYLALHGGQTYHLIVDTDSLGNPSLSVIPSNPYQEQLSDTPLIPLTIFVGENTGVPP Sbjct: 1 MPTFYLALHGGQTYHLIVDTDSLGNPSLSVIPSNPYQEQLSDTPLIPLTIFVGENTGVPP 60 Query: 61 PLPPPPPPPPPPPPPPPPPPPPPPPPPPSPPPPPPPPPPPQRRDAWTQEPSPLDRDPLGY 120 PLPPPPPPPPPPPPPPPPPPPPPPPPPPSPPPPPPPPPPPQRRDAWTQEPSPLDRDPLGY Sbjct: 61 PLPPPPPPPPPPPPPPPPPPPPPPPPPPSPPPPPPPPPPPQRRDAWTQEPSPLDRDPLGY 120 Query: 121 DVGHGPLASAMRMLWMANYIVRQSRGDRGLILPQGPQTAPQARLVQPHVPPLRPTAPTIL 180 DVGHGPLASAMRMLWMANYIVRQSRGDRGLILPQGPQTAPQARLVQPHVPPLRPTAPTIL Sbjct: 121 DVGHGPLASAMRMLWMANYIVRQSRGDRGLILPQGPQTAPQARLVQPHVPPLRPTAPTIL 180 Query: 181 SPLSQPRLTPPQPLMMPPRPTPPTPLPPATLTVPPRPTRPTTLPPTPLLTVLQRPTELQP 240 SPLSQPRLTPPQPLMMPPRPTPPTPLPPATLTVPPRPTRPTTLPPTPLLTVLQRPTELQP Sbjct: 181 SPLSQPRLTPPQPLMMPPRPTPPTPLPPATLTVPPRPTRPTTLPPTPLLTVLQRPTELQP 240 Query: 241 TPSPPRMHLPVLHVPDQSMHPLTHQSTPNDPDSPEPRSPTVFYNIPPMPLPPSQLPPPAA 300 TPSPPRMHLPVLHVPDQSMHPLTHQSTPNDPDSPEPRSPTVFYNIPPMPLPPSQLPPPAA Sbjct: 241 TPSPPRMHLPVLHVPDQSMHPLTHQSTPNDPDSPEPRSPTVFYNIPPMPLPPSQLPPPAA 300 Query: 301 PAQPPPGVINDQQLHHLPSGPPWWPPICDPPQPSKTQGQSRGQSRGRGRGRGRGRGKGKS 360 PAQPPPGVINDQQLHHLPSGPPWWPPICDPPQPSKTQGQSRGQSRGRGRGRGRGRGKGKS Sbjct: 301 PAQPPPGVINDQQLHHLPSGPPWWPPICDPPQPSKTQGQSRGQSRGRGRGRGRGRGKGKS 360 Query: 361 RDKQRKPGGPWRPEPNTSSPSMPELSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATP 420 RDKQRKPGGPWRPEPNTSSPSMPELSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATP Sbjct: 361 RDKQRKPGGPWRPEPNTSSPSMPELSPVLGLHQGQGAGDSPTPGPSNAAPVCRNSHTATP 420 Query: 421 NVSPIHEPESHNSPEAPILFPDDWYPPSIDPADLDESWDYIFETTESPSSDEDYVEGPSK 480 NVSPIHEPESHNSPEAPILFPDDWYPPSIDPADLDESWDYIFETTESPSSDEDYVEGPSK Sbjct: 421 NVSPIHEPESHNSPEAPILFPDDWYPPSIDPADLDESWDYIFETTESPSSDEDYVEGPSK 480 Query: 481 RPRPSIQ 487 RPRPSIQ Sbjct: 481 RPRPSIQ 487 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 24.2 bits (51), Expect = 0.37 Identities = 13/50 (26%), Positives = 19/50 (38%), Gaps = 8/50 (16%) Query: 39 QLSDTPLIPLTIFVGENTGVPPPLPPPPPPPPPPPPPPPPPPPPPPPPPP 88 ++ D+ +P+ G+PPP PP P P PPP Sbjct: 21 EMQDSDAVPV--------GIPPPASLAANLAGPPCAPEGPMAAQQASPPP 62 >swiss|O08579|EMD_MOUSE Emerin. Length = 259 Score = 21.5 bits (44), Expect = 2.3 Identities = 6/16 (37%), Positives = 11/16 (68%) Query: 120 YDVGHGPLASAMRMLW 135 Y++ HGP+ + R L+ Sbjct: 19 YNIPHGPIVGSTRKLY 34 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 20.7 bits (42), Expect = 4.1 Identities = 9/21 (42%), Positives = 13/21 (61%) Query: 214 PPRPTRPTTLPPTPLLTVLQR 234 PPR +R + PP P +T +R Sbjct: 403 PPRTSRKESPPPYPSVTERKR 423 >trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR Length = 752 Score = 20.0 bits (40), Expect = 6.6 Identities = 10/28 (35%), Positives = 16/28 (56%), Gaps = 5/28 (17%) Query: 328 CDPP--QPSKTQGQ---SRGQSRGRGRG 350 C+PP QP+ Q + +R + +GRG Sbjct: 308 CNPPNSQPASYQRRFLVTRSGRKIKGRG 335 >swiss|Q09824|REP2_SCHPO Transcriptional activator protein rep2. Length = 219 Score = 19.6 bits (39), Expect = 7.4 Identities = 8/19 (42%), Positives = 11/19 (57%) Query: 170 PPLRPTAPTILSPLSQPRL 188 PP P + ILS ++ P L Sbjct: 16 PPTYPWSSPILSSIANPSL 34 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.312 0.138 0.457 Lambda K H 0.267 0.0423 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 71,691 Number of Sequences: 43 Number of extensions: 6456 Number of successful extensions: 1053 Number of sequences better than 10.0: 20 Number of HSP's better than 10.0 without gapping: 20 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 149 length of query: 487 length of database: 17,163 effective HSP length: 59 effective length of query: 428 effective length of database: 14,626 effective search space: 6259928 effective search space used: 6259928 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (20.4 bits) S2: 39 (19.6 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). (415 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 834 0.0 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 47 4e-08 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 45 2e-07 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 42 1e-06 swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1... 19 9.6 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 834 bits (2155), Expect = 0.0 Identities = 415/415 (100%), Positives = 415/415 (100%) Query: 1 MSAPSESSETAAPLLSDLSPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQH 60 MSAPSESSETAAPLLSDLSPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQH Sbjct: 1 MSAPSESSETAAPLLSDLSPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQH 60 Query: 61 GSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLR 120 GSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLR Sbjct: 61 GSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLR 120 Query: 121 KQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQRHPEKVYDLMTYQEIADGIDDF 180 KQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQRHPEKVYDLMTYQEIADGIDDF Sbjct: 121 KQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQRHPEKVYDLMTYQEIADGIDDF 180 Query: 181 FSLLEMPNLSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYKK 240 FSLLEMPNLSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYKK Sbjct: 181 FSLLEMPNLSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYKK 240 Query: 241 KTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVELTDRH 300 KTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVELTDRH Sbjct: 241 KTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVELTDRH 300 Query: 301 RRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSV 360 RRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSV Sbjct: 301 RRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSV 360 Query: 361 RVSIRLKHRQLTATWIELEPIIENGEVKDVMFKLSTRVQYTEENENETRPESSSE 415 RVSIRLKHRQLTATWIELEPIIENGEVKDVMFKLSTRVQYTEENENETRPESSSE Sbjct: 361 RVSIRLKHRQLTATWIELEPIIENGEVKDVMFKLSTRVQYTEENENETRPESSSE 415 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 47.0 bits (110), Expect = 4e-08 Identities = 29/103 (28%), Positives = 45/103 (43%), Gaps = 1/103 (0%) Query: 19 SPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQHGSYEVPEAFAQCTYLAQI 78 S EE + + +E+ + + D + + P N +IT G+ E+PE + LA Sbjct: 3 SREEVRASRPTKEM-KMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAIF 61 Query: 79 YVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLRK 121 YV Y + PY + W P V I +R Q +K F K Sbjct: 62 YVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANK 104 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 44.7 bits (104), Expect = 2e-07 Identities = 26/89 (29%), Positives = 46/89 (51%), Gaps = 3/89 (3%) Query: 307 LGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSVRVSIRL 366 L Y+ D +L + R + ++ VR ++ G ++GVRC NG ++VS+ Sbjct: 94 LSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFY 153 Query: 367 KH-RQLTATWIELEPII--ENGEVKDVMF 392 + +T+ W+ L PI ENG ++D+ F Sbjct: 154 GPCKWVTSMWVTLVPITLRENGSLEDIEF 182 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 42.0 bits (97), Expect = 1e-06 Identities = 74/346 (21%), Positives = 141/346 (40%), Gaps = 39/346 (11%) Query: 49 YDPLNNTIITQHGSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLI 108 YD I T G++ VP+ + + LA +V + +N+ K+L+W PA VY I Sbjct: 23 YDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNTDDTPVIEKELDWPDPALVYNTI 82 Query: 109 MERLQKNKYFLRKQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQ--RHPEKV-Y 165 ++R+ + + + I V + +D N+K + ++ + R P+ V + Sbjct: 83 VDRIINHP-------ELSQFISVAFISQLKATIGEGLDINVKGTLNRRGKGIRRPKGVFF 135 Query: 166 DLMTYQEIADGIDDFFSLLEMPN-LSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIR 224 M + + FFS L N ++ + N T L SC A G IR Sbjct: 136 RYMESPFVNTKVTAFFSYLRDYNKIASEYHNNTKFILTFSCQ-AYWASGPNFSALKNVIR 194 Query: 225 HYWIKNAILEFADYKK-------KTVPYPFVGPPRDPGLFEHIVDDCS-TGAPIN---WD 273 I I +F + ++ + +P P P R+ +H V+ + A + W Sbjct: 195 CSIIHEYISKFVEREQDKGHIGDQELP-PEEDPSRELNNVQHEVNSLTEQDAEADEGLWG 253 Query: 274 DFSIDSYAS-YVTVMEHLEEVVELTDRHRRVIEYLGMYIASDLHEEGKLTRARKLDRTTL 332 + IDS + + E E + DR I + L R ++ L Sbjct: 254 E--IDSLCEKWQSEAEDQTEAEIIADR-----------IIGNSQRMANLKIRRTKFKSVL 300 Query: 333 FHTVRDVLLHDGSYTRHKGVRCLDNGSVRVSIRLKHRQLTATWIEL 378 +H +++++ G+ ++G + S+++S+ + + +TA W+ L Sbjct: 301 YHILKELIQSQGTVKVYRG-SSFSHDSIKISLHYEEQHITAVWVYL 345 >swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu Length = 589 Score = 19.2 bits (38), Expect = 9.6 Identities = 12/43 (27%), Positives = 19/43 (43%) Query: 295 ELTDRHRRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVR 337 E +++ LGM + SD H+ KL + F T+R Sbjct: 138 EFLEKNLHPTNCLGMLLLSDAHQCTKLYELSWRMCLSNFQTIR 180 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 701 0.0 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 454 e-130 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 179 5e-48 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 142 6e-37 Sequences not found previously or not previously below threshold: swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 47 5e-08 trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. 20 6.8 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 701 bits (1810), Expect = 0.0 Identities = 415/415 (100%), Positives = 415/415 (100%) Query: 1 MSAPSESSETAAPLLSDLSPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQH 60 MSAPSESSETAAPLLSDLSPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQH Sbjct: 1 MSAPSESSETAAPLLSDLSPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQH 60 Query: 61 GSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLR 120 GSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLR Sbjct: 61 GSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLR 120 Query: 121 KQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQRHPEKVYDLMTYQEIADGIDDF 180 KQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQRHPEKVYDLMTYQEIADGIDDF Sbjct: 121 KQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQRHPEKVYDLMTYQEIADGIDDF 180 Query: 181 FSLLEMPNLSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYKK 240 FSLLEMPNLSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYKK Sbjct: 181 FSLLEMPNLSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYKK 240 Query: 241 KTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVELTDRH 300 KTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVELTDRH Sbjct: 241 KTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVELTDRH 300 Query: 301 RRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSV 360 RRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSV Sbjct: 301 RRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSV 360 Query: 361 RVSIRLKHRQLTATWIELEPIIENGEVKDVMFKLSTRVQYTEENENETRPESSSE 415 RVSIRLKHRQLTATWIELEPIIENGEVKDVMFKLSTRVQYTEENENETRPESSSE Sbjct: 361 RVSIRLKHRQLTATWIELEPIIENGEVKDVMFKLSTRVQYTEENENETRPESSSE 415 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 454 bits (1167), Expect = e-130 Identities = 74/347 (21%), Positives = 141/347 (40%), Gaps = 39/347 (11%) Query: 48 AYDPLNNTIITQHGSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKL 107 YD I T G++ VP+ + + LA +V + +N+ K+L+W PA VY Sbjct: 22 VYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNTDDTPVIEKELDWPDPALVYNT 81 Query: 108 IMERLQKNKYFLRKQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQ--RHPEKV- 164 I++R+ + + + I V + +D N+K + ++ + R P+ V Sbjct: 82 IVDRIINHP-------ELSQFISVAFISQLKATIGEGLDINVKGTLNRRGKGIRRPKGVF 134 Query: 165 YDLMTYQEIADGIDDFFSLLEMPN-LSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTI 223 + M + + FFS L N ++ + N T L SC A G I Sbjct: 135 FRYMESPFVNTKVTAFFSYLRDYNKIASEYHNNTKFILTFSCQ-AYWASGPNFSALKNVI 193 Query: 224 RHYWIKNAILEFADYKK-------KTVPYPFVGPPRDPGLFEHIVDDCS-TGAPIN---W 272 R I I +F + ++ + +P P P R+ +H V+ + A + W Sbjct: 194 RCSIIHEYISKFVEREQDKGHIGDQELP-PEEDPSRELNNVQHEVNSLTEQDAEADEGLW 252 Query: 273 DDFSIDSYAS-YVTVMEHLEEVVELTDRHRRVIEYLGMYIASDLHEEGKLTRARKLDRTT 331 + IDS + + E E + DR I + L R ++ Sbjct: 253 GE--IDSLCEKWQSEAEDQTEAEIIADR-----------IIGNSQRMANLKIRRTKFKSV 299 Query: 332 LFHTVRDVLLHDGSYTRHKGVRCLDNGSVRVSIRLKHRQLTATWIEL 378 L+H +++++ G+ ++G + S+++S+ + + +TA W+ L Sbjct: 300 LYHILKELIQSQGTVKVYRG-SSFSHDSIKISLHYEEQHITAVWVYL 345 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 179 bits (454), Expect = 5e-48 Identities = 45/228 (19%), Positives = 80/228 (34%), Gaps = 15/228 (6%) Query: 19 SPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQHGSYEVPEAFAQCTYLAQI 78 S EE + + +E+ + + D + + P N +IT G+ E+PE + LA Sbjct: 3 SREEVRASRPTKEM-KMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAIF 61 Query: 79 YVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLRKQVQAMEKIKVLLCPSPE 138 YV Y + PY + W P V I +R Q +K F K + + Sbjct: 62 YVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNAD-------VYVERLQ 114 Query: 139 LLENNYVDDNLKISSHKVTQ-RHPEKVYDLMTYQEIA-----DGIDDFFSLLEMPNLSLA 192 + + + ++ Q + + V + I +D++F L+ L+ Sbjct: 115 NAIASGIKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDGTLTGD 174 Query: 193 FVNK-TSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYK 239 +K ISC + A +R WI + K Sbjct: 175 LEGGLCKVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNK 222 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 142 bits (359), Expect = 6e-37 Identities = 41/178 (23%), Positives = 72/178 (40%), Gaps = 8/178 (4%) Query: 218 RTARTIRHYWIKNAILEFADYKKKTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSI 277 R + IR W+ I+ A+ + + I + S I+ + Sbjct: 10 RFGKAIRKIWLYETIMLGAEK----IAKGNLSEESTSTSESEIPNTSSQQDVISSSEHVS 65 Query: 278 DSYASYVTVMEHLEEVVELTDRHRRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVR 337 DS + V+ LE V L Y+ D +L + R + ++ VR Sbjct: 66 DSLKFWE-VLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVR 124 Query: 338 DVLLHDGSYTRHKGVRCLDNGSVRVSIRLKH-RQLTATWIELEPII--ENGEVKDVMF 392 ++ G ++GVRC NG ++VS+ + +T+ W+ L PI ENG ++D+ F Sbjct: 125 QIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDIEF 182 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 46.5 bits (109), Expect = 5e-08 Identities = 32/177 (18%), Positives = 69/177 (38%), Gaps = 20/177 (11%) Query: 240 KKTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVE---- 295 ++ +P P E +V +T + S + + E EE + Sbjct: 140 QQNIPTKL---PSTQQAVERVVSSQTTSRAAETRERSPEHTEASTQEAETAEEALNKEEF 196 Query: 296 ------------LTDRHRRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHD 343 ++ + I L M + ++L +G+ R R + L+ ++D++ Sbjct: 197 WNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEP 256 Query: 344 GSYTRHKGVRCLDNGSVRVSIRLKHR-QLTATWIELEPIIENGEVKDVMFKLSTRVQ 399 ++G R D ++ VS + K+R Q + ++LEP V ++ + +S V+ Sbjct: 257 WELRAYRGCRSADGLTLSVSFKYKNRPQFSGVKVDLEPKFLQESVVNIYYTISFTVR 313 >trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. Length = 456 Score = 19.5 bits (39), Expect = 6.8 Identities = 3/20 (15%), Positives = 6/20 (30%) Query: 314 DLHEEGKLTRARKLDRTTLF 333 D L + R ++ Sbjct: 173 DPSIFESLQKERVEAGDVIY 192 Searching.done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 647 0.0 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 415 e-119 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 319 4e-90 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 224 1e-61 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 195 1e-52 Sequences not found previously or not previously below threshold: trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 20 4.0 swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 19 6.3 CONVERGED! >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 647 bits (1670), Expect = 0.0 Identities = 415/415 (100%), Positives = 415/415 (100%) Query: 1 MSAPSESSETAAPLLSDLSPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQH 60 MSAPSESSETAAPLLSDLSPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQH Sbjct: 1 MSAPSESSETAAPLLSDLSPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQH 60 Query: 61 GSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLR 120 GSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLR Sbjct: 61 GSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLR 120 Query: 121 KQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQRHPEKVYDLMTYQEIADGIDDF 180 KQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQRHPEKVYDLMTYQEIADGIDDF Sbjct: 121 KQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQRHPEKVYDLMTYQEIADGIDDF 180 Query: 181 FSLLEMPNLSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYKK 240 FSLLEMPNLSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYKK Sbjct: 181 FSLLEMPNLSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYKK 240 Query: 241 KTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVELTDRH 300 KTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVELTDRH Sbjct: 241 KTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVELTDRH 300 Query: 301 RRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSV 360 RRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSV Sbjct: 301 RRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSV 360 Query: 361 RVSIRLKHRQLTATWIELEPIIENGEVKDVMFKLSTRVQYTEENENETRPESSSE 415 RVSIRLKHRQLTATWIELEPIIENGEVKDVMFKLSTRVQYTEENENETRPESSSE Sbjct: 361 RVSIRLKHRQLTATWIELEPIIENGEVKDVMFKLSTRVQYTEENENETRPESSSE 415 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 415 bits (1068), Expect = e-119 Identities = 79/371 (21%), Positives = 150/371 (40%), Gaps = 40/371 (10%) Query: 48 AYDPLNNTIITQHGSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKL 107 YD I T G++ VP+ + + LA +V + +N+ K+L+W PA VY Sbjct: 22 VYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNTDDTPVIEKELDWPDPALVYNT 81 Query: 108 IMERLQKNKYFLRKQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQ--RHPEKV- 164 I++R+ + + + I V + +D N+K + ++ + R P+ V Sbjct: 82 IVDRIINHP-------ELSQFISVAFISQLKATIGEGLDINVKGTLNRRGKGIRRPKGVF 134 Query: 165 YDLMTYQEIADGIDDFFSLLEMPN-LSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTI 223 + M + + FFS L N ++ + N T L SC A G I Sbjct: 135 FRYMESPFVNTKVTAFFSYLRDYNKIASEYHNNTKFILTFSCQ-AYWASGPNFSALKNVI 193 Query: 224 RHYWIKNAILEFADYKK-------KTVPYPFVGPPRDPGLFEHIVDDCS-TGAPIN---W 272 R I I +F + ++ + +P P P R+ +H V+ + A + W Sbjct: 194 RCSIIHEYISKFVEREQDKGHIGDQELP-PEEDPSRELNNVQHEVNSLTEQDAEADEGLW 252 Query: 273 DDFSIDSYAS-YVTVMEHLEEVVELTDRHRRVIEYLGMYIASDLHEEGKLTRARKLDRTT 331 + IDS + + E E + DR I + L R ++ Sbjct: 253 GE--IDSLCEKWQSEAEDQTEAEIIADR-----------IIGNSQRMANLKIRRTKFKSV 299 Query: 332 LFHTVRDVLLHDGSYTRHKGVRCLDNGSVRVSIRLKHRQLTATWIELEPIIENGEVKDVM 391 L+H +++++ G+ ++G + S+++S+ + + +TA W+ L E K V Sbjct: 300 LYHILKELIQSQGTVKVYRG-SSFSHDSIKISLHYEEQHITAVWVYLTVKFE-EHWKPVD 357 Query: 392 FKLSTRVQYTE 402 ++ R ++ E Sbjct: 358 VEVEFRCKFKE 368 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 319 bits (817), Expect = 4e-90 Identities = 54/345 (15%), Positives = 108/345 (30%), Gaps = 26/345 (7%) Query: 19 SPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQHGSYEVPEAFAQCTYLAQI 78 S EE + + +E+ + + D + + P N +IT G+ E+PE + LA Sbjct: 3 SREEVRASRPTKEM-KMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAIF 61 Query: 79 YVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLRKQVQAMEKIKVLLCPSPE 138 YV Y + PY + W P V I +R Q +K F K + + Sbjct: 62 YVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNAD-------VYVERLQ 114 Query: 139 LLENNYVDDNLKISSHKVTQ-RHPEKVYDLMTYQEIA-----DGIDDFFSLLEMPNLSLA 192 + + + ++ Q + + V + I +D++F L+ L+ Sbjct: 115 NAIASGIKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDGTLTGD 174 Query: 193 FVNK-TSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYKKKTVPYPFVGPP 251 +K ISC + A +R WI + K Sbjct: 175 LEGGLCKVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEE 234 Query: 252 RDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVELTD------RHRRVI- 304 R+ + ++ +D+ + + +E E E R Sbjct: 235 RNASGEQ-TTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQL 293 Query: 305 ---EYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSY 346 +++ T+ + R + +++ L + Sbjct: 294 EECRKAAVFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNL 338 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 224 bits (572), Expect = 1e-61 Identities = 41/182 (22%), Positives = 73/182 (39%), Gaps = 8/182 (4%) Query: 215 FIGRTARTIRHYWIKNAILEFADYKKKTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDD 274 + R + IR W+ I+ A+ + + I + S I+ + Sbjct: 7 ELERFGKAIRKIWLYETIMLGAEK----IAKGNLSEESTSTSESEIPNTSSQQDVISSSE 62 Query: 275 FSIDSYASYVTVMEHLEEVVELTDRHRRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFH 334 DS + V+ LE V L Y+ D +L + R + ++ Sbjct: 63 HVSDSLKFWE-VLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVYG 121 Query: 335 TVRDVLLHDGSYTRHKGVRCLDNGSVRVSIRLKH-RQLTATWIELEPII--ENGEVKDVM 391 VR ++ G ++GVRC NG ++VS+ + +T+ W+ L PI ENG ++D+ Sbjct: 122 YVRQIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDIE 181 Query: 392 FK 393 F Sbjct: 182 FT 183 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 195 bits (495), Expect = 1e-52 Identities = 32/177 (18%), Positives = 69/177 (38%), Gaps = 20/177 (11%) Query: 240 KKTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVE---- 295 ++ +P P E +V +T + S + + E EE + Sbjct: 140 QQNIPTKL---PSTQQAVERVVSSQTTSRAAETRERSPEHTEASTQEAETAEEALNKEEF 196 Query: 296 ------------LTDRHRRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHD 343 ++ + I L M + ++L +G+ R R + L+ ++D++ Sbjct: 197 WNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEP 256 Query: 344 GSYTRHKGVRCLDNGSVRVSIRLKHR-QLTATWIELEPIIENGEVKDVMFKLSTRVQ 399 ++G R D ++ VS + K+R Q + ++LEP V ++ + +S V+ Sbjct: 257 WELRAYRGCRSADGLTLSVSFKYKNRPQFSGVKVDLEPKFLQESVVNIYYTISFTVR 313 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 20.3 bits (41), Expect = 4.0 Identities = 7/41 (17%), Positives = 13/41 (31%), Gaps = 3/41 (7%) Query: 312 ASDLHEEGKLTRARKLDRTTLFHTVRDVLLH---DGSYTRH 349 S L K + +FH +D++ D + Sbjct: 251 KSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIW 291 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 19.5 bits (39), Expect = 6.3 Identities = 6/20 (30%), Positives = 12/20 (60%) Query: 393 KLSTRVQYTEENENETRPES 412 +++ + EEN +TRP + Sbjct: 631 TVASSAESDEENRQKTRPRT 650 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.308 0.148 0.388 Lambda K H 0.267 0.0452 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 46,277 Number of Sequences: 43 Number of extensions: 2113 Number of successful extensions: 56 Number of sequences better than 10.0: 25 Number of HSP's better than 10.0 without gapping: 25 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 26 length of query: 415 length of database: 17,163 effective HSP length: 57 effective length of query: 358 effective length of database: 14,712 effective search space: 5266896 effective search space used: 5266896 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 38 (19.6 bits) S2: 38 (19.1 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). (373 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 768 0.0 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 65 9e-14 swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 42 2e-06 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 37 3e-05 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 33 6e-04 swiss|P33418|LOS1_YEAST LOS1 protein. 21 2.4 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 20 6.4 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 768 bits (1982), Expect = 0.0 Identities = 373/373 (100%), Positives = 373/373 (100%) Query: 1 MNGERLLACIKQCIMQHFQPMVYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNT 60 MNGERLLACIKQCIMQHFQPMVYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNT Sbjct: 1 MNGERLLACIKQCIMQHFQPMVYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNT 60 Query: 61 DDTPVIEKELDWPDPALVYNTIVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTL 120 DDTPVIEKELDWPDPALVYNTIVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTL Sbjct: 61 DDTPVIEKELDWPDPALVYNTIVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTL 120 Query: 121 NRRGKGIRRPKGVFFRYMESPFVNTKVTAFFSYLRDYNKIASEYHNNTKFILTFSCQAYW 180 NRRGKGIRRPKGVFFRYMESPFVNTKVTAFFSYLRDYNKIASEYHNNTKFILTFSCQAYW Sbjct: 121 NRRGKGIRRPKGVFFRYMESPFVNTKVTAFFSYLRDYNKIASEYHNNTKFILTFSCQAYW 180 Query: 181 ASGPNFSALKNVIRCSIIHEYISKFVEREQDKGHIGDQELPPEEDPSRELNNVQHEVNSL 240 ASGPNFSALKNVIRCSIIHEYISKFVEREQDKGHIGDQELPPEEDPSRELNNVQHEVNSL Sbjct: 181 ASGPNFSALKNVIRCSIIHEYISKFVEREQDKGHIGDQELPPEEDPSRELNNVQHEVNSL 240 Query: 241 TEQDAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVL 300 TEQDAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVL Sbjct: 241 TEQDAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVL 300 Query: 301 YHILKELIQSQGTVKVYRGSSFSHDSIKISLHYEEQHITAVWVYLTVKFEEHWKPVDVEV 360 YHILKELIQSQGTVKVYRGSSFSHDSIKISLHYEEQHITAVWVYLTVKFEEHWKPVDVEV Sbjct: 301 YHILKELIQSQGTVKVYRGSSFSHDSIKISLHYEEQHITAVWVYLTVKFEEHWKPVDVEV 360 Query: 361 EFRCKFKERKVDG 373 EFRCKFKERKVDG Sbjct: 361 EFRCKFKERKVDG 373 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 65.5 bits (158), Expect = 9e-14 Identities = 67/327 (20%), Positives = 138/327 (41%), Gaps = 25/327 (7%) Query: 22 VYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNTDDTPVIEKELDWPDPALVYNT 81 V+ S+ ++ T +GT +P+NY+ Y LA +V +++ + ++ ++WP+P +V NT Sbjct: 31 VHVPSKNILITPKGTVEIPENYQNYPILAIFYVKYLMKKNPYDLLPSTVNWPEPYVVVNT 90 Query: 82 IVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTLNRRGKGIRRPKGVFFRYMESP 141 I R +H + + ++ +L+ I G+ I R G+ ++ K V + +E Sbjct: 91 ITKRFQDHKLFANKNADVYVERLQNAIASGIKIPESKKNERLGQP-KKTKNV-TKEIEET 148 Query: 142 FVNT-----KVTAFFSYLRDYNKIASEYHNNTKFILTFSCQA-YWASGPNFSALKNVIRC 195 F++ ++ +F L+D K SC+A + + +R Sbjct: 149 FIDATNARKELDEYFRKLQDGTLTGDLEGGLCKVKTLISCKALFGGHTQELQFMATNVRK 208 Query: 196 SIIHEYISKFVEREQDKGHIGDQELPPEE------------DPSRELNNVQHEVNSLTEQ 243 I E + V +K I D +L EE + S L+ + +++L Q Sbjct: 209 VWIGEIVCGMV---SNKNAIDDNDLEEEERNASGEQTTTAREESEALDTTSNGLDALNTQ 265 Query: 244 --DAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVLY 301 E +E W I +L + ++ E A ++G+ + + ++ +++ Y Sbjct: 266 INAIETEESFWEAIRALHNELRTSPTQLEECRKAAVFLLGHKKILQTFTKQKDTARALFY 325 Query: 302 HILKELIQSQGTVKVYRGSSFSHDSIK 328 LKE + + ++ S +IK Sbjct: 326 INLKECLGTSWNLEYTEASDARKMAIK 352 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 41.6 bits (96), Expect = 2e-06 Identities = 70/343 (20%), Positives = 138/343 (39%), Gaps = 33/343 (9%) Query: 23 YDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNTDDTPVIEKELDWPDPALVYNTI 82 YD I T G++ VP+ + + LA +V + +N+ K+L+W PA VY I Sbjct: 49 YDPLNNTIITQHGSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLI 108 Query: 83 VDRIINHP-------ELSQFISVAFISQLKATIGEGLDINVKGTLNRRGKGIRRPKGVFF 135 ++R+ + + + I V + +D N+K + ++ + R P+ V + Sbjct: 109 MERLQKNKYFLRKQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQ--RHPEKV-Y 165 Query: 136 RYMESPFVNTKVTAFFSYLRDYNKIASEYHNNTKFILTFSCQ-AYWASGPNFSALKNVIR 194 M + + FFS L N ++ + N T L SC A G IR Sbjct: 166 DLMTYQEIADGIDDFFSLLEMPN-LSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIR 224 Query: 195 CSIIHEYISKFVEREQDKGHIGDQELP-PEEDPSRELNNVQHEVNSLTE------QDAEA 247 I I +F + ++ + +P P P R+ +H V+ + D Sbjct: 225 HYWIKNAILEFADYKK-------KTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSI 277 Query: 248 DEGLWGEIDSLCEKWQSEAE----DQTEAEIIADRIIGNSQRMANLKIRRTKFKSVLYHI 303 D + ++ E + E + E + I + L R ++ L+H Sbjct: 278 DS--YASYVTVMEHLEEVVELTDRHRRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHT 335 Query: 304 LKELIQSQGTVKVYRG-SSFSHDSIKISLHYEEQHITAVWVYL 345 +++++ G+ ++G + S+++S+ + + +TA W+ L Sbjct: 336 VRDVLLHDGSYTRHKGVRCLDNGSVRVSIRLKHRQLTATWIEL 378 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 37.0 bits (84), Expect = 3e-05 Identities = 35/156 (22%), Positives = 75/156 (47%), Gaps = 5/156 (3%) Query: 212 KGHIGDQELPPEEDPSRELNNVQHEVNSLTEQDAEADEGLWGEIDSLCEKWQSEAEDQTE 271 KG++ ++ E ++ Q +V S +E +++ + W ++ L W + + Sbjct: 33 KGNLSEESTSTSESEIPNTSS-QQDVISSSEHVSDSLK-FWEVLNFLEVSWVPHSGEDGV 90 Query: 272 AEIIADRIIGNSQRMANLKIRRTKFKSVLYHILKELIQSQGTVKVYRGSSFSHDSI-KIS 330 ++ + + LK RR K+ +Y +++++ S+G V++YRG + + K+S Sbjct: 91 QHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVLKVS 150 Query: 331 LHYEE-QHITAVWVYLT-VKFEEHWKPVDVEVEFRC 364 L Y + +T++WV L + E+ D+E F C Sbjct: 151 LFYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFNC 186 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 33.1 bits (74), Expect = 6e-04 Identities = 29/112 (25%), Positives = 51/112 (44%), Gaps = 4/112 (3%) Query: 241 TEQDAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVL 300 T ++A E W SL K E+ + ++ + N + RR K+ L Sbjct: 186 TAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVGLKNRL 245 Query: 301 YHILKELIQSQGTVKVYRGSSFSHDSIKISLHYEEQH---ITAVWVYLTVKF 349 Y ILK+L++ ++ YRG S D + +S+ ++ ++ + V V L KF Sbjct: 246 YIILKDLVEEPWELRAYRGCR-SADGLTLSVSFKYKNRPQFSGVKVDLEPKF 296 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 20.8 bits (42), Expect = 2.4 Identities = 9/22 (40%), Positives = 14/22 (62%) Query: 218 QELPPEEDPSRELNNVQHEVNS 239 QE+ DPS LNN+ E+++ Sbjct: 455 QEIIVVIDPSLFLNNISQEISA 476 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 19.6 bits (39), Expect = 6.4 Identities = 7/14 (50%), Positives = 10/14 (71%) Query: 348 KFEEHWKPVDVEVE 361 KFE W+ +DV+ E Sbjct: 557 KFEAEWEHIDVKRE 570 Score = 19.2 bits (38), Expect = 7.9 Identities = 8/16 (50%), Positives = 10/16 (62%) Query: 357 DVEVEFRCKFKERKVD 372 DVE E CK K +V+ Sbjct: 392 DVEAELECKSKSVEVE 407 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 590 e-171 swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 411 e-118 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 399 e-114 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 188 6e-51 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 138 1e-35 Sequences not found previously or not previously below threshold: trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. 23 0.86 swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1... 22 0.97 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 19 7.8 CONVERGED! >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 590 bits (1521), Expect = e-171 Identities = 373/373 (100%), Positives = 373/373 (100%) Query: 1 MNGERLLACIKQCIMQHFQPMVYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNT 60 MNGERLLACIKQCIMQHFQPMVYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNT Sbjct: 1 MNGERLLACIKQCIMQHFQPMVYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNT 60 Query: 61 DDTPVIEKELDWPDPALVYNTIVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTL 120 DDTPVIEKELDWPDPALVYNTIVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTL Sbjct: 61 DDTPVIEKELDWPDPALVYNTIVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTL 120 Query: 121 NRRGKGIRRPKGVFFRYMESPFVNTKVTAFFSYLRDYNKIASEYHNNTKFILTFSCQAYW 180 NRRGKGIRRPKGVFFRYMESPFVNTKVTAFFSYLRDYNKIASEYHNNTKFILTFSCQAYW Sbjct: 121 NRRGKGIRRPKGVFFRYMESPFVNTKVTAFFSYLRDYNKIASEYHNNTKFILTFSCQAYW 180 Query: 181 ASGPNFSALKNVIRCSIIHEYISKFVEREQDKGHIGDQELPPEEDPSRELNNVQHEVNSL 240 ASGPNFSALKNVIRCSIIHEYISKFVEREQDKGHIGDQELPPEEDPSRELNNVQHEVNSL Sbjct: 181 ASGPNFSALKNVIRCSIIHEYISKFVEREQDKGHIGDQELPPEEDPSRELNNVQHEVNSL 240 Query: 241 TEQDAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVL 300 TEQDAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVL Sbjct: 241 TEQDAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVL 300 Query: 301 YHILKELIQSQGTVKVYRGSSFSHDSIKISLHYEEQHITAVWVYLTVKFEEHWKPVDVEV 360 YHILKELIQSQGTVKVYRGSSFSHDSIKISLHYEEQHITAVWVYLTVKFEEHWKPVDVEV Sbjct: 301 YHILKELIQSQGTVKVYRGSSFSHDSIKISLHYEEQHITAVWVYLTVKFEEHWKPVDVEV 360 Query: 361 EFRCKFKERKVDG 373 EFRCKFKERKVDG Sbjct: 361 EFRCKFKERKVDG 373 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 411 bits (1057), Expect = e-118 Identities = 73/363 (20%), Positives = 145/363 (39%), Gaps = 34/363 (9%) Query: 23 YDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNTDDTPVIEKELDWPDPALVYNTI 82 YD I T G++ VP+ + + LA +V + +N+ K+L+W PA VY I Sbjct: 49 YDPLNNTIITQHGSYEVPEAFAQCTYLAQIYVSYKINSLPYQTYIKDLEWISPAEVYKLI 108 Query: 83 VDRIINHP-------ELSQFISVAFISQLKATIGEGLDINVKGTLNRRGKGIRRPKGVFF 135 ++R+ + + + I V + +D N+K + ++ + R P+ V + Sbjct: 109 MERLQKNKYFLRKQVQAMEKIKVLLCPSPELLENNYVDDNLKISSHKVTQ--RHPEKV-Y 165 Query: 136 RYMESPFVNTKVTAFFSYLRDYNKIASEYHNNTKFILTFSCQ-AYWASGPNFSALKNVIR 194 M + + FFS L N ++ + N T L SC A G IR Sbjct: 166 DLMTYQEIADGIDDFFSLLEMPN-LSLAFVNKTSIKLNISCSGANNHIGSFIGRTARTIR 224 Query: 195 CSIIHEYISKFVEREQDKGHIGDQELP-PEEDPSRELNNVQHEVNSLTE------QDAEA 247 I I +F + ++ + +P P P R+ +H V+ + D Sbjct: 225 HYWIKNAILEFADYKK-------KTVPYPFVGPPRDPGLFEHIVDDCSTGAPINWDDFSI 277 Query: 248 DEGLWGEIDSLCEKWQSEAE----DQTEAEIIADRIIGNSQRMANLKIRRTKFKSVLYHI 303 D + ++ E + E + E + I + L R ++ L+H Sbjct: 278 DS--YASYVTVMEHLEEVVELTDRHRRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHT 335 Query: 304 LKELIQSQGTVKVYRGSSF-SHDSIKISLHYEEQHITAVWVYLTVKFEEHWKPVDVEVEF 362 +++++ G+ ++G + S+++S+ + + +TA W+ L E+ + DV + Sbjct: 336 VRDVLLHDGSYTRHKGVRCLDNGSVRVSIRLKHRQLTATWIELEPII-ENGEVKDVMFKL 394 Query: 363 RCK 365 + Sbjct: 395 STR 397 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 399 bits (1027), Expect = e-114 Identities = 67/327 (20%), Positives = 138/327 (41%), Gaps = 25/327 (7%) Query: 22 VYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNTDDTPVIEKELDWPDPALVYNT 81 V+ S+ ++ T +GT +P+NY+ Y LA +V +++ + ++ ++WP+P +V NT Sbjct: 31 VHVPSKNILITPKGTVEIPENYQNYPILAIFYVKYLMKKNPYDLLPSTVNWPEPYVVVNT 90 Query: 82 IVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTLNRRGKGIRRPKGVFFRYMESP 141 I R +H + + ++ +L+ I G+ I R G+ ++ K V + +E Sbjct: 91 ITKRFQDHKLFANKNADVYVERLQNAIASGIKIPESKKNERLGQP-KKTKNV-TKEIEET 148 Query: 142 FVNT-----KVTAFFSYLRDYNKIASEYHNNTKFILTFSCQA-YWASGPNFSALKNVIRC 195 F++ ++ +F L+D K SC+A + + +R Sbjct: 149 FIDATNARKELDEYFRKLQDGTLTGDLEGGLCKVKTLISCKALFGGHTQELQFMATNVRK 208 Query: 196 SIIHEYISKFVEREQDKGHIGDQELPPEE------------DPSRELNNVQHEVNSLTEQ 243 I E + V +K I D +L EE + S L+ + +++L Q Sbjct: 209 VWIGEIVCGMV---SNKNAIDDNDLEEEERNASGEQTTTAREESEALDTTSNGLDALNTQ 265 Query: 244 --DAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVLY 301 E +E W I +L + ++ E A ++G+ + + ++ +++ Y Sbjct: 266 INAIETEESFWEAIRALHNELRTSPTQLEECRKAAVFLLGHKKILQTFTKQKDTARALFY 325 Query: 302 HILKELIQSQGTVKVYRGSSFSHDSIK 328 LKE + + ++ S +IK Sbjct: 326 INLKECLGTSWNLEYTEASDARKMAIK 352 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 188 bits (479), Expect = 6e-51 Identities = 40/183 (21%), Positives = 82/183 (43%), Gaps = 6/183 (3%) Query: 185 NFSALKNVIRCSIIHEYISKFVEREQDKGHIGDQELPPEEDPSRELNNVQHEVNSLTEQD 244 IR ++E I E + KG++ ++ E ++ Q +V S +E Sbjct: 7 ELERFGKAIRKIWLYETIMLGAE-KIAKGNLSEESTSTSESEIPNTSS-QQDVISSSEHV 64 Query: 245 AEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVLYHIL 304 +++ + W ++ L W + + ++ + + LK RR K+ +Y + Sbjct: 65 SDSLK-FWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYV 123 Query: 305 KELIQSQGTVKVYRGSSFSHDSI-KISLHYEE-QHITAVWVYLTV-KFEEHWKPVDVEVE 361 ++++ S+G V++YRG + + K+SL Y + +T++WV L E+ D+E Sbjct: 124 RQIMYSKGQVRIYRGVRCKPNGVLKVSLFYGPCKWVTSMWVTLVPITLRENGSLEDIEFT 183 Query: 362 FRC 364 F C Sbjct: 184 FNC 186 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 138 bits (348), Expect = 1e-35 Identities = 29/112 (25%), Positives = 50/112 (43%), Gaps = 4/112 (3%) Query: 241 TEQDAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVL 300 T ++A E W SL K E+ + ++ + N + RR K+ L Sbjct: 186 TAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVGLKNRL 245 Query: 301 YHILKELIQSQGTVKVYRGSSFSHDSIKISLHYEEQ---HITAVWVYLTVKF 349 Y ILK+L++ ++ YRG S D + +S+ ++ + + V V L KF Sbjct: 246 YIILKDLVEEPWELRAYRGCR-SADGLTLSVSFKYKNRPQFSGVKVDLEPKF 296 >trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. Length = 456 Score = 22.5 bits (47), Expect = 0.86 Identities = 4/24 (16%), Positives = 10/24 (41%) Query: 279 IIGNSQRMANLKIRRTKFKSVLYH 302 + + +L+ R + V+Y Sbjct: 170 LKLDPSIFESLQKERVEAGDVIYI 193 >swiss|O14682|ENC1_HUMAN Ectoderm-neural cortex-1 protein (ENC-1) (P53-indu Length = 589 Score = 22.1 bits (46), Expect = 0.97 Identities = 14/63 (22%), Positives = 24/63 (37%), Gaps = 7/63 (11%) Query: 65 VIEKELDWPDPALVYNTIVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTLNRRG 124 V E ++W Y+ R PEL Q + +A + + + + T R+ Sbjct: 210 VYESAINWIS----YDL-KKRYCYLPELLQTVRLALLPAIYLMEN--VAMEELITKQRKS 262 Query: 125 KGI 127 K I Sbjct: 263 KEI 265 Score = 19.0 bits (38), Expect = 8.3 Identities = 5/14 (35%), Positives = 9/14 (63%), Gaps = 1/14 (7%) Query: 249 EGLWGEIDSLCEKW 262 + +W D+L E+W Sbjct: 356 KDVW-VYDTLHEEW 368 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 19.0 bits (38), Expect = 7.8 Identities = 7/38 (18%), Positives = 14/38 (36%) Query: 206 VEREQDKGHIGDQELPPEEDPSRELNNVQHEVNSLTEQ 243 R++ GD+ P + R+ + Q + Q Sbjct: 903 AGRKRRGNTSGDETSEPSNNKKRKHDVTQKYSDEADTQ 940 Score = 19.0 bits (38), Expect = 8.6 Identities = 7/24 (29%), Positives = 10/24 (41%) Query: 277 DRIIGNSQRMANLKIRRTKFKSVL 300 D I SQ +A L+ K+ Sbjct: 268 DHIFARSQELAELEKGLDTAKTTF 291 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.312 0.155 0.417 Lambda K H 0.267 0.0474 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 28,667 Number of Sequences: 43 Number of extensions: 1345 Number of successful extensions: 48 Number of sequences better than 10.0: 19 Number of HSP's better than 10.0 without gapping: 19 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 29 length of query: 373 length of database: 17,163 effective HSP length: 57 effective length of query: 316 effective length of database: 14,712 effective search space: 4648992 effective search space used: 4648992 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 37 (19.3 bits) S2: 38 (19.0 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). (357 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 699 0.0 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 67 4e-14 swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 47 3e-08 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 30 0.004 swiss|P33418|LOS1_YEAST LOS1 protein. 21 2.0 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 20 5.5 swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 19 6.8 swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. 19 7.1 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 699 bits (1804), Expect = 0.0 Identities = 357/357 (100%), Positives = 357/357 (100%) Query: 1 MFSREEVRASRPTKEMKMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAI 60 MFSREEVRASRPTKEMKMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAI Sbjct: 1 MFSREEVRASRPTKEMKMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAI 60 Query: 61 FYVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNADVYVERLQNAIASG 120 FYVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNADVYVERLQNAIASG Sbjct: 61 FYVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNADVYVERLQNAIASG 120 Query: 121 IKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDGTLTGDLEGGLC 180 IKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDGTLTGDLEGGLC Sbjct: 121 IKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDGTLTGDLEGGLC 180 Query: 181 KVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGE 240 KVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGE Sbjct: 181 KVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGE 240 Query: 241 QTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAA 300 QTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAA Sbjct: 241 QTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAA 300 Query: 301 VFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEASDARKMAIKGELQN 357 VFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEASDARKMAIKGELQN Sbjct: 301 VFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEASDARKMAIKGELQN 357 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 66.6 bits (161), Expect = 4e-14 Identities = 67/327 (20%), Positives = 138/327 (41%), Gaps = 25/327 (7%) Query: 31 VHVPSKNILITPKGTVEIPENYQNYPILAIFYVKYLMKKNPYDLLPSTVNWPEPYVVVNT 90 V+ S+ ++ T +GT +P+NY+ Y LA +V +++ + ++ ++WP+P +V NT Sbjct: 22 VYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNTDDTPVIEKELDWPDPALVYNT 81 Query: 91 ITKRFQDHKLFANKNADVYVERLQNAIASGIKIPESKKNERLGQP-KKTKNV-TKEIEET 148 I R +H + + ++ +L+ I G+ I R G+ ++ K V + +E Sbjct: 82 IVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTLNRRGKGIRRPKGVFFRYMESP 141 Query: 149 FIDATNARKELDEYFRKLQDGTLTGDLEGGLCKVKTLISCKALFGGHTQELQFMATNVRK 208 F++ ++ +F L+D K SC+A + + +R Sbjct: 142 FVNT-----KVTAFFSYLRDYNKIASEYHNNTKFILTFSCQA-YWASGPNFSALKNVIRC 195 Query: 209 VWIGEIVCGMV---SNKNAIDDNDLEEEERNASGEQTTTAREESEALDTTSNGLDALNTQ 265 I E + V +K I D +L EE + S L+ + +++L Q Sbjct: 196 SIIHEYISKFVEREQDKGHIGDQELPPEE------------DPSRELNNVQHEVNSLTEQ 243 Query: 266 INAIETEESFWEAIRALHNELRTSPTQLEECRKAAVFLLGHKKILQTFTKQKDTARALFY 325 E +E W I +L + ++ E A ++G+ + + ++ +++ Y Sbjct: 244 --DAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVLY 301 Query: 326 INLKECLGTSWNLEYTEASDARKMAIK 352 LKE + + ++ S +IK Sbjct: 302 HILKELIQSQGTVKVYRGSSFSHDSIK 328 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 47.4 bits (111), Expect = 3e-08 Identities = 29/103 (28%), Positives = 45/103 (43%), Gaps = 1/103 (0%) Query: 3 SREEVRASRPTKEM-KMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAIF 61 S EE + + +E+ + + D + + P N +IT G+ E+PE + LA Sbjct: 19 SPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQHGSYEVPEAFAQCTYLAQI 78 Query: 62 YVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANK 104 YV Y + PY + W P V I +R Q +K F K Sbjct: 79 YVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLRK 121 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 30.4 bits (67), Expect = 0.004 Identities = 26/106 (24%), Positives = 46/106 (42%), Gaps = 14/106 (13%) Query: 233 EERNASGEQTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQ 292 E R S E T + +E+E + +ALN +E FW R+L ++ S Sbjct: 168 ETRERSPEHTEASTQEAETAE------EALN--------KEEFWNLYRSLESKSYISREN 213 Query: 293 LEECRKAAVFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNL 338 + + R + L + +I F +++ + YI LK+ + W L Sbjct: 214 VPDIRLLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWEL 259 >swiss|P33418|LOS1_YEAST LOS1 protein. Length = 1100 Score = 21.2 bits (43), Expect = 2.0 Identities = 20/89 (22%), Positives = 35/89 (38%), Gaps = 19/89 (21%) Query: 249 SEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAAVFLLGHKK 308 S ++ + G++A +N ++ E + W + L N+ P FL+ K Sbjct: 61 STLIELMTEGVNANPNGLNLVKFEITKWLKFQVLGNKQTKLPD----------FLM--NK 108 Query: 309 ILQTFTKQKDTARALFYINLKECLGTSWN 337 I + T LF + +C G WN Sbjct: 109 ISEVLT-------TLFMLMYSDCNGNQWN 130 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 19.6 bits (39), Expect = 5.5 Identities = 6/18 (33%), Positives = 11/18 (60%) Query: 198 ELQFMATNVRKVWIGEIV 215 EL+ +RK+W+ E + Sbjct: 7 ELERFGKAIRKIWLYETI 24 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 19.2 bits (38), Expect = 6.8 Identities = 11/31 (35%), Positives = 18/31 (57%) Query: 202 MATNVRKVWIGEIVCGMVSNKNAIDDNDLEE 232 M TN+RK + + C + +NAI+ + EE Sbjct: 64 MKTNLRKGTMLPVFCVVEHYENAIEYDCKEE 94 >swiss|P54252|MJD1_HUMAN Machado-Joseph disease protein 1. Length = 360 Score = 19.2 bits (38), Expect = 7.1 Identities = 10/33 (30%), Positives = 15/33 (45%) Query: 228 NDLEEEERNASGEQTTTAREESEALDTTSNGLD 260 + L+EEER E T+ + L S +D Sbjct: 38 HQLDEEERMRMAEGGVTSEDYRTFLQQPSGNMD 70 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 575 e-167 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 466 e-134 swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 177 2e-47 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 134 2e-34 Sequences not found previously or not previously below threshold: swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 30 0.003 swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein c... 22 0.82 trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 22 1.2 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 575 bits (1482), Expect = e-167 Identities = 357/357 (100%), Positives = 357/357 (100%) Query: 1 MFSREEVRASRPTKEMKMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAI 60 MFSREEVRASRPTKEMKMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAI Sbjct: 1 MFSREEVRASRPTKEMKMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAI 60 Query: 61 FYVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNADVYVERLQNAIASG 120 FYVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNADVYVERLQNAIASG Sbjct: 61 FYVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNADVYVERLQNAIASG 120 Query: 121 IKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDGTLTGDLEGGLC 180 IKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDGTLTGDLEGGLC Sbjct: 121 IKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDGTLTGDLEGGLC 180 Query: 181 KVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGE 240 KVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGE Sbjct: 181 KVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGE 240 Query: 241 QTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAA 300 QTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAA Sbjct: 241 QTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAA 300 Query: 301 VFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEASDARKMAIKGELQN 357 VFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEASDARKMAIKGELQN Sbjct: 301 VFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEASDARKMAIKGELQN 357 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 466 bits (1199), Expect = e-134 Identities = 67/327 (20%), Positives = 138/327 (41%), Gaps = 25/327 (7%) Query: 31 VHVPSKNILITPKGTVEIPENYQNYPILAIFYVKYLMKKNPYDLLPSTVNWPEPYVVVNT 90 V+ S+ ++ T +GT +P+NY+ Y LA +V +++ + ++ ++WP+P +V NT Sbjct: 22 VYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNTDDTPVIEKELDWPDPALVYNT 81 Query: 91 ITKRFQDHKLFANKNADVYVERLQNAIASGIKIPESKKNERLGQP-KKTKNV-TKEIEET 148 I R +H + + ++ +L+ I G+ I R G+ ++ K V + +E Sbjct: 82 IVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTLNRRGKGIRRPKGVFFRYMESP 141 Query: 149 FIDATNARKELDEYFRKLQDGTLTGDLEGGLCKVKTLISCKALFGGHTQELQFMATNVRK 208 F++ ++ +F L+D K SC+A + + +R Sbjct: 142 FVNT-----KVTAFFSYLRDYNKIASEYHNNTKFILTFSCQA-YWASGPNFSALKNVIRC 195 Query: 209 VWIGEIVCGMV---SNKNAIDDNDLEEEERNASGEQTTTAREESEALDTTSNGLDALNTQ 265 I E + V +K I D +L EE + S L+ + +++L Q Sbjct: 196 SIIHEYISKFVEREQDKGHIGDQELPPEE------------DPSRELNNVQHEVNSLTEQ 243 Query: 266 INAIETEESFWEAIRALHNELRTSPTQLEECRKAAVFLLGHKKILQTFTKQKDTARALFY 325 E +E W I +L + ++ E A ++G+ + + ++ +++ Y Sbjct: 244 --DAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVLY 301 Query: 326 INLKECLGTSWNLEYTEASDARKMAIK 352 LKE + + ++ S +IK Sbjct: 302 HILKELIQSQGTVKVYRGSSFSHDSIK 328 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 177 bits (449), Expect = 2e-47 Identities = 51/233 (21%), Positives = 86/233 (36%), Gaps = 25/233 (10%) Query: 3 SREEVRASRPTKEM-KMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAIF 61 S EE + + +E+ + + D + + P N +IT G+ E+PE + LA Sbjct: 19 SPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQHGSYEVPEAFAQCTYLAQI 78 Query: 62 YVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNADVYVERLQ------- 114 YV Y + PY + W P V I +R Q +K F K +E+++ Sbjct: 79 YVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLRKQVQA-MEKIKVLLCPSP 137 Query: 115 -----NAIASGIKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDG 169 N + +KI K +R P+K ++ E +D++F L+ Sbjct: 138 ELLENNYVDDNLKISSHKVTQR--HPEKVYDLMTYQE--------IADGIDDFFSLLEMP 187 Query: 170 TLTGDLEGGLCKVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNK 222 L+ +K ISC + A +R WI + K Sbjct: 188 NLSLA-FVNKTSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYK 239 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 134 bits (337), Expect = 2e-34 Identities = 26/111 (23%), Positives = 46/111 (41%), Gaps = 14/111 (12%) Query: 233 EERNASGEQTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQ 292 E R S E T + +E+E + +ALN +E FW R+L ++ S Sbjct: 168 ETRERSPEHTEASTQEAETAE------EALN--------KEEFWNLYRSLESKSYISREN 213 Query: 293 LEECRKAAVFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEA 343 + + R + L + +I F +++ + YI LK+ + W L Sbjct: 214 VPDIRLLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRG 264 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 30.3 bits (67), Expect = 0.003 Identities = 21/146 (14%), Positives = 52/146 (35%), Gaps = 14/146 (9%) Query: 198 ELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGEQTTTAREESEALDTTSN 257 EL+ +RK+W+ E + + I +L EE + S + + + + ++ + Sbjct: 7 ELERFGKAIRKIWLYETI---MLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISSSEH 63 Query: 258 GLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAAVFLLGHKKILQTFTKQK 317 D+L FWE + L + K + ++ + K++ Sbjct: 64 VSDSL-----------KFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRR 112 Query: 318 DTARALFYINLKECLGTSWNLEYTEA 343 A+ Y +++ + + + Sbjct: 113 LAAKNDVYGYVRQIMYSKGQVRIYRG 138 >swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein charlie). Length = 296 Score = 22.2 bits (46), Expect = 0.82 Identities = 10/34 (29%), Positives = 19/34 (55%), Gaps = 2/34 (5%) Query: 112 RLQNAIASGIK-IPESK-KNERLGQPKKTKNVTK 143 +LQN I G K +P K + + +K++N ++ Sbjct: 262 KLQNTITEGPKAVPTKKRRVATRVRGRKSRNTSR 295 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 21.8 bits (45), Expect = 1.2 Identities = 5/34 (14%), Positives = 13/34 (37%) Query: 299 AAVFLLGHKKILQTFTKQKDTARALFYINLKECL 332 VF ++IL T ++ + ++ + Sbjct: 246 VVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLV 279 Searching.done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 544 e-158 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 431 e-124 swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 304 7e-86 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 163 3e-43 swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 125 1e-31 Sequences not found previously or not previously below threshold: trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. 25 0.15 swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 19 7.6 CONVERGED! >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 544 bits (1402), Expect = e-158 Identities = 357/357 (100%), Positives = 357/357 (100%) Query: 1 MFSREEVRASRPTKEMKMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAI 60 MFSREEVRASRPTKEMKMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAI Sbjct: 1 MFSREEVRASRPTKEMKMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAI 60 Query: 61 FYVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNADVYVERLQNAIASG 120 FYVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNADVYVERLQNAIASG Sbjct: 61 FYVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNADVYVERLQNAIASG 120 Query: 121 IKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDGTLTGDLEGGLC 180 IKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDGTLTGDLEGGLC Sbjct: 121 IKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDGTLTGDLEGGLC 180 Query: 181 KVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGE 240 KVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGE Sbjct: 181 KVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGE 240 Query: 241 QTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAA 300 QTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAA Sbjct: 241 QTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAA 300 Query: 301 VFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEASDARKMAIKGELQN 357 VFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEASDARKMAIKGELQN Sbjct: 301 VFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEASDARKMAIKGELQN 357 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 431 bits (1109), Expect = e-124 Identities = 67/327 (20%), Positives = 138/327 (41%), Gaps = 25/327 (7%) Query: 31 VHVPSKNILITPKGTVEIPENYQNYPILAIFYVKYLMKKNPYDLLPSTVNWPEPYVVVNT 90 V+ S+ ++ T +GT +P+NY+ Y LA +V +++ + ++ ++WP+P +V NT Sbjct: 22 VYDESRCVIETTRGTFPVPDNYKKYKTLAFAFVGHVLNTDDTPVIEKELDWPDPALVYNT 81 Query: 91 ITKRFQDHKLFANKNADVYVERLQNAIASGIKIPESKKNERLGQP-KKTKNV-TKEIEET 148 I R +H + + ++ +L+ I G+ I R G+ ++ K V + +E Sbjct: 82 IVDRIINHPELSQFISVAFISQLKATIGEGLDINVKGTLNRRGKGIRRPKGVFFRYMESP 141 Query: 149 FIDATNARKELDEYFRKLQDGTLTGDLEGGLCKVKTLISCKALFGGHTQELQFMATNVRK 208 F++ ++ +F L+D K SC+A + + +R Sbjct: 142 FVNT-----KVTAFFSYLRDYNKIASEYHNNTKFILTFSCQA-YWASGPNFSALKNVIRC 195 Query: 209 VWIGEIVCGMV---SNKNAIDDNDLEEEERNASGEQTTTAREESEALDTTSNGLDALNTQ 265 I E + V +K I D +L EE + S L+ + +++L Q Sbjct: 196 SIIHEYISKFVEREQDKGHIGDQELPPEE------------DPSRELNNVQHEVNSLTEQ 243 Query: 266 INAIETEESFWEAIRALHNELRTSPTQLEECRKAAVFLLGHKKILQTFTKQKDTARALFY 325 E +E W I +L + ++ E A ++G+ + + ++ +++ Y Sbjct: 244 --DAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVLY 301 Query: 326 INLKECLGTSWNLEYTEASDARKMAIK 352 LKE + + ++ S +IK Sbjct: 302 HILKELIQSQGTVKVYRGSSFSHDSIK 328 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 304 bits (780), Expect = 7e-86 Identities = 61/364 (16%), Positives = 117/364 (31%), Gaps = 43/364 (11%) Query: 3 SREEVRASRPTKEM-KMIFDVLMTFPYFAVHVPSKNILITPKGTVEIPENYQNYPILAIF 61 S EE + + +E+ + + D + + P N +IT G+ E+PE + LA Sbjct: 19 SPEEQKGVQQVREIYRALLDSFKFKNEYNAYDPLNNTIITQHGSYEVPEAFAQCTYLAQI 78 Query: 62 YVKYLMKKNPYDLLPSTVNWPEPYVVVNTITKRFQDHKLFANKNADVYVERLQ------- 114 YV Y + PY + W P V I +R Q +K F K +E+++ Sbjct: 79 YVSYKINSLPYQTYIKDLEWISPAEVYKLIMERLQKNKYFLRKQVQA-MEKIKVLLCPSP 137 Query: 115 -----NAIASGIKIPESKKNERLGQPKKTKNVTKEIEETFIDATNARKELDEYFRKLQDG 169 N + +KI K +R P+K ++ E +D++F L+ Sbjct: 138 ELLENNYVDDNLKISSHKVTQR--HPEKVYDLMTYQE--------IADGIDDFFSLLEMP 187 Query: 170 TLTGDLEGGLCKVKTLISCKALFGGHTQELQFMATNVRKVWIGEIVCGMVSNKNA-IDDN 228 L+ +K ISC + A +R WI + K + Sbjct: 188 NLSLA-FVNKTSIKLNISCSGANNHIGSFIGRTARTIRHYWIKNAILEFADYKKKTVPYP 246 Query: 229 DLEEEERNASGEQTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRT 288 + E ++ +D+ + + +E E E Sbjct: 247 FVGPPRDPGLFEHIVDDCSTGAPINWDDFSIDSYASYVTVMEHLEEVVELTD-------- 298 Query: 289 SPTQLEECRKAAVFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEASDARK 348 +++ T+ + R + +++ L + S R Sbjct: 299 --RHRRVIEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVL-------LHDGSYTRH 349 Query: 349 MAIK 352 ++ Sbjct: 350 KGVR 353 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 163 bits (413), Expect = 3e-43 Identities = 21/146 (14%), Positives = 52/146 (35%), Gaps = 14/146 (9%) Query: 198 ELQFMATNVRKVWIGEIVCGMVSNKNAIDDNDLEEEERNASGEQTTTAREESEALDTTSN 257 EL+ +RK+W+ E + + I +L EE + S + + + + ++ + Sbjct: 7 ELERFGKAIRKIWLYETI---MLGAEKIAKGNLSEESTSTSESEIPNTSSQQDVISSSEH 63 Query: 258 GLDALNTQINAIETEESFWEAIRALHNELRTSPTQLEECRKAAVFLLGHKKILQTFTKQK 317 D+L FWE + L + K + ++ + K++ Sbjct: 64 VSDSL-----------KFWEVLNFLEVSWVPHSGEDGVQHKLSQYVEKDPSTFEELKKRR 112 Query: 318 DTARALFYINLKECLGTSWNLEYTEA 343 A+ Y +++ + + + Sbjct: 113 LAAKNDVYGYVRQIMYSKGQVRIYRG 138 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 125 bits (313), Expect = 1e-31 Identities = 26/111 (23%), Positives = 46/111 (41%), Gaps = 14/111 (12%) Query: 233 EERNASGEQTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQ 292 E R S E T + +E+E + +ALN +E FW R+L ++ S Sbjct: 168 ETRERSPEHTEASTQEAETAE------EALN--------KEEFWNLYRSLESKSYISREN 213 Query: 293 LEECRKAAVFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNLEYTEA 343 + + R + L + +I F +++ + YI LK+ + W L Sbjct: 214 VPDIRLLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRG 264 >trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. Length = 456 Score = 24.9 bits (53), Expect = 0.15 Identities = 6/24 (25%), Positives = 11/24 (45%) Query: 303 LLGHKKILQTFTKQKDTARALFYI 326 L I ++ K++ A + YI Sbjct: 170 LKLDPSIFESLQKERVEAGDVIYI 193 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 19.1 bits (38), Expect = 7.6 Identities = 9/37 (24%), Positives = 19/37 (51%) Query: 294 EECRKAAVFLLGHKKILQTFTKQKDTARALFYINLKE 330 E+ + A+ LL + + +Q F + + R Y + +E Sbjct: 413 EDPKTASQSLLVNLRAMQNFLQLPEAERDRIYQDERE 449 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.309 0.148 0.389 Lambda K H 0.267 0.0453 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 39,618 Number of Sequences: 43 Number of extensions: 1739 Number of successful extensions: 63 Number of sequences better than 10.0: 31 Number of HSP's better than 10.0 without gapping: 33 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 38 length of query: 357 length of database: 17,163 effective HSP length: 56 effective length of query: 301 effective length of database: 14,755 effective search space: 4441255 effective search space used: 4441255 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 37 (19.2 bits) S2: 38 (19.1 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). (322 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 555 e-161 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 34 2e-04 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 31 0.002 trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 31 0.002 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 30 0.003 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 20 4.0 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 18 9.8 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 555 bits (1430), Expect = e-161 Identities = 322/322 (100%), Positives = 322/322 (100%) Query: 1 MPGTAVHFNELELFFRQTRFAELVQYFVEMDDDSDDAEAVFPTSEPSRVSEGSENESGSG 60 MPGTAVHFNELELFFRQTRFAELVQYFVEMDDDSDDAEAVFPTSEPSRVSEGSENESGSG Sbjct: 1 MPGTAVHFNELELFFRQTRFAELVQYFVEMDDDSDDAEAVFPTSEPSRVSEGSENESGSG 60 Query: 61 EEYEGAPHVHFQESTATEEPRTPSKTPASQTTQSIGRTVSIAEATRRASQESPRQTETAG 120 EEYEGAPHVHFQESTATEEPRTPSKTPASQTTQSIGRTVSIAEATRRASQESPRQTETAG Sbjct: 61 EEYEGAPHVHFQESTATEEPRTPSKTPASQTTQSIGRTVSIAEATRRASQESPRQTETAG 120 Query: 121 SSSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAS 180 SSSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAS Sbjct: 121 SSSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAS 180 Query: 181 TQEAETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVG 240 TQEAETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVG Sbjct: 181 TQEAETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVG 240 Query: 241 LKNRLYIILKDLVEEPWELRAYRGCRSADGLTLSVSFKYKNRPQFSGVKVDLEPKFLQES 300 LKNRLYIILKDLVEEPWELRAYRGCRSADGLTLSVSFKYKNRPQFSGVKVDLEPKFLQES Sbjct: 241 LKNRLYIILKDLVEEPWELRAYRGCRSADGLTLSVSFKYKNRPQFSGVKVDLEPKFLQES 300 Query: 301 VVNIYYTISFTVRNVRPLTHSV 322 VVNIYYTISFTVRNVRPLTHSV Sbjct: 301 VVNIYYTISFTVRNVRPLTHSV 322 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 33.9 bits (76), Expect = 2e-04 Identities = 29/112 (25%), Positives = 51/112 (44%), Gaps = 4/112 (3%) Query: 186 TAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVGLKNRL 245 T ++A E W SL K E+ + ++ + N + RR K+ L Sbjct: 241 TEQDAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADRIIGNSQRMANLKIRRTKFKSVL 300 Query: 246 YIILKDLVEEPWELRAYRGCR-SADGLTLSVSFKYKNRPQFSGVKVDLEPKF 296 Y ILK+L++ ++ YRG S D + +S+ ++ ++ + V V L KF Sbjct: 301 YHILKELIQSQGTVKVYRGSSFSHDSIKISLHYEEQH---ITAVWVYLTVKF 349 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 30.8 bits (68), Expect = 0.002 Identities = 26/106 (24%), Positives = 46/106 (42%), Gaps = 14/106 (13%) Query: 168 ETRERSPEHTEASTQEAETAE------EALN--------KEEFWNLYRSLESKSYISREN 213 E R S E T + +E+E + +ALN +E FW R+L ++ S Sbjct: 233 EERNASGEQTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQ 292 Query: 214 VPDIRLLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWEL 259 + + R + L + +I F +++ + YI LK+ + W L Sbjct: 293 LEECRKAAVFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNL 338 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 30.8 bits (68), Expect = 0.002 Identities = 26/106 (24%), Positives = 45/106 (41%), Gaps = 4/106 (3%) Query: 122 SSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAST 181 ++A +AL+T +P Q +P+ PS A E + + E ++ + T Sbjct: 125 TAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKATVLT 184 Query: 182 QE----AETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMC 223 E +T E L K E + YR + S + ++P I L +C Sbjct: 185 TERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLC 230 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 30.4 bits (67), Expect = 0.003 Identities = 34/158 (21%), Positives = 68/158 (42%), Gaps = 4/158 (2%) Query: 155 ERVVSSQTTSRAAETRERSPEHTEASTQEAETAEEALNKEEFWNLYRSLESKSYISRENV 214 E++ + + T E +T + ++E + +FW + LE S++ Sbjct: 29 EKIAKGNLSEESTSTSESEIPNTSSQQDVISSSEHVSDSLKFWEVLNFLEV-SWVPHSGE 87 Query: 215 PDIR-LLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRGCRSADGLTL 273 ++ L ++ + E +RR+ KN +Y ++ ++ ++R YRG R L Sbjct: 88 DGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVL 147 Query: 274 SVSFKYKNRPQFSGVKVDLEPKFLQE--SVVNIYYTIS 309 VS Y + + V L P L+E S+ +I +T + Sbjct: 148 KVSLFYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFN 185 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 20.0 bits (40), Expect = 4.0 Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 3/48 (6%) Query: 57 SGSGEEYEGAPHVHFQESTATEEPRTPSKTP---ASQTTQSIGRTVSI 101 S ++ A +VH + + E TPSK+ S Q GR V++ Sbjct: 96 SSKNQDSMAAQNVHQKSQMSVETCTTPSKSNLILESLGLQFFGREVTL 143 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 18.5 bits (36), Expect = 9.8 Identities = 22/80 (27%), Positives = 37/80 (45%), Gaps = 5/80 (6%) Query: 150 TQQAVERVVSSQTTSRAAETRERSPEHTEASTQEAETAEEALN--KEEFWNLYRSLESKS 207 T++ R + RA E+ + + A +E E A ++ ++E L +S+ + Sbjct: 569 TREKESRKNLEEELERATESLDEMNRNAFALAKELELANSHISSLEDEREVLQKSVSEQK 628 Query: 208 YI---SRENVPDIRLLVMCL 224 I SREN+ D LVM L Sbjct: 629 QISQESRENLEDAHSLVMKL 648 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 516 e-149 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 215 4e-59 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 163 2e-43 trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 152 7e-40 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 120 2e-30 Sequences not found previously or not previously below threshold: swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 57 2e-11 trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. 26 0.056 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 22 1.3 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 21 2.0 trembl|Q60952|Q60952 Intranuclear protein. 21 2.2 swiss|P25233|NECD_MOUSE Necdin. 20 3.8 trembl|Q15219|Q15219 Protein p84. 20 4.0 >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 516 bits (1328), Expect = e-149 Identities = 322/322 (100%), Positives = 322/322 (100%) Query: 1 MPGTAVHFNELELFFRQTRFAELVQYFVEMDDDSDDAEAVFPTSEPSRVSEGSENESGSG 60 MPGTAVHFNELELFFRQTRFAELVQYFVEMDDDSDDAEAVFPTSEPSRVSEGSENESGSG Sbjct: 1 MPGTAVHFNELELFFRQTRFAELVQYFVEMDDDSDDAEAVFPTSEPSRVSEGSENESGSG 60 Query: 61 EEYEGAPHVHFQESTATEEPRTPSKTPASQTTQSIGRTVSIAEATRRASQESPRQTETAG 120 EEYEGAPHVHFQESTATEEPRTPSKTPASQTTQSIGRTVSIAEATRRASQESPRQTETAG Sbjct: 61 EEYEGAPHVHFQESTATEEPRTPSKTPASQTTQSIGRTVSIAEATRRASQESPRQTETAG 120 Query: 121 SSSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAS 180 SSSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAS Sbjct: 121 SSSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAS 180 Query: 181 TQEAETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVG 240 TQEAETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVG Sbjct: 181 TQEAETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVG 240 Query: 241 LKNRLYIILKDLVEEPWELRAYRGCRSADGLTLSVSFKYKNRPQFSGVKVDLEPKFLQES 300 LKNRLYIILKDLVEEPWELRAYRGCRSADGLTLSVSFKYKNRPQFSGVKVDLEPKFLQES Sbjct: 241 LKNRLYIILKDLVEEPWELRAYRGCRSADGLTLSVSFKYKNRPQFSGVKVDLEPKFLQES 300 Query: 301 VVNIYYTISFTVRNVRPLTHSV 322 VVNIYYTISFTVRNVRPLTHSV Sbjct: 301 VVNIYYTISFTVRNVRPLTHSV 322 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 215 bits (548), Expect = 4e-59 Identities = 34/158 (21%), Positives = 68/158 (42%), Gaps = 4/158 (2%) Query: 155 ERVVSSQTTSRAAETRERSPEHTEASTQEAETAEEALNKEEFWNLYRSLESKSYISRENV 214 E++ + + T E +T + ++E + +FW + LE S++ Sbjct: 29 EKIAKGNLSEESTSTSESEIPNTSSQQDVISSSEHVSDSLKFWEVLNFLEV-SWVPHSGE 87 Query: 215 PDIR-LLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRGCRSADGLTL 273 ++ L ++ + E +RR+ KN +Y ++ ++ ++R YRG R L Sbjct: 88 DGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVL 147 Query: 274 SVSFKYKNRPQFSGVKVDLEPKFLQE--SVVNIYYTIS 309 VS Y + + V L P L+E S+ +I +T + Sbjct: 148 KVSLFYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFN 185 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 163 bits (413), Expect = 2e-43 Identities = 35/156 (22%), Positives = 62/156 (39%), Gaps = 8/156 (5%) Query: 165 RAAETRERSPEHTEASTQEAE-TAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMC 223 + S E + T ++A E W SL K E+ + ++ Sbjct: 219 ELPPEEDPSRELNNVQHEVNSLTEQDAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADR 278 Query: 224 LKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRGCR-SADGLTLSVSFKYKNR 282 + N + RR K+ LY ILK+L++ ++ YRG S D + +S+ ++ ++ Sbjct: 279 IIGNSQRMANLKIRRTKFKSVLYHILKELIQSQGTVKVYRGSSFSHDSIKISLHYEEQH- 337 Query: 283 PQFSGVKVDLEPKF---LQESVVNIYYTISFTVRNV 315 + V V L KF + V + + F R V Sbjct: 338 --ITAVWVYLTVKFEEHWKPVDVEVEFRCKFKERKV 371 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 152 bits (383), Expect = 7e-40 Identities = 39/214 (18%), Positives = 76/214 (35%), Gaps = 31/214 (14%) Query: 122 SSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAST 181 ++A +AL+T +P Q +P+ PS A E + + E ++ + T Sbjct: 125 TAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKATVLT 184 Query: 182 QE----AETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCL------------- 224 E +T E L K E + YR + S + ++P I L +C Sbjct: 185 TERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLCPSDTNKILTGGADK 244 Query: 225 ------KANLEIQGEFDRRRVGLKNRLYIILKDLVEEP---WELRAYRGCRSADGLTLSV 275 K++ +I + + ++ +DLV +R + S + V Sbjct: 245 NVVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIW----SVPNASC-V 299 Query: 276 SFKYKNRPQFSGVKVDLEPKFLQESVVNIYYTIS 309 + +G+ + +L S + Y+ S Sbjct: 300 QVVRAHESAVTGLSLHATGDYLLSSSDDQYWAFS 333 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 120 bits (300), Expect = 2e-30 Identities = 26/106 (24%), Positives = 46/106 (42%), Gaps = 14/106 (13%) Query: 168 ETRERSPEHTEASTQEAETAE------EALN--------KEEFWNLYRSLESKSYISREN 213 E R S E T + +E+E + +ALN +E FW R+L ++ S Sbjct: 233 EERNASGEQTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQ 292 Query: 214 VPDIRLLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWEL 259 + + R + L + +I F +++ + YI LK+ + W L Sbjct: 293 LEECRKAAVFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNL 338 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 57.3 bits (137), Expect = 2e-11 Identities = 23/97 (23%), Positives = 48/97 (48%), Gaps = 1/97 (1%) Query: 217 IRLLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRGCRSADGLTLSVS 276 I L M + ++L +G+ R R + L+ ++D++ ++G R D ++ VS Sbjct: 304 IEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSVRVS 363 Query: 277 FKYKNRPQFSGVKVDLEPKFLQESVVNIYYTISFTVR 313 + K+R Q + ++LEP V ++ + +S V+ Sbjct: 364 IRLKHR-QLTATWIELEPIIENGEVKDVMFKLSTRVQ 399 >trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. Length = 456 Score = 26.1 bits (56), Expect = 0.056 Identities = 7/24 (29%), Positives = 11/24 (45%) Query: 224 LKANLEIQGEFDRRRVGLKNRLYI 247 LK + I + RV + +YI Sbjct: 170 LKLDPSIFESLQKERVEAGDVIYI 193 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 21.5 bits (44), Expect = 1.3 Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 3/48 (6%) Query: 57 SGSGEEYEGAPHVHFQESTATEEPRTPSKTP---ASQTTQSIGRTVSI 101 S ++ A +VH + + E TPSK+ S Q GR V++ Sbjct: 96 SSKNQDSMAAQNVHQKSQMSVETCTTPSKSNLILESLGLQFFGREVTL 143 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 20.7 bits (42), Expect = 2.0 Identities = 14/91 (15%), Positives = 34/91 (36%), Gaps = 9/91 (9%) Query: 155 ERVVSSQTTSRAAETRERSPEHTEASTQEAETAEEALNKEEFWNLYRSLESKSYISRENV 214 ER + + RA E R+ ++ + L KE+ + + + + Sbjct: 174 ERDIRNNQLKRAGEERQALVNQLNSAKSTVTNLGQELQKEK------RIAEELIV---QI 224 Query: 215 PDIRLLVMCLKANLEIQGEFDRRRVGLKNRL 245 ++ +M +K + + E + ++ L L Sbjct: 225 EGLQNNLMQMKEDKKKLQEELKEKLDLIQVL 255 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 20.7 bits (42), Expect = 2.2 Identities = 27/128 (21%), Positives = 47/128 (36%), Gaps = 15/128 (11%) Query: 104 ATRRASQESPRQTETAGSSSA--PDALSTQQPPLE-----HSTQQNIPTKLPSTQQAVER 156 A R +Q S RQ E A S + +ALS Q L+ Q + L ++ A++ Sbjct: 27 ALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAELSHTLEASTAALQA 86 Query: 157 V-----VSSQTTSRAAETRERSPE-HTEASTQEAETAEEALNKEEFWNLYRSLESKSYIS 210 ++ A RE + + + ++ L ++E R L K+ Sbjct: 87 TLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQKE--EELRQLLEKAGAR 144 Query: 211 RENVPDIR 218 R I+ Sbjct: 145 RSQENGIQ 152 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 20.0 bits (40), Expect = 3.8 Identities = 6/24 (25%), Positives = 11/24 (45%) Query: 197 WNLYRSLESKSYISRENVPDIRLL 220 WN+ R L + + D+R + Sbjct: 215 WNVLRILGLRPWKKHSTFGDVRKI 238 >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 20.0 bits (40), Expect = 4.0 Identities = 7/15 (46%), Positives = 9/15 (59%) Query: 184 AETAEEALNKEEFWN 198 ++ E LN EE WN Sbjct: 375 SKMVEHILNTEENWN 389 Searching.done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). 494 e-143 trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. 226 3e-62 swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). 197 1e-53 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 196 3e-53 swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 136 3e-35 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 114 2e-28 Sequences not found previously or not previously below threshold: trembl|Q60952|Q60952 Intranuclear protein. 21 1.4 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 21 1.4 trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. 20 3.0 trembl|Q15219|Q15219 Protein p84. 20 4.2 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 19 7.1 CONVERGED! >swiss|P13777|REP1_ZYGBI Trans-acting factor B (REP1). Length = 322 Score = 494 bits (1273), Expect = e-143 Identities = 322/322 (100%), Positives = 322/322 (100%) Query: 1 MPGTAVHFNELELFFRQTRFAELVQYFVEMDDDSDDAEAVFPTSEPSRVSEGSENESGSG 60 MPGTAVHFNELELFFRQTRFAELVQYFVEMDDDSDDAEAVFPTSEPSRVSEGSENESGSG Sbjct: 1 MPGTAVHFNELELFFRQTRFAELVQYFVEMDDDSDDAEAVFPTSEPSRVSEGSENESGSG 60 Query: 61 EEYEGAPHVHFQESTATEEPRTPSKTPASQTTQSIGRTVSIAEATRRASQESPRQTETAG 120 EEYEGAPHVHFQESTATEEPRTPSKTPASQTTQSIGRTVSIAEATRRASQESPRQTETAG Sbjct: 61 EEYEGAPHVHFQESTATEEPRTPSKTPASQTTQSIGRTVSIAEATRRASQESPRQTETAG 120 Query: 121 SSSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAS 180 SSSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAS Sbjct: 121 SSSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAS 180 Query: 181 TQEAETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVG 240 TQEAETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVG Sbjct: 181 TQEAETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCLKANLEIQGEFDRRRVG 240 Query: 241 LKNRLYIILKDLVEEPWELRAYRGCRSADGLTLSVSFKYKNRPQFSGVKVDLEPKFLQES 300 LKNRLYIILKDLVEEPWELRAYRGCRSADGLTLSVSFKYKNRPQFSGVKVDLEPKFLQES Sbjct: 241 LKNRLYIILKDLVEEPWELRAYRGCRSADGLTLSVSFKYKNRPQFSGVKVDLEPKFLQES 300 Query: 301 VVNIYYTISFTVRNVRPLTHSV 322 VVNIYYTISFTVRNVRPLTHSV Sbjct: 301 VVNIYYTISFTVRNVRPLTHSV 322 >trembl|Q9UMS4|Q9UMS4 Nuclear matrix protein NMP200. Length = 504 Score = 226 bits (576), Expect = 3e-62 Identities = 39/214 (18%), Positives = 76/214 (35%), Gaps = 31/214 (14%) Query: 122 SSAPDALSTQQPPLEHSTQQNIPTKLPSTQQAVERVVSSQTTSRAAETRERSPEHTEAST 181 ++A +AL+T +P Q +P+ PS A E + + E ++ + T Sbjct: 125 TAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKATVLT 184 Query: 182 QE----AETAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMCL------------- 224 E +T E L K E + YR + S + ++P I L +C Sbjct: 185 TERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILALDLCPSDTNKILTGGADK 244 Query: 225 ------KANLEIQGEFDRRRVGLKNRLYIILKDLVEE---PWELRAYRGCRSADGLTLSV 275 K++ +I + + ++ +DLV +R + S + V Sbjct: 245 NVVVFDKSSEQILATLKGHTKKVTSVVFHPSQDLVFSASPDATIRIW----SVPNASC-V 299 Query: 276 SFKYKNRPQFSGVKVDLEPKFLQESVVNIYYTIS 309 + +G+ + +L S + Y+ S Sbjct: 300 QVVRAHESAVTGLSLHATGDYLLSSSDDQYWAFS 333 >swiss|P13742|REP2_ZYGFE Trans-acting factor C (REP2). Length = 200 Score = 197 bits (501), Expect = 1e-53 Identities = 34/158 (21%), Positives = 68/158 (42%), Gaps = 4/158 (2%) Query: 155 ERVVSSQTTSRAAETRERSPEHTEASTQEAETAEEALNKEEFWNLYRSLESKSYISRENV 214 E++ + + T E +T + ++E + +FW + LE S++ Sbjct: 29 EKIAKGNLSEESTSTSESEIPNTSSQQDVISSSEHVSDSLKFWEVLNFLEV-SWVPHSGE 87 Query: 215 PDIR-LLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRGCRSADGLTL 273 ++ L ++ + E +RR+ KN +Y ++ ++ ++R YRG R L Sbjct: 88 DGVQHKLSQYVEKDPSTFEELKKRRLAAKNDVYGYVRQIMYSKGQVRIYRGVRCKPNGVL 147 Query: 274 SVSFKYKNRPQFSGVKVDLEPKFLQE--SVVNIYYTIS 309 VS Y + + V L P L+E S+ +I +T + Sbjct: 148 KVSLFYGPCKWVTSMWVTLVPITLRENGSLEDIEFTFN 185 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 196 bits (498), Expect = 3e-53 Identities = 35/156 (22%), Positives = 62/156 (39%), Gaps = 8/156 (5%) Query: 165 RAAETRERSPEHTEASTQEAE-TAEEALNKEEFWNLYRSLESKSYISRENVPDIRLLVMC 223 + S E + T ++A E W SL K E+ + ++ Sbjct: 219 ELPPEEDPSRELNNVQHEVNSLTEQDAEADEGLWGEIDSLCEKWQSEAEDQTEAEIIADR 278 Query: 224 LKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRGCR-SADGLTLSVSFKYKNR 282 + N + RR K+ LY ILK+L++ ++ YRG S D + +S+ ++ ++ Sbjct: 279 IIGNSQRMANLKIRRTKFKSVLYHILKELIQSQGTVKVYRGSSFSHDSIKISLHYEEQH- 337 Query: 283 PQFSGVKVDLEPKF---LQESVVNIYYTISFTVRNV 315 + V V L KF + V + + F R V Sbjct: 338 --ITAVWVYLTVKFEEHWKPVDVEVEFRCKFKERKV 371 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 136 bits (342), Expect = 3e-35 Identities = 23/97 (23%), Positives = 48/97 (48%), Gaps = 1/97 (1%) Query: 217 IRLLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWELRAYRGCRSADGLTLSVS 276 I L M + ++L +G+ R R + L+ ++D++ ++G R D ++ VS Sbjct: 304 IEYLGMYIASDLHEEGKLTRARKLDRTTLFHTVRDVLLHDGSYTRHKGVRCLDNGSVRVS 363 Query: 277 FKYKNRPQFSGVKVDLEPKFLQESVVNIYYTISFTVR 313 + K+R Q + ++LEP V ++ + +S V+ Sbjct: 364 IRLKHR-QLTATWIELEPIIENGEVKDVMFKLSTRVQ 399 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 114 bits (285), Expect = 2e-28 Identities = 26/106 (24%), Positives = 46/106 (42%), Gaps = 14/106 (13%) Query: 168 ETRERSPEHTEASTQEAETAE------EALN--------KEEFWNLYRSLESKSYISREN 213 E R S E T + +E+E + +ALN +E FW R+L ++ S Sbjct: 233 EERNASGEQTTTAREESEALDTTSNGLDALNTQINAIETEESFWEAIRALHNELRTSPTQ 292 Query: 214 VPDIRLLVMCLKANLEIQGEFDRRRVGLKNRLYIILKDLVEEPWEL 259 + + R + L + +I F +++ + YI LK+ + W L Sbjct: 293 LEECRKAAVFLLGHKKILQTFTKQKDTARALFYINLKECLGTSWNL 338 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 21.5 bits (44), Expect = 1.4 Identities = 27/128 (21%), Positives = 47/128 (36%), Gaps = 15/128 (11%) Query: 104 ATRRASQESPRQTETAGSSSA--PDALSTQQPPLE-----HSTQQNIPTKLPSTQQAVER 156 A R +Q S RQ E A S + +ALS Q L+ Q + L ++ A++ Sbjct: 27 ALRAENQYSRRQEEAAVSQAEALQEALSKAQAALQEKEQSLLEQAELSHTLEASTAALQA 86 Query: 157 V-----VSSQTTSRAAETRERSPEHTEAS-TQEAETAEEALNKEEFWNLYRSLESKSYIS 210 ++ A RE + + + ++ L ++E R L K+ Sbjct: 87 TLDTCQARARQLEEALRIREGEIQAQALQHHEVTQHLQQELCQKE--EELRQLLEKAGAR 144 Query: 211 RENVPDIR 218 R I+ Sbjct: 145 RSQENGIQ 152 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 21.5 bits (44), Expect = 1.4 Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 3/48 (6%) Query: 57 SGSGEEYEGAPHVHFQESTATEEPRTPSKTP---ASQTTQSIGRTVSI 101 S ++ A +VH + + E TPSK+ S Q GR V++ Sbjct: 96 SSKNQDSMAAQNVHQKSQMSVETCTTPSKSNLILESLGLQFFGREVTL 143 >trembl|Q9Y265|Q9Y265 RUVB-LIKE 1. Length = 456 Score = 20.3 bits (41), Expect = 3.0 Identities = 7/30 (23%), Positives = 10/30 (33%) Query: 167 AETRERSPEHTEASTQEAETAEEALNKEEF 196 + A T+ +EEALN Sbjct: 367 TPQEMKQIIKIRAQTEGINISEEALNHLGE 396 >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 20.0 bits (40), Expect = 4.2 Identities = 7/15 (46%), Positives = 9/15 (59%) Query: 184 AETAEEALNKEEFWN 198 ++ E LN EE WN Sbjct: 375 SKMVEHILNTEENWN 389 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 19.2 bits (38), Expect = 7.1 Identities = 9/41 (21%), Positives = 16/41 (38%) Query: 155 ERVVSSQTTSRAAETRERSPEHTEASTQEAETAEEALNKEE 195 ER + + RA E R+ ++ + L KE+ Sbjct: 174 ERDIRNNQLKRAGEERQALVNQLNSAKSTVTNLGQELQKEK 214 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.305 0.140 0.356 Lambda K H 0.267 0.0428 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 32,568 Number of Sequences: 43 Number of extensions: 1341 Number of successful extensions: 93 Number of sequences better than 10.0: 43 Number of HSP's better than 10.0 without gapping: 43 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 64 length of query: 322 length of database: 17,163 effective HSP length: 55 effective length of query: 267 effective length of database: 14,798 effective search space: 3951066 effective search space used: 3951066 T: 11 A: 40 X1: 16 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 37 (19.1 bits) S2: 37 (18.8 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). (224 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). 403 e-116 swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). 23 0.29 swiss|O85062|REP2_BUCAP Probable replication-associated protein... 20 2.5 swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein B... 19 4.5 >swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). Length = 224 Score = 403 bits (1036), Expect = e-116 Identities = 224/224 (100%), Positives = 224/224 (100%) Query: 1 MGALYNEDYDATDMTETRLLAIPLQAYVYRTFSNVMRSASEHLDDKYAQEAHEESETIRA 60 MGALYNEDYDATDMTETRLLAIPLQAYVYRTFSNVMRSASEHLDDKYAQEAHEESETIRA Sbjct: 1 MGALYNEDYDATDMTETRLLAIPLQAYVYRTFSNVMRSASEHLDDKYAQEAHEESETIRA 60 Query: 61 HWEALETRYEVPDWSKEDFDVSEGITNVDTAVRLAESAERFVRQTNLELSANQLLRLLHY 120 HWEALETRYEVPDWSKEDFDVSEGITNVDTAVRLAESAERFVRQTNLELSANQLLRLLHY Sbjct: 61 HWEALETRYEVPDWSKEDFDVSEGITNVDTAVRLAESAERFVRQTNLELSANQLLRLLHY 120 Query: 121 RSMQERQWELETRSYLNDLINLVEDPADQKRIDDVLMPWFANKPFMVELTTEELTQLRKK 180 RSMQERQWELETRSYLNDLINLVEDPADQKRIDDVLMPWFANKPFMVELTTEELTQLRKK Sbjct: 121 RSMQERQWELETRSYLNDLINLVEDPADQKRIDDVLMPWFANKPFMVELTTEELTQLRKK 180 Query: 181 IEDAIKNTNVQKSARQALKKYPLLSTFSSNGSRRGRSNKHRVVH 224 IEDAIKNTNVQKSARQALKKYPLLSTFSSNGSRRGRSNKHRVVH Sbjct: 181 IEDAIKNTNVQKSARQALKKYPLLSTFSSNGSRRGRSNKHRVVH 224 >swiss|P13782|REP2_ZYGRO Trans-acting factor C (REP2). Length = 233 Score = 23.1 bits (48), Expect = 0.29 Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 1/66 (1%) Query: 82 SEGITNVDTAVRLAESAERFVRQTNLELSANQLLRLLHYRSMQE-RQWELETRSYLNDLI 140 + G+ +VD L E+ R + + + ++L H +S++ W ET S+ N Sbjct: 14 NHGLLDVDYVANLLENLLRTWKHGKPTIKVREAIQLAHAKSIKVISLWPQETCSFRNFDG 73 Query: 141 NLVEDP 146 N +DP Sbjct: 74 NPEDDP 79 >swiss|O85062|REP2_BUCAP Probable replication-associated protein repA2. Length = 250 Score = 20.0 bits (40), Expect = 2.5 Identities = 9/27 (33%), Positives = 19/27 (70%) Query: 187 NTNVQKSARQALKKYPLLSTFSSNGSR 213 N+N+ +++ + L LSTFS++G++ Sbjct: 87 NSNLVEASIEKLSDECGLSTFSNSGNK 113 >swiss|P03871|REP1_YEAST Trans-acting factor B (REP1) (Protein Baker). Length = 373 Score = 19.2 bits (38), Expect = 4.5 Identities = 10/30 (33%), Positives = 14/30 (46%), Gaps = 1/30 (3%) Query: 52 HEESETIRAHWEALETRYEVPDWSKEDFDV 81 H E + I A W L ++E W D +V Sbjct: 332 HYEEQHITAVWVYLTVKFE-EHWKPVDVEV 360 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). 444 e-128 Sequences not found previously or not previously below threshold: swiss|O85062|REP2_BUCAP Probable replication-associated protein... 21 1.2 swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). 19 4.3 CONVERGED! >swiss|P13741|REP1_ZYGFE Trans-acting factor B (REP1). Length = 224 Score = 444 bits (1142), Expect = e-128 Identities = 224/224 (100%), Positives = 224/224 (100%) Query: 1 MGALYNEDYDATDMTETRLLAIPLQAYVYRTFSNVMRSASEHLDDKYAQEAHEESETIRA 60 MGALYNEDYDATDMTETRLLAIPLQAYVYRTFSNVMRSASEHLDDKYAQEAHEESETIRA Sbjct: 1 MGALYNEDYDATDMTETRLLAIPLQAYVYRTFSNVMRSASEHLDDKYAQEAHEESETIRA 60 Query: 61 HWEALETRYEVPDWSKEDFDVSEGITNVDTAVRLAESAERFVRQTNLELSANQLLRLLHY 120 HWEALETRYEVPDWSKEDFDVSEGITNVDTAVRLAESAERFVRQTNLELSANQLLRLLHY Sbjct: 61 HWEALETRYEVPDWSKEDFDVSEGITNVDTAVRLAESAERFVRQTNLELSANQLLRLLHY 120 Query: 121 RSMQERQWELETRSYLNDLINLVEDPADQKRIDDVLMPWFANKPFMVELTTEELTQLRKK 180 RSMQERQWELETRSYLNDLINLVEDPADQKRIDDVLMPWFANKPFMVELTTEELTQLRKK Sbjct: 121 RSMQERQWELETRSYLNDLINLVEDPADQKRIDDVLMPWFANKPFMVELTTEELTQLRKK 180 Query: 181 IEDAIKNTNVQKSARQALKKYPLLSTFSSNGSRRGRSNKHRVVH 224 IEDAIKNTNVQKSARQALKKYPLLSTFSSNGSRRGRSNKHRVVH Sbjct: 181 IEDAIKNTNVQKSARQALKKYPLLSTFSSNGSRRGRSNKHRVVH 224 >swiss|O85062|REP2_BUCAP Probable replication-associated protein repA2. Length = 250 Score = 21.2 bits (43), Expect = 1.2 Identities = 9/27 (33%), Positives = 19/27 (70%) Query: 187 NTNVQKSARQALKKYPLLSTFSSNGSR 213 N+N+ +++ + L LSTFS++G++ Sbjct: 87 NSNLVEASIEKLSDECGLSTFSNSGNK 113 >swiss|P13775|REP1_KLULA Trans-acting factor B (REP1). Length = 415 Score = 19.3 bits (38), Expect = 4.3 Identities = 9/24 (37%), Positives = 12/24 (49%) Query: 37 RSASEHLDDKYAQEAHEESETIRA 60 R E+L A + HEE + RA Sbjct: 301 RRVIEYLGMYIASDLHEEGKLTRA 324 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.315 0.129 0.362 Lambda K H 0.267 0.0394 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 14,104 Number of Sequences: 43 Number of extensions: 535 Number of successful extensions: 8 Number of sequences better than 10.0: 8 Number of HSP's better than 10.0 without gapping: 8 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 8 length of query: 224 length of database: 17,163 effective HSP length: 52 effective length of query: 172 effective length of database: 14,927 effective search space: 2567444 effective search space used: 2567444 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 35 (18.8 bits) S2: 35 (18.1 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc (764 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 1363 0.0 swiss|P25233|NECD_MOUSE Necdin. 24 0.53 swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein c... 21 5.2 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 1363 bits (3529), Expect = 0.0 Identities = 764/764 (100%), Positives = 764/764 (100%) Query: 1 MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGGKMQGVPLKHS 60 MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGGKMQGVPLKHS Sbjct: 1 MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGGKMQGVPLKHS 60 Query: 61 GHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSLGY 120 GHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSLGY Sbjct: 61 GHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSLGY 120 Query: 121 SHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVANMLQDVYHVVTLKIQLHSCPKLEDLP 180 SHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVANMLQDVYHVVTLKIQLHSCPKLEDLP Sbjct: 121 SHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVANMLQDVYHVVTLKIQLHSCPKLEDLP 180 Query: 181 PEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGRWY 240 PEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGRWY Sbjct: 181 PEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGRWY 240 Query: 241 KHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPSVRTPL 300 KHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPSVRTPL Sbjct: 241 KHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPSVRTPL 300 Query: 301 PNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKP 360 PNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKP Sbjct: 301 PNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKP 360 Query: 361 LEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQ 420 LEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQ Sbjct: 361 LEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQ 420 Query: 421 SLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLLSTPPSRPPQVKTATLA 480 SLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLLSTPPSRPPQVKTATLA Sbjct: 421 SLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLLSTPPSRPPQVKTATLA 480 Query: 481 TERNGKPENNTMNINASIYDEIQQEMKRAKVSQAPFAKVAATKSQGWLCELLRWKEDPSP 540 TERNGKPENNTMNINASIYDEIQQEMKRAKVSQAPFAKVAATKSQGWLCELLRWKEDPSP Sbjct: 481 TERNGKPENNTMNINASIYDEIQQEMKRAKVSQAPFAKVAATKSQGWLCELLRWKEDPSP 540 Query: 541 ENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVPAEQIQQQQQQQQ 600 ENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVPAEQIQQQQQQQQ Sbjct: 541 ENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVPAEQIQQQQQQQQ 600 Query: 601 QQQQQQQPPPPPPQPQPQPQAGPRLPPRQPTVASSAESDEENRQKTRPRTKISVEALGIL 660 QQQQQQQPPPPPPQPQPQPQAGPRLPPRQPTVASSAESDEENRQKTRPRTKISVEALGIL Sbjct: 601 QQQQQQQPPPPPPQPQPQPQAGPRLPPRQPTVASSAESDEENRQKTRPRTKISVEALGIL 660 Query: 661 QSFIQDVGLYPDEEAIQTLSAQLDLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEY 720 QSFIQDVGLYPDEEAIQTLSAQLDLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEY Sbjct: 661 QSFIQDVGLYPDEEAIQTLSAQLDLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEY 720 Query: 721 KDEELLKDLEESVQDKNANTLFSVKLEEELSVEGSTDVNADLKD 764 KDEELLKDLEESVQDKNANTLFSVKLEEELSVEGSTDVNADLKD Sbjct: 721 KDEELLKDLEESVQDKNANTLFSVKLEEELSVEGSTDVNADLKD 764 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 24.3 bits (51), Expect = 0.53 Identities = 12/33 (36%), Positives = 12/33 (36%) Query: 611 PPPQPQPQPQAGPRLPPRQPTVASSAESDEENR 643 PPP AGP P P A A E R Sbjct: 33 PPPASLAANLAGPPCAPEGPMAAQQASPPPEER 65 >swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein charlie). Length = 296 Score = 20.8 bits (42), Expect = 5.2 Identities = 8/28 (28%), Positives = 17/28 (60%) Query: 538 PSPENRTLWENLSMIRRFLSLPQPERDA 565 P+P + + +S++ + L+L P+R A Sbjct: 168 PAPTEEDVMKLVSVVTQLLTLVPPDRQA 195 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 1513 0.0 Sequences not found previously or not previously below threshold: swiss|P25233|NECD_MOUSE Necdin. 26 0.12 swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein c... 23 1.6 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 21 3.9 swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). 20 7.2 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 20 8.8 CONVERGED! >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 1513 bits (3918), Expect = 0.0 Identities = 764/764 (100%), Positives = 764/764 (100%) Query: 1 MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGGKMQGVPLKHS 60 MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGGKMQGVPLKHS Sbjct: 1 MDHLNEATQGKEHSEMSNNVSDPKGPPAKIARLEQNGSPLGRGRLGSTGGKMQGVPLKHS 60 Query: 61 GHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSLGY 120 GHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSLGY Sbjct: 61 GHLMKTNLRKGTMLPVFCVVEHYENAIEYDCKEEHAEFVLVRKDMLFNQLIEMALLSLGY 120 Query: 121 SHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVANMLQDVYHVVTLKIQLHSCPKLEDLP 180 SHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVANMLQDVYHVVTLKIQLHSCPKLEDLP Sbjct: 121 SHSSAAQAKGLIQVGKWNPVPLSYVTDAPDATVANMLQDVYHVVTLKIQLHSCPKLEDLP 180 Query: 181 PEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGRWY 240 PEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGRWY Sbjct: 181 PEQWSHTTVRNALKDLLKDMNQSSLAKECPLSQSMISSIVNSTYYANVSAAKCQEFGRWY 240 Query: 241 KHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPSVRTPL 300 KHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPSVRTPL Sbjct: 241 KHFKKTKDMMVEMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQLSHGSQPSVRTPL 300 Query: 301 PNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKP 360 PNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKP Sbjct: 301 PNLHPGLVSTPISPQLVNQQLVMAQLLNQQYAVNRLLAQQSLNQQYLNHPPPVSRSMNKP 360 Query: 361 LEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQ 420 LEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQ Sbjct: 361 LEQQVSTNTEVSSEIYQWVRDELKRAGISQAVFARVAFNRTQGLLSEILRKEEDPKTASQ 420 Query: 421 SLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLLSTPPSRPPQVKTATLA 480 SLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLLSTPPSRPPQVKTATLA Sbjct: 421 SLLVNLRAMQNFLQLPEAERDRIYQDERERSLNAASAMGPAPLLSTPPSRPPQVKTATLA 480 Query: 481 TERNGKPENNTMNINASIYDEIQQEMKRAKVSQAPFAKVAATKSQGWLCELLRWKEDPSP 540 TERNGKPENNTMNINASIYDEIQQEMKRAKVSQAPFAKVAATKSQGWLCELLRWKEDPSP Sbjct: 481 TERNGKPENNTMNINASIYDEIQQEMKRAKVSQAPFAKVAATKSQGWLCELLRWKEDPSP 540 Query: 541 ENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVPAEQIQQQQQQQQ 600 ENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVPAEQIQQQQQQQQ Sbjct: 541 ENRTLWENLSMIRRFLSLPQPERDAIYEQESNAVHHHGDRPPHIIHVPAEQIQQQQQQQQ 600 Query: 601 QQQQQQQPPPPPPQPQPQPQAGPRLPPRQPTVASSAESDEENRQKTRPRTKISVEALGIL 660 QQQQQQQPPPPPPQPQPQPQAGPRLPPRQPTVASSAESDEENRQKTRPRTKISVEALGIL Sbjct: 601 QQQQQQQPPPPPPQPQPQPQAGPRLPPRQPTVASSAESDEENRQKTRPRTKISVEALGIL 660 Query: 661 QSFIQDVGLYPDEEAIQTLSAQLDLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEY 720 QSFIQDVGLYPDEEAIQTLSAQLDLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEY Sbjct: 661 QSFIQDVGLYPDEEAIQTLSAQLDLPKYTIIKFFQNQRYYLKHHGKLKDNSGLEVDVAEY 720 Query: 721 KDEELLKDLEESVQDKNANTLFSVKLEEELSVEGSTDVNADLKD 764 KDEELLKDLEESVQDKNANTLFSVKLEEELSVEGSTDVNADLKD Sbjct: 721 KDEELLKDLEESVQDKNANTLFSVKLEEELSVEGSTDVNADLKD 764 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 26.2 bits (56), Expect = 0.12 Identities = 12/33 (36%), Positives = 12/33 (36%) Query: 611 PPPQPQPQPQAGPRLPPRQPTVASSAESDEENR 643 PPP AGP P P A A E R Sbjct: 33 PPPASLAANLAGPPCAPEGPMAAQQASPPPEER 65 >swiss|P03872|REP2_YEAST Trans-acting factor C (REP2) (Protein charlie). Length = 296 Score = 22.8 bits (47), Expect = 1.6 Identities = 8/28 (28%), Positives = 17/28 (60%) Query: 538 PSPENRTLWENLSMIRRFLSLPQPERDA 565 P+P + + +S++ + L+L P+R A Sbjct: 168 PAPTEEDVMKLVSVVTQLLTLVPPDRQA 195 >swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. Length = 722 Score = 21.2 bits (43), Expect = 3.9 Identities = 12/25 (48%), Positives = 18/25 (72%), Gaps = 2/25 (8%) Query: 197 LKDMNQS--SLAKECPLSQSMISSI 219 L +MN++ +LAKE L+ S ISS+ Sbjct: 589 LDEMNRNAFALAKELELANSHISSL 613 >swiss|P13776|REP1_ZYGBA Trans-acting factor B (REP1). Length = 357 Score = 20.5 bits (41), Expect = 7.2 Identities = 11/31 (35%), Positives = 18/31 (57%) Query: 64 MKTNLRKGTMLPVFCVVEHYENAIEYDCKEE 94 M TN+RK + + C + +NAI+ + EE Sbjct: 202 MATNVRKVWIGEIVCGMVSNKNAIDDNDLEE 232 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 20.1 bits (40), Expect = 8.8 Identities = 12/31 (38%), Positives = 16/31 (50%) Query: 719 EYKDEELLKDLEESVQDKNANTLFSVKLEEE 749 EY E ++LE S +D+ KLEEE Sbjct: 656 EYCIENKREELENSSRDREKAFEQEKKLEEE 686 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.312 0.129 0.367 Lambda K H 0.267 0.0393 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 63,225 Number of Sequences: 43 Number of extensions: 3557 Number of successful extensions: 273 Number of sequences better than 10.0: 22 Number of HSP's better than 10.0 without gapping: 22 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 91 length of query: 764 length of database: 17,163 effective HSP length: 62 effective length of query: 702 effective length of database: 14,497 effective search space: 10176894 effective search space used: 10176894 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.0 bits) S2: 40 (20.1 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= swiss|P25233|NECD_MOUSE Necdin. (325 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value swiss|P25233|NECD_MOUSE Necdin. 585 e-170 swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 22 0.83 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 22 1.2 trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.... 21 2.0 >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 585 bits (1509), Expect = e-170 Identities = 325/325 (100%), Positives = 325/325 (100%) Query: 1 MSEQSKDLSDPNFAAEVPDCEMQDSDAVPVGIPPPASLAANLAGPPCAPEGPMAAQQASP 60 MSEQSKDLSDPNFAAEVPDCEMQDSDAVPVGIPPPASLAANLAGPPCAPEGPMAAQQASP Sbjct: 1 MSEQSKDLSDPNFAAEVPDCEMQDSDAVPVGIPPPASLAANLAGPPCAPEGPMAAQQASP 60 Query: 61 PPEERIEDVDPKILQQAAEEGRAHQPQSPARPIPAPPAPAQLVQKAHELMWYVLVKDQKR 120 PPEERIEDVDPKILQQAAEEGRAHQPQSPARPIPAPPAPAQLVQKAHELMWYVLVKDQKR Sbjct: 61 PPEERIEDVDPKILQQAAEEGRAHQPQSPARPIPAPPAPAQLVQKAHELMWYVLVKDQKR 120 Query: 121 MVLWFPDMVKEVMGSYKKWCRSILRRTSVILARVFGLHLRLTNLHTMEFALVKALSPEEL 180 MVLWFPDMVKEVMGSYKKWCRSILRRTSVILARVFGLHLRLTNLHTMEFALVKALSPEEL Sbjct: 121 MVLWFPDMVKEVMGSYKKWCRSILRRTSVILARVFGLHLRLTNLHTMEFALVKALSPEEL 180 Query: 181 DRVALNNRMPMTGLLLMILSLIYVKGRGAREGAVWNVLRILGLRPWKKHSTFGDVRKIIT 240 DRVALNNRMPMTGLLLMILSLIYVKGRGAREGAVWNVLRILGLRPWKKHSTFGDVRKIIT Sbjct: 181 DRVALNNRMPMTGLLLMILSLIYVKGRGAREGAVWNVLRILGLRPWKKHSTFGDVRKIIT 240 Query: 241 EEFVQQNYLKYQRVPHIEPPEYEFFWGSRANREITKMQIMEFLARVFKKDPQAWPSRYRE 300 EEFVQQNYLKYQRVPHIEPPEYEFFWGSRANREITKMQIMEFLARVFKKDPQAWPSRYRE Sbjct: 241 EEFVQQNYLKYQRVPHIEPPEYEFFWGSRANREITKMQIMEFLARVFKKDPQAWPSRYRE 300 Query: 301 ALEQARALREANLAAQAPRSSVSED 325 ALEQARALREANLAAQAPRSSVSED Sbjct: 301 ALEQARALREANLAAQAPRSSVSED 325 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 22.3 bits (46), Expect = 0.83 Identities = 12/33 (36%), Positives = 12/33 (36%) Query: 33 PPPASLAANLAGPPCAPEGPMAAQQASPPPEER 65 PPP AGP P P A A E R Sbjct: 611 PPPQPQPQPQAGPRLPPRQPTVASSAESDEENR 643 Score = 21.9 bits (45), Expect = 0.99 Identities = 23/97 (23%), Positives = 34/97 (34%), Gaps = 6/97 (6%) Query: 6 KDLSDPNFAAEVPDCEMQDSDAVPVGIPPPASLAANLAGPPCAPEGPMAAQQASPPPEER 65 KD++ + A E P + S V ++ AN++ C G Sbjct: 198 KDMNQSSLAKECPLSQSMISSIV------NSTYYANVSAAKCQEFGRWYKHFKKTKDMMV 251 Query: 66 IEDVDPKILQQAAEEGRAHQPQSPARPIPAPPAPAQL 102 D ++ QQ A Q P PP+PAQL Sbjct: 252 EMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQL 288 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 21.6 bits (44), Expect = 1.2 Identities = 9/25 (36%), Positives = 13/25 (52%) Query: 64 ERIEDVDPKILQQAAEEGRAHQPQS 88 ER DVD ++Q+ + R P S Sbjct: 153 ERHSDVDDNVVQENLNDSRGEIPDS 177 >trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR Length = 752 Score = 20.8 bits (42), Expect = 2.0 Identities = 14/41 (34%), Positives = 16/41 (38%), Gaps = 13/41 (31%) Query: 59 SPPPEERIEDVDPKILQQAAEEGRAHQPQSPARPIPAPPAP 99 SP E ++VD QPQS RP PP P Sbjct: 252 SPSSESEADNVDA-------------QPQSTVRPEEIPPIP 279 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: swiss|P25233|NECD_MOUSE Necdin. 635 0.0 Sequences not found previously or not previously below threshold: swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-r... 24 0.26 trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.... 23 0.51 swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centrome... 23 0.65 CONVERGED! >swiss|P25233|NECD_MOUSE Necdin. Length = 325 Score = 635 bits (1638), Expect = 0.0 Identities = 325/325 (100%), Positives = 325/325 (100%) Query: 1 MSEQSKDLSDPNFAAEVPDCEMQDSDAVPVGIPPPASLAANLAGPPCAPEGPMAAQQASP 60 MSEQSKDLSDPNFAAEVPDCEMQDSDAVPVGIPPPASLAANLAGPPCAPEGPMAAQQASP Sbjct: 1 MSEQSKDLSDPNFAAEVPDCEMQDSDAVPVGIPPPASLAANLAGPPCAPEGPMAAQQASP 60 Query: 61 PPEERIEDVDPKILQQAAEEGRAHQPQSPARPIPAPPAPAQLVQKAHELMWYVLVKDQKR 120 PPEERIEDVDPKILQQAAEEGRAHQPQSPARPIPAPPAPAQLVQKAHELMWYVLVKDQKR Sbjct: 61 PPEERIEDVDPKILQQAAEEGRAHQPQSPARPIPAPPAPAQLVQKAHELMWYVLVKDQKR 120 Query: 121 MVLWFPDMVKEVMGSYKKWCRSILRRTSVILARVFGLHLRLTNLHTMEFALVKALSPEEL 180 MVLWFPDMVKEVMGSYKKWCRSILRRTSVILARVFGLHLRLTNLHTMEFALVKALSPEEL Sbjct: 121 MVLWFPDMVKEVMGSYKKWCRSILRRTSVILARVFGLHLRLTNLHTMEFALVKALSPEEL 180 Query: 181 DRVALNNRMPMTGLLLMILSLIYVKGRGAREGAVWNVLRILGLRPWKKHSTFGDVRKIIT 240 DRVALNNRMPMTGLLLMILSLIYVKGRGAREGAVWNVLRILGLRPWKKHSTFGDVRKIIT Sbjct: 181 DRVALNNRMPMTGLLLMILSLIYVKGRGAREGAVWNVLRILGLRPWKKHSTFGDVRKIIT 240 Query: 241 EEFVQQNYLKYQRVPHIEPPEYEFFWGSRANREITKMQIMEFLARVFKKDPQAWPSRYRE 300 EEFVQQNYLKYQRVPHIEPPEYEFFWGSRANREITKMQIMEFLARVFKKDPQAWPSRYRE Sbjct: 241 EEFVQQNYLKYQRVPHIEPPEYEFFWGSRANREITKMQIMEFLARVFKKDPQAWPSRYRE 300 Query: 301 ALEQARALREANLAAQAPRSSVSED 325 ALEQARALREANLAAQAPRSSVSED Sbjct: 301 ALEQARALREANLAAQAPRSSVSED 325 >swiss|Q60611|SAT1_MOUSE DNA-binding protein SATB1 (Special AT-rich sequenc Length = 764 Score = 23.9 bits (50), Expect = 0.26 Identities = 23/97 (23%), Positives = 34/97 (34%), Gaps = 6/97 (6%) Query: 6 KDLSDPNFAAEVPDCEMQDSDAVPVGIPPPASLAANLAGPPCAPEGPMAAQQASPPPEER 65 KD++ + A E P + S V ++ AN++ C G Sbjct: 198 KDMNQSSLAKECPLSQSMISSIV------NSTYYANVSAAKCQEFGRWYKHFKKTKDMMV 251 Query: 66 IEDVDPKILQQAAEEGRAHQPQSPARPIPAPPAPAQL 102 D ++ QQ A Q P PP+PAQL Sbjct: 252 EMDSLSELSQQGANHVNFGQQPVPGNTAEQPPSPAQL 288 Score = 23.9 bits (50), Expect = 0.29 Identities = 12/33 (36%), Positives = 12/33 (36%) Query: 33 PPPASLAANLAGPPCAPEGPMAAQQASPPPEER 65 PPP AGP P P A A E R Sbjct: 611 PPPQPQPQPQAGPRLPPRQPTVASSAESDEENR 643 >trembl|O55035|O55035 PEPTIDYLPROLYL ISOMERASE MATRIN CYP (EC 5.2.1.8) (PEPTIDYLPR Length = 752 Score = 23.1 bits (48), Expect = 0.51 Identities = 14/41 (34%), Positives = 16/41 (38%), Gaps = 13/41 (31%) Query: 59 SPPPEERIEDVDPKILQQAAEEGRAHQPQSPARPIPAPPAP 99 SP E ++VD QPQS RP PP P Sbjct: 252 SPSSESEADNVDA-------------QPQSTVRPEEIPPIP 279 >swiss|P49453|CENC_SHEEP Centromere protein C (CENP-C) (Centromere autoanti Length = 402 Score = 22.7 bits (47), Expect = 0.65 Identities = 9/25 (36%), Positives = 13/25 (52%) Query: 64 ERIEDVDPKILQQAAEEGRAHQPQS 88 ER DVD ++Q+ + R P S Sbjct: 153 ERHSDVDDNVVQENLNDSRGEIPDS 177 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.319 0.134 0.406 Lambda K H 0.267 0.0411 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 25,460 Number of Sequences: 43 Number of extensions: 1299 Number of successful extensions: 98 Number of sequences better than 10.0: 18 Number of HSP's better than 10.0 without gapping: 18 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 40 length of query: 325 length of database: 17,163 effective HSP length: 55 effective length of query: 270 effective length of database: 14,798 effective search space: 3995460 effective search space used: 3995460 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 37 (19.9 bits) S2: 37 (18.9 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. (976 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 1727 0.0 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 20 9.0 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 1727 bits (4473), Expect = 0.0 Identities = 976/976 (100%), Positives = 976/976 (100%) Query: 1 MVGMDAISALLQTGSWRTSRSKSQNPYSRTPRRTDPLSELDIHIDKPLERRRIQQLPPPT 60 MVGMDAISALLQTGSWRTSRSKSQNPYSRTPRRTDPLSELDIHIDKPLERRRIQQLPPPT Sbjct: 1 MVGMDAISALLQTGSWRTSRSKSQNPYSRTPRRTDPLSELDIHIDKPLERRRIQQLPPPT 60 Query: 61 VEDEADSLAKEYGSPSVISSYEDEPPNRGEIDQLILMEPVLENNPERRFVVVTGPGAGKE 120 VEDEADSLAKEYGSPSVISSYEDEPPNRGEIDQLILMEPVLENNPERRFVVVTGPGAGKE Sbjct: 61 VEDEADSLAKEYGSPSVISSYEDEPPNRGEIDQLILMEPVLENNPERRFVVVTGPGAGKE 120 Query: 121 SEPAPSIVETTPPVTSTDTAQIKETNDCKKFVLVPSDEIKDGDKQEEKKEPELAKRKSHQ 180 SEPAPSIVETTPPVTSTDTAQIKETNDCKKFVLVPSDEIKDGDKQEEKKEPELAKRKSHQ Sbjct: 121 SEPAPSIVETTPPVTSTDTAQIKETNDCKKFVLVPSDEIKDGDKQEEKKEPELAKRKSHQ 180 Query: 181 DLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSPVHQDDGKAARIKRSGSRRRPEACVDEED 240 DLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSPVHQDDGKAARIKRSGSRRRPEACVDEED Sbjct: 181 DLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSPVHQDDGKAARIKRSGSRRRPEACVDEED 240 Query: 241 TKKASLKRASSRRRPEIPPDVQDSADTSIKRSSSRRYRERPADQDAKESPIKRSNSRKPR 300 TKKASLKRASSRRRPEIPPDVQDSADTSIKRSSSRRYRERPADQDAKESPIKRSNSRKPR Sbjct: 241 TKKASLKRASSRRRPEIPPDVQDSADTSIKRSSSRRYRERPADQDAKESPIKRSNSRKPR 300 Query: 301 HRPLHDQDFEITADSPDKVNPGASEFPASIVIQTAGGRERRPRHERMVSTPDVPRIDSRS 360 HRPLHDQDFEITADSPDKVNPGASEFPASIVIQTAGGRERRPRHERMVSTPDVPRIDSRS Sbjct: 301 HRPLHDQDFEITADSPDKVNPGASEFPASIVIQTAGGRERRPRHERMVSTPDVPRIDSRS 360 Query: 361 LELEGPGIAPLSMYRYNDADDAMSFMVQEVVIPDHDLTRKSSPPRTSRKESPPPYPSVTE 420 LELEGPGIAPLSMYRYNDADDAMSFMVQEVVIPDHDLTRKSSPPRTSRKESPPPYPSVTE Sbjct: 361 LELEGPGIAPLSMYRYNDADDAMSFMVQEVVIPDHDLTRKSSPPRTSRKESPPPYPSVTE 420 Query: 421 RKRMPSQSATNRRRRNSMAAPRDRREYPIGDDPRYQQLDHHEQPQSFRPAVEEDASLMAL 480 RKRMPSQSATNRRRRNSMAAPRDRREYPIGDDPRYQQLDHHEQPQSFRPAVEEDASLMAL Sbjct: 421 RKRMPSQSATNRRRRNSMAAPRDRREYPIGDDPRYQQLDHHEQPQSFRPAVEEDASLMAL 480 Query: 481 KHRLGAVDIGLESQDVVPHPAALALRQQDQFSEASDPGSSRSATFSAEGDHRRGRAGTYS 540 KHRLGAVDIGLESQDVVPHPAALALRQQDQFSEASDPGSSRSATFSAEGDHRRGRAGTYS Sbjct: 481 KHRLGAVDIGLESQDVVPHPAALALRQQDQFSEASDPGSSRSATFSAEGDHRRGRAGTYS 540 Query: 541 LTSSPPNGRLGPGREDESSPRTRRRARSRTMGTNPYSNSSPILSVPGHHSSNGSNALAKV 600 LTSSPPNGRLGPGREDESSPRTRRRARSRTMGTNPYSNSSPILSVPGHHSSNGSNALAKV Sbjct: 541 LTSSPPNGRLGPGREDESSPRTRRRARSRTMGTNPYSNSSPILSVPGHHSSNGSNALAKV 600 Query: 601 SSPLASPSLTRQELVSTRPSASFWDQNPFEPAGRRAPGARPILSFRNYSEEVLQGLTPPL 660 SSPLASPSLTRQELVSTRPSASFWDQNPFEPAGRRAPGARPILSFRNYSEEVLQGLTPPL Sbjct: 601 SSPLASPSLTRQELVSTRPSASFWDQNPFEPAGRRAPGARPILSFRNYSEEVLQGLTPPL 660 Query: 661 PDCPWQIPGLPYSRNPGDRFWILRGAENFLVCPHCHSELFANSQFRLEFVATSLRPDQPV 720 PDCPWQIPGLPYSRNPGDRFWILRGAENFLVCPHCHSELFANSQFRLEFVATSLRPDQPV Sbjct: 661 PDCPWQIPGLPYSRNPGDRFWILRGAENFLVCPHCHSELFANSQFRLEFVATSLRPDQPV 720 Query: 721 LCHLGSSFWYRMAYLMTLKQRLPDLQLIRNVATVSGRHPPCSGPQETARKWYGVWDSYKG 780 LCHLGSSFWYRMAYLMTLKQRLPDLQLIRNVATVSGRHPPCSGPQETARKWYGVWDSYKG Sbjct: 721 LCHLGSSFWYRMAYLMTLKQRLPDLQLIRNVATVSGRHPPCSGPQETARKWYGVWDSYKG 780 Query: 781 DFVPLFEVCVGCVKMLEAVLPNLANTLYPISPEPFIAYCQLHYAPGRKRFIEYFDLFEMT 840 DFVPLFEVCVGCVKMLEAVLPNLANTLYPISPEPFIAYCQLHYAPGRKRFIEYFDLFEMT Sbjct: 781 DFVPLFEVCVGCVKMLEAVLPNLANTLYPISPEPFIAYCQLHYAPGRKRFIEYFDLFEMT 840 Query: 841 SDYALSRRAAPDIGDLANRLRRIARVKECPPCPKDQALVGAYWHTMNFLPEFTVCEECYR 900 SDYALSRRAAPDIGDLANRLRRIARVKECPPCPKDQALVGAYWHTMNFLPEFTVCEECYR Sbjct: 841 SDYALSRRAAPDIGDLANRLRRIARVKECPPCPKDQALVGAYWHTMNFLPEFTVCEECYR 900 Query: 901 EVIKPMLDANDRRSDIPWNFLPERLLSKSIASCQLSSERMREEFREACRLNDFGYLDGKY 960 EVIKPMLDANDRRSDIPWNFLPERLLSKSIASCQLSSERMREEFREACRLNDFGYLDGKY Sbjct: 901 EVIKPMLDANDRRSDIPWNFLPERLLSKSIASCQLSSERMREEFREACRLNDFGYLDGKY 960 Query: 961 IERLGSLLTPVSRGGW 976 IERLGSLLTPVSRGGW Sbjct: 961 IERLGSLLTPVSRGGW 976 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 20.4 bits (41), Expect = 9.0 Identities = 21/68 (30%), Positives = 31/68 (44%), Gaps = 4/68 (5%) Query: 239 EDTKKASLKRASSRRRPEIPPDVQDSADTSIKRSSSRRYRERPADQDAKESPI----KRS 294 E K ASLK+ + + + S + + S RR + R + P+ KRS Sbjct: 137 EQEKLASLKKQDEDKDKRDKEEKESSREKRERSRSPRRRKSRSPSPRRRSYPVRRERKRS 196 Query: 295 NSRKPRHR 302 +SR PRHR Sbjct: 197 HSRSPRHR 204 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 1898 0.0 Sequences not found previously or not previously below threshold: trembl|Q60952|Q60952 Intranuclear protein. 23 1.6 trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 23 1.9 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 21 8.0 CONVERGED! >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 1898 bits (4916), Expect = 0.0 Identities = 976/976 (100%), Positives = 976/976 (100%) Query: 1 MVGMDAISALLQTGSWRTSRSKSQNPYSRTPRRTDPLSELDIHIDKPLERRRIQQLPPPT 60 MVGMDAISALLQTGSWRTSRSKSQNPYSRTPRRTDPLSELDIHIDKPLERRRIQQLPPPT Sbjct: 1 MVGMDAISALLQTGSWRTSRSKSQNPYSRTPRRTDPLSELDIHIDKPLERRRIQQLPPPT 60 Query: 61 VEDEADSLAKEYGSPSVISSYEDEPPNRGEIDQLILMEPVLENNPERRFVVVTGPGAGKE 120 VEDEADSLAKEYGSPSVISSYEDEPPNRGEIDQLILMEPVLENNPERRFVVVTGPGAGKE Sbjct: 61 VEDEADSLAKEYGSPSVISSYEDEPPNRGEIDQLILMEPVLENNPERRFVVVTGPGAGKE 120 Query: 121 SEPAPSIVETTPPVTSTDTAQIKETNDCKKFVLVPSDEIKDGDKQEEKKEPELAKRKSHQ 180 SEPAPSIVETTPPVTSTDTAQIKETNDCKKFVLVPSDEIKDGDKQEEKKEPELAKRKSHQ Sbjct: 121 SEPAPSIVETTPPVTSTDTAQIKETNDCKKFVLVPSDEIKDGDKQEEKKEPELAKRKSHQ 180 Query: 181 DLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSPVHQDDGKAARIKRSGSRRRPEACVDEED 240 DLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSPVHQDDGKAARIKRSGSRRRPEACVDEED Sbjct: 181 DLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSPVHQDDGKAARIKRSGSRRRPEACVDEED 240 Query: 241 TKKASLKRASSRRRPEIPPDVQDSADTSIKRSSSRRYRERPADQDAKESPIKRSNSRKPR 300 TKKASLKRASSRRRPEIPPDVQDSADTSIKRSSSRRYRERPADQDAKESPIKRSNSRKPR Sbjct: 241 TKKASLKRASSRRRPEIPPDVQDSADTSIKRSSSRRYRERPADQDAKESPIKRSNSRKPR 300 Query: 301 HRPLHDQDFEITADSPDKVNPGASEFPASIVIQTAGGRERRPRHERMVSTPDVPRIDSRS 360 HRPLHDQDFEITADSPDKVNPGASEFPASIVIQTAGGRERRPRHERMVSTPDVPRIDSRS Sbjct: 301 HRPLHDQDFEITADSPDKVNPGASEFPASIVIQTAGGRERRPRHERMVSTPDVPRIDSRS 360 Query: 361 LELEGPGIAPLSMYRYNDADDAMSFMVQEVVIPDHDLTRKSSPPRTSRKESPPPYPSVTE 420 LELEGPGIAPLSMYRYNDADDAMSFMVQEVVIPDHDLTRKSSPPRTSRKESPPPYPSVTE Sbjct: 361 LELEGPGIAPLSMYRYNDADDAMSFMVQEVVIPDHDLTRKSSPPRTSRKESPPPYPSVTE 420 Query: 421 RKRMPSQSATNRRRRNSMAAPRDRREYPIGDDPRYQQLDHHEQPQSFRPAVEEDASLMAL 480 RKRMPSQSATNRRRRNSMAAPRDRREYPIGDDPRYQQLDHHEQPQSFRPAVEEDASLMAL Sbjct: 421 RKRMPSQSATNRRRRNSMAAPRDRREYPIGDDPRYQQLDHHEQPQSFRPAVEEDASLMAL 480 Query: 481 KHRLGAVDIGLESQDVVPHPAALALRQQDQFSEASDPGSSRSATFSAEGDHRRGRAGTYS 540 KHRLGAVDIGLESQDVVPHPAALALRQQDQFSEASDPGSSRSATFSAEGDHRRGRAGTYS Sbjct: 481 KHRLGAVDIGLESQDVVPHPAALALRQQDQFSEASDPGSSRSATFSAEGDHRRGRAGTYS 540 Query: 541 LTSSPPNGRLGPGREDESSPRTRRRARSRTMGTNPYSNSSPILSVPGHHSSNGSNALAKV 600 LTSSPPNGRLGPGREDESSPRTRRRARSRTMGTNPYSNSSPILSVPGHHSSNGSNALAKV Sbjct: 541 LTSSPPNGRLGPGREDESSPRTRRRARSRTMGTNPYSNSSPILSVPGHHSSNGSNALAKV 600 Query: 601 SSPLASPSLTRQELVSTRPSASFWDQNPFEPAGRRAPGARPILSFRNYSEEVLQGLTPPL 660 SSPLASPSLTRQELVSTRPSASFWDQNPFEPAGRRAPGARPILSFRNYSEEVLQGLTPPL Sbjct: 601 SSPLASPSLTRQELVSTRPSASFWDQNPFEPAGRRAPGARPILSFRNYSEEVLQGLTPPL 660 Query: 661 PDCPWQIPGLPYSRNPGDRFWILRGAENFLVCPHCHSELFANSQFRLEFVATSLRPDQPV 720 PDCPWQIPGLPYSRNPGDRFWILRGAENFLVCPHCHSELFANSQFRLEFVATSLRPDQPV Sbjct: 661 PDCPWQIPGLPYSRNPGDRFWILRGAENFLVCPHCHSELFANSQFRLEFVATSLRPDQPV 720 Query: 721 LCHLGSSFWYRMAYLMTLKQRLPDLQLIRNVATVSGRHPPCSGPQETARKWYGVWDSYKG 780 LCHLGSSFWYRMAYLMTLKQRLPDLQLIRNVATVSGRHPPCSGPQETARKWYGVWDSYKG Sbjct: 721 LCHLGSSFWYRMAYLMTLKQRLPDLQLIRNVATVSGRHPPCSGPQETARKWYGVWDSYKG 780 Query: 781 DFVPLFEVCVGCVKMLEAVLPNLANTLYPISPEPFIAYCQLHYAPGRKRFIEYFDLFEMT 840 DFVPLFEVCVGCVKMLEAVLPNLANTLYPISPEPFIAYCQLHYAPGRKRFIEYFDLFEMT Sbjct: 781 DFVPLFEVCVGCVKMLEAVLPNLANTLYPISPEPFIAYCQLHYAPGRKRFIEYFDLFEMT 840 Query: 841 SDYALSRRAAPDIGDLANRLRRIARVKECPPCPKDQALVGAYWHTMNFLPEFTVCEECYR 900 SDYALSRRAAPDIGDLANRLRRIARVKECPPCPKDQALVGAYWHTMNFLPEFTVCEECYR Sbjct: 841 SDYALSRRAAPDIGDLANRLRRIARVKECPPCPKDQALVGAYWHTMNFLPEFTVCEECYR 900 Query: 901 EVIKPMLDANDRRSDIPWNFLPERLLSKSIASCQLSSERMREEFREACRLNDFGYLDGKY 960 EVIKPMLDANDRRSDIPWNFLPERLLSKSIASCQLSSERMREEFREACRLNDFGYLDGKY Sbjct: 901 EVIKPMLDANDRRSDIPWNFLPERLLSKSIASCQLSSERMREEFREACRLNDFGYLDGKY 960 Query: 961 IERLGSLLTPVSRGGW 976 IERLGSLLTPVSRGGW Sbjct: 961 IERLGSLLTPVSRGGW 976 >trembl|Q60952|Q60952 Intranuclear protein. Length = 446 Score = 23.1 bits (48), Expect = 1.6 Identities = 24/86 (27%), Positives = 34/86 (38%), Gaps = 13/86 (15%) Query: 62 EDEADSLAKEYGSPSVI---------SSYEDEPPNRGEIDQLILMEPVLENN----PERR 108 +D A SLA+ S S + S E E + ++L+L LE P Sbjct: 238 QDLALSLAQSKASISSLQEITMFLQASVLERESEQQRLQEELVLSSQALEEQQSGGPHST 297 Query: 109 FVVVTGPGAGKESEPAPSIVETTPPV 134 GP G+ES+P E +P V Sbjct: 298 SRADQGPKVGQESQPGEVETEPSPGV 323 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 22.7 bits (47), Expect = 1.9 Identities = 21/68 (30%), Positives = 31/68 (44%), Gaps = 4/68 (5%) Query: 239 EDTKKASLKRASSRRRPEIPPDVQDSADTSIKRSSSRRYRERPADQDAKESPI----KRS 294 E K ASLK+ + + + S + + S RR + R + P+ KRS Sbjct: 137 EQEKLASLKKQDEDKDKRDKEEKESSREKRERSRSPRRRKSRSPSPRRRSYPVRRERKRS 196 Query: 295 NSRKPRHR 302 +SR PRHR Sbjct: 197 HSRSPRHR 204 >trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP1)-like. Length = 1042 Score = 20.8 bits (42), Expect = 8.0 Identities = 10/53 (18%), Positives = 16/53 (29%) Query: 166 EEKKEPELAKRKSHQDLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSPVHQDDG 218 +E EP K++ H + +P P D+ S G Sbjct: 914 DETSEPSNNKKRKHDVTQKYSDEADTQSVISSPQNVPEDKHELPSSQTQTPSG 966 Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique .fasta Posted date: Mar 7, 2003 12:04 PM Number of letters in database: 17,163 Number of sequences in database: 43 Lambda K H 0.315 0.133 0.399 Lambda K H 0.267 0.0407 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 80,487 Number of Sequences: 43 Number of extensions: 4314 Number of successful extensions: 87 Number of sequences better than 10.0: 18 Number of HSP's better than 10.0 without gapping: 18 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 43 length of query: 976 length of database: 17,163 effective HSP length: 64 effective length of query: 912 effective length of database: 14,411 effective search space: 13142832 effective search space used: 13142832 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.5 bits) S2: 41 (20.4 bits) BLASTP 2.2.2 [Dec-14-2001] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). (265 letters) Database: /home/mika/Theses/Databases/unique_set/NMP_Database_unique.f asta 43 sequences; 17,163 total letters Searching.done Results from round 1 Score E Sequences producing significant alignments: (bits) Value trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 323 9e-92 trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 29 0.007 swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). 27 0.020 trembl|Q15219|Q15219 Protein p84. 22 0.90 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 323 bits (829), Expect = 9e-92 Identities = 247/247 (100%), Positives = 247/247 (100%) Query: 1 MDAGFFRGTSAEQDNRFSNKQKKLLKQLKFAECLEKKVDMSKVNLEVIKPWITKRVTEIL 60 MDAGFFRGTSAEQDNRFSNKQKKLLKQLKFAECLEKKVDMSKVNLEVIKPWITKRVTEIL Sbjct: 1 MDAGFFRGTSAEQDNRFSNKQKKLLKQLKFAECLEKKVDMSKVNLEVIKPWITKRVTEIL 60 Query: 61 GFEDDVVIEFIFNQLEVKNPDSKMMQINLTGFLNGKNAREFMGELWPLLLSAQENIAGIP 120 GFEDDVVIEFIFNQLEVKNPDSKMMQINLTGFLNGKNAREFMGELWPLLLSAQENIAGIP Sbjct: 61 GFEDDVVIEFIFNQLEVKNPDSKMMQINLTGFLNGKNAREFMGELWPLLLSAQENIAGIP 120 Query: 121 SAFLELKKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSREKRERSRSPRRRKSRSP 180 SAFLELKKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSREKRERSRSPRRRKSRSP Sbjct: 121 SAFLELKKEEIKQRQIEQEKLASLKKQDEDKDKRDKEEKESSREKRERSRSPRRRKSRSP 180 Query: 181 SPRRRSYPVRRERKRSHSRSPRHRTKSRSPSPAPEKKEKSPELPEPSVRMKDSSVQEATS 240 SPRRRSYPVRRERKRSHSRSPRHRTKSRSPSPAPEKKEKSPELPEPSVRMKDSSVQEATS Sbjct: 181 SPRRRSYPVRRERKRSHSRSPRHRTKSRSPSPAPEKKEKSPELPEPSVRMKDSSVQEATS 240 Query: 241 TSDILKA 247 TSDILKA Sbjct: 241 TSDILKA 247 >trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix protein. Length = 976 Score = 28.9 bits (63), Expect = 0.007 Identities = 22/69 (31%), Positives = 32/69 (45%), Gaps = 5/69 (7%) Query: 168 RSRSPRRRKSRSPSPRRRSYPVRRERKRSHSRSPRHRTKSRSPSPAPEKKEKSPELPEPS 227 +S PR + SP P YP ERKR S+S +R + R+ AP + + P +P Sbjct: 400 KSSPPRTSRKESPPP----YPSVTERKRMPSQSATNR-RRRNSMAAPRDRREYPIGDDPR 454 Query: 228 VRMKDSSVQ 236 + D Q Sbjct: 455 YQQLDHHEQ 463 Score = 23.1 bits (48), Expect = 0.33 Identities = 24/99 (24%), Positives = 40/99 (40%), Gaps = 2/99 (2%) Query: 144 LKKQDEDKD-KRDKEEKESSREKRERSRSPRRRKSRSPSPRRRSYPVRRERKR-SHSRSP 201 L DE KD + +E+KE KR+ + R + P + P R S +SP Sbjct: 153 LVPSDEIKDGDKQEEKKEPELAKRKSHQDLPRLDTHVQEPDSKDAPDEPPSDRQSPEKSP 212 Query: 202 RHRTKSRSPSPAPEKKEKSPELPEPSVRMKDSSVQEATS 240 H+ ++ + PE K +S++ A+S Sbjct: 213 VHQDDGKAARIKRSGSRRRPEACVDEEDTKKASLKRASS 251 Score = 23.1 bits (48), Expect = 0.38 Identities = 13/46 (28%), Positives = 19/46 (41%) Query: 184 RRSYPVRRERKRSHSRSPRHRTKSRSPSPAPEKKEKSPELPEPSVR 229 R+S P R RK S P + R PS + + + + P R Sbjct: 399 RKSSPPRTSRKESPPPYPSVTERKRMPSQSATNRRRRNSMAAPRDR 444 Score = 21.9 bits (45), Expect = 0.72 Identities = 15/66 (22%), Positives = 32/66 (47%), Gaps = 8/66 (12%) Query: 195 RSHSRSPRHRTKSRSPSPA--------PEKKEKSPELPEPSVRMKDSSVQEATSTSDILK 246 RS S++P RT R+ + P ++ + +LP P+V + S+ + + ++ Sbjct: 20 RSKSQNPYSRTPRRTDPLSELDIHIDKPLERRRIQQLPPPTVEDEADSLAKEYGSPSVIS 79 Query: 247 APKPEP 252 + + EP Sbjct: 80 SYEDEP 85 Score = 21.2 bits (43), Expect = 1.4 Identities = 13/46 (28%), Positives = 20/46 (43%) Query: 159 KESSREKRERSRSPRRRKSRSPSPRRRSYPVRRERKRSHSRSPRHR 204 ++SS + R SP S + R S R+R+ +PR R Sbjct: 399 RKSSPPRTSRKESPPPYPSVTERKRMPSQSATNRRRRNSMAAPRDR 444 Score = 20.8 bits (42), Expect = 1.9 Identities = 20/72 (27%), Positives = 31/72 (42%), Gaps = 17/72 (23%) Query: 148 DEDKDKRDKEEKESSREK-------RERSRSP------RRRKSRSPSPR-RRSYPV---R 190 D D ++ + S +E ER R P RRR++ +PR RR YP+ Sbjct: 394 DHDLTRKSSPPRTSRKESPPPYPSVTERKRMPSQSATNRRRRNSMAAPRDRREYPIGDDP 453 Query: 191 RERKRSHSRSPR 202 R ++ H P+ Sbjct: 454 RYQQLDHHEQPQ 465 Score = 20.4 bits (41), Expect = 2.1 Identities = 21/68 (30%), Positives = 31/68 (44%), Gaps = 4/68 (5%) Query: 137 EQEKLASLKKQDEDKDKRDKEEKESSREKRERSRSPRRRKSRSPSPRRRSYPVRRERKRS 196 E K ASLK+ + + + S + + S RR + R + P+ KRS Sbjct: 239 EDTKKASLKRASSRRRPEIPPDVQDSADTSIKRSSSRRYRERPADQDAKESPI----KRS 294 Query: 197 HSRSPRHR 204 +SR PRHR Sbjct: 295 NSRKPRHR 302 Score = 20.0 bits (40), Expect = 2.8 Identities = 13/40 (32%), Positives = 18/40 (44%), Gaps = 1/40 (2%) Query: 201 PRHRTKSRSPSPAPEKKEKSPELPEPSVRMKDSSVQEATS 240 P H +S P +KE P P + R + S Q AT+ Sbjct: 393 PDHDLTRKSSPPRTSRKESPPPYPSVTERKRMPS-QSATN 431 >swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). Length = 212 Score = 27.3 bits (59), Expect = 0.020 Identities = 11/38 (28%), Positives = 19/38 (49%) Query: 214 PEKKEKSPELPEPSVRMKDSSVQEATSTSDILKAPKPE 251 PE++E SP P P ++ + + E + + P PE Sbjct: 71 PEERENSPSPPIPPYQISEMPLHELLESGNAKLVPNPE 108 >trembl|Q15219|Q15219 Protein p84. Length = 657 Score = 21.6 bits (44), Expect = 0.90 Identities = 13/46 (28%), Positives = 22/46 (47%), Gaps = 11/46 (23%) Query: 102 MGELWPLLLSAQENIAGIPSAFLELKKEEIKQRQIEQEKLASLKKQ 147 +GE W +L + +LE+K EI+Q + + E + KQ Sbjct: 581 LGEQWKIL-----------APYLEMKDSEIRQIECDSEDMKMRAKQ 615 Searching.done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fr... 363 e-103 Sequences not found previously or not previously below threshold: trembl|Q9C2H8|Q9C2H8 Related to ser/arg-related nuclear matrix ... 33 4e-04 swiss|P13779|REP2_KLULA Trans-acting factor C (REP2). 29 0.004 trembl|Q15219|Q15219 Protein p84. 25 0.093 swiss|Q9M7J4|MFP1_TOBAC MAR binding filament-like protein 1-1. 21 1.2 trembl|Q9FLH0|Q9FLH0 Nuclear matrix constituent protein 1 (NMCP... 21 1.7 >trembl|Q9CVG5|Q9CVG5 Ser/Arg-related nuclear matrix protein (Fragment). Length = 265 Score = 363 bits (931), Expect = e-103 Identities = 247/247 (100%), Positives = 247/247 (100%) Query: 1 MDAGFFRGTSAEQDNRFSNKQKKLLKQLKFAECLEKKVDMSKVNLEVIKPWITKRVTEIL 60 MDAGFFRGTSAEQDNRFSNKQKKLLKQLKFAECLEKKVDMSKVNLEVIKPWITKRVTEIL Sbjct: 1 MDAGFFRGTSAEQDNRFSNKQKKLLKQLKFAECLEKKVDMSKVNLEVIKPWITKRVTEIL 60 Query: 61 GFEDDVVIEFIFNQLEVKNPDSKMMQINLTGFLNGKNAREFMGELWPLLLSAQENIAGIP 120 GFEDDVVIEFIFNQLEVK