...........ATGGCTGCAGCTTCATATGATCAGTTGTTAAAGCAAGTTGAGGCACTGA
50 AGATGGAGAACTCAAATCTTCGACAAGAGCTAGAAGATAATTCCAATCATCTTACAAAAC
110 TGGAAACTGAGGCATCTAATATGAAGGAAGTACTTAAGCAGCTACAGGGAAGTATTGAAG
170 ATGAGACTATGACTTCTGGACAGATTGACTTACTAGAGCGTCTTAAAGAATTTAACTTAG
230 ATAGTAATTTCCCCGGAGTGAAACTACGCTCAAAAATGTCCCTTCGCTCCTACGGAAGTC
290 GGGAAGGATCTGTATCCAGCCGTTCAGGAGAATGCAGTCCTGTCCCCATGGGGTCATTCC
350 CAAGAAGAACATTTGTAAATGGAAGCAGAGAGAGTACTGGGTATCTAGAAGAGCTTGAAA
410 AAGAAAGATCATTACTCCTTGCTGATCTTGACAAAGAAGAGAAGGAAAAGGACTGGTATT
470 ATGCTCAACTTCAGAACCTCACAAAAAGAATAGATAGCCTGCCTTTAACTGAAAATTTTT
530 CCTTACAGACAGACATGACAAGACGGCAGCTGGAGTATGAAGCAAGGCAGATCAGGGCTG
590 CAATGGAGGAGCAGCTTGGCACCTGCCAGGACATGGAGAAGCGTGCACAGCGAAGAATAG
650 CCAGGATCCAGCAAATAGAAAAGGACATACTGCGCGTGCGCCAGCTTTTACAGTCCCAGG
710 CGGCGGAAGCGGAGAGGTCATCTCAGAGCAGGCATGATGCTGCCTCCCATGAAGCTGGCC
770 GGCAGCACGAAGGCCACGGAGTGGCAGAAAGCAACACCGCAGCCTCCAGTAGTGGTCAGA
830 GTCCAGCTACACGTGTGGATCACGAAACAGCCAGTGTTTTGAGTTCTAGCGGCACGCACT
890 CTGCTCCTCGAAGGTTGACAAGTCATCTGGGGACAAAGGTGGAAATGGTGTATTCCTTGT
950 TGTCAATGCTTGGTACTCATGATAAGGACGATATGTCACGAACTTTGCTAGCTATGTCCA
1010 GCTCCCAAGACAGCTGTATATCCATGCGGCAGTCTGGATGTCTTCCTCTCCTCATCCAGC
1070 TTTTACATGGCAATGACAAAGACTCTGTATTGTTGGGAAATTCCCGGGGCAGTAAAGAGG
1130 CTCGGGCCAGGGCCAGTGCAGCACTCCACAACATCATTCACTCACAGCCTGATGACAAGA
1190 GAGGCAGGCGTGAAATCCGAGTCCTTCATCTTTTGGAACAGATACGAGCTTACTGTGAAA
1250 CCTGTTGGGAGTGGCAGGAAGCCCACGAACAAGGCATGGACCAGGACAAAAACCCAATGC
1310 CAGCTCCTGTTGAGCATCAGATCTGTCCTGCTGTGTGTGTTCTAATGAAGCTTTCATTTG
1370 ATGAAGAGCATAGGCATGCAATGAATGAACTTGGGGGACTGCAGGCCATTGCAGAGTTAT
1430 TGCAGGTGGACTGTGAGATGTATGGGCTTACTAATGACCACTACAGTGTTACTTTAAGAC
1490 GGTATGCTGGAATGGCTTTGACAAACTTGACCTTTGGAGATGTTGCCAACAAGGCTACGC
1550 TGTGTTCTATGAAAGGCTGCATGAGAGCACTTGTGGCCCAGTTAAAATCTGAGAGTGAAG
1610 ACTTACAGCAGGTTATTGCAAGTGTTTTGAGGAATTTGTCTTGGCGAGCAGATGTAAATA
1670 GCAAAAAGACGTTGAGAGAAGTTGGAAGTGTGAAAGCATTGATGGAATGTGCTTTGGAAG
1730 TTAAAAAGGAATCAACCCTCAAAAGCGTTTTGAGTGCCTTATGGAACCTGTCTGCACACT
1790 GCACTGAGAATAAGGCTGACATCTGTGCTGTGGATGGAGCACTGGCATTTCTGGTTGGCA
1850 CCCTCACTTACCGGAGCCAGACAAATACTTTAGCCATTATTGAAAGTGGAGGTGGGATAT
1910 TACGGAATGTGTCCAGCTTGATAGCTACAAACGAAGACCACAGGCAAATCCTAAGAGAGA
1970 ACAATTGCCTACAAACTTTATTACAGCACTTGAAATCTCACAGCTTGACAATAGTCAGTA
2030 ATGCATGTGGAACTTTGTGGAATCTCTCAGCAAGAAATCCTAAAGACCAGGAAGCCTTGT
2090 GGGACATGGGGGCAGTGAGCATGCTCAAGAACCTCATTCATTCCAAGCACAAAATGATTG
2150 CCATGGGAAGTGCAGCAGCTTTAAGGAATCTCATGGCAAACAGACCTGCAAAGTATAAGG
2210 ATGCCAATATCATGTCTCCCGGCTCAAGTCTGCCATCCCTTCACGTTAGGAAACAGAAAG
2270 CTCTAGAAGCTGAGCTAGATGCTCAGCATTTATCAGAAACCTTCGACAACATTGACAACC
2330 TAAGTCCCAAGGCCTCTCACCGGAGTAAGCAGAGACACAAGCAGAATCTTTATGGTGACT
2390 ATGCTTTTGACGCCAATCGACATGATGATAGTAGGTCAGACAATTTCAATACTGGAAACA
2450 TGACTGTTCTTTCACCATATTTAAATACTACGGTATTGCCCAGCTCTTCTTCCTCAAGGG
2510 GAAGTTTAGACAGTTCTCGTTCTGAGAAAGACAGAAGTTTGGAGAGAGAGCGAGGTATTG
2570 GCCTCAGTGCTTACCATCCAACAACAGAAAATGCAGGAACCTCATCAAAACGAGGTCTGC
2630 AGATCACTACCACTGCAGCCCAGATAGCCAAAGTTATGGAAGAAGTATCAGCCATTCATA
2690 CCTCCCAGGACGACAGAAGTTCTGCTTCTACCACCGAGTTCCATTGTGTGGCAGACGACA
2750 GGAGTGCGGCACGAAGAAGCTCTGCCTCCCACACACACTCAAACACATACAACTTCACTA
2810 AGTCGGAAAATTCAAATAGGACATGCTCTATGCCTTATGCCAAAGTGGAATATAAACGAT
2870 CTTCAAATGACAGTTTAAATAGTGTCACTAGTAGTGATGGATATGGTAAAAGAGGCCAAA
2930 TGAAACCCTCAGTTGAATCCTATTCTGAAGATGATGAAAGTAAATTTTGCAGTTATGGTC
2990 AGTATCCAGCTGACCTAGCCCATAAGATACACAGTGCAAATCATATGGATGATAATGATG
3050 GAGAACTGGATACACCAATAAATTACAGTCTTAAATATTCAGATGAGCAGTTGAACTCAG
3110 GAAGGCAGAGTCCCTCACAGAATGAAAGGTGGGCAAGACCAAAGCATGTGATAGAAGATG
3170 AAATAAAGCAAAACGAGCAAAGACAAGCAAGAAGCCAGAACACCAGTTATCCTGTCTATT
3230 CTGAGAATACCGATGACAAACACCTCAAATTCCAACCACATTTTGGACAACAAGAATGTG
3290 TTTCCCCATATAGGTCAAGGGGAACCAGTGGTTCAGAAACAAATCGAATGGGTTCTAGTC
3350 ATGCAATTAATCAAAATGTAAACCAGTCTCTGTGTCAGGAAGATGATTATGAAGATGATA
3410 AACCTACCAACTACAGTGAACGTTATTCTGAGGAAGAACAACATGAAGAAGAAGAAGAGA
3470 GACCGACAAATTATAGCATAAAATATAATGAAGAGAAACATCATGTGGATCAGCCTATTG
3530 ATTATAGTTTAAAATATGCCACTGACATTTCTTCCTCACAAAAACCATCATTTTCATTCT
3590 CAAAGAATTCATCAGCACAAAGCACTAAACCTGAACATCTCTCTCCAAGCAGCGAGAATA
3650 CAGCTGTACCTCCATCTAATGCCAAAAGGCAGAATCAGCTGCGTCCAAGTTCAGCACAAA
3710 GAAATGGCCAGACTCAAAAAGGCACTACTTGCAAAGTCCCCTCCATCAACCAAGAAACAA
3770 TACAGACTTACTGCGTAGAAGACACCCCAATATGTTTTTCAAGGTGCAGTTCATTATCAT
3830 CACTGTCATCAGCTGACGATGAAATAGGATGTGATCAGACAACACAGGAAGCAGATTCTG
3890 CTAATACTCTGCAGACAGCAGAAGTAAAAGAGAATGATGTAACTCGGTCAGCTGAAGATC
3950 CTGCAACTGAAGTTCCAGCAGTGTCCCAGAATGCTAGAGCCAAACCCAGCCGACTCCAGG
4010 CTTCTGGCTTATCTTCAGAATCAACCAGGCATAATAAAGCTGTTGAGTTTTCTTCAGGAG
4070 CCAAGTCTCCCTCCAAAAGTGGTGCTCAGACACCCAAAAGTCCCCCAGAACACTATGTCC
4130 AGGAGACTCCGCTCGTATTCAGCAGGTGTACTTCTGTCAGCTCCCTTGACAGTTTTGAGA
4190 GTCGCTCCATTGCCAGCTCTGTTCAGAGTGAGCCATGTAGTGGAATGGTGAGTGGCATCA
4250 TAAGCCCCAGTGACCTTCCAGATAGTCCTGGGCAGACCATGCCACCAAGCAGAAGCAAAA
4310 CCCCTCCACCTCCTCCACAGACAGTGCAGGCCAAGAGAGAGGTGCCAAAAAGTAAAGTCC
4370 CTGCTGCTGAGAAGAGAGAGAGTGGGCCTAAGCAGACTGCTGTAAATGCTGCCGTGCAGA
4430 GGGTGCAGGTCCTTCCAGACGTGGATACTTTGTTACACTTCGCCACAGAAAGTACTCCAG
4490 ACGGGTTTTCTTGTTCCTCCAGCCTAAGTGCTCTGAGCCTGGATGAGCCATTTATACAGA
4550 AAGATGTAGAATTAAGAATCATGCCTCCAGTTCAGGAAAACGACAATGGGAATGAAACTG
4610 AATCAGAACAGCCTGAGGAATCAAATGAAAACCAGGATAAAGAGGTAGAAAAGCCTGACT
4670 CTGAAAAAGACTTATTAGATGATTCTGATGACGATGATATTGAAATATTAGAAGAATGTA
4730 TTATTTCAGCCATGCCAACAAAGTCATCACGCAAAGCCAAAAAACTAGCCCAGACTGCTT
4790 CAAAATTACCTCCACCTGTGGCAAGGAAACCAAGTCAGCTACCTGTGTATAAACTTCTGC
4850 CAGCACAGAATAGGCTGCAGGCACAAAAACATGTTAGCTTTACACCAGGGGATGATGTGC
4910 CCCGGGTGTACTGTGTAGAAGGGACACCTATAAACTTTTCCACAGCAACGTCTCTAAGTG
4970 ATCTGACAATAGAGTCCCCTCCAAATGAATTGGCTACTGGAGATGGGGTCAGAGCGGGTA
5030 TACAGTCAGGTGAATTTGAAAAACGAGATACCATTCCTACAGAAGGCAGAAGTACAGATG
5090 ATGCTCAGCGAGGAAAAATCTCATCTATAGTTACACCAGACCTGGATGACAACAAAGCAG
5150 AGGAAGGAGATATTCTTGCAGAATGTATCAATTCTGCTATGCCCAAAGGAAAAAGCCACA
5210 AGCCTTTCCGAGTGAAAAAGATAATGGACCAAGTCCAACAAGCATCCTCGACTTCATCTG
5270 GAGCTAACAAAAATCAAGTAGACACTAAGAAAAAGAAGCCTACTTCACCAGTAAAGCCCA
5330 TGCCACAAAATACTGAATATAGAACGCGTGTGAGAAAGAATACAGACTCAAAAGTTAATG
5390 TAAATACTGAAGAAACTTTCTCAGACAACAAAGACTCAAAGAAACCAAGCTTACAAACCA
5450 ATGCCAAGGCCTTCAATGAAAAGCTACCTAACAATGAAGACAGAGTGCGGGGGAGCTTCG
5510 CCTTGGACTCACCGCATCACTACACCCCTATTGAGGGGACGCCGTACTGCTTTTCCCGAA
5570 ATGACTCCTTGAGTTCTCTGGATTTTGATGATGACGATGTTGACCTTTCCAGGGAAAAGG
5630 CCGAGTTAAGAAAGGGCAAAGAAAGCAAGGATTCCGAAGCCAAAGTTACCTGCCGCCCAG
5690 AACCAAACTCAAGCCAGCAGGCAGCTAGTAAGTCACAAGCCAGTATAAAACATCCAGCAA
5750 ACAGAGCACAGTCCAAACCAGTGCTGCAGAAACAGCCCACTTTCCCCCAGTCCTCCAAAG
5810 ACGGACCAGATAGAGGGGCAGCAACTGACGAAAAACTGCAGAATTTTGCTATTGAAAATA
5870 CTCCAGTTTGCTTTTCTCGAAATTCCTCTCTGAGTTCCCTTAGTGACATTGACCAGGAAA
5930 ACAACAATAACAAAGAAAGTGAACCAATCAAAGAAGCTGAACCTGCCAACTCACAAGGAG
5990 AGCCCAGTAAGCCTCAGGCATCCGGGTATGCTCCCAAGTCCTTCCACGTCGAAGACACCC
6050 CTGTCTGTTTCTCAAGAAACAGCTCTCTCAGTTCTCTTAGCATTGACTCTGAGGACGACC
6110 TGTTACAGGAGTGTATAAGTTCTGCCATGCCAAAAAAGAAAAGGCCTTCAAGACTCAAGA
6170 GTGAGAGCGAAAAGCAGAGCCCTAGAAAAGTGGGTGGCATATTAGCTGAAGACCTGACGC
6230 TTGATTTGAAAGATCTACAGAGGCCAGATTCAGAACACGCTTTCTCCCCCGACTCAGAAA
6290 ATTTTGACTGGAAAGCTATTCAGGAAGGCGCAAACTCCATAGTAAGTAGTTTGCACCAAG
6350 CTGCTGCAGCCGCCGCGTGCTTATCTAGACAAGCGTCATCCGACTCAGATTCCATTCTGT
6410 CACTAAAGTCCGGCATTTCTCTGGGATCGCCTTTTCATCTTACACCTGATCAAGAGGAAA
6470 AGCCATTCACAAGCAATAAAGGCCCAAGAATTCTCAAACCTGGAGAGAAAAGCACATTAG
6530 AAGCAAAAAAAATAGAATCTGAAAACAAAGGAATCAAAGGCGGGAAAAAGGTTTATAAAA
6590 GCTTGATTACGGGAAAGATTCGCTCCAATTCAGAAATTTCCAGCCAAATGAAACAACCCC
6650 TCCCGACAAACATGCCTTCAATCTCAAGAGGCAGGACGATGATTCACATCCCAGGGCTTC
6710 GGAATAGCTCCTCTAGTACAAGCCCTGTCTCTAAGAAAGGCCCACCCCTCAAGACTCCAG
6770 CCTCTAAAAGCCCCAGTGAAGGGCCGGGAGCTACCACTTCTCCTCGAGGAACTAAGCCAG
6830 CAGGAAAGTCAGAGCTTAGCCCTATCACCAGGCAAACTTCCCAAATCAGTGGGTCAAATA
6890 AGGGGTCTTCTAGATCAGGATCTAGAGACTCCACTCCCTCAAGACCTACACAGCAACCAT
6950 TAAGTAGGCCAATGCAGTCTCCAGGGCGAAACTCAATTTCCCCTGGTAGAAATGGAATAA
7010 GCCCTCCTAACAAACTGTCTCAGCTGCCCAGAACATCATCTCCCAGTACTGCTTCAACTA
7070 AGTCCTCCGGTTCTGGGAAAATGTCATATACATCCCCAGGTAGACAGCTGAGCCAACAAA
7130 ATCTTACCAAACAAGCAAGTTTATCCAAGAATGCCAGCAGTATCCCCAGAAGTGAGTCGG
7190 CATCTAAAGGACTGAATCAGATGAGTAACGGCAATGGGTCAAATAAAAAGGTAGAACTTT
7250 CTAGAATGTCTTCAACTAAATCAAGTGGAAGTGAATCAGACAGATCAGAAAGGCCTGCAT
7310 TAGTACGCCAGTCTACTTTCATCAAAGAAGCCCCAAGCCCAACCCTGAGGAGGAAACTGG
7370 AGGAATCTGCCTCATTTGAATCCCTTTCTCCATCTTCTAGACCAGATTCTCCCACCAGGT
7430 CGCAGGCACAGACCCCAGTTTTAAGCCCTTCCCTTCCTGATATGTCTCTGTCCACACATC
7490 CATCTGTTCAGGCAGGTGGGTGGCGAAAGCTCCCGCCTAATCTCAGCCCCACTATCGAGT
7550 ATAATGACGGAAGGCCCACAAAACGGCATGATATTGCACGCTCCCATTCTGAAAGTCCTT
7610 CCAGACTACCAATCAACCGGGCGGGAACCTGGAAGCGTGAACACAGCAAACATTCCTCGT
7670 CCCTTCCTCGAGTGAGTACTTGGAGAAGAACTGGAAGCTCATCTTCTATTCTTTCTGCTT
7730 CATCAGAGTCCAGTGAAAAAGCAAAAAGTGAGGATGAAAGGCATGTGAGCTCCATGCCAG
7790 CACCCAGACAGATGAAGGAAAACCAGGTGCCCACCAAAGGAACATGGAGGAAAATCAAGG
7850 AAAGTGACATTTCTCCCACAGGCATGGCTTCTCAGAGCGCTTCCTCAGGTGCTGCCAGTG
7910 GTGCTGAATCCAAGCCTCTGATCTATCAGATGGCACCTCCTGTCTCTAAAACAGAGGATG
7970 TTTGGGTGAGAATTGAGGACTGCCCCATTAACAACCCTAGATCTGGACGGTCCCCCACAG
8030 GCAACACCCCCCCAGTGATTGACAGTGTTTCAGAGAAGGGAAGTTCAAGCATTAAAGATT
8090 CAAAAGACACCCATGGGAAACAGAGTGTGGGCAGTGGCAGTCCTGTGCAAACCGTGGGTC
8150 TGGAAACCCGCCTCAACTCCTTTGTTCAGGTAGAGGCCCCAGAACAGAAAGGAACTGAGG
8210 CAAAACCAGGACAGAGTAACCCAGTCTCTATAGCAGAGACTGCTGAGACGTGTATAGCAG
8270 AGCGTACCCCTTTCAGTTCCAGTAGCTCCAGCAAGCACAGCTCACCTAGCGGGACTGTTG
8330 CTGCCAGAGTGACACCTTTTAATTACAACCCTAGCCCTAGGAAGAGCAGCGCAGACAGCA
8390 CTTCAGCCCGGCCGTCTCAGATCCCTACGCCAGTGAGCACCAACACGAAGAAGAGAGATT
8450 CGAAGACTGACAGCACAGAATCCAGTGGAGCCCAAAGTCCTAAACGCCATTCCGGGTCTT
8510 ACCTCGTGACGTCTGTTTAA...........................
MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDETM TSGQIDLLERLKEFNLDSNFPGVKLRSKMSLRSYGSREGSVSSRSGECSPVPMGSFPRRT FVNGSRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTENFSLQT DMTRRQLEYEARQIRAAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRVRQLLQSQAAEA ERSSQSRHDAASHEAGRQHEGHGVAESNTAASSSGQSPATRVDHETASVLSSSGTHSAPR RLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGCLPLLIQLLHG NDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQIRAYCETCWE WQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGGLQAIAELLQVD CEMYGLTNDHYSVTLRRYAGMALTNLTFGDVANKATLCSMKGCMRALVAQLKSESEDLQQ VIASVLRNLSWRADVNSKKTLREVGSVKALMECALEVKKESTLKSVLSALWNLSAHCTEN KADICAVDGALAFLVGTLTYRSQTNTLAIIESGGGILRNVSSLIATNEDHRQILRENNCL QTLLQHLKSHSLTIVSNACGTLWNLSARNPKDQEALWDMGAVSMLKNLIHSKHKMIAMGS AAALRNLMANRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPK ASHRSKQRHKQNLYGDYAFDANRHDDSRSDNFNTGNMTVLSPYLNTTVLPSSSSSRGSLD SSRSEKDRSLERERGIGLSAYHPTTENAGTSSKRGLQITTTAAQIAKVMEEVSAIHTSQD DRSSASTTEFHCVADDRSAARRSSASHTHSNTYNFTKSENSNRTCSMPYAKVEYKRSSND SLNSVTSSDGYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDGELD TPINYSLKYSDEQLNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSENT DDKHLKFQPHFGQQECVSPYRSRGTSGSETNRMGSSHAINQNVNQSLCQEDDYEDDKPTN YSERYSEEEQHEEEEERPTNYSIKYNEEKHHVDQPIDYSLKYATDISSSQKPSFSFSKNS SAQSTKPEHLSPSSENTAVPPSNAKRQNQLRPSSAQRNGQTQKGTTCKVPSINQETIQTY CVEDTPICFSRCSSLSSLSSADDEIGCDQTTQEADSANTLQTAEVKENDVTRSAEDPATE VPAVSQNARAKPSRLQASGLSSESTRHNKAVEFSSGAKSPSKSGAQTPKSPPEHYVQETP LVFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPPP PPQTVQAKREVPKSKVPAAEKRESGPKQTAVNAAVQRVQVLPDVDTLLHFATESTPDGFS CSSSLSALSLDEPFIQKDVELRIMPPVQENDNGNETESEQPEESNENQDKEVEKPDSEKD LLDDSDDDDIEILEECIISAMPTKSSRKAKKLAQTASKLPPPVARKPSQLPVYKLLPAQN RLQAQKHVSFTPGDDVPRVYCVEGTPINFSTATSLSDLTIESPPNELATGDGVRAGIQSG EFEKRDTIPTEGRSTDDAQRGKISSIVTPDLDDNKAEEGDILAECINSAMPKGKSHKPFR VKKIMDQVQQASSTSSGANKNQVDTKKKKPTSPVKPMPQNTEYRTRVRKNTDSKVNVNTE ETFSDNKDSKKPSLQTNAKAFNEKLPNNEDRVRGSFALDSPHHYTPIEGTPYCFSRNDSL SSLDFDDDDVDLSREKAELRKGKESKDSEAKVTCRPEPNSSQQAASKSQASIKHPANRAQ SKPVLQKQPTFPQSSKDGPDRGAATDEKLQNFAIENTPVCFSRNSSLSSLSDIDQENNNN KESEPIKEAEPANSQGEPSKPQASGYAPKSFHVEDTPVCFSRNSSLSSLSIDSEDDLLQE CISSAMPKKKRPSRLKSESEKQSPRKVGGILAEDLTLDLKDLQRPDSEHAFSPDSENFDW KAIQEGANSIVSSLHQAAAAAACLSRQASSDSDSILSLKSGISLGSPFHLTPDQEEKPFT SNKGPRILKPGEKSTLEAKKIESENKGIKGGKKVYKSLITGKIRSNSEISSQMKQPLPTN MPSISRGRTMIHIPGLRNSSSSTSPVSKKGPPLKTPASKSPSEGPGATTSPRGTKPAGKS ELSPITRQTSQISGSNKGSSRSGSRDSTPSRPTQQPLSRPMQSPGRNSISPGRNGISPPN KLSQLPRTSSPSTASTKSSGSGKMSYTSPGRQLSQQNLTKQASLSKNASSIPRSESASKG LNQMSNGNGSNKKVELSRMSSTKSSGSESDRSERPALVRQSTFIKEAPSPTLRRKLEESA SFESLSPSSRPDSPTRSQAQTPVLSPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDG RPTKRHDIARSHSESPSRLPINRAGTWKREHSKHSSSLPRVSTWRRTGSSSSILSASSES SEKAKSEDERHVSSMPAPRQMKENQVPTKGTWRKIKESDISPTGMASQSASSGAASGAES KPLIYQMAPPVSKTEDVWVRIEDCPINNPRSGRSPTGNTPPVIDSVSEKGSSSIKDSKDT HGKQSVGSGSPVQTVGLETRLNSFVQVEAPEQKGTEAKPGQSNPVSIAETAETCIAERTP FSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSTNTKKRDSKTD STESSGAQSPKRHSGSYLVTSV
tr|B2RUG9|B2RUG9_MOUSE Adenomatosis polyposis coli OS=Mus musculus GN=Apc PE=2
SV=1
MGI:1923206 Srrm2 serine/arginine repetitive matrix 2 (Chr 17)
Length = 2842
Score = 12127 (4274.0 bits), Expect = 0., P = 0.
Identities = 2393/2842 (84%), Positives = 2393/2842 (84%)
Query: 1 MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDETM 60
MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDETM
Sbjct: 1 MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDETM 60
Query: 61 TSGQIDLLERLKEFNLDSNFPGVKLRSKMSLXXXXXXXXXXXXXXXXXXPVPMGSFPRRT 120
TSGQIDLLERLKEFNLDSNFPGVKLRSKMSL PVPMGSFPRRT
Sbjct: 61 TSGQIDLLERLKEFNLDSNFPGVKLRSKMSLRSYGSREGSVSSRSGECSPVPMGSFPRRT 120
Query: 121 FVNGSRESTGYXXXXXXXXXXXXXXXXXXXXXXXWYYAQLQNLTKRIDSLPLTENFSLQT 180
FVNGSRESTGY WYYAQLQNLTKRIDSLPLTENFSLQT
Sbjct: 121 FVNGSRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTENFSLQT 180
Query: 181 DMTRRQLEYEARQIRAAMEEQLGTCQDMEKXXXXXXXXXXXXEKDILRVRQLLXXXXXXX 240
DMTRRQLEYEARQIRAAMEEQLGTCQDMEK EKDILRVRQLL
Sbjct: 181 DMTRRQLEYEARQIRAAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRVRQLLQSQAAEA 240
Query: 241 XXXXXXXHDAASHEAGRQHEGHGVAESNTAASSSGQSPATRVDHETASVLSSSGTHSAPR 300
HDAASHEAGRQHEGHGVAESNTAASSSGQSPATRVDHETASVLSSSGTHSAPR
Sbjct: 241 ERSSQSRHDAASHEAGRQHEGHGVAESNTAASSSGQSPATRVDHETASVLSSSGTHSAPR 300
Query: 301 RLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGCLPLLIQLLHG 360
RLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGCLPLLIQLLHG
Sbjct: 301 RLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGCLPLLIQLLHG 360
Query: 361 NDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQIRAYCETCWE 420
NDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQIRAYCETCWE
Sbjct: 361 NDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQIRAYCETCWE 420
Query: 421 WQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGGLQAIAELLQVD 480
WQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGGLQAIAELLQVD
Sbjct: 421 WQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGGLQAIAELLQVD 480
Query: 481 CEMYGLTNDHYSVTLRRYAGMALTNLTFGDVANKATLCSMKGCMRALVAQLKSESEDLQQ 540
CEMYGLTNDHYSVTLRRYAGMALTNLTFGDVANKATLCSMKGCMRALVAQLKSESEDLQQ
Sbjct: 481 CEMYGLTNDHYSVTLRRYAGMALTNLTFGDVANKATLCSMKGCMRALVAQLKSESEDLQQ 540
Query: 541 VIASVLRNLSWRADVNSKKTLREVGSVKALMECALEVKKESTLKSVLSALWNLSAHCTEN 600
VIASVLRNLSWRADVNSKKTLREVGSVKALMECALEVKKESTLKSVLSALWNLSAHCTEN
Sbjct: 541 VIASVLRNLSWRADVNSKKTLREVGSVKALMECALEVKKESTLKSVLSALWNLSAHCTEN 600
Query: 601 KADICAVDGALAFLVGTLTYRSQTNTLAIIESGGGILRNVSSLIATNEDHRQILRENNCL 660
KADICAVDGALAFLVGTLTYRSQTNTLAIIESGGGILRNVSSLIATNEDHRQILRENNCL
Sbjct: 601 KADICAVDGALAFLVGTLTYRSQTNTLAIIESGGGILRNVSSLIATNEDHRQILRENNCL 660
Query: 661 QTLLQHLKSHSLTIVSNACGTLWNLSARNPKDQEALWDMGAVSMLKNLIHSKHKMIAMGS 720
QTLLQHLKSHSLTIVSNACGTLWNLSARNPKDQEALWDMGAVSMLKNLIHSKHKMIAMGS
Sbjct: 661 QTLLQHLKSHSLTIVSNACGTLWNLSARNPKDQEALWDMGAVSMLKNLIHSKHKMIAMGS 720
Query: 721 AAALRNLMANRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPK 780
AAALRNLMANRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPK
Sbjct: 721 AAALRNLMANRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPK 780
Query: 781 ASHRSKQRHKQNLYGDYAFDANRHDDSRSDNFNTGNMTVLSPYLNTTVLPXXXXXXXXXX 840
ASHRSKQRHKQNLYGDYAFDANRHDDSRSDNFNTGNMTVLSPYLNTTVLP
Sbjct: 781 ASHRSKQRHKQNLYGDYAFDANRHDDSRSDNFNTGNMTVLSPYLNTTVLPSSSSSRGSLD 840
Query: 841 XXXXEKDRSLERERGIGLSAYHPTTENAGTSSKRGLQITTTAAQIAKVMEEVSAIHTSQD 900
EKDRSLERERGIGLSAYHPTTENAGTSSKRGLQITTTAAQIAKVMEEVSAIHTSQD
Sbjct: 841 SSRSEKDRSLERERGIGLSAYHPTTENAGTSSKRGLQITTTAAQIAKVMEEVSAIHTSQD 900
Query: 901 DRSSASTTEFHCVXXXXXXXXXXXXXHTHSNTYNFTKSENSNRTCSMPYAKVEYKRXXXX 960
DRSSASTTEFHCV HTHSNTYNFTKSENSNRTCSMPYAKVEYKR
Sbjct: 901 DRSSASTTEFHCVADDRSAARRSSASHTHSNTYNFTKSENSNRTCSMPYAKVEYKRSSND 960
Query: 961 XXXXXXXXXGYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDGELD 1020
GYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDGELD
Sbjct: 961 SLNSVTSSDGYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDGELD 1020
Query: 1021 TPINYSLKYSDEQLNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSENT 1080
TPINYSLKYSDEQLNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSENT
Sbjct: 1021 TPINYSLKYSDEQLNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSENT 1080
Query: 1081 DDKHLKFQPHFGQQECVSPYRSRGTSGSETNRMGSSHAINQNVNQSLCQEDDYEDDKPTN 1140
DDKHLKFQPHFGQQECVSPYRSRGTSGSETNRMGSSHAINQNVNQSLCQEDDYEDDKPTN
Sbjct: 1081 DDKHLKFQPHFGQQECVSPYRSRGTSGSETNRMGSSHAINQNVNQSLCQEDDYEDDKPTN 1140
Query: 1141 XXXXXXXXXXXXXXXXXPTNYSIKYNEEKHHVDQPIDYSLKYATDIXXXXXXXXXXXXXX 1200
PTNYSIKYNEEKHHVDQPIDYSLKYATDI
Sbjct: 1141 YSERYSEEEQHEEEEERPTNYSIKYNEEKHHVDQPIDYSLKYATDISSSQKPSFSFSKNS 1200
Query: 1201 XXXXTKPEHLSPSSENTAVPPSNAKRQNQLRPSSAQRNGQTQKGTTCKVPSINQETIQTY 1260
TKPEHLSPSSENTAVPPSNAKRQNQLRPSSAQRNGQTQKGTTCKVPSINQETIQTY
Sbjct: 1201 SAQSTKPEHLSPSSENTAVPPSNAKRQNQLRPSSAQRNGQTQKGTTCKVPSINQETIQTY 1260
Query: 1261 CVEDTPICFSRCXXXXXXXXADDEIGCDQTTQEADSANTLQTAEVKENDVTRSAEDPATE 1320
CVEDTPICFSRC ADDEIGCDQTTQEADSANTLQTAEVKENDVTRSAEDPATE
Sbjct: 1261 CVEDTPICFSRCSSLSSLSSADDEIGCDQTTQEADSANTLQTAEVKENDVTRSAEDPATE 1320
Query: 1321 VPAVSQNARAKPSRLQASGLSSESTRHNKAVEFXXXXXXXXXXXXQTPKSPPEHYVQETP 1380
VPAVSQNARAKPSRLQASGLSSESTRHNKAVEF QTPKSPPEHYVQETP
Sbjct: 1321 VPAVSQNARAKPSRLQASGLSSESTRHNKAVEFSSGAKSPSKSGAQTPKSPPEHYVQETP 1380
Query: 1381 LVFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPSDLPDSPGQTMXXXXXXXXXX 1440
LVFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPSDLPDSPGQTM
Sbjct: 1381 LVFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPPP 1440
Query: 1441 XXXXXXXXXEVPKSKVPAAEKRESGPKQTAVNAAVQRVQVLPDVDTLLHFATESTPDGFX 1500
EVPKSKVPAAEKRESGPKQTAVNAAVQRVQVLPDVDTLLHFATESTPDGF
Sbjct: 1441 PPQTVQAKREVPKSKVPAAEKRESGPKQTAVNAAVQRVQVLPDVDTLLHFATESTPDGFS 1500
Query: 1501 XXXXXXXXXXDEPFIQKDVELRIMPPVQENDXXXXXXXXXXXXXXXXXDKEVEKPXXXXX 1560
DEPFIQKDVELRIMPPVQEND DKEVEKP
Sbjct: 1501 CSSSLSALSLDEPFIQKDVELRIMPPVQENDNGNETESEQPEESNENQDKEVEKPDSEKD 1560
Query: 1561 XXXXXXXXXXXXXXXXXXXAMPTKSSRKAKKLAQTASKLPPPVARKPSQLPVYKLLPAQN 1620
AMPTKSSRKAKKLAQTASKLPPPVARKPSQLPVYKLLPAQN
Sbjct: 1561 LLDDSDDDDIEILEECIISAMPTKSSRKAKKLAQTASKLPPPVARKPSQLPVYKLLPAQN 1620
Query: 1621 RLQAQKHVSFTPGDDVPRVYCVEGTPINFSTATSLSDLTIESPPNELATGDGVRAGIQSG 1680
RLQAQKHVSFTPGDDVPRVYCVEGTPINFSTATSLSDLTIESPPNELATGDGVRAGIQSG
Sbjct: 1621 RLQAQKHVSFTPGDDVPRVYCVEGTPINFSTATSLSDLTIESPPNELATGDGVRAGIQSG 1680
Query: 1681 EFEKRDTIPTEGRSTDDAQRGKISSIVTPDLDDNKAEEGDILAECINSAMPKGKSHKPFR 1740
EFEKRDTIPTEGRSTDDAQRGKISSIVTPDLDDNKAEEGDILAECINSAMPKGKSHKPFR
Sbjct: 1681 EFEKRDTIPTEGRSTDDAQRGKISSIVTPDLDDNKAEEGDILAECINSAMPKGKSHKPFR 1740
Query: 1741 VKKIMDQVQQASSTSSGANKNQVDTXXXXXXXXXXXXXQNTEYRTRVRKNTDSKVNVNTE 1800
VKKIMDQVQQASSTSSGANKNQVDT QNTEYRTRVRKNTDSKVNVNTE
Sbjct: 1741 VKKIMDQVQQASSTSSGANKNQVDTKKKKPTSPVKPMPQNTEYRTRVRKNTDSKVNVNTE 1800
Query: 1801 ETFSDNKDSKKPSLQTNAKAFNEKLPNNEDRVRGSFALDSPHHYTPIEGTPYCFSRNXXX 1860
ETFSDNKDSKKPSLQTNAKAFNEKLPNNEDRVRGSFALDSPHHYTPIEGTPYCFSRN
Sbjct: 1801 ETFSDNKDSKKPSLQTNAKAFNEKLPNNEDRVRGSFALDSPHHYTPIEGTPYCFSRNDSL 1860
Query: 1861 XXXXXXXXXXXXXREKAELRKGKESKDSEAKVTCRPEPNXXXXXXXXXXXXIKHPANRAQ 1920
REKAELRKGKESKDSEAKVTCRPEPN IKHPANRAQ
Sbjct: 1861 SSLDFDDDDVDLSREKAELRKGKESKDSEAKVTCRPEPNSSQQAASKSQASIKHPANRAQ 1920
Query: 1921 SKPVLQKQPTFPQSSKDGPDRGAATDEKLQNFAIENTPVCFSRNXXXXXXXDIDQENNNN 1980
SKPVLQKQPTFPQSSKDGPDRGAATDEKLQNFAIENTPVCFSRN DIDQENNNN
Sbjct: 1921 SKPVLQKQPTFPQSSKDGPDRGAATDEKLQNFAIENTPVCFSRNSSLSSLSDIDQENNNN 1980
Query: 1981 KESEPIKEAEPANSQGEPSKPQASGYAPKSFHVEDTPVCFSRNXXXXXXXXXXXXXXXQE 2040
KESEPIKEAEPANSQGEPSKPQASGYAPKSFHVEDTPVCFSRN QE
Sbjct: 1981 KESEPIKEAEPANSQGEPSKPQASGYAPKSFHVEDTPVCFSRNSSLSSLSIDSEDDLLQE 2040
Query: 2041 CISSAMPKKKRPSRLKSESEKQSPRKVGGILAEDLTLDLKDLQRPDSEHAFSPDSENFDW 2100
CISSAMPKKKRPSRLKSESEKQSPRKVGGILAEDLTLDLKDLQRPDSEHAFSPDSENFDW
Sbjct: 2041 CISSAMPKKKRPSRLKSESEKQSPRKVGGILAEDLTLDLKDLQRPDSEHAFSPDSENFDW 2100
Query: 2101 KAIQEGANSIVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPFHLTPDQEEKPFT 2160
KAIQEGANSIV PFHLTPDQEEKPFT
Sbjct: 2101 KAIQEGANSIVSSLHQAAAAAACLSRQASSDSDSILSLKSGISLGSPFHLTPDQEEKPFT 2160
Query: 2161 SNKGPRILKPGEKSTLEAKKIESENKGIKGGKKVYKSLITGKIRSNSEISSQMKQPLPTN 2220
SNKGPRILKPGEKSTLEAKKIESENKGIKGGKKVYKSLITGKIRSNSEISSQMKQPLPTN
Sbjct: 2161 SNKGPRILKPGEKSTLEAKKIESENKGIKGGKKVYKSLITGKIRSNSEISSQMKQPLPTN 2220
Query: 2221 MPSISRGRTMIHIPGLRNXXXXXXXXXKKGPPLKTPASKSPSEGPGATTSPRGTKPAGKS 2280
MPSISRGRTMIHIPGLRN KKGPPLKTPASKSPSEGPGATTSPRGTKPAGKS
Sbjct: 2221 MPSISRGRTMIHIPGLRNSSSSTSPVSKKGPPLKTPASKSPSEGPGATTSPRGTKPAGKS 2280
Query: 2281 ELSPITRQTXXXXXXXXXXXXXXXXXXXXXXPTQQPLSRPMQSPGRNSISPGRNGISPPN 2340
ELSPITRQT PTQQPLSRPMQSPGRNSISPGRNGISPPN
Sbjct: 2281 ELSPITRQTSQISGSNKGSSRSGSRDSTPSRPTQQPLSRPMQSPGRNSISPGRNGISPPN 2340
Query: 2341 KLSQLPRXXXXXXXXXXXXXXXXXXXXXXGRQLSQQNLTKQASLSKNASSIPRSESASKG 2400
KLSQLPR GRQLSQQNLTKQASLSKNASSIPRSESASKG
Sbjct: 2341 KLSQLPRTSSPSTASTKSSGSGKMSYTSPGRQLSQQNLTKQASLSKNASSIPRSESASKG 2400
Query: 2401 LNQMSNGNGSNKKVELSRMSSTKSSGSESDRSERPALVRQSTFIKEAPSPTLRRKLXXXX 2460
LNQMSNGNGSNKKVELSRMSSTKSSGSESDRSERPALVRQSTFIKEAPSPTLRRKL
Sbjct: 2401 LNQMSNGNGSNKKVELSRMSSTKSSGSESDRSERPALVRQSTFIKEAPSPTLRRKLEESA 2460
Query: 2461 XXXXXXXXXXXXXXXXXQAQTPVLSPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDG 2520
QAQTPVLSPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDG
Sbjct: 2461 SFESLSPSSRPDSPTRSQAQTPVLSPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDG 2520
Query: 2521 RPTKRHDIARSHSESPSRLPINRAGTWKREHSKHSSSLPRVSTWRRTGXXXXXXXXXXXX 2580
RPTKRHDIARSHSESPSRLPINRAGTWKREHSKHSSSLPRVSTWRRTG
Sbjct: 2521 RPTKRHDIARSHSESPSRLPINRAGTWKREHSKHSSSLPRVSTWRRTGSSSSILSASSES 2580
Query: 2581 XXXXXXXDERHVSSMPAPRQMKENQVPTKGTWRKIKESDISPTGMXXXXXXXXXXXXXES 2640
DERHVSSMPAPRQMKENQVPTKGTWRKIKESDISPTGM ES
Sbjct: 2581 SEKAKSEDERHVSSMPAPRQMKENQVPTKGTWRKIKESDISPTGMASQSASSGAASGAES 2640
Query: 2641 KPLIYQMAPPVSKTEDVWVRIEDCPINNPRSGRSPTGNTPPVIDSVSEKGSSSIKDSKDT 2700
KPLIYQMAPPVSKTEDVWVRIEDCPINNPRSGRSPTGNTPPVIDSVSEKGSSSIKDSKDT
Sbjct: 2641 KPLIYQMAPPVSKTEDVWVRIEDCPINNPRSGRSPTGNTPPVIDSVSEKGSSSIKDSKDT 2700
Query: 2701 HGKQSVGSGSPVQTVGLETRLNSFVQVEAPEQKGTEAKPGQSNPVSIAETAETCIAERTP 2760
HGKQSVGSGSPVQTVGLETRLNSFVQVEAPEQKGTEAKPGQSNPVSIAETAETCIAERTP
Sbjct: 2701 HGKQSVGSGSPVQTVGLETRLNSFVQVEAPEQKGTEAKPGQSNPVSIAETAETCIAERTP 2760
Query: 2761 FXXXXXXXXXXXXGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSTNTKKRDSKTD 2820
F GTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSTNTKKRDSKTD
Sbjct: 2761 FSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSTNTKKRDSKTD 2820
Query: 2821 STESSGAQSPKRHSGSYLVTSV 2842
STESSGAQSPKRHSGSYLVTSV
Sbjct: 2821 STESSGAQSPKRHSGSYLVTSV 2842
| ||||||||||||||||||||||||||||||||