Home Search Reports Help

Gene Model: Apc

NomenclatureGenomic Location
SymbolApcChromosome1
NameAdenomatous polyposis coli proteinLinkage mapunknown
SpeciesDracomimus familiarisGenome CoordinatesChr1: 88 Mbp

Molecular Function

Tumor suppressor. Promotes rapid degradation of CTNNB1 and participates in Wnt signaling as a negative regulator. APC activity is correlated with its phosphorylation state. Activates the GEF activity of SPATA13 and ARHGEF4. Plays a role in hepatocyte growth factor (HGF)-induced cell migration. Required for MMP9 up-regulation via the JNK signaling pathway in colorectal tumor cells.

Molecular Function Terms:

binding
   protein binding
      beta-catenin binding
      cytoskeletal protein binding
         tubulin binding
      enzyme binding
      gamma-catenin binding

enzyme regulator activity
   kinase regulator activity

Human Disease Association

Defects in APC are a cause of familial adenomatous polyposis (FAP) [MIM:175100]; which includes also Gardner syndrome (GS). FAP and GS contribute to tumor development in patients with uninherited forms of colorectal cancer. FAP is characterized by adenomatous polyps of the colon and rectum, but also of upper gastrointestinal tract (ampullary, duodenal and gastric adenomas). This is a viciously premalignant disease with one or more polyps progressing through dysplasia to malignancy in untreated gene carriers with a median age at diagnosis of 40 years.

Defects in APC are a cause of hereditary desmoid disease (HDD) [MIM:135290]; also known as familial infiltrative fibromatosis (FIF). HDD is an autosomal dominant trait with 100% penetrance and possible variable expression among affected relatives. HDD patients show multifocal fibromatosis of the paraspinal muscles, breast, occiput, arms, lower ribs, abdominal wall, and mesentery. Desmoid tumors appears also as a complication of familial adenomatous polyposis.

Defects in APC are a cause of medulloblastoma (MDB) [MIM:155255]. MDB is a malignant, invasive embryonal tumor of the cerebellum with a preferential manifestation in children. Although the majority of medulloblastomas occur sporadically, some manifest within familial cancer syndromes such as Turcot syndrome and basal cell nevus syndrome (Gorlin syndrome).

Defects in APC are a cause of mismatch repair cancer syndrome (MMRCS) [MIM:276300]; also known as Turcot syndrome or brain tumor-polyposis syndrome 1 (BTPS1). MMRCS is an autosomal dominant disorder characterized by malignant tumors of the brain associated with multiple colorectal adenomas. Skin features include sebaceous cysts, hyperpigmented and cafe au lait spots.

Defects in APC are a cause of gastric cancer (GASC) [MIM:613659]; also called gastric cancer intestinal or stomach cancer. Gastric cancer is a malignant disease which starts in the stomach, can spread to the esophagus or the small intestine, and can extend through the stomach wall to nearby lymph nodes and organs. It also can metastasize to other parts of the body. The term gastric cancer or gastric carcinoma refers to adenocarcinoma of the stomach that accounts for most of all gastric malignant tumors. Two main histologic types are recognized, diffuse type and intestinal type carcinomas. Diffuse tumors are poorly differentiated infiltrating lesions, resulting in thickening of the stomach. In contrast, intestinal tumors are usually exophytic, often ulcerating, and associated with intestinal metaplasia of the stomach, most often observed in sporadic disease.

Defects in APC are a cause of hepatocellular carcinoma (HCC) [MIM:114550]. This defect includes also the disease entity termed hepatoblastoma.

Predicted Transcript
       ...........ATGGCTGCAGCTTCATATGATCAGTTGTTAAAGCAAGTTGAGGCACTGA
    50 AGATGGAGAACTCAAATCTTCGACAAGAGCTAGAAGATAATTCCAATCATCTTACAAAAC
   110 TGGAAACTGAGGCATCTAATATGAAGGAAGTACTTAAGCAGCTACAGGGAAGTATTGAAG
   170 ATGAGACTATGACTTCTGGACAGATTGACTTACTAGAGCGTCTTAAAGAATTTAACTTAG
   230 ATAGTAATTTCCCCGGAGTGAAACTACGCTCAAAAATGTCCCTTCGCTCCTACGGAAGTC
   290 GGGAAGGATCTGTATCCAGCCGTTCAGGAGAATGCAGTCCTGTCCCCATGGGGTCATTCC
   350 CAAGAAGAACATTTGTAAATGGAAGCAGAGAGAGTACTGGGTATCTAGAAGAGCTTGAAA
   410 AAGAAAGATCATTACTCCTTGCTGATCTTGACAAAGAAGAGAAGGAAAAGGACTGGTATT
   470 ATGCTCAACTTCAGAACCTCACAAAAAGAATAGATAGCCTGCCTTTAACTGAAAATTTTT
   530 CCTTACAGACAGACATGACAAGACGGCAGCTGGAGTATGAAGCAAGGCAGATCAGGGCTG
   590 CAATGGAGGAGCAGCTTGGCACCTGCCAGGACATGGAGAAGCGTGCACAGCGAAGAATAG
   650 CCAGGATCCAGCAAATAGAAAAGGACATACTGCGCGTGCGCCAGCTTTTACAGTCCCAGG
   710 CGGCGGAAGCGGAGAGGTCATCTCAGAGCAGGCATGATGCTGCCTCCCATGAAGCTGGCC
   770 GGCAGCACGAAGGCCACGGAGTGGCAGAAAGCAACACCGCAGCCTCCAGTAGTGGTCAGA
   830 GTCCAGCTACACGTGTGGATCACGAAACAGCCAGTGTTTTGAGTTCTAGCGGCACGCACT
   890 CTGCTCCTCGAAGGTTGACAAGTCATCTGGGGACAAAGGTGGAAATGGTGTATTCCTTGT
   950 TGTCAATGCTTGGTACTCATGATAAGGACGATATGTCACGAACTTTGCTAGCTATGTCCA
  1010 GCTCCCAAGACAGCTGTATATCCATGCGGCAGTCTGGATGTCTTCCTCTCCTCATCCAGC
  1070 TTTTACATGGCAATGACAAAGACTCTGTATTGTTGGGAAATTCCCGGGGCAGTAAAGAGG
  1130 CTCGGGCCAGGGCCAGTGCAGCACTCCACAACATCATTCACTCACAGCCTGATGACAAGA
  1190 GAGGCAGGCGTGAAATCCGAGTCCTTCATCTTTTGGAACAGATACGAGCTTACTGTGAAA
  1250 CCTGTTGGGAGTGGCAGGAAGCCCACGAACAAGGCATGGACCAGGACAAAAACCCAATGC
  1310 CAGCTCCTGTTGAGCATCAGATCTGTCCTGCTGTGTGTGTTCTAATGAAGCTTTCATTTG
  1370 ATGAAGAGCATAGGCATGCAATGAATGAACTTGGGGGACTGCAGGCCATTGCAGAGTTAT
  1430 TGCAGGTGGACTGTGAGATGTATGGGCTTACTAATGACCACTACAGTGTTACTTTAAGAC
  1490 GGTATGCTGGAATGGCTTTGACAAACTTGACCTTTGGAGATGTTGCCAACAAGGCTACGC
  1550 TGTGTTCTATGAAAGGCTGCATGAGAGCACTTGTGGCCCAGTTAAAATCTGAGAGTGAAG
  1610 ACTTACAGCAGGTTATTGCAAGTGTTTTGAGGAATTTGTCTTGGCGAGCAGATGTAAATA
  1670 GCAAAAAGACGTTGAGAGAAGTTGGAAGTGTGAAAGCATTGATGGAATGTGCTTTGGAAG
  1730 TTAAAAAGGAATCAACCCTCAAAAGCGTTTTGAGTGCCTTATGGAACCTGTCTGCACACT
  1790 GCACTGAGAATAAGGCTGACATCTGTGCTGTGGATGGAGCACTGGCATTTCTGGTTGGCA
  1850 CCCTCACTTACCGGAGCCAGACAAATACTTTAGCCATTATTGAAAGTGGAGGTGGGATAT
  1910 TACGGAATGTGTCCAGCTTGATAGCTACAAACGAAGACCACAGGCAAATCCTAAGAGAGA
  1970 ACAATTGCCTACAAACTTTATTACAGCACTTGAAATCTCACAGCTTGACAATAGTCAGTA
  2030 ATGCATGTGGAACTTTGTGGAATCTCTCAGCAAGAAATCCTAAAGACCAGGAAGCCTTGT
  2090 GGGACATGGGGGCAGTGAGCATGCTCAAGAACCTCATTCATTCCAAGCACAAAATGATTG
  2150 CCATGGGAAGTGCAGCAGCTTTAAGGAATCTCATGGCAAACAGACCTGCAAAGTATAAGG
  2210 ATGCCAATATCATGTCTCCCGGCTCAAGTCTGCCATCCCTTCACGTTAGGAAACAGAAAG
  2270 CTCTAGAAGCTGAGCTAGATGCTCAGCATTTATCAGAAACCTTCGACAACATTGACAACC
  2330 TAAGTCCCAAGGCCTCTCACCGGAGTAAGCAGAGACACAAGCAGAATCTTTATGGTGACT
  2390 ATGCTTTTGACGCCAATCGACATGATGATAGTAGGTCAGACAATTTCAATACTGGAAACA
  2450 TGACTGTTCTTTCACCATATTTAAATACTACGGTATTGCCCAGCTCTTCTTCCTCAAGGG
  2510 GAAGTTTAGACAGTTCTCGTTCTGAGAAAGACAGAAGTTTGGAGAGAGAGCGAGGTATTG
  2570 GCCTCAGTGCTTACCATCCAACAACAGAAAATGCAGGAACCTCATCAAAACGAGGTCTGC
  2630 AGATCACTACCACTGCAGCCCAGATAGCCAAAGTTATGGAAGAAGTATCAGCCATTCATA
  2690 CCTCCCAGGACGACAGAAGTTCTGCTTCTACCACCGAGTTCCATTGTGTGGCAGACGACA
  2750 GGAGTGCGGCACGAAGAAGCTCTGCCTCCCACACACACTCAAACACATACAACTTCACTA
  2810 AGTCGGAAAATTCAAATAGGACATGCTCTATGCCTTATGCCAAAGTGGAATATAAACGAT
  2870 CTTCAAATGACAGTTTAAATAGTGTCACTAGTAGTGATGGATATGGTAAAAGAGGCCAAA
  2930 TGAAACCCTCAGTTGAATCCTATTCTGAAGATGATGAAAGTAAATTTTGCAGTTATGGTC
  2990 AGTATCCAGCTGACCTAGCCCATAAGATACACAGTGCAAATCATATGGATGATAATGATG
  3050 GAGAACTGGATACACCAATAAATTACAGTCTTAAATATTCAGATGAGCAGTTGAACTCAG
  3110 GAAGGCAGAGTCCCTCACAGAATGAAAGGTGGGCAAGACCAAAGCATGTGATAGAAGATG
  3170 AAATAAAGCAAAACGAGCAAAGACAAGCAAGAAGCCAGAACACCAGTTATCCTGTCTATT
  3230 CTGAGAATACCGATGACAAACACCTCAAATTCCAACCACATTTTGGACAACAAGAATGTG
  3290 TTTCCCCATATAGGTCAAGGGGAACCAGTGGTTCAGAAACAAATCGAATGGGTTCTAGTC
  3350 ATGCAATTAATCAAAATGTAAACCAGTCTCTGTGTCAGGAAGATGATTATGAAGATGATA
  3410 AACCTACCAACTACAGTGAACGTTATTCTGAGGAAGAACAACATGAAGAAGAAGAAGAGA
  3470 GACCGACAAATTATAGCATAAAATATAATGAAGAGAAACATCATGTGGATCAGCCTATTG
  3530 ATTATAGTTTAAAATATGCCACTGACATTTCTTCCTCACAAAAACCATCATTTTCATTCT
  3590 CAAAGAATTCATCAGCACAAAGCACTAAACCTGAACATCTCTCTCCAAGCAGCGAGAATA
  3650 CAGCTGTACCTCCATCTAATGCCAAAAGGCAGAATCAGCTGCGTCCAAGTTCAGCACAAA
  3710 GAAATGGCCAGACTCAAAAAGGCACTACTTGCAAAGTCCCCTCCATCAACCAAGAAACAA
  3770 TACAGACTTACTGCGTAGAAGACACCCCAATATGTTTTTCAAGGTGCAGTTCATTATCAT
  3830 CACTGTCATCAGCTGACGATGAAATAGGATGTGATCAGACAACACAGGAAGCAGATTCTG
  3890 CTAATACTCTGCAGACAGCAGAAGTAAAAGAGAATGATGTAACTCGGTCAGCTGAAGATC
  3950 CTGCAACTGAAGTTCCAGCAGTGTCCCAGAATGCTAGAGCCAAACCCAGCCGACTCCAGG
  4010 CTTCTGGCTTATCTTCAGAATCAACCAGGCATAATAAAGCTGTTGAGTTTTCTTCAGGAG
  4070 CCAAGTCTCCCTCCAAAAGTGGTGCTCAGACACCCAAAAGTCCCCCAGAACACTATGTCC
  4130 AGGAGACTCCGCTCGTATTCAGCAGGTGTACTTCTGTCAGCTCCCTTGACAGTTTTGAGA
  4190 GTCGCTCCATTGCCAGCTCTGTTCAGAGTGAGCCATGTAGTGGAATGGTGAGTGGCATCA
  4250 TAAGCCCCAGTGACCTTCCAGATAGTCCTGGGCAGACCATGCCACCAAGCAGAAGCAAAA
  4310 CCCCTCCACCTCCTCCACAGACAGTGCAGGCCAAGAGAGAGGTGCCAAAAAGTAAAGTCC
  4370 CTGCTGCTGAGAAGAGAGAGAGTGGGCCTAAGCAGACTGCTGTAAATGCTGCCGTGCAGA
  4430 GGGTGCAGGTCCTTCCAGACGTGGATACTTTGTTACACTTCGCCACAGAAAGTACTCCAG
  4490 ACGGGTTTTCTTGTTCCTCCAGCCTAAGTGCTCTGAGCCTGGATGAGCCATTTATACAGA
  4550 AAGATGTAGAATTAAGAATCATGCCTCCAGTTCAGGAAAACGACAATGGGAATGAAACTG
  4610 AATCAGAACAGCCTGAGGAATCAAATGAAAACCAGGATAAAGAGGTAGAAAAGCCTGACT
  4670 CTGAAAAAGACTTATTAGATGATTCTGATGACGATGATATTGAAATATTAGAAGAATGTA
  4730 TTATTTCAGCCATGCCAACAAAGTCATCACGCAAAGCCAAAAAACTAGCCCAGACTGCTT
  4790 CAAAATTACCTCCACCTGTGGCAAGGAAACCAAGTCAGCTACCTGTGTATAAACTTCTGC
  4850 CAGCACAGAATAGGCTGCAGGCACAAAAACATGTTAGCTTTACACCAGGGGATGATGTGC
  4910 CCCGGGTGTACTGTGTAGAAGGGACACCTATAAACTTTTCCACAGCAACGTCTCTAAGTG
  4970 ATCTGACAATAGAGTCCCCTCCAAATGAATTGGCTACTGGAGATGGGGTCAGAGCGGGTA
  5030 TACAGTCAGGTGAATTTGAAAAACGAGATACCATTCCTACAGAAGGCAGAAGTACAGATG
  5090 ATGCTCAGCGAGGAAAAATCTCATCTATAGTTACACCAGACCTGGATGACAACAAAGCAG
  5150 AGGAAGGAGATATTCTTGCAGAATGTATCAATTCTGCTATGCCCAAAGGAAAAAGCCACA
  5210 AGCCTTTCCGAGTGAAAAAGATAATGGACCAAGTCCAACAAGCATCCTCGACTTCATCTG
  5270 GAGCTAACAAAAATCAAGTAGACACTAAGAAAAAGAAGCCTACTTCACCAGTAAAGCCCA
  5330 TGCCACAAAATACTGAATATAGAACGCGTGTGAGAAAGAATACAGACTCAAAAGTTAATG
  5390 TAAATACTGAAGAAACTTTCTCAGACAACAAAGACTCAAAGAAACCAAGCTTACAAACCA
  5450 ATGCCAAGGCCTTCAATGAAAAGCTACCTAACAATGAAGACAGAGTGCGGGGGAGCTTCG
  5510 CCTTGGACTCACCGCATCACTACACCCCTATTGAGGGGACGCCGTACTGCTTTTCCCGAA
  5570 ATGACTCCTTGAGTTCTCTGGATTTTGATGATGACGATGTTGACCTTTCCAGGGAAAAGG
  5630 CCGAGTTAAGAAAGGGCAAAGAAAGCAAGGATTCCGAAGCCAAAGTTACCTGCCGCCCAG
  5690 AACCAAACTCAAGCCAGCAGGCAGCTAGTAAGTCACAAGCCAGTATAAAACATCCAGCAA
  5750 ACAGAGCACAGTCCAAACCAGTGCTGCAGAAACAGCCCACTTTCCCCCAGTCCTCCAAAG
  5810 ACGGACCAGATAGAGGGGCAGCAACTGACGAAAAACTGCAGAATTTTGCTATTGAAAATA
  5870 CTCCAGTTTGCTTTTCTCGAAATTCCTCTCTGAGTTCCCTTAGTGACATTGACCAGGAAA
  5930 ACAACAATAACAAAGAAAGTGAACCAATCAAAGAAGCTGAACCTGCCAACTCACAAGGAG
  5990 AGCCCAGTAAGCCTCAGGCATCCGGGTATGCTCCCAAGTCCTTCCACGTCGAAGACACCC
  6050 CTGTCTGTTTCTCAAGAAACAGCTCTCTCAGTTCTCTTAGCATTGACTCTGAGGACGACC
  6110 TGTTACAGGAGTGTATAAGTTCTGCCATGCCAAAAAAGAAAAGGCCTTCAAGACTCAAGA
  6170 GTGAGAGCGAAAAGCAGAGCCCTAGAAAAGTGGGTGGCATATTAGCTGAAGACCTGACGC
  6230 TTGATTTGAAAGATCTACAGAGGCCAGATTCAGAACACGCTTTCTCCCCCGACTCAGAAA
  6290 ATTTTGACTGGAAAGCTATTCAGGAAGGCGCAAACTCCATAGTAAGTAGTTTGCACCAAG
  6350 CTGCTGCAGCCGCCGCGTGCTTATCTAGACAAGCGTCATCCGACTCAGATTCCATTCTGT
  6410 CACTAAAGTCCGGCATTTCTCTGGGATCGCCTTTTCATCTTACACCTGATCAAGAGGAAA
  6470 AGCCATTCACAAGCAATAAAGGCCCAAGAATTCTCAAACCTGGAGAGAAAAGCACATTAG
  6530 AAGCAAAAAAAATAGAATCTGAAAACAAAGGAATCAAAGGCGGGAAAAAGGTTTATAAAA
  6590 GCTTGATTACGGGAAAGATTCGCTCCAATTCAGAAATTTCCAGCCAAATGAAACAACCCC
  6650 TCCCGACAAACATGCCTTCAATCTCAAGAGGCAGGACGATGATTCACATCCCAGGGCTTC
  6710 GGAATAGCTCCTCTAGTACAAGCCCTGTCTCTAAGAAAGGCCCACCCCTCAAGACTCCAG
  6770 CCTCTAAAAGCCCCAGTGAAGGGCCGGGAGCTACCACTTCTCCTCGAGGAACTAAGCCAG
  6830 CAGGAAAGTCAGAGCTTAGCCCTATCACCAGGCAAACTTCCCAAATCAGTGGGTCAAATA
  6890 AGGGGTCTTCTAGATCAGGATCTAGAGACTCCACTCCCTCAAGACCTACACAGCAACCAT
  6950 TAAGTAGGCCAATGCAGTCTCCAGGGCGAAACTCAATTTCCCCTGGTAGAAATGGAATAA
  7010 GCCCTCCTAACAAACTGTCTCAGCTGCCCAGAACATCATCTCCCAGTACTGCTTCAACTA
  7070 AGTCCTCCGGTTCTGGGAAAATGTCATATACATCCCCAGGTAGACAGCTGAGCCAACAAA
  7130 ATCTTACCAAACAAGCAAGTTTATCCAAGAATGCCAGCAGTATCCCCAGAAGTGAGTCGG
  7190 CATCTAAAGGACTGAATCAGATGAGTAACGGCAATGGGTCAAATAAAAAGGTAGAACTTT
  7250 CTAGAATGTCTTCAACTAAATCAAGTGGAAGTGAATCAGACAGATCAGAAAGGCCTGCAT
  7310 TAGTACGCCAGTCTACTTTCATCAAAGAAGCCCCAAGCCCAACCCTGAGGAGGAAACTGG
  7370 AGGAATCTGCCTCATTTGAATCCCTTTCTCCATCTTCTAGACCAGATTCTCCCACCAGGT
  7430 CGCAGGCACAGACCCCAGTTTTAAGCCCTTCCCTTCCTGATATGTCTCTGTCCACACATC
  7490 CATCTGTTCAGGCAGGTGGGTGGCGAAAGCTCCCGCCTAATCTCAGCCCCACTATCGAGT
  7550 ATAATGACGGAAGGCCCACAAAACGGCATGATATTGCACGCTCCCATTCTGAAAGTCCTT
  7610 CCAGACTACCAATCAACCGGGCGGGAACCTGGAAGCGTGAACACAGCAAACATTCCTCGT
  7670 CCCTTCCTCGAGTGAGTACTTGGAGAAGAACTGGAAGCTCATCTTCTATTCTTTCTGCTT
  7730 CATCAGAGTCCAGTGAAAAAGCAAAAAGTGAGGATGAAAGGCATGTGAGCTCCATGCCAG
  7790 CACCCAGACAGATGAAGGAAAACCAGGTGCCCACCAAAGGAACATGGAGGAAAATCAAGG
  7850 AAAGTGACATTTCTCCCACAGGCATGGCTTCTCAGAGCGCTTCCTCAGGTGCTGCCAGTG
  7910 GTGCTGAATCCAAGCCTCTGATCTATCAGATGGCACCTCCTGTCTCTAAAACAGAGGATG
  7970 TTTGGGTGAGAATTGAGGACTGCCCCATTAACAACCCTAGATCTGGACGGTCCCCCACAG
  8030 GCAACACCCCCCCAGTGATTGACAGTGTTTCAGAGAAGGGAAGTTCAAGCATTAAAGATT
  8090 CAAAAGACACCCATGGGAAACAGAGTGTGGGCAGTGGCAGTCCTGTGCAAACCGTGGGTC
  8150 TGGAAACCCGCCTCAACTCCTTTGTTCAGGTAGAGGCCCCAGAACAGAAAGGAACTGAGG
  8210 CAAAACCAGGACAGAGTAACCCAGTCTCTATAGCAGAGACTGCTGAGACGTGTATAGCAG
  8270 AGCGTACCCCTTTCAGTTCCAGTAGCTCCAGCAAGCACAGCTCACCTAGCGGGACTGTTG
  8330 CTGCCAGAGTGACACCTTTTAATTACAACCCTAGCCCTAGGAAGAGCAGCGCAGACAGCA
  8390 CTTCAGCCCGGCCGTCTCAGATCCCTACGCCAGTGAGCACCAACACGAAGAAGAGAGATT
  8450 CGAAGACTGACAGCACAGAATCCAGTGGAGCCCAAAGTCCTAAACGCCATTCCGGGTCTT
  8510 ACCTCGTGACGTCTGTTTAA...........................


Predicted Protein Product
MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDETM
TSGQIDLLERLKEFNLDSNFPGVKLRSKMSLRSYGSREGSVSSRSGECSPVPMGSFPRRT
FVNGSRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTENFSLQT
DMTRRQLEYEARQIRAAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRVRQLLQSQAAEA
ERSSQSRHDAASHEAGRQHEGHGVAESNTAASSSGQSPATRVDHETASVLSSSGTHSAPR
RLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGCLPLLIQLLHG
NDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQIRAYCETCWE
WQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGGLQAIAELLQVD
CEMYGLTNDHYSVTLRRYAGMALTNLTFGDVANKATLCSMKGCMRALVAQLKSESEDLQQ
VIASVLRNLSWRADVNSKKTLREVGSVKALMECALEVKKESTLKSVLSALWNLSAHCTEN
KADICAVDGALAFLVGTLTYRSQTNTLAIIESGGGILRNVSSLIATNEDHRQILRENNCL
QTLLQHLKSHSLTIVSNACGTLWNLSARNPKDQEALWDMGAVSMLKNLIHSKHKMIAMGS
AAALRNLMANRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPK
ASHRSKQRHKQNLYGDYAFDANRHDDSRSDNFNTGNMTVLSPYLNTTVLPSSSSSRGSLD
SSRSEKDRSLERERGIGLSAYHPTTENAGTSSKRGLQITTTAAQIAKVMEEVSAIHTSQD
DRSSASTTEFHCVADDRSAARRSSASHTHSNTYNFTKSENSNRTCSMPYAKVEYKRSSND
SLNSVTSSDGYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDGELD
TPINYSLKYSDEQLNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSENT
DDKHLKFQPHFGQQECVSPYRSRGTSGSETNRMGSSHAINQNVNQSLCQEDDYEDDKPTN
YSERYSEEEQHEEEEERPTNYSIKYNEEKHHVDQPIDYSLKYATDISSSQKPSFSFSKNS
SAQSTKPEHLSPSSENTAVPPSNAKRQNQLRPSSAQRNGQTQKGTTCKVPSINQETIQTY
CVEDTPICFSRCSSLSSLSSADDEIGCDQTTQEADSANTLQTAEVKENDVTRSAEDPATE
VPAVSQNARAKPSRLQASGLSSESTRHNKAVEFSSGAKSPSKSGAQTPKSPPEHYVQETP
LVFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPPP
PPQTVQAKREVPKSKVPAAEKRESGPKQTAVNAAVQRVQVLPDVDTLLHFATESTPDGFS
CSSSLSALSLDEPFIQKDVELRIMPPVQENDNGNETESEQPEESNENQDKEVEKPDSEKD
LLDDSDDDDIEILEECIISAMPTKSSRKAKKLAQTASKLPPPVARKPSQLPVYKLLPAQN
RLQAQKHVSFTPGDDVPRVYCVEGTPINFSTATSLSDLTIESPPNELATGDGVRAGIQSG
EFEKRDTIPTEGRSTDDAQRGKISSIVTPDLDDNKAEEGDILAECINSAMPKGKSHKPFR
VKKIMDQVQQASSTSSGANKNQVDTKKKKPTSPVKPMPQNTEYRTRVRKNTDSKVNVNTE
ETFSDNKDSKKPSLQTNAKAFNEKLPNNEDRVRGSFALDSPHHYTPIEGTPYCFSRNDSL
SSLDFDDDDVDLSREKAELRKGKESKDSEAKVTCRPEPNSSQQAASKSQASIKHPANRAQ
SKPVLQKQPTFPQSSKDGPDRGAATDEKLQNFAIENTPVCFSRNSSLSSLSDIDQENNNN
KESEPIKEAEPANSQGEPSKPQASGYAPKSFHVEDTPVCFSRNSSLSSLSIDSEDDLLQE
CISSAMPKKKRPSRLKSESEKQSPRKVGGILAEDLTLDLKDLQRPDSEHAFSPDSENFDW
KAIQEGANSIVSSLHQAAAAAACLSRQASSDSDSILSLKSGISLGSPFHLTPDQEEKPFT
SNKGPRILKPGEKSTLEAKKIESENKGIKGGKKVYKSLITGKIRSNSEISSQMKQPLPTN
MPSISRGRTMIHIPGLRNSSSSTSPVSKKGPPLKTPASKSPSEGPGATTSPRGTKPAGKS
ELSPITRQTSQISGSNKGSSRSGSRDSTPSRPTQQPLSRPMQSPGRNSISPGRNGISPPN
KLSQLPRTSSPSTASTKSSGSGKMSYTSPGRQLSQQNLTKQASLSKNASSIPRSESASKG
LNQMSNGNGSNKKVELSRMSSTKSSGSESDRSERPALVRQSTFIKEAPSPTLRRKLEESA
SFESLSPSSRPDSPTRSQAQTPVLSPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDG
RPTKRHDIARSHSESPSRLPINRAGTWKREHSKHSSSLPRVSTWRRTGSSSSILSASSES
SEKAKSEDERHVSSMPAPRQMKENQVPTKGTWRKIKESDISPTGMASQSASSGAASGAES
KPLIYQMAPPVSKTEDVWVRIEDCPINNPRSGRSPTGNTPPVIDSVSEKGSSSIKDSKDT
HGKQSVGSGSPVQTVGLETRLNSFVQVEAPEQKGTEAKPGQSNPVSIAETAETCIAERTP
FSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSTNTKKRDSKTD
STESSGAQSPKRHSGSYLVTSV
Protein Alignment to Mouse
tr|B2RUG9|B2RUG9_MOUSE Adenomatosis polyposis coli OS=Mus musculus GN=Apc PE=2
            SV=1
      MGI:1923206 Srrm2 serine/arginine repetitive matrix 2 (Chr 17)
        Length = 2842

 Score = 12127 (4274.0 bits), Expect = 0., P = 0.
 Identities = 2393/2842 (84%), Positives = 2393/2842 (84%)

Query:     1 MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDETM 60
             MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDETM
Sbjct:     1 MAAASYDQLLKQVEALKMENSNLRQELEDNSNHLTKLETEASNMKEVLKQLQGSIEDETM 60

Query:    61 TSGQIDLLERLKEFNLDSNFPGVKLRSKMSLXXXXXXXXXXXXXXXXXXPVPMGSFPRRT 120
             TSGQIDLLERLKEFNLDSNFPGVKLRSKMSL                  PVPMGSFPRRT
Sbjct:    61 TSGQIDLLERLKEFNLDSNFPGVKLRSKMSLRSYGSREGSVSSRSGECSPVPMGSFPRRT 120

Query:   121 FVNGSRESTGYXXXXXXXXXXXXXXXXXXXXXXXWYYAQLQNLTKRIDSLPLTENFSLQT 180
             FVNGSRESTGY                       WYYAQLQNLTKRIDSLPLTENFSLQT
Sbjct:   121 FVNGSRESTGYLEELEKERSLLLADLDKEEKEKDWYYAQLQNLTKRIDSLPLTENFSLQT 180

Query:   181 DMTRRQLEYEARQIRAAMEEQLGTCQDMEKXXXXXXXXXXXXEKDILRVRQLLXXXXXXX 240
             DMTRRQLEYEARQIRAAMEEQLGTCQDMEK            EKDILRVRQLL       
Sbjct:   181 DMTRRQLEYEARQIRAAMEEQLGTCQDMEKRAQRRIARIQQIEKDILRVRQLLQSQAAEA 240

Query:   241 XXXXXXXHDAASHEAGRQHEGHGVAESNTAASSSGQSPATRVDHETASVLSSSGTHSAPR 300
                    HDAASHEAGRQHEGHGVAESNTAASSSGQSPATRVDHETASVLSSSGTHSAPR
Sbjct:   241 ERSSQSRHDAASHEAGRQHEGHGVAESNTAASSSGQSPATRVDHETASVLSSSGTHSAPR 300

Query:   301 RLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGCLPLLIQLLHG 360
             RLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGCLPLLIQLLHG
Sbjct:   301 RLTSHLGTKVEMVYSLLSMLGTHDKDDMSRTLLAMSSSQDSCISMRQSGCLPLLIQLLHG 360

Query:   361 NDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQIRAYCETCWE 420
             NDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQIRAYCETCWE
Sbjct:   361 NDKDSVLLGNSRGSKEARARASAALHNIIHSQPDDKRGRREIRVLHLLEQIRAYCETCWE 420

Query:   421 WQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGGLQAIAELLQVD 480
             WQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGGLQAIAELLQVD
Sbjct:   421 WQEAHEQGMDQDKNPMPAPVEHQICPAVCVLMKLSFDEEHRHAMNELGGLQAIAELLQVD 480

Query:   481 CEMYGLTNDHYSVTLRRYAGMALTNLTFGDVANKATLCSMKGCMRALVAQLKSESEDLQQ 540
             CEMYGLTNDHYSVTLRRYAGMALTNLTFGDVANKATLCSMKGCMRALVAQLKSESEDLQQ
Sbjct:   481 CEMYGLTNDHYSVTLRRYAGMALTNLTFGDVANKATLCSMKGCMRALVAQLKSESEDLQQ 540

Query:   541 VIASVLRNLSWRADVNSKKTLREVGSVKALMECALEVKKESTLKSVLSALWNLSAHCTEN 600
             VIASVLRNLSWRADVNSKKTLREVGSVKALMECALEVKKESTLKSVLSALWNLSAHCTEN
Sbjct:   541 VIASVLRNLSWRADVNSKKTLREVGSVKALMECALEVKKESTLKSVLSALWNLSAHCTEN 600

Query:   601 KADICAVDGALAFLVGTLTYRSQTNTLAIIESGGGILRNVSSLIATNEDHRQILRENNCL 660
             KADICAVDGALAFLVGTLTYRSQTNTLAIIESGGGILRNVSSLIATNEDHRQILRENNCL
Sbjct:   601 KADICAVDGALAFLVGTLTYRSQTNTLAIIESGGGILRNVSSLIATNEDHRQILRENNCL 660

Query:   661 QTLLQHLKSHSLTIVSNACGTLWNLSARNPKDQEALWDMGAVSMLKNLIHSKHKMIAMGS 720
             QTLLQHLKSHSLTIVSNACGTLWNLSARNPKDQEALWDMGAVSMLKNLIHSKHKMIAMGS
Sbjct:   661 QTLLQHLKSHSLTIVSNACGTLWNLSARNPKDQEALWDMGAVSMLKNLIHSKHKMIAMGS 720

Query:   721 AAALRNLMANRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPK 780
             AAALRNLMANRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPK
Sbjct:   721 AAALRNLMANRPAKYKDANIMSPGSSLPSLHVRKQKALEAELDAQHLSETFDNIDNLSPK 780

Query:   781 ASHRSKQRHKQNLYGDYAFDANRHDDSRSDNFNTGNMTVLSPYLNTTVLPXXXXXXXXXX 840
             ASHRSKQRHKQNLYGDYAFDANRHDDSRSDNFNTGNMTVLSPYLNTTVLP          
Sbjct:   781 ASHRSKQRHKQNLYGDYAFDANRHDDSRSDNFNTGNMTVLSPYLNTTVLPSSSSSRGSLD 840

Query:   841 XXXXEKDRSLERERGIGLSAYHPTTENAGTSSKRGLQITTTAAQIAKVMEEVSAIHTSQD 900
                 EKDRSLERERGIGLSAYHPTTENAGTSSKRGLQITTTAAQIAKVMEEVSAIHTSQD
Sbjct:   841 SSRSEKDRSLERERGIGLSAYHPTTENAGTSSKRGLQITTTAAQIAKVMEEVSAIHTSQD 900

Query:   901 DRSSASTTEFHCVXXXXXXXXXXXXXHTHSNTYNFTKSENSNRTCSMPYAKVEYKRXXXX 960
             DRSSASTTEFHCV             HTHSNTYNFTKSENSNRTCSMPYAKVEYKR    
Sbjct:   901 DRSSASTTEFHCVADDRSAARRSSASHTHSNTYNFTKSENSNRTCSMPYAKVEYKRSSND 960

Query:   961 XXXXXXXXXGYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDGELD 1020
                      GYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDGELD
Sbjct:   961 SLNSVTSSDGYGKRGQMKPSVESYSEDDESKFCSYGQYPADLAHKIHSANHMDDNDGELD 1020

Query:  1021 TPINYSLKYSDEQLNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSENT 1080
             TPINYSLKYSDEQLNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSENT
Sbjct:  1021 TPINYSLKYSDEQLNSGRQSPSQNERWARPKHVIEDEIKQNEQRQARSQNTSYPVYSENT 1080

Query:  1081 DDKHLKFQPHFGQQECVSPYRSRGTSGSETNRMGSSHAINQNVNQSLCQEDDYEDDKPTN 1140
             DDKHLKFQPHFGQQECVSPYRSRGTSGSETNRMGSSHAINQNVNQSLCQEDDYEDDKPTN
Sbjct:  1081 DDKHLKFQPHFGQQECVSPYRSRGTSGSETNRMGSSHAINQNVNQSLCQEDDYEDDKPTN 1140

Query:  1141 XXXXXXXXXXXXXXXXXPTNYSIKYNEEKHHVDQPIDYSLKYATDIXXXXXXXXXXXXXX 1200
                              PTNYSIKYNEEKHHVDQPIDYSLKYATDI              
Sbjct:  1141 YSERYSEEEQHEEEEERPTNYSIKYNEEKHHVDQPIDYSLKYATDISSSQKPSFSFSKNS 1200

Query:  1201 XXXXTKPEHLSPSSENTAVPPSNAKRQNQLRPSSAQRNGQTQKGTTCKVPSINQETIQTY 1260
                 TKPEHLSPSSENTAVPPSNAKRQNQLRPSSAQRNGQTQKGTTCKVPSINQETIQTY
Sbjct:  1201 SAQSTKPEHLSPSSENTAVPPSNAKRQNQLRPSSAQRNGQTQKGTTCKVPSINQETIQTY 1260

Query:  1261 CVEDTPICFSRCXXXXXXXXADDEIGCDQTTQEADSANTLQTAEVKENDVTRSAEDPATE 1320
             CVEDTPICFSRC        ADDEIGCDQTTQEADSANTLQTAEVKENDVTRSAEDPATE
Sbjct:  1261 CVEDTPICFSRCSSLSSLSSADDEIGCDQTTQEADSANTLQTAEVKENDVTRSAEDPATE 1320

Query:  1321 VPAVSQNARAKPSRLQASGLSSESTRHNKAVEFXXXXXXXXXXXXQTPKSPPEHYVQETP 1380
             VPAVSQNARAKPSRLQASGLSSESTRHNKAVEF            QTPKSPPEHYVQETP
Sbjct:  1321 VPAVSQNARAKPSRLQASGLSSESTRHNKAVEFSSGAKSPSKSGAQTPKSPPEHYVQETP 1380

Query:  1381 LVFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPSDLPDSPGQTMXXXXXXXXXX 1440
             LVFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPSDLPDSPGQTM          
Sbjct:  1381 LVFSRCTSVSSLDSFESRSIASSVQSEPCSGMVSGIISPSDLPDSPGQTMPPSRSKTPPP 1440

Query:  1441 XXXXXXXXXEVPKSKVPAAEKRESGPKQTAVNAAVQRVQVLPDVDTLLHFATESTPDGFX 1500
                      EVPKSKVPAAEKRESGPKQTAVNAAVQRVQVLPDVDTLLHFATESTPDGF 
Sbjct:  1441 PPQTVQAKREVPKSKVPAAEKRESGPKQTAVNAAVQRVQVLPDVDTLLHFATESTPDGFS 1500

Query:  1501 XXXXXXXXXXDEPFIQKDVELRIMPPVQENDXXXXXXXXXXXXXXXXXDKEVEKPXXXXX 1560
                       DEPFIQKDVELRIMPPVQEND                 DKEVEKP     
Sbjct:  1501 CSSSLSALSLDEPFIQKDVELRIMPPVQENDNGNETESEQPEESNENQDKEVEKPDSEKD 1560

Query:  1561 XXXXXXXXXXXXXXXXXXXAMPTKSSRKAKKLAQTASKLPPPVARKPSQLPVYKLLPAQN 1620
                                AMPTKSSRKAKKLAQTASKLPPPVARKPSQLPVYKLLPAQN
Sbjct:  1561 LLDDSDDDDIEILEECIISAMPTKSSRKAKKLAQTASKLPPPVARKPSQLPVYKLLPAQN 1620

Query:  1621 RLQAQKHVSFTPGDDVPRVYCVEGTPINFSTATSLSDLTIESPPNELATGDGVRAGIQSG 1680
             RLQAQKHVSFTPGDDVPRVYCVEGTPINFSTATSLSDLTIESPPNELATGDGVRAGIQSG
Sbjct:  1621 RLQAQKHVSFTPGDDVPRVYCVEGTPINFSTATSLSDLTIESPPNELATGDGVRAGIQSG 1680

Query:  1681 EFEKRDTIPTEGRSTDDAQRGKISSIVTPDLDDNKAEEGDILAECINSAMPKGKSHKPFR 1740
             EFEKRDTIPTEGRSTDDAQRGKISSIVTPDLDDNKAEEGDILAECINSAMPKGKSHKPFR
Sbjct:  1681 EFEKRDTIPTEGRSTDDAQRGKISSIVTPDLDDNKAEEGDILAECINSAMPKGKSHKPFR 1740

Query:  1741 VKKIMDQVQQASSTSSGANKNQVDTXXXXXXXXXXXXXQNTEYRTRVRKNTDSKVNVNTE 1800
             VKKIMDQVQQASSTSSGANKNQVDT             QNTEYRTRVRKNTDSKVNVNTE
Sbjct:  1741 VKKIMDQVQQASSTSSGANKNQVDTKKKKPTSPVKPMPQNTEYRTRVRKNTDSKVNVNTE 1800

Query:  1801 ETFSDNKDSKKPSLQTNAKAFNEKLPNNEDRVRGSFALDSPHHYTPIEGTPYCFSRNXXX 1860
             ETFSDNKDSKKPSLQTNAKAFNEKLPNNEDRVRGSFALDSPHHYTPIEGTPYCFSRN   
Sbjct:  1801 ETFSDNKDSKKPSLQTNAKAFNEKLPNNEDRVRGSFALDSPHHYTPIEGTPYCFSRNDSL 1860

Query:  1861 XXXXXXXXXXXXXREKAELRKGKESKDSEAKVTCRPEPNXXXXXXXXXXXXIKHPANRAQ 1920
                          REKAELRKGKESKDSEAKVTCRPEPN            IKHPANRAQ
Sbjct:  1861 SSLDFDDDDVDLSREKAELRKGKESKDSEAKVTCRPEPNSSQQAASKSQASIKHPANRAQ 1920

Query:  1921 SKPVLQKQPTFPQSSKDGPDRGAATDEKLQNFAIENTPVCFSRNXXXXXXXDIDQENNNN 1980
             SKPVLQKQPTFPQSSKDGPDRGAATDEKLQNFAIENTPVCFSRN       DIDQENNNN
Sbjct:  1921 SKPVLQKQPTFPQSSKDGPDRGAATDEKLQNFAIENTPVCFSRNSSLSSLSDIDQENNNN 1980

Query:  1981 KESEPIKEAEPANSQGEPSKPQASGYAPKSFHVEDTPVCFSRNXXXXXXXXXXXXXXXQE 2040
             KESEPIKEAEPANSQGEPSKPQASGYAPKSFHVEDTPVCFSRN               QE
Sbjct:  1981 KESEPIKEAEPANSQGEPSKPQASGYAPKSFHVEDTPVCFSRNSSLSSLSIDSEDDLLQE 2040

Query:  2041 CISSAMPKKKRPSRLKSESEKQSPRKVGGILAEDLTLDLKDLQRPDSEHAFSPDSENFDW 2100
             CISSAMPKKKRPSRLKSESEKQSPRKVGGILAEDLTLDLKDLQRPDSEHAFSPDSENFDW
Sbjct:  2041 CISSAMPKKKRPSRLKSESEKQSPRKVGGILAEDLTLDLKDLQRPDSEHAFSPDSENFDW 2100

Query:  2101 KAIQEGANSIVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPFHLTPDQEEKPFT 2160
             KAIQEGANSIV                                   PFHLTPDQEEKPFT
Sbjct:  2101 KAIQEGANSIVSSLHQAAAAAACLSRQASSDSDSILSLKSGISLGSPFHLTPDQEEKPFT 2160

Query:  2161 SNKGPRILKPGEKSTLEAKKIESENKGIKGGKKVYKSLITGKIRSNSEISSQMKQPLPTN 2220
             SNKGPRILKPGEKSTLEAKKIESENKGIKGGKKVYKSLITGKIRSNSEISSQMKQPLPTN
Sbjct:  2161 SNKGPRILKPGEKSTLEAKKIESENKGIKGGKKVYKSLITGKIRSNSEISSQMKQPLPTN 2220

Query:  2221 MPSISRGRTMIHIPGLRNXXXXXXXXXKKGPPLKTPASKSPSEGPGATTSPRGTKPAGKS 2280
             MPSISRGRTMIHIPGLRN         KKGPPLKTPASKSPSEGPGATTSPRGTKPAGKS
Sbjct:  2221 MPSISRGRTMIHIPGLRNSSSSTSPVSKKGPPLKTPASKSPSEGPGATTSPRGTKPAGKS 2280

Query:  2281 ELSPITRQTXXXXXXXXXXXXXXXXXXXXXXPTQQPLSRPMQSPGRNSISPGRNGISPPN 2340
             ELSPITRQT                      PTQQPLSRPMQSPGRNSISPGRNGISPPN
Sbjct:  2281 ELSPITRQTSQISGSNKGSSRSGSRDSTPSRPTQQPLSRPMQSPGRNSISPGRNGISPPN 2340

Query:  2341 KLSQLPRXXXXXXXXXXXXXXXXXXXXXXGRQLSQQNLTKQASLSKNASSIPRSESASKG 2400
             KLSQLPR                      GRQLSQQNLTKQASLSKNASSIPRSESASKG
Sbjct:  2341 KLSQLPRTSSPSTASTKSSGSGKMSYTSPGRQLSQQNLTKQASLSKNASSIPRSESASKG 2400

Query:  2401 LNQMSNGNGSNKKVELSRMSSTKSSGSESDRSERPALVRQSTFIKEAPSPTLRRKLXXXX 2460
             LNQMSNGNGSNKKVELSRMSSTKSSGSESDRSERPALVRQSTFIKEAPSPTLRRKL    
Sbjct:  2401 LNQMSNGNGSNKKVELSRMSSTKSSGSESDRSERPALVRQSTFIKEAPSPTLRRKLEESA 2460

Query:  2461 XXXXXXXXXXXXXXXXXQAQTPVLSPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDG 2520
                              QAQTPVLSPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDG
Sbjct:  2461 SFESLSPSSRPDSPTRSQAQTPVLSPSLPDMSLSTHPSVQAGGWRKLPPNLSPTIEYNDG 2520

Query:  2521 RPTKRHDIARSHSESPSRLPINRAGTWKREHSKHSSSLPRVSTWRRTGXXXXXXXXXXXX 2580
             RPTKRHDIARSHSESPSRLPINRAGTWKREHSKHSSSLPRVSTWRRTG            
Sbjct:  2521 RPTKRHDIARSHSESPSRLPINRAGTWKREHSKHSSSLPRVSTWRRTGSSSSILSASSES 2580

Query:  2581 XXXXXXXDERHVSSMPAPRQMKENQVPTKGTWRKIKESDISPTGMXXXXXXXXXXXXXES 2640
                    DERHVSSMPAPRQMKENQVPTKGTWRKIKESDISPTGM             ES
Sbjct:  2581 SEKAKSEDERHVSSMPAPRQMKENQVPTKGTWRKIKESDISPTGMASQSASSGAASGAES 2640

Query:  2641 KPLIYQMAPPVSKTEDVWVRIEDCPINNPRSGRSPTGNTPPVIDSVSEKGSSSIKDSKDT 2700
             KPLIYQMAPPVSKTEDVWVRIEDCPINNPRSGRSPTGNTPPVIDSVSEKGSSSIKDSKDT
Sbjct:  2641 KPLIYQMAPPVSKTEDVWVRIEDCPINNPRSGRSPTGNTPPVIDSVSEKGSSSIKDSKDT 2700

Query:  2701 HGKQSVGSGSPVQTVGLETRLNSFVQVEAPEQKGTEAKPGQSNPVSIAETAETCIAERTP 2760
             HGKQSVGSGSPVQTVGLETRLNSFVQVEAPEQKGTEAKPGQSNPVSIAETAETCIAERTP
Sbjct:  2701 HGKQSVGSGSPVQTVGLETRLNSFVQVEAPEQKGTEAKPGQSNPVSIAETAETCIAERTP 2760

Query:  2761 FXXXXXXXXXXXXGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSTNTKKRDSKTD 2820
             F            GTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSTNTKKRDSKTD
Sbjct:  2761 FSSSSSSKHSSPSGTVAARVTPFNYNPSPRKSSADSTSARPSQIPTPVSTNTKKRDSKTD 2820

Query:  2821 STESSGAQSPKRHSGSYLVTSV 2842
             STESSGAQSPKRHSGSYLVTSV
Sbjct:  2821 STESSGAQSPKRHSGSYLVTSV 2842