number of amino acids

0 downloads 0 Views 465KB Size Report
U L LL LLL LLL LLL LLL LLL LLLL. OYL LLL ... TK L LL LLLL LLLL LLLL LLLL ...... QQ AA. AA. ET SH. ET SH. ET. QRE. E E GRE R. E E ORE R. QRE. Ε Ε QRE.
Fig. S_2: List of Csl subfamily genes, their protein sizes (number of amino acids), and multiple protein sequence alignments. The conserved motifs (D, D, DXD, QXXRW) diagnostic of CSL proteins are highlighted with red boxes for each of the subfamilies. Figure S_2A: CslA subfamily S.No 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32

Gene name with number of splice variants (CslA) No. of amino acids (aa) TRIAE_CS42_2BS_TGACv1_146583_AA0468630.1 581 aa TRIAE_CS42_2AS_TGACv1_113418_AA0355820.2 580 aa TRIAE_CS42_2DS_TGACv1_177473_AA0578070.1 581 aa TRIAE_CS42_2AS_TGACv1_113300_AA0354190.1 579 aa TRIAE_CS42_2DS_TGACv1_177798_AA0584795.1 881 aa TRIAE_CS42_6BS_TGACv1_513375_AA1639370.1_2_SPLICE 518 aa TRIAE_CS42_6AS_TGACv1_485966_AA1554960.1 518 aa TRIAE_CS42_U_TGACv1_642146_AA2112270.1 522 aa TRIAE_CS42_7BL_TGACv1_579090_AA1903960.1 375 aa TRIAE_CS42_7AL_TGACv1_558725_AA1795700.1 518 aa TRIAE_CS42_6DS_TGACv1_543811_AA1744360.1 531 aa TRIAE_CS42_6AS_TGACv1_487286_AA1569690.1 528 aa TRIAE_CS42_6BS_TGACv1_513376_AA1639390.2_2_SPLICE 528 aa TRIAE_CS42_U_TGACv1_642146_AA2112290.1 512 aa TRIAE_CS42_7BS_TGACv1_592860_AA1945380.1 547 aa TRIAE_CS42_7DS_TGACv1_623146_AA2050070.1 545 aa TRIAE_CS42_7AS_TGACv1_569190_AA1809650.1_3_SPLICE 551 aa TRIAE_CS42_7DL_TGACv1_602617_AA1962870.1_2_SPLICE 555 aa TRIAE_CS42_7AL_TGACv1_557254_AA1778850.1 515 aa TRIAE_CS42_7BL_TGACv1_578444_AA1895100.1 515 aa TRIAE_CS42_3DL_TGACv1_249033_AA0835410.1_2_SPLICE 572 aa TRIAE_CS42_3B_TGACv1_221079_AA0729630.1_2_SPLICE 571 aa TRIAE_CS42_3AL_TGACv1_197519_AA0666560.1 573 aa TRIAE_CS42_3B_TGACv1_220828_AA0720500.1 570 aa TRIAE_CS42_3DS_TGACv1_273022_AA0927600.1 568 aa TRIAE_CS42_2AL_TGACv1_093375_AA0278800.1 527 aa TRIAE_CS42_2BL_TGACv1_129747_AA0394630.1 528 aa TRIAE_CS42_2DL_TGACv1_160461_AA0550770.1 548 aa TRIAE_CS42_1AS_TGACv1_019142_AA0061550.1 515 aa TRIAE_CS42_3AS_TGACv1_210508_AA0674280.1 566 aa TRIAE_CS42_3DS_TGACv1_272005_AA0912960.1 570 aa TRIAE_CS42_3B_TGACv1_223332_AA0780350.1 925 aa

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv

-------------------------------------------------------------------------------MLLLKIDIAKAFDTVSWEYILELLQRMNFPAHWRDRIALLLSSVSSAYLLKGDPGPAILHQRGLRQGDPLSAILFILVIV -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

0 80 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

TRIAE_CS42_2AS_TGACv -------------------------------------------------------------------------------- 0 TRIAE_CS42_2DS_TGACv -------------------------------------------------------------------------------- 0 TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

-------------------------------------------------------------------------------PLHRMLEAAQQAGTIAPLPAGAARLRVTLYADDAIFFANPVRQEIDTIMQLLQGFGEAAGLRGNPQKSSAATLNYGSIDL -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

0 160 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

-------------------------------------------------------------------------------IDVLKNFSGTRVGFPIRYLGLPLCIGRLPLCTRVGFPIRYLGWLLGKANSCIAPPLAVASHVLVRCVLSALPAFAMAVLR -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MAAWNPETHGSGAIIVGADDCETTVEDEMAAGRDANTKLFHRVANGRK

0 240 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 48

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv

-------------------------------------------------------------------------------IPKRFYKDVDKARWRFLWVHDHEVTGGRCKVNWRLVTSPVDHGGLGIPSMERFARALCLRWLWLAWTDPARPWARMGTPC -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

0 320 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

------------------------------------------------------------------------------------------------------------------------------------------------------------MEA -----------------------------------------------------------------------------MEA -----------------------------------------------------------------------------MEA -----------------------------------------------------------------------------MEA LKNFIPAISVEGITITDQAAKEEAFFEAYSELLGRCGSREHTLDLDYLGIESINLEDQDLVFQEEEVWKVVRDMPSDRAL

0 3 3 3 3 128

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

---------------------------------------------------------------------------MAMAA DDKDRALFASATTVTVGDGNRVLFWHCSWLGEQPVRQDYPNLFRRSTRKNRMMADAIRDDRWIMDLRRSGAGEEVMAMAA ---------------------------------------------------------------------------MAMAA -----------------------------------------------------------------------------MAA ---------------------------------------------------------------------------MAATA ---------------------------------------------------------------------MAGAGEEFMA---------------------------------------------------------------------MAGAGEEFMA---------------------------------------------------------------------MAGAGEEFMAS -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MEKKKRRSSIS ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MAGDGEGAAAFAAAKAEW --------------------------------------------------------MAGDGEGAGDGEGAAAFAAAKAEW --------------------------------------------------------------MAGDGEGAAAFAVAKAEW ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------AEIGGALLFALAAAAALFSAVSTGAVDFSHPLAVGGRVDFQETISWFIG------------------------------AEIGGALLFALAAAAALFAAVSTGAVDFSHPPAVGGRVDFQEAISWFIG------------------------------AEIGGALLFALAAAAALFAAVSTGAIDFSRPLAVGGRVDFQEAISWFIG------------------------------GEIGGALLFVLAAAAAVLAAVSTGAVDFSHPPAVGGQLDFQETISWFTG------------------------------GPNGFIGVFFQKAWAIVKRDVMAALNKLFLNNGRGFGRLNQALITLIPKNHEACQIKDFRPICLVHSIPKLASKLLATRL

5 400 5 3 5 10 10 11 0 0 0 0 0 0 0 0 0 0 11 0 0 18 24 18 0 0 0 52 52 52 52 208

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

TAWLWVEVPVRVDWPAVAAQCAWAGEQARAFLVVPAVRLLVLLSLAMTVMILLEKLFVAAV-CYAAKAFGHRPESRYQWR TAGLWAEVPVRLDWATVAAQCALAGEQARAFLVVPAVRLLVLLSLAMTVMILLEKLFVAAV-CYAAKALGHRPERRYKWG TAGLWAEVPVRLDWATVAAQCSLAGEQARAFLVVPALRLLVLLSLAMTVMILLEKLFVAAV-CYSAKAFRHRPESRYRWR TVGLREEVPVRLDWATVAAQCAWAGEQTRSFLVVPAVRLLVLLSLAMTVMILLEKLFVAAV-CYAAKAFGHRPESRYKWG WLWAEVPVPVRVDWAAVAAQCAWAGQQARALLVVPTVRLLVLLSLAMTVMILLEKLFVAAV-CYAAKAFGHRPESRYRWR --GVWAEVPVRVDWAAVAAQCAWAGAQARAFLVVPAIRLLVVLSLAMTVMILLEKIFVAAV-CFAAKAFGHRPERRYQWR --AVWAGLPVRVDWAAVAAQCAWAGMQARAFLVVPAIRLLVLLSLAMTVMILLEKVFVAAV-CFAAKAFGHRPERRYQWR AAGVWAELPVRVDWAAVAAQCAWAGAQARAFLVVPAIRLLLVLSLTMTVMILLEKIFVAAV-CFAAKAFGHKPERRYQWR --------------MDAAVGLPDAWSQVRAPVIVPLLKLAVAVCLLMSVLLFLERLYMAVV-IVGVKLLGRRPERRYKCD --------------MDAAVGLPDAWSQVRAPVIVPLLKLAVALCLLMSVLLFLERLYMAVV-IVGVKLLGRRPERRYKCD -----------MSTLPGVWQIAAAWEQVRGPVIVPLLRASVLLCLAMSAMLFAEKVYMAVV-VLAVRLLGRRPERQWKWE -----------MSTLPGAWHVAAAWEQVRGPVIVPLLRASVLLCLAMSAMLFAEKVYMAVV-VLAVRLLGRRPERQYQWE --------MAAALLPGTRITFSGAWQQVRGPVIVPLLRASVLLCVAMSAMLLAEKVYMAVV-VLALRLLGRRPELQYRWE -----------MSTLPRVWQIAAAWEQVRGPVIVPLLRVSVLLCLAMSAMLFAEKVYMAVV-VLAVRLLGRRPEQQYRWE --------------MEAAEQIAVVWKQVRGPVIAPLLRASVMVCLAMCVILFVEKVYMAVV-IVAMRLIGRHPERQWRWE --------------MEAAEQIAVVWKQVRGPVIVPLLRASVMVCLAMCVILFVEKVYMAVV-IVAMRLIGRRPERQWRWE --------------MEAAEQIAVVWKQVRGPVIVPLLRASVMVCLAMCVILFVEKVYMAVV-IVAMRLIGRRPERQWRWE -----------MKGVSMLTMARAAWAAVRHAVVVPLLQLAVYLCAAMSLMLFAERLYMGLV-VAALWLRRRRRQRRNPGR FLLSFGGGRRRMKGVSMLTMARAAWAVVRYAVVVPLLQLAVYLCAAMSLMLFAERLYMGLV-VAALWLRRRRRQRRSPSR -----------MRGVSMLTMARAAWAAVRYAVVVPLLQLAVYLCAAMSLMLFAERLYMGLV-VAALWLRRRRRQRRNPSR --------------MSMLPMARAAWLVLRYAVVVPLLQLAIYLCVVMSLMLFADRLYMGLV-VAVLWLYRRCRNRNQRNK LDGSGGLPLLRWWRASGGGELLGRWDAVRAGAVAPALAAVSGACLAMSAMLLAEAVFMAAA-SLVR----RRPERRYSAG LGGSGGLPLLRWWRASGGGELLRGWDAVRAGAVAPALAAVSGACLAMSAMLLAEAVFMAAA-SLVR----RRPERRYSAG LAGSGGLPLLRWWRASGGGELLRGWDAVRAGAVAPALAAVSGACLAMSAMLLAEAVFMAAA-SLVR----RRPERRYSAG ------------MAPLGADAAAAAWAAVRARAVAPALTAAVWACLAMSAMLLLEAACMSLVSLVAVRLLRLRPERRFKWE ------------MAPLGADAAAAAWAAVRARAVSPALTAAVWACLAMSAMLLLEAACMSLVSLVAVRLLRLRPQRRFKWE ------------MAPLSAGAAAAAWAAVRARAVAPALTAAVWACLAMSAMLLLEAACMSLVSLVAVRLLRLRPERRFKWE -IFDGSSSSSSAAGGVSLAEVYELWVRVRGRVIAPALQVAVWACMVMSVMLVVEALYNCVV-SLGVKAVGWRPEWRFKWE -VFDGSSSSSSAAGGVSLAEVYELWVRVRGRVIAPALQVAVWACMVMSVMLVVEALYNCVV-SLGVKAVGWRPEWRFKWE -VFDGSSSSS-AAGGVSLAEVYELWVRVRGRVIAPALQVAVWACMVMSVMLVVEALYNCVV-SLGVKAVGWRPEWRFKWE -VYNG-ASYSSGAGAVSLAEVHELWVRVRGRVIAPALQVTVWACMVMSVMLAVEALYNCVV-SLGVKAIGWRPEWRFKWE CPRMGELVHAKQSAFIKGRNIHDNFLQVRQLARKLYKRKTKSVMLKLDISRAFDSLSWPFL-FEVLRVKGFSRTWRFWIA

84 479 84 82 84 87 87 90 65 65 68 68 71 68 65 65 65 68 90 68 65 93 99 93 68 68 68 130 130 129 129 287

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv

PIAASACKTGGDDEEDGIVVVG----SAAFPVVLVQIPMYNEREVYKVSIGAACALEWPSDRMVIQVLDDSTDPVVKDLV PVAASACKTGGDDEEDGIVGVGSGSGSAAFPVVLVQIHMYNER---------------------------------EDLV PITASACKTGGDDEEDGIVVVGSGSGRAAFPVVLVQIPMYNEREVYKVSIGAACALEWPSDRMVIQVLDDSTDPVVKELV PIAASACKTGGDDEEDGIVVVGSGSGSGAFPVVLVQIPMYNEREVYKVSIGAACALEWPSDRMVIQVLDDSTDPVVKELV PIAASACKAGGGDEEDGIVIVGSSSGSAAFPVVLVQIPMYNEREVYKVSIGAACALEWPSDRMVIQVLDDSTDPVVKDLV PIAAGAAAAARGDEE--AGLVGGGGGSAAFPVVLVQIPMYNEREVYKLSIGAACALEWPSDRVVIQVLDDSTDPAVKDLV PIAAGAAAAARGDEE--AGVGGGG--SAAFPVVLVQIPMYNEREVYKLSIGAACALEWPAERVVIQVLDDSTDPVVKDLV PIAASACKTGGVDEE--ASVGGGS---SAFPVVLVQIPMYNEREVYKLSIGAACALEWPSDRVVIQVLDDSTDPAVKDLV PICEDDDPE---------------LGSAAFPIVLVQIPMFNEREVYQLSIGAVCGLSWPSDRLVVQVLDDSTDPLIKEMV PICEDDDPE---------------LGSAAFPVVLVQIPMFNEREVYQLSIGAVCGLSWPSDRLVVQVLDDSTDPLVKEMV PVGE-DDPE---------------LGSAAYPMVLVQIPMYNEREVYQLSIGAACGLSWPSDRIVVQVLDDSTDPVIKELV PMGD-DDPE---------------LGSAAYPMVLVQIPMYNEREVYQLSIGAACGLSWPSDRIVVQVLDDSTDPVIKELV PMRDGDDPE---------------LGSAAYPMVLVQIPMYNEREVYQLSIGAACGLSWPSDRIIVQVLDDSTDPVVKELV PVGDGNDPE---------------LGSAAYPMVLVQIPMYNEREVYQLSIGAACGLSWPSDRIIVQVLDDSTDPVIKELV PLRD-DDPE---------------LGNAAYPMVLVQIPMYNEREVYKKSIGAACGLSWPSDRIVIQVLDDSTDPAIKELV PLRD-DDPE---------------LGNAAYPMVLVQIPMYNEREVYKKSIGAVCGLSWPSDRIVIQVLDDSTEPAIKELV PLRD-DDPE---------------LGNAAYPMVLVQIPMYNEREVYKKSIGAACGLSWPSDRIVIQVLDDSTDPAIKELV NKGGDDDVG-----------DLESGAAEDLPVVLVQIPMFNEKQVYRLSIGAACGLWWPADKLVIQVLDDSTDAGIRAMV NKGGDDDD-------------LESGAAEDLPLVLVQIPMFNEKQVYRLSIGAACGLWWPADKLVIQVLDDSTDAGIRAMV NKGDDDGGAG----------DLESGGGEDLPMVLVQIPMFNEKQVYRLSIGAACGLWWPADKLVIQVLDDSTDAGIRALV GDDDNLESD-----------------DADRPMVLVQIPMFNEKQVFRLSIGAACGLWWPADKLVIQVLDDSTDAGIRSLV PLGAQDGEDE-------------ERGLLGYPMVLVQIPMYNEREVYKLSIGAACGLSWPSDRVIVQVLDDSTDPTIKDLV

160 526 164 162 164 165 163 165 130 130 132 132 136 133 129 129 129 137 157 138 128 160

TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

PLGAQDGEDE-------------ERGLLGYPMVLVQIPMYNEREVYKLSIGAACGLSWPSDRVIVQVLDDSTDPTIKDLV PLGAQDGEDED-----------EERGLLGYPMVLVQIPMYNEREVYKLSIGAACGLSWPSDRVIVQVLDDSTDPTIKDLV PMAGALEGGEADVED-----PPASAGRREFPMVLVQIPMYNEKEVYKLSIGAVCALTWPPDRIIIQVLDDSTDPIIKELV PMPGALPGAEADAED-----PPG---RREFPMVLVQIPMYNEKEVYKLSIGAVCALTWPPDRIIIQVLDDSTDPIIKELV PMTGALEGGEADVED-----PAG---RREFPMVLVQIPMYNEKEVYKLSIGAVCALTWPPDRIIIQVLDDSTDPIIKELV PLAGDDEEKGG----------------AHYPMVLVQIPMYNELEVYKLSIGAACELQWPKDRIIVQVLDDSTDPFIKNLV PLAGDDEEKGG----------------AHYPMVLVQIPMYNELEVYKLSIGAACELQWPKDRIIVQVLDDSTDPFIKNLV PLAGDDEEKGG----------------AHYPVVLVQIPMYNELEVYKLSIGAACELQWPKDRIIVQVLDDSTDPFIKNLV PLAGD-EEKGS----------------AHYPMVLVQIPMYNELEVYKLSIGAACELKWPKDRMIVQVLDNSTDPLIKNLV TLLTTASSRVV----------------VNGCVGKKFMHACGLRQGDSISPLLFVIAMDVLSAMILKARETNAVSKIPGCA

166 162 143 140 140 194 194 193 192 351

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

KIECQRWKSKGVNIRYEVRQNRKGYKAGALKEGLMRD---------------------------------------YVRE KIECQRWKSKGVNIRYEVRQNRKGYKAGALKEGLIRD---------------------------------------YVRE KTECQRWKGKGVNIRYEVRGNRKGYKAGALKQGLMRD---------------------------------------YVRE KTECQRWKGKGVNIRYEVRGNRKGYKAGALKQGLMRD---------------------------------------YVRE KIECQRWKSKGVNIRYEVRENRKGYKAGALKQGLMRD---------------------------------------YVRE EIECQRWKGKGVNIKYEVRGNRKGYKAGALKEGLKHD---------------------------------------YVQE EIECQRWKGKGVNIKYEVRGNRKGYKAGALKEGLKHD---------------------------------------YVQE EIECQRWKGKGVNIKYEVRGNRKGYKAGALKEGLKHD---------------------------------------YVQE RMECERWAHKGINITYQIREDRKGYKAGALKAGMKHG---------------------------------------YVRE RMECERWAHKGINITYQIREDRKGYKAGALKAGMKHG---------------------------------------YVRE QVECRRWARKGVNIKYEIRDNRRGYKAGALKEGMKHG---------------------------------------YVKD RVECRRWARKGVNIKYEIRDNRRGYKAGALKEGMKHG---------------------------------------YVKD QVECQRWARKGVNIKYETRNNRRGYKAGALKEAMKHG---------------------------------------YVKD QVECRRWARKGVNIKYEIRDNRRGYKAGALKEGMKHG---------------------------------------YVKD QAECHRWANKGVNIKYEIRDNRRGYKAGALKEGMKHG---------------------------------------YVKD QVECQRWANKGVNIKYEIRDNRRGYKAGALKEGMKHG---------------------------------------YVKD QVECQRWANKGVNIKYEIRDNRRGYKAGALKEGMKHG---------------------------------------YVKD EAECRRWAGKGVHIRYENRSNRSGYKAGAMREGLKKG---------------------------------------YAKD EAECRRWAGKGVQIRYENRSNRSGYKAGAMREGLKKG---------------------------------------YAKD EAECRRWAGKGVQIRYENRSNRSGYKAGAMREGLKKG---------------------------------------YARD EAECRRWAGKGVHIRYENRSNRSGYKAGAMRDGLKKQ---------------------------------------YVKD ELECKIWAKKGKNVKYEVRNNREGYKAGALKEGMLHA---------------------------------------YVQQ ELECKIWAKKGKNVKYEVRNNREGYKAGALKEGMLHA---------------------------------------YVQQ ELECKIWAKKGKNVKYEVRNNREGYKAGALKEGMLHA---------------------------------------YVQQ ELECQEWASKKIDIKYEVRNNRKGYKAGALKKGMEHV---------------------------------------YAQQ ELECQEWASKKIDIKYEVRNNRKGYKAGALKKGMEHV---------------------------------------YAQQ ELECQEWASKKIDIKYEVRNNRKGYKAGALKKGMEHV---------------------------------------YAQQ ELECESWAVKGLNIKYATRSSRKGFKAGALKKGMECD---------------------------------------YAKQ ELECESWAVKGLNIKYATRSSRKGFKAGALKKGMECD---------------------------------------YAKQ ELECESWSVKGLNIKYATRSSRKGFKAGALKKGMEYD---------------------------------------YAKQ ELECETWVTKGLNIKYAPRSGQKGFKAGALKKGMECD---------------------------------------YARQ PIQRLSLYVDDVVMFIKPSWTDLWFVQEALRVFGEASGLKVNFSKSSAVMIRSEEEEEVLVRKAMPWKMETFPIKYLGLQ

201 567 205 203 205 206 204 206 171 171 173 173 177 174 170 170 170 178 198 179 169 201 207 203 184 181 181 235 235 234 233 431

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

CEFIAMFDADFQPESDFLLRTVPFLVHN---------------------------------------------------CEFIAMFDADFQPESDFLLRTVPFLVHN---------------------------------------------------CEFIAMFDADFQPESDFLLRTVPFLVHN---------------------------------------------------CKFIAMFDADFQPESDFLLRTVPFLVHN---------------------------------------------------CEFIAMFDADFQPESDFLLRTVPFLVHN---------------------------------------------------CEFIAMFDADFQPESDFLLRTVPFLVHN---------------------------------------------------CEFIAMFDADFQPESDFLLRTVPFLVHN---------------------------------------------------CEFIAMFDADFQPESDFLLRTVPFLVHN---------------------------------------------------CEYMVIFDADFQPDPDFLHRTIPYLHHN---------------------------------------------------CEYMVIFDADFQPDPDFLHRTIPYLHHN---------------------------------------------------CDLVAIFDADFQPEPDFLWRAVPFLVHN---------------------------------------------------CDLVAIFDADF------LWRAVPFLVHN---------------------------------------------------CDLVAIFDADFQPEPDFLSRSVPFLVHN---------------------------------------------------CDLVAIFDADFQPEPDFLSRSVPFLVHN---------------------------------------------------CDYVVIFDADFQPEPDYLSRAMPFLVHN---------------------------------------------------CDFVVIFDADFQPEPDYLSRAMPFLIHN---------------------------------------------------CDFVVIFDADFQPEPDYLSRAMPFLIHN---------------------------------------------------CELVAVFDADFQPDADFLRRTVPVLQAD---------------------------------------------------CELVAVFDADFQPDADFLRRTVPVLQAD---------------------------------------------------CELVAVFDADFQPDADFLRRTVPVLQAD---------------------------------------------------CEFVAVFDADFQPDADFLRHTVPVLEAD---------------------------------------------------CDFLAVFDADFQPEPDFLMRTIPYLARN---------------------------------------------------CDFLAVFDADFQPEPDFLMRTIPYLARN---------------------------------------------------CDFLAVFDADFQPEPDFLMRTIPYLSRN---------------------------------------------------CEFVAIFDADFQPESDFLLKTIPFLVHN---------------------------------------------------CEFVAIFDADFQPESDFLLKTIPFLVHN---------------------------------------------------CEFVAIFDADFQPESDFLLKTIPFLVHN---------------------------------------------------CEYVAIFDADFQPEPDFLLRTVPFFVHN---------------------------------------------------CEYVAIFDADFQPEPDFLLRTVPFFVHN---------------------------------------------------CEYVAIFDADFQPEPDFLLRTVPFFVHN---------------------------------------------------CEYVAIFDADFQPEPDFLLRTIPFFVHN---------------------------------------------------LGIKQLTRSEWQPVVDQALKMMPGWQRGPVTRPGRLPLVNQVVRARPIHHLIVAEAPKRALDRVDKGCRAFFWAGSEEIQ

229 595 233 231 233 234 232 234 199 199 201 195 205 202 198 198 198 206 226 207 197 229 235 231 212 209 209 263 263 262 261 511

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv

----------------------------------------------------PDIALVQTRWKFVNSDECLLTRFQEMSL ----------------------------------------------------PDIALVQTRWKFVNSDECLLTRFLEMSL ----------------------------------------------------PDIALVQTRWKFVNSDKCLLTRFQEMSL ----------------------------------------------------PDIALVQTRWKFVNSDKCLLTRFQEMSL ----------------------------------------------------PDIALVQTRWKFVNSDECLLTRFQEMSL ----------------------------------------------------PDIALVQTRWKFVNSDECLLTRFQEMSL ----------------------------------------------------PDIALVQTRWKFVNSDECLLTRFQEMSL ----------------------------------------------------PDIALVQTRWKFVNSDECLLTRFQEMSL ----------------------------------------------------PEIALVQARWRFVNADECLMTRMQEMSL ----------------------------------------------------PEIALVQARWRFVNADECLMTRMQEMSL ----------------------------------------------------PDIALVQARWKFVNADECLMTRMQEMSL ----------------------------------------------------PDIALVQARWKFVNADECLMTRMQEMSL ----------------------------------------------------PDIALVQARWKFVNADECLMTRMQEMSL ----------------------------------------------------PDIALVQARWKFVNADECLMTRMQEMSL ----------------------------------------------------PEIALVQARWVFVNANECLMTRMQEMSL ----------------------------------------------------PEIALVQARWVFVNANECLMTRMQEMSL ----------------------------------------------------PEIALIQARWVFVNANECLMTRMQEMSL ----------------------------------------------------PAVALVQARWRFVNADECILTRIQEMSL

257 623 261 259 261 262 260 262 227 227 229 223 233 230 226 226 226 234

TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

----------------------------------------------------PSVALVQARWRFVNADECILTRIQEMSL ----------------------------------------------------PAVALVQARWRFVNADECILTRIQEMSL ----------------------------------------------------PAVALVQARWRFVNADECILTRMQEMSL ----------------------------------------------------PQIALVQARWEFVNPNECLMTRIQKMTL ----------------------------------------------------PQIALVQARWEFVNPNECLMTRIQKMTL ----------------------------------------------------PQIALVQARWEFVNPNECLMTRIQKMTL ----------------------------------------------------PKIALVQTRWKFVNYDACLMTRIQKMSL ----------------------------------------------------PKIALVQTRWKFVNYDACLMTRIQKMSL ----------------------------------------------------PKIALVQTRWKFVNYDACLMTRIQKMSL ----------------------------------------------------PEVALVQARWSFVNDTASLLTRVQKMFF ----------------------------------------------------PEVALVQARWSFVNDTASLLTRVQKMFF ----------------------------------------------------PEVALVQARWSFVNDNASLLTRVQKMFF ----------------------------------------------------PKVALVQARWSFVNGTVSLLTRIQKMFF GGQCAVAWRGVYRPKQMGGLGVVDLHKHGIALRLSLSQTSFSEQSRSSCTIQKLLLFKLSGPSNLNGTVSLLTRIQKMFF

254 235 225 257 263 259 240 237 237 291 291 290 289 591

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

DYHFKFEQEAGSIVYSFFGFNGTAGVWRISAINDAGGWKERTTVEDMDLAVRTALLGLKFVYVGAVKVKSELPSTFKAYR DYHFKFEQEAGSIVYSFFGFNGTAGVWRISAINDAGGWKERTMVEDMDLVVRTALLGLKFVYIGAVKVKSELPSTFKAYR DYHFKFEQEAGSIVYSFFGFNGTAGVWRISAINDAGGWEDRTTVEDMDLAVRTSLLGWKFVYVGAVKVKSELPSTFKAYR DYHFKFEQEAGSIVYSFFGFNGTAGVWRISAINDAGGWEDRTTVEDMDLAVRTALLGWKFVYVGAVKVGSELPSTFKAYR DYHFKFEQEAGSIVYSFFGFNGTAGVWRISAINDAGGWNDRTTVEDMDLAVRTALLGWKFVYVGDVKVRSELPSTFKAYR DYHFKFEQEAGSIVYSFFGFNGTAGVWRISAIDDAGGWKDRTTVEDMDLAVRTALKGWKFVYVGAVKVRSELPSTFKAYR DYHFKFEQEAGSIVYSFFGFNGTAGVWRIAAIDDAGGWKDRTTVEDMDLAVRTALKGWKFVYVGAVKVRSELPSTFKAYR DYHFKFEQEAGSIVYSFFGFNGTAGVWRISAIDDAGGWKDRTTVEDMDLAVRTALKGWKFVYVGAVKVRSELPSTFKAYR DYHFKVEQEVSSSVCAFFGFNGTAGVWRIAAVNEAGGWKDRTTVEDMDLAIRASLKGWKFVYLGDVQVKSELPSTFKAFR DYHFKVEQEVSSSVCAFFGFNGTAGVWRIAAVNEAGGWKDRTTVEDMDLAIRASLKGWKFVYLGDVQVKSELPSTFKAFR DYHFRVEQEVGSSTYAFFGFNGTAGVWRISALNEAGGWKDRTTVEDMDLAVRASLKGWKFVYLGDLKVKNELPSTFKAFR DYHFKVEQEVGSSTYAFFGFNGTAGVWRISALNEAGGWKDRTTVEDMDLAVRASLKGWKFVYLGDLKVKNELPSTFKAFR DYHFKVEQEVGSSTYAFFGFNGTAGVWRISALNEAGGWKDRTTVEDMDLAVRASLKGWKFVYLGDLKVKNELPSTFKAFR DYHFKVEQEVGSSTYAFFGFNGTAGVWRISALNEAGGWKDRTTVEDMDLAVRASLKGWKFVYIGDLKVKNELPSTFKAFR DYHFKVEQEVGSSAYAFFGFNGTAGVWRISALNEAGGWKDRTTVEDMDLAVRASLKGWKFVCLGDLRVKSELPSTFKAFR DYHFKVEQEVGSSAYAFFGFNGTAGVWRISALNEAGGWKDRTTVEDMDLAVRASLKGWKFVYLGDLRVKSELPSTFKAFR DYHFKVEQEVGSSAYAFFGFNGTAGVWRISALNEAGGWKDRTTVEDMDLAVRASLKGWKFVYLGDLRVKSELPSTFKAFR DYHFSVEQEVGSACHGFFGFNGTAGVWRVQALADAGGWKDRTTVEDMDLAVRASMRGWRFVYAGDVQVRNELPSSFKAYR DYHFSVEQEVGSACHGFFGFNGTAGVWRVQALADAGGWKDRTTVEDMDLAVRASMRGWRFVYAGDVQVRNELPSSFKAYR DYHFSVEQEVGSACHGFFGFNGTAGVWRVQALADAGGWKDRTTVEDMDLAVRASMRGWRFVYAGDVQVRNELPSSFKAYR DYHFSVEQEVGSAFHGFFSFNGTAGVWRLHALADAGGWKDRTTVEDMDLAVRASMRGWRFVYAGDVQVRNELPSSFKAYR DYHFKVEQEAGSSTFAFFGFNGTAGVWRISAIKEAGGWDDRTTVEDMDLAVRAGLKGWKFVYVGDVKVKSELPSNLKAYR DYHFKVEQEAGSSTFAFFGFNGTAGVWRISAIKEAGGWDDRTTVEDMDLAVRAGLKGWKFVYVGDVKVKSELPSNLKAYR DYHFKVEQEAGSSTFAFFGFNGTAGVWRISAIKEAGGWDDRTTVEDMDLAVRAGLKGWKFVYVGDVKVKSELPSNLKAYR DYHFKVEQESGSFMHAFFGFNGTAGVWRVSAINESGGWKDRTTVEDMDLAVRACLKEWEFLYVGDIRVKSELPSTFKAYR DYHFKVEQESGSFMHAFFGFNGTAGVWRVSAINESGGWKDRTTVEDMDLAVRACLKEWEFLYVGDIRVKSELPSTFKAYR DYHFKVEQESGSFMHAFFGFNGTAGVWRVSAINESGGWKDRTTVEDMDLAVRACLKEWEFLYVGDIRVKSELPSTFKAYR DYHFKVEQEAGSATFSFFSFNGTAGVWRTSAIKEAGGWKDRTTVEDMDLAVRATLKGWKFVYVGDIRVKSELPSTYKAYC DYHFKVEQEAGSATFSFFSFNGTAGVWRTAAIKEAGGWKDRTTVEDMDLAVRATLKGWKFVYVGDIRVKSELPSTYKAYC DYHFKVEQEAGSATFSFFSFNGTAGVWRAAAIKEAGGWKDRTTVEDMDLAVRATLKGWKFVYVGDIRVKSELPSTYKAYC DYHFKVEQEAGSATFAFFSFNGTAGVWRTAAIKEAGGWKDRTTVEDMDLAIRATLKGWKFIYVGDIRVKSELPSSYKAYC DYHFKVEQEAGSATFAFFSFNGTAGVWRTAAIKEAGGWKDRTTVEDMDLAIRATLKGWKFIYVGDIRVKSELPSSYKAYC

337 703 341 339 341 342 340 342 307 307 309 303 313 310 306 306 306 314 334 315 305 337 343 339 320 317 317 371 371 370 369 671

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

FQQHRWSCGPANLFKKVLVEILHNKKVSFWCKLHLLYDFFFVGKITAHTVTFIYYCFAIPVSVFFP--EIQIPLWCVVYV FQQHRWSCGPANLFKKMLVEILQNKKVSFWSKLHLLYDFFFVGKIIAHIVTFIYYCFATPVSVFFP--EIQIPLWGVVYV FQQHRWSCGPANLFKKILLDILKNKKVSFWSKLHLLYDFFFVGKIAAHTVTFIYYCFAIPVSVFFP--EIQIPLWGVVYV FQQHRWSCGPANLFKKMLLDILRKKKVSFWSKLHLLYDFFFVGKIAAHMVTFIYYCFAIPVSVFFP--EIQIPLWGVVYV FQQHRWSCGPANLFKKMLVDILENKKVSFWSKLHLLYDFFFVGKIAAHTLTFIYYCFAIPVSVFFP--EIQIPLWGVVYV FQQHRWSCGPANLFKKMLVEILESKKVSFWSKLHLLYDFFFVGKIAAHTVTFIYYCFAIPLSVFFP--EIQIPLWGVVYV FQQHRWSCGPANLFKKMLVEILESKKVSFWSKLHLLYDFFFVGKIAAHTVTFIYYCFAIPLSVFFP--EIQIPLWGVVYV FQQHRWSCGPANLFKKMLVEILENKKVSFWSKLHLLYDFFFVGKIAAHTVTFIYYCFAIPLSVFFP--EIQIPLWGVVYV FQQHRWSCGPANLFRKMLLEIVTNKKVTIWKKFHVIYNFFLVRKIVAHIVTFTFYCIIIPTTIFVP--EVHIPKWGCVYI FQQHRWSCGPANLFRKMLVEIVTNKKVTIWKKFHVIYNFFLVRKIVAHIVTFTFYCIIIPTTIFVP--EVHIPKWGCVYI YQQHRWSCGPANLFRKMVMEIIRNKKVTLWKKIHVVYSFFLVRKVVAHIVTFVFYCLVIPATVLVP--EVEVPKWGCVYI YQQHRWSCGPANLFRKMVMEIIRNKKVTLWKKIHVVYSFFLVRKVVAHIVTFVFYCLVIPATVLVP--EVEVPKWGCVYI YQQHRWSCGPANLFRKMVMEIIKNKKVTLWKKIHVVYNFFFLRKVVAHIMTFVFYCLVIPATVLVP--EVEVPKWGCVYI YQQHRWSCGPANLFRKMVMEIVRNKKVTLWKKIHVIYNFFLVRKVVAHIVTFVFYCVVERIDMVD--------------YQQHRWSCGPANLFRKMLMEIVKNQKVTLWKKIYVIYNFFLVRKIIGHILTSVFYCLVIPATVFVP--EVEIPRWGYFYI YQQHRWSCGPANLFRKMLMEIVKNQKVTLWKKIYVIYNFFFVRKIIGHILTSVFYCLVIPATVFVP--EVEIPRWGYFYI YQQHRWSCGPANLFRKMLMEIVKNQKVTLWKKIYVIYNFFFVRKIIGHILTSVFYCLVIPATVFVP--EIEIPRWGYFYI YQQHRWSCGPANLMRKMFWEIVASRQVSAWKKVHVLYGFFFVRKVVAHLVTFLFYCVVIPAYVLVGGQDVRLPKYVAMYV YQQHRWSCGPANLMRKMFWEIVASRQVSAWKKVHVLYGFFFVRKVVAHLVTFLFYCVVIPAYVLVGGQDVRLPKYVAMYV YQQHRWSCGPANLMRKMFWEIVASRQVSAWKKVHVLYGFFFVRKVVAHLVTFLFYCVVIPAYVLVGGQDVRLPKYVAMYV YQQHRWSCGPPNLMRKMFWEIVANKQVSAWKKLHVLYGFFFVRKVVAHLATFLFCCVVIPVYVLVGGQDVWLPQYVPMYV RQQHRWTCGAANLFRKMGAEILLTKEVSLWWKLYLLYSFFLVRKVVAHVVPFVLYCVVIPFSVLIP--EIKIPAWGVVYI RQQHRWTCGAANLFRKMGAEILLTKEVSLWWKLYLLYSFFLVRKVVAHVLPFVLYCVVIPFSVLIP--EIKIPAWGVVYI RQQHRWTCGAANLFRKMGAEILLTKEVSLWWKLYLLYSFFLVRKVVAHVVPFVLYCVVIPFSVLIP--EIKIPAWGVVYI HQQHRWTCGAANLFRKMGWEIVTNKGVSIWKKWHLLYSFLFVRRVIAPILTFLFYCVVIPLSAMVP--EVHIPVWGLVYI HQQHRWTCGAANLFRKMGWEIVTNKGVSIWKKWHLLYSFLFVRRVIAPILTFLFYCVVIPLSAMVP--EVHIPVWGLVYI HQQHRWTCGAANLFRKMGWEIVTNKGVSIWKKWHLLYSFLFVRRVIAPILTFLFYCVVIPLSAMVP--EVHIPVWGLVYI RQQFRWSCGGAHLFRKVAKDILTAKDVSLIKKFHMLYSFFLVRRVVAPTVACILYNIILPISVMIP--ELFLPVWGIAYI RQQFRWSCGGAHLFRKVAKDILTAKDVSLIKKFHMLYSFFLVRRVMAPTVACILYNIILPISVMIP--ELFLPVWGIAYI RQQFRWSCGGAHLFRKVAKDILTAKDVSLIKKFHMLYSFFLVRRVVAPTLACILYNIILPISVMIP--ELFLPIWGIAYI RQQFRWACGGANLFRKVAIDILTSKDVSVVKKFYMLYSFLFVRRVVAPAVACILSNIIVPLSVMIP--ELYLPVWGVAYI RQQFRWACGGANLFRKVAVDILTSKDVSVIKKFYMLYSFLFVRRVVAPAVACILSNIIVPLSVMIP--ELYLPVWGVAYI

415 781 419 417 419 420 418 420 385 385 387 381 391 375 384 384 384 394 414 395 385 415 421 417 398 395 395 449 449 448 447 749

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv

PTVITLCKALGSPSSFHLVILWVLFDNVMSLHRIKATITGLLDARRVNEWVVTEKLGDANKTEPAVEGLNDVQVIDVELS PTVITLCKALGSPSSFHLVILWVLFDNVMSLHRIKATITGLLDARRVNEWVVTEKLGDTNKTEPAMEGLNDVQVIDVELS PTVITLCKALGSPSSFHLVILWVLFDNVMSLHRIKATITGLLDTRRVNEWVVTEKLGDANKTEPAMERLDDVQVIDVELS PTVITLCKALGSPSSFHLVILWVLFDNVMSLHRIKATITGLLDTRRVNEWVVTEKLGDANKTEPAMEGLDDVQVIDVELS PTVITLCKSLGSPSSFHLVILWVLFENVMSLHRIKATITGLLDTRRVNEWVVTEKLGDANKTEPAMEGLDDVQVIDVELS PTVITLCKALGSPSSFHLVILWVLFENVMSLHRIRAAITGLLDAGRVNEWVVTEKLGDANKTKPATEVLDAVKVIDVELT PTVITLCKALGSPSSFHLVILWVLFENVMSLHRIRAAVTGLLDAGRVNEWVVTEKLGDANKTKPAMEVLDAVKVIDVELT PTVITLCKALGSPSSFHLVILWVLFENVMSLHRIKAAVTGLLDAGRVNEWVVTEKLGDANKTKPAMEALDAVKVIDVELA PTIITLLNSVGTPRSFHLLFFWILFENVMSLHRTKATLIGLLEAGRANEWVVTEKLG-----------------SAMKMK PTIITLLNSVGTPRSFHLLFFWILFENVMSLHRTKATLIGLLEAGRANEWVVTEKLG-----------------SAMKMK PTIITLLNAVGTPRSVHLVVFWVLFENVMSLHRAKATFIGLLEAGTVNEWVVTEKLG-----------------DTLKAK PTIITLLNAVGTPRSVHLVVFWVLFENVMSLHRAKATFIGLLEVGTVNEWVVTEKLG-----------------DTLKAK PAIITLLSVVGTPRSVHLVIFWALFENVMSLHRTKATFIGLLEAHTVNEWVVTEKLG-----------------DTVKTK --------------------------------------------------------------------------------

495 861 499 497 499 500 498 500 448 448 450 444 454 375

TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

PTVITLLNAVGTPRSFHLVIFWVLFENVMSLHRTKATFSGLLELGRVNEWVVTEKLG-----------------DILKMK PTVITLLNAVGTPRSFHLVIFWVLFENVMSLHRTKATFSGLLELGRVNEWVVTEKLG-----------------DVLKMK PTIITLLNAVGTPRSFHLVIFWVLFENVMSLHRTKATFSGLLELGRVNEWVVTEKLG-----------------DVLKMK PAIITLLNAVCTPRSWHLLVFWILFENVMSMHRSKATIIGLVEASRANEWVVTEKLGSV------------TS-TPAATT PAIITLLNAVCTPRSWHLLVFWILFENVMSMHRSKATIIGLVEASRANEWVVTEKLGSV------------TSSTPAATT PAIITLLNAVCTPRSWHLLVFWILFENVMSMHRSKATIIGLVEASRANEWVVTEKLGSV------------TS-TPAAAT AAVLTLLNAVCTPRSCHLLVFWILFENVMSIHRCKATIIGLLEASRANEWVVTEKLGGS------------TTSTPAAAT PTAITVLYAVRNPSSIHFIPFWILFENVMSFHRTKATFIGLLELGSVNEWVVTEKLG------------------SASNT PTAITILYAVRNPSSIHFIPFWILFENVMSFHRTKATFIGLLELGSVNEWVVTEKLG------------------SVSNT PTAITILYAVRNPSSIHFIPFWILFENVMSFHRTKATFIGLLELGSVNEWVVTEKLG------------------SVSNT PTAITVMNAIRNPGSLHLMPFWILFENVMSMHRMRAALTGLLETAHVNDWVVTEKVG-----------------DLVKDD PTAITIMNAIRNPGSLHLMPFWILFENVMSMHRMRAALTGLLETAHVNDWVVTEKVG-----------------DLVKDD PTAITIMNAIRNPGSLHLMPFWILFENVMSMHRMRAALTGLLETAHVNDWVVTEKVG-----------------DVVKDD PTVLLVVTAIRHPKNLHILPFWILFESVMTMHRMRAALSGLFELSEFNEWVVTKKTG-------------------NNFE PTVLLVVTAIRHPKNLHILPFWILFESVMTMHRMRAALSGLFELSEFNEWVVTKKTG-------------------NNFE PTVLLVVTAIRHPKNLHILPFWILFESVMTMHRMRAALSGLFELSEFNEWVVTKKTG-------------------NNFE PAVLLVVTAIRNPKNIHLLPFWILFESVMTIHRTRAALVGLFEFSEFNEWVVTKKTG-------------------NNFE PTVLLVVTAIRNPKNIHLLPFWILFESVMTIHRTRAALVGLFELTEFDEWLVTKKTG-------------------NNFE

447 447 447 461 482 462 453 477 483 479 461 458 458 510 510 509 508 810

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

TPLVPKLEKRRTRLWDKYNCSEIFVGTCIIICGCYDVLYA-NKGYYIYLFIQGLAFLVIGFEYIGTRPPNTE-------TPLVPKLEKR-------YNCSEIFVGTCIIICGCYDVLYA-NKGCYIYLFIQGVAFLVIGFEYIGTRPPGAE-------TPLVPKLEKRRTRLWHKYNCSEIFVGTFIIICGCYDVLYA-KKGYYIYLFIQGLAFLVIGFEYIGTRPPSTE-------TPLVPKLEKRRTRLWDKYNCSEIFVGTCIIICGCYDVLYA-KKGYYIYLFIQGLAFLVIGFEYIGTRPPSIE-------TPLVPKLEKRRTRLWDKYNCSEIPVGTCIIICGCYDVLYA-KKGYYIYLFIQGLAFLVVGFEYIGTRPPSAE-------TPLVPKLKKRRIRLWDKYNCSEIFVGTCIVICGFYDLFYA-NKGYYIYLFIQGLAFLVVGFEYIGTRPPTPSA------TPLVPKLKKRRIRLWDKYNCSEIFVGTCIIISGFYDLFYA-NKGYYIYLFIQGLAFLVVGFEYIGTRPPTPSAG-----TPLVPKLKKRRIKLWDKYNCSEIFVGTSIIICGFYDLFYA-NKGYYIYLFIQGLAFLVVGFEYIGTRPPTPSAE-----SANKASARKSFMRMWERLNVPELGVGAFLFSCGWYDVAFG-KDNFFIYLFFQSMAFFVVGVGYVGTIVPPS--------SANKASARKSFMRMWERLNVPELGVGAFLFSCGWYDVAFG-KDNFFIYLFFQSMAFFVVGVGYVGTIVPQS--------MPSKALK-KLRMRIGERLHLWELGVAAYLFLCGCYDISFG-NNRYFIFLFMQSIAFFIVGVGYVGTFVAQ---------MPSKALR-KLRMRIGERLHLWELGVAAYLFLCGCYDISFG-NNRYFIFLFMQSIAFFIVGVGYVGTFVAQ---------MPSKALK-KLRIGIGERLHLWELGVAAYLFICGCYSISFG-NNHYFIFLLMQSIAFFIVGVGYVGTFVTQ----------------------------------------------------------------------------------------VQSKVTK-KLRMRIRERLQLLELGVAAYIFFCGSYDLLFG-KRYYYVFLFMQSIAFFVVGVGFVGTLVPN---------VQSKVTK-KLRMRIRERLQLLELGVAAYIFFCGSYDLLFG-KRYYYIFLFMQSIAFFVVGVGFVGTLVPN---------VQSKVTK-KLRMRIRERYIIIIGLLMCISQLSYLNFNMES-GCSFWSLVLQPISSFVEVTTFCLAKDITISFSSCNPSLS TMATNKGAMKKKKSQSSILAPEIVMGLCLLYCAVYDIFFG-HDHFYVYLLMQSAAAFVIGFGYVGSQ------------TMATNKGATKKKKSQSSILAPEIVMGLCLLYCAVYDIVFG-HDHFYVYLLMQSAAAFVIGFGYVGSQ------------TMAANKGAMKKKKSQSSILAPEIVMGLCLLYCAVYDIVFG-HDHFYVYLLMQSAAAFVIGFGYVGSQ------------TTMVAK----KKKSSSSFLAPEIVMGLFLLYCALYDIVFG-HDHFYVYLLMQSAAAFVIGFGYVGSQ------------KPVPQILERPRCRFWDRWTVSELLFAVFLFVCATYNLVYG-SDFYFIYIYLQAITFIIVGTGFCGTSNS----------KPVPQILERPRCRFWDRWTVSELLFAVFLFVCATYNLVYG-SDFYFIYIYLQAITFIIVGTGFCGTSNS----------KPVPQILEKPRCRFWDRWTVSELLFAVFLFVCATYNLVYG-SDFYFIYIYLQAITFIIVGTGFCGTSNS----------FDVPLLEPLKPTECVERIYIPELLLALYLLICASYDYVLG-SQTYFMYIYLQALAFIVLGFGFVGMKTPCS--------FDVPLLEPLKPTECVERIYIPELLLALYLLICASYDYVLG-SQTYFMYIYLQALAFIVLGFGFVGTKTPCS--------FEVPLLEPLKPTECVERIYIPELLLALYLLICASYDYVLG-SQTYFMYIYLQALAFIVLGFGFVGMKTPCS--------DSEVPLLQKTRKRLRDRVNFREIVFSAFLFFCASYNLVFTGKTSYYFNLYLQGLAFVCLGLNFTGTCSCCQ--------DNEVPLLQKTRKRLRDRVNFREIVFSAFLFFCASYNLVFPGKTSYYFNLYLQGLAFVCLGLNFTGTCSCCQ--------DNEVPLLQKTRKRLRDRVNFREIVFSAFLFFCASYNLVFPGKTRYYFNLYLQGLAFVCLGLNFTGTCSCCQ--------DNKVPLLQKTRKRLRDRVNFPEILFSAFLFFCASYNLVFPGKTSYYFNLYLQGLAFAFLGLNFSGTCTCFQ--------DNKVPLLQKTRKRLRDRVNFPEILFSAFLFFCASYNLVFPGKTSYYFNLYFQGLAFAFLGLNFTGTCTCFQ---------

566 925 570 568 570 572 571 573 518 518 518 512 522 375 515 515 525 527 548 528 515 545 551 547 531 528 528 581 581 580 579 881

TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AL_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_U_TGACv1_ TRIAE_CS42_7BL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_1AS_TGACv TRIAE_CS42_7DS_TGACv TRIAE_CS42_7AS_TGACv TRIAE_CS42_7BS_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_6BS_TGACv TRIAE_CS42_6AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv

--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------LSSVWDLLGHLSPTDNALRSWCNVVRKDSV ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

566 925 570 568 570 572 571 573 518 518 518 512 522 375 515 515 555 527 548 528 515 545 551 547 531 528 528 581 581 580 579 881

Fig. S_2B: CslC subfamily.

S.No 1 2 3 4 5 6 7 8 9 10 11 12 13

Gene name with number of splice variants (CslC) TRIAE_CS42_1DL_TGACv1_062162_AA0209740.1 TRIAE_CS42_1BL_TGACv1_030501_AA0092480.1 TRIAE_CS42_5BL_TGACv1_404820_AA1311790.1_3_SPLICE TRIAE_CS42_5DL_TGACv1_435778_AA1454840.1_2_SPLICE TRIAE_CS42_5AL_TGACv1_374268_AA1195590.3_3_SPLICE TRIAE_CS42_1DL_TGACv1_061928_AA0205730.1 TRIAE_CS42_1BL_TGACv1_030750_AA0099830.1 TRIAE_CS42_1AL_TGACv1_001272_AA0028090.1 TRIAE_CS42_3DL_TGACv1_251593_AA0882850.1_3_SPLICE TRIAE_CS42_3AL_TGACv1_197197_AA0665370.1_3_SPLICE TRIAE_CS42_3DS_TGACv1_271926_AA0910940.1 TRIAE_CS42_3B_TGACv1_220758_AA0718310.2 TRIAE_CS42_3AS_TGACv1_211225_AA0686890.2

No. of amino acids (aa) 690 aa 656 aa 712 aa 708 a 703 aa 702 aa 702 aa 702 aa 704 aa 704 aa 758 aa 751 aa 750 aa

TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3AL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AS_TGACv

---MAPSFWGREAR--LSDGGGGTPVVVKMENPNWSISEMEQEAVPGSPAGLAAGK-------AGRGKNARQITWVLLLK ---MAPSFWGREAR--LSDGGGGTPVVVKMENPNWSISEMEQEAVPGSPAGLAAGK-------AGRGKNARQITWVLLLK ---MAPSFWGREAR--LSDGGGGTPVVVKMENPNWSISEMEQEPVPGSPAGLAAGK-------AGRGKNARQITWVLLLK ----MAPWWGQEARGGVSGGVTGTPVVVKMQTPDWAISEVPPPGSP-----AAGGK-------DGRGKNARQITWVLLLK ----MAPWWGQEARGGVSGGVTGTPVVVKMQTPDWAISEVPPPGSP-----AAGGK-------DGRGKNARQITWVLLLK MAPWNGLWGGRAAIAGGN-AYRDMPVIVKMENPNWSISEINGGGDNGEDFLARVGG------QRRRVKNTKQITWVFRLK -----------------------------MENPNWSISEINIDDDNSEDFLARVGG------QRRRVKNTKQITWVFRLK MAPWTGLWGARAGAGAGAGAYRGTPVVVKMENPNWSISEISPEDAEDEDFLVSGAGAARRSRKGGRGKNAKQITWVLLLK MAPWTGLWGARAGAGAG--AYRGTPVVVKMENPNWSISEISPEDAEDEDFLVSGAGAARR-RKGGRGKNAKQITWVLLLK MAPWTGLWGARAGAGAYR----GTPVVVKMENPNWSISEISPEDAEDEDFLVS----GAARRKGGRGKNAKQITWVLLLK ----------MASSWWGDKEEHGTPVVVKMDNPYSLVEIDGPGMDSSEK--------------ARRSKNAKQFKWVLLLR ----------MASSWWGDKEEHGTPVVVKMDNPYSLVEIDGPGMDSSEK--------------ARRSKNAKQFKWVLLLR ----------MASSWWGDKEEHGTPVVVKMDNPYSLVEIDGPGMDSSEK--------------ARRSKNAKQFKWVLLLR

68 68 68 64 64 73 45 80 77 72 56 56 56

TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3AL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AS_TGACv

AHRAAGRLTGAASAALAVAAAARRRVAAGRTDGDAAPG--------ESTALRARFYGCLRLFVVLSMLLLAVEVAAYLQG AHRAAGRLTGAASAALAVAAAARRRVAAGRTDGDAAPG--------ESTALRARFYGCLRLFVVLSMLLLAVEVAAYLQG AHRAAGRLTGAASAALAVAAAARRRVAAGRTDGDAAPG--------ESTALRARFYGCLRVFVVLSMLLLAVEVAAYLQG AHRAAGKLTGAATAALSVAAAARRRVAAGRTDSDADNAPPGLG---GSPALRTRLYGFIRASLLLSVLLLAADVAAHAQG AHRAAGKLTGAATAALSVAAAARRRVAAGRTDSDADADGAPPGPGAGRPALRTRLYGFIRASLLLSLLLLAADVAAHAQG AHRAAGCLARLTSAAVALGGAARRRVVAGRTDSDAADGECEDVEERDPASRRSRFYTLIKACLMMSVFLLVVELAAYSNAHRAAGCLSWLTSAAFALGGATRRRVVAGRTDSNATDGECKDVEEWAPASRRSRFYTLIKACLMMFVCLLIVELAAYSNAHRAAGCLASLASAAVTLGAAARRRVADGRTDADAGAPG-SAGES---PVLRSRFYAFIRAFLLLSLLLLAVELAARLHG AHRAAGCLASLASAAVTLGAAARRRVADGRTDADAGATPGSAGES---PVLRSRFYAFIRAFLLLSLLLLAVELAARFHG AHRAAGCLASLASAAVTLGAAARRRVADGRTDADAGAPG-PARES---PVLRSRFYAFIRAFLLLSLLLLAVELAARFHR AHRAVGCVAWLAGGFWGLLGAVNRRVRRSRDADAEPDAEASGRGR--------HMLGFLRAFLLLSLAMLAFETAAYLKG AHRAVGCVAWLAGGFWGLLGAVNRRVRRSRDADAEPDAEASGRGR--------HMLGFLRAFLLLSLAMLAFETAAYLKG AHRAVGCVAWLAGGFWGLLGAVNRRVRRSRDADAEPDAEASGRGR--------HMLGFLRAFLLLSLAMLAFETAAYLKG

140 140 140 141 144 152 124 156 154 148 128 128 128

TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3AL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AS_TGACv

W---------------------------------HLQMPEMPEMPGQLAMDGLLAVDGLAASAYAGWMRVRLQYIAPPLQ W---------------------------------HLQMPEMPEMPGQLAMDGLLAVDGLAAAAYAGWMRVRLQYIAPPLQ W---------------------------------HLQMPQMPEMPGQLAMDGLLAVDGLAAAAYAGWMRVRLQYIAPPLQ W---------------------------------HLAA-----------LPDLEAVEGLFAAGYAAWMRARAAYLGPALQ W---------------------------------HLAA-----------LPDLEAVEGLFAAGYAAWMRARAAYLGPALQ ---------------------------------------------------GRVNLAIFINSFNTSWIRFRATYVAPPLQ ---------------------------------------------------GKGNLAVFINSFNTSWIRFRAAYIAPPLQ WDLAA---------------------------------------------SALALPIIGVESLYASWLRLRAAYLAPLLQ WDLAA---------------------------------------------SALALPIIGVESLYASWLRLRAAYLAPLLQ WDLAA---------------------------------------------SALALPIIGVESLYASWLRLRAAYLAPLLQ WHYFPRDLPEHYLRQLPEHLQNLPEHLRHLPENLRHLPENLRHLPDGLRMPEQQEIQGWLHRAYVAWLAFRIDYIAWAIE WHYFPRDLPEHYLRQLPEHLQ-------NLPENLRHLPENLRHLPDGLRMPEQQEIQGWLHRAYVAWLAFRIDYIAWAIE WHYFPRDLPEHYLRQLPEHLQN-------LPEHLRHLPENLRHLPDGLRMPEQQEIQGWLHRAYVAWLAFRIDYIAWAIE

187 187 187 177 180 181 153 191 189 183 208 201 201

TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3AL_TGACv

FLTNSCVVLFMIQSVDRLVLCLGCLWIKLRGIKP---VPIAADKD--------------DVEAGDEDFPMVLVQMPMCNE FLTNSCVVLFMIQSVDRLILCLGCLWIKLRGIKP---VPIAADKD--------------DVEAGEEDFPMVLVQMPMCNE FLTNSCVVLFMIQSVDRLVLCLGCLWIKLRGIKP---VPIAADKD--------------DVEAGDEDFPMVLVQMPMCNE FLTNACVVLFMIQSADRLILCLGCFWIKLRGIRP---VPNAAAAAGNGNGKGSDDVEAGAQEE--GDFPMVLVQIPMCNE FLTNACVVLFMIQSADRLILCLGCFWIKLRGIRP---VPNAATAG---NGKGSDDVEAGAQEEEEGEFPMVLVQIPMCNE

250 250 250 252 254

TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AS_TGACv

LLADACVVLFLVQSADRLFQSLGCFYILVKRIKPKPLSPALA----------------DAEDPDAGYYPMVLVQIPMCNE LLANACVVLFLVQSADRVFQSLGCFYILVKRIKPKPLFLALS----------------DAEDPDAGYYPMVLVQIPMCNE FLTDACVVLFLIQSADRLIQCLGSFYITVKRIKPTLKSPALP----------------DAEDPDAGYYPMVLVQIPMCNE FLTDACVVLFLIQSADRLIQCLGSFYITVKRIKPRLKSPALP----------------DAEDPDAGYYPMVLVQIPMCNE FLTDACVVLFLIQSADRLIQCLGSFYITVKRIKPRLRSPALP----------------DAEDPDAGYYPMVLVQIPMCNE KLSGFCIVLFMVQSIDRILLCLGCFWIKLRGIKPGLKAAANKRGSK-----YADDDDLEDGDDLGAYFPMVLLQMPMCNE KLSGFCIVLFMVQSIDRILLCLGCFWIKLRGIKPGLKAAASKRGSK-----YADENDLEDGDDLGAYFPMVLLQMPMCNE KLSGFCIVLFMVQSIDRILLCLGCFWIKVRGIKPGLVATK-KRGNK-----YADDNDLEDGDDLGAYFPMVLLQMPMCNE

245 217 255 253 247 283 276 275

TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3AL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AS_TGACv

REVYQQSIGAICALDWPRSNFLVQVLDDSDDATTSALIKEEVEKWQREGVRIVYRHRVIRDGYKAGNLKSAMNCSYVKDY REVYQQSIGAICALDWPRSNFLVQVLDDSDDATTSALIKEEVEKWQREGVRIVYRHRVIRDGYKAGNLKSAMNCSYVKDY REVYQQSIGAICALDWPRSNFLVQVLDDSDDATTSALIKEEVEKWQREGVRIVYRHRVIRDGYKAGNLKSAMNCSYVKDY KEVYQQSIGAVCNLDWPRSNFLVQVLDDSDDAATSALIREEVEKWQREGVRVLYRHRVIRDGYKAGNLKSAMNCSYVKDY KEVYQQSIGAVCNLDWPRSNFLVQVLDDSDDAATSALIREEVEKWQREGVRILYRHRVIRDGYKAGNLKSAMNCSYVKDY KEVYRQSIAAVCNLDWPRSNFLVQVLDDSDDVATQALIKEEVEKWRHSGAHIVYRHRVLREGYKAGNLKSAMSCSYVKDY KEVYRQSIAAVCNLDWPRSNFLVQVLDDSDDVTTQALIKDEVEKWRHSGAHIVYRHRVLREGYKAGNLKSAMSCSYVKDY KEVYQQSIAAVCNLDWPRSNFLVQVLDDSDDPTTQSLIREEVAKWQQTGARILYRHRVLRDGYKAGNLKSAMACSYVKDY KEVYQQSIAAVCNLDWPRSNFLVQVLDDSDDPTTQSLIREEVAKWQQTGARILYRHRVLRDGYKAGNLKSAMACSYVKDY KEVYQQSIAAVCNLDWPRSNFLVQVLDDSDDPTTQSLIREEVAKWQQTGARILYRHRVLRDGYKAGNLKSAMACSYVKDY KEVYETSISHVCQIDWPRDRMLVQVLDDSDDETCQMLIRAEVTKWNQRGVNIIYRHRLSRTGYKAGNLKSAMSCEYVKDY KEVYETSISHVCQIDWPRDRMLVQVLDDSDDETCQMLIRAEVTKWSQRGVNIIYRHRLSRTGYKAGNLKSAMSCEYVKDY KEVYETSISHVCQIDWPRDRMLVQVLDDSDDETCQMLIRAEVTKWNQRGVNIIYRHRLSRTGYKAGNLKSAMSCEYVKDY

330 330 330 332 334 325 297 335 333 327 363 356 355

TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3AL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AS_TGACv

EYVVIFDADFQPQADFLKRAMPHFKGKDDVGLVQARWSFVNNDENLLTRLQNINLCFHFEVEQQVNGAFLNFFGFNGTAG EYVVIFDADFQPQADFLKRAMPHFKGKDDVGLVQARWSFVNNDENLLTRLQNINLCFHFEVEQQVNGAFLNFFGFNGTAG EYVVIFDADFQPQADFLKRAMPHFKGKDDVGLVQARWSFVNNDENLLTRLQNINLCFHFEVEQQVNGAFLNFFGFNGTAG EFVVIFDADFQPQEDFLKLTVPHFKGKEDVGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGAFLNFFGFNGTAG EFVVIFDADFQPQEDFLKLTVPHFKGKEDVGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGAFLNFFGFNGTAG EYVAIFDADFQPYPDFLKRTVPHFKDNEDLGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVFINFFGFNGTAG EYVAIFDADFQPYPDFLKRTVPHFKDNEDLGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVFINFFGFNGTAG EFVAIFDADFQPNPDFLKRTVPHFKDNDELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVFLNFFGFNGTAG EFVAIFDADFQPNPDFLKRTVPHFKDNDELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVFLNFFGFNGTAG EFVAIFDADFQPNPDFLKRTVPHFKDNDELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGVFLNFFGFNGTAG EFVAIFDADFQPNPDFLKLTVPHFKGNPELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGIYLNFFGFNGTAG EFVAIFDADFQPNPDFLKLTVPHFKGNPELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGIYLNFFGFNGTAG EFVAIFDADFQPNPDFLKLTVPHFKGNPELGLVQARWSFVNKDENLLTRLQNINLCFHFEVEQQVNGIYLNFFGFNGTAG

410 410 410 412 414 405 377 415 413 407 443 436 435

TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3AL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AS_TGACv

VWRIKALEDSGGWMERTTVEDMDIAVRAHLKGWKFLYLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCFVDIIKSK VWRIKALEDSGGWMERTTVEDMDIAVRAHLKGWKFLYLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCFVDIIKSK VWRIKALEDSGGWMERTTVEDMDIAVRAHLKGWKFLYLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCFVDIIKSK VWRIKALEDSGGWMERTTVEDMDIAVRAHLKGWKFLYLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCFVDIIKSK VWRIKALEDSGGWMERTTVEDMDIAVRAHLKGWKFLYLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCFVDIIKSK VWRIKAVEDSGGWMERTTVEDMDIAVRAHLKGWKFVFLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCLPDIIRCK VWRIKAVEDSGGWMERTTVEDMDIAG------WKFVFLNDVECQCELPETYEAYRKQQHRWHSGPMQLFRLCLPDIIRCK VWRIKALEESGGWMERTTVEDMDIAVRAHLHGWKFIFLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCIPDIIKSK VWRIKALEESGGWMERTTVEDMDIAVRAHLHGWKFIFLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCIPDIIKSK VWRIKALEESGGWMERTTVEDMDIAVRAHLHGWKFIFLNDVECQCELPESYEAYRKQQHRWHSGPMQLFRLCIPDIIKSK VWRIEALEDSGGWMERTTVEDMDIAVRAHLQGWKFIYLNDVKVLCELPESYQAYRKQQHRWHSGPMQLFRLCLPAIIKSK VWRIEALEDSGGWMERTTVEDMDIAVRAHLQGWKFIYLNDVKVLCELPESYQAYRKQQHRWHSGPMQLFRLCLPAIIKSK VWRIEALEDSGGWMERTTVEDMDIAVRAHLQGWKFIYLNDVKVLCELPESYQAYRKQQHRWHSGPMQLFRLCLPAIIKSK

490 490 490 492 494 485 451 495 493 487 523 516 515

TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3AL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AS_TGACv

IGFWKKCNLIFLFFLLRKLILPFYSFTLFCVILPMTMFVPEAELPAWVVCYIPATMSIMSILPSPKSFPFIVPYLLFENT IGFWKKCNLIFLFFLLRKLILPFYSFTLFCVILPMTMFVPEAELPAWVVCYIPATMSIMSILPSPKSFPFIVPYLLFENT IGFWKKCNLIFLFFLLRKLILPFYSFTLFCVILPMTMFVPEAELPAWVVCYIPATMSIMSILPSPKSFPFIVPYLLFENT IGFWKKFNLIFLFFLLRKLILPFYSFTLFCVILPMTMFAPEAELPAWVVCYIPATMSLLNILPAPKSFPFIVPYLLFENT IGFWKKFNLIFLFFLLRKLILPFYSFTLFCVILPMTMFAPEAELPAWVVCYIPATMSLLNILPAPKSFPFIVPYLLFENT IVFWKKANLIFLFFLLRKLILPFYSFTLFCIILPMTMFVPEAELPDWVVCYIPVLMSFLNIAPAPKSFPFIIPYLLFENT IVFWKKANLIFLFFLLRKLILPFYSFTLFCIILPMTMFVPEAELPDWVVCYIPVLMSFLNIAPAPKSFPFIIPYLLFENT ISVWKKFNLIFLFFLLRKLILPFYSFTLFCIILPMTMFVPEAELPDWVVCYIPALMSLLNILPSPKSFPFIIPHLLFENT ISVWKKFNLIFLFFLLRKLILPFYSFTLFCIILPMTMFVPEAELPDWVVCYIPALMSLLNILPSPKSFPFIIPYLLFENT ISVWKKFNLIFLFFLLRKLILPFYSFTLFCIILPMTMFVPEAELPDWVVCYIPALMSLLNILPSPKSVPFIIPYLLFENT IPLWKKANLVMLFFLLRKLILPFYSFTLFCVILPLTMFVPEAELPIWVICYVPMIMSVLNILPAPKSFPFVIPYLLFENT IPLWKKANLVMLFFLLRKLILPFYSFTLFCVILPLTMFVPEAELPIWVICYVPMIMSVLNILPAPKSFPFVIPYLLFENT IPLWKKANLVMLFFLLRKLILPFYSFTLFCVILPLTMFVPEAELPIWVICYVPMIMSVLNILPAPKSFPFVIPYLLFENT

570 570 570 572 574 565 531 575 573 567 603 596 595

TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3AL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AS_TGACv

MSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLVALVEKHTVQQQQRVG-----------------------SAPDL MSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLVALVEKHTVQQQQRVG-----------------------SAPDL MSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLVALVEKHTAQQQQRVG-----------------------SAPDL MSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLVALVENEKQSKQLRVG-----------------------SAPNL MSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLVALVENEKQSKQQRVG-----------------------SAPNL MSVTKFNAMISGLFQLGSTYEWVVTKKSGRSLEGDLISLAPKGLKQLKYG-----------------------------MSVTKFNAMISGLFQLGSTYEWVVTKKSGRSLEGDLISLAPKGLKQLKYG-----------------------------MSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLISLAAAAPPRELRHHPKTGS------------------APSLEA MSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLISLAAAPP-RELRHHPKTGS------------------APSMEA MSVTKFNAMISGLFQLGSAYEWVVTKKSGRSSEGDLISLAAAAPPRELRQQQKTGS------------------APSLEA MSVTKFNAMVSGLFQLGSSYEWVVTKKAGRTSSESDIFAMAEETDTATRPAPRLVRGVSEAGLEAWAKTHQLDNKDLQLK MSVTKFNAMVSGLFQLGSSYEWVVTKKAGRTSSESDIFAMAEETNTATRPAPRLVRGVSEAGLEAWAKTHQLDNKDLQLK MSVTKFNAMVSGLFQLGSSYEWVVTKKAGRTSSESDIFAMAEKTDTATRPAPRLVRGVSEAGLEAWAKTHQLDNKDLQLK

627 627 627 629 631 615 581 637 634 629 683 676 675

TRIAE_CS42_1DL_TGACv AGLAAKDSSLPKKDAPKKKQKHNRIYRKELALSFLLLTAAARSVLSAQGIHFYFLLFQGVSFLVMGLDLIGEQVE 702 TRIAE_CS42_1BL_TGACv AGLAAKDSSLPKKDAPKKKQKHNRIYRKELALSFLLLTAAARSVLSAQGIHFYFLLFQGVSFLVMGLDLIGEQVE 702

TRIAE_CS42_1AL_TGACv TRIAE_CS42_3DL_TGACv TRIAE_CS42_3AL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1BL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3AS_TGACv

AGLAAKDSSLPKKDAPKKKQKHNRIYRKELALSFLLLTAAARSVLSAQGIHFYFLLFQGVSFLVMGLDLIGEQVE DSLAAKEELYPKAEPKPKKKKHNRLYRKELALSFLLLTAAARSLLSVQGIHFYFLLFQGVSFLVVGLDLIGEQVE DSLAAKEELYPKSEP--KKKKHNRLYRKELALSFLLLTAAARSLLSVQGIHFYFLLFQGVSFLVVGLDLIGEQVE SVPAINVAIKEQSKAKKESKKYNRIYKKELAMSLLLLSAAARSLLSKQGIHFYFLLFQGISFLLVGLDLIGQDIK SVPAINVAIKEKLKAKKESKKYNRIYKKELAMSLLLLSAAIRSLLSKQGIHFYFLLFQGISFLLVGLDLIGQDIK LMVLKEQQPSPKKEGKKQQKKHNRIYKKELALSLLLLTAAARSLLTKQGIHFYFLLFQGISFLLVGLDLIGEQVE LMVLKEQ-PSPKKEGKKQQKKHNRIYKKELALSLLLLTAAARSLLTKQGIHFYFLLFQGISFLLVGLDLIGEQVE LMVLKEEQASPRKEGKKQ-KKHNRIYKKELALSLLLLTAAARSLLTKQGIHFYFLLFQGISFLLVGLDLIGEQVE AQAEEVTSLAAAIKKTSKAKPPNRIFKKELALAFLLLIAATRSLLSAQGLHFYFLLFQGVTFLVVGLDLIGEQVS AEAEEVTSLAAAIKKTSKAKPPNRIFKKELALAFLLLIAATRSLLSAQGLHFYFLLFQGVTFLVVGLDLIGEQVS AEAEEVTSLAAAIKKTSKAKPPNRIFKKELALAFLLLIAATRSLLSAQGLHFYFLLFQGVTFLVVGLDLIGEQVS

702 704 704 690 656 712 708 703 758 751 750

Fig. S_2C: CslD subfamily. S.No 1 2 3 4 5 6 7 8 9 10 11 12

Gene name with number of splice variants (CslD) TRIAE_CS42_2BS_TGACv1_148683_AA0494520.1 TRIAE_CS42_2DS_TGACv1_177279_AA0572180.1 TRIAE_CS42_2AS_TGACv1_114244_AA0365360.1 TRIAE_CS42_1BL_TGACv1_030586_AA0094860.1 TRIAE_CS42_1AL_TGACv1_001700_AA0034150.2 TRIAE_CS42_1DL_TGACv1_063091_AA0223780.1 TRIAE_CS42_1BS_TGACv1_049706_AA0160220.1 TRIAE_CS42_5BS_TGACv1_425241_AA1392650.1 TRIAE_CS42_5DS_TGACv1_457675_AA1488780.1 TRIAE_CS42_7BL_TGACv1_577301_AA1871610.1 TRIAE_CS42_7AL_TGACv1_559436_AA1799630.1 TRIAE_CS42_7DL_TGACv1_603510_AA1985050.1

No. of amino acids (aa) 1121 aa 1120 aa 1120 aa 1189 aa 1146 aa 1014 aa 330 aa 1022 aa 989 aa 994 aa 993 aa 994 aa

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

--------------------MGSKGILKNSGSSRMPPHGPSKPPTAPTSAPQVVFGRRTESGRFISYSRDDLDS-EISSV --------------------MGSKGILKNSGSSRVPPHGPSKPPTAPTSAPQVVFGRRTESGRFISYSRDDLDS-EISSV --------------------MGSKGILKNSGSSRVPPHGPSKPPTAPTSAPQVVFGRRTESGRFISYSRDDLDS-EISSV -----------------------------MSKAPRNPGGGSAGAPKSSSGQPVKFARRTPSGRYLSLSREDIDMEGEMGP -----------------------------MSKAPRNPGGGSAGAPKSSSGQPVKFARRTPSGRYLSLSREDIDMEGEMGP -----------------------------MSKAPRNPGGGSAGAPKSSSGQPVKFARRTPSGRYLSLSREDIDMEGEMGP -----------------------------------------------------------------------------MAS -----------------------------------------------------------------------------MAS -----------------------------------------------------------------------------MAS -------------------------------------------------------------------------------MSRRLSLPAGSPVTVTVSPTKGKGAGGGSPGDGVVRRGSGLTSPVPRHSIGSSTATLQVSPVRRSGGSRYASRDGADASA MSRRLSLPASSPVTVTVSPTRGKGAGGGSPGDGVVRRGSGLTSPVPRHSIGSSTATLQVSPVRRSGGSRYASRDGADASA

59 59 59 51 51 51 3 3 3 0 80 80

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

DFQDYHVHIPMTPDNQPMEED------GTKADEQYVSSSLFTGGFNSVTRAHVMD--KQGPDSDIGRSGPKGSICMVEGC DFQDYHVHIPMTPDNQPMEED------GTKADEQYVSSSLFTGGFNSVTRAHVMD--KQGPDSDMGRSGPKGSICMVEGC DFQDYHVHIPMTPDNQPMEED------GTKADEQYVSSSLFTGGFNSVTRAHVMD--KQGPDSDMGRSGPKGSICMVEGC DYANYTVHIPPTPDNQPMKDGAERTAVAMKAEEQYVSNSLFTGGFNSVTRAHLMDRVIDSDVKHPQMAGARPARCAMPAC DYANYTVHIPPTPDNQPMKDGAEPTAVAMKAEEQYVSNSLFTGGFNSVTRAHLMDRVIDSDVKHPQMAGAKATRCAMPAC DYANYTVHIPPTPDNQPMKDGSEPTAVAMKAEEQYVSNSLFTGGFNSVTRAHLMDRVIDSDVKHPQMAGAKATRCAMPAC DHTNYTVFMPPTPDNQPGAAPAPASGGSTKPDNLPLP--RYTSGSKLVNRRSGDDGAAGGAKMDRGLS-----------DHTNYTVFMPPTPDNQPGAAPTPASGGSTKPENLPLP--RYTSGSKLVNRRSGDDGAAGGAKMDRWLS-----------DHTNYTVFMPPTPDNQPGAASAPASGGPTKPDNLPLP--RSS-GSKLVNRRSGDDGAAGGGKMDRRLS-----------------------------------------------------------------------------------MSCKMRGC EFVHYTVHIPPTPDRTTASASTDVPAAEEEGEVLPQRSYVSGTIFTGGLNCATRAHVLSNSADGARPAASANMSCKMRGC EFVHYTVHIPPTPDRNTASASTDAPVAEEEGEVLPQRSYVSGTIFTGGLNCTTRAHVLSNSADGARPAASVNMSCKMRGC

131 131 131 131 131 131 69 69 68 8 160 160

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

DSKIMRNGRGEDILPCECDFKICVDCFTDAVKGGGGVCPGCKELYKHTEWEEVLSNSSNELTRALSLPHGPGGKMERRLS DSKIMRNGRGEDILPCECDFKICVDCFTDAVKGGRGVCPGCKELYKHTEWEEVLSNSSNELTRALSLPHGPGGKMERRLS DSKIMRNGRGEDILPCECDFKICVDCFTDAVKGGGGVCPGCKELYKHTEWEEVLSNSSNELTRALSLPHGPGGKMERRLS DGKVMRNERGEEIEPCECRFKICRDCYLDAQKDGCLCPG----------------------CKEHYKIGDYADDDTHDVS DGKVMRNERGEEIDPCECRFKICRDCYLDAQKDGCLCPG----------------------CKEHYKIGDYADDDPHDVS DGKVMRNERGEEVDPCECRFKICRDCYLDAQKDGCLCPG----------------------CKEHYKIGDYADDDPHDVS --------------------------------------------------------------------------TEHVAS --------------------------------------------------------------------------TEQVAS --------------------------------------------------------------------------PVQVAS DMLALAATRP----------MICEECYMDCVAASGNCPGCKEAYSAGSDTDDSVDEDDDDAISSSEERDQMPMTSMSKRF DMPAFLNAGRGGHPPCDCGFMICEECYMDCVAAAGNCPGCKEAYSAGSDTDDSVDEDDDDAISSSEERDQMPMTSMSKRF DMPAFLNAGRGGRPPCDCGFMICEECYMDCVAAAGNCPGCKEAYSAGSDTDDSVDEDDDDAISSSEERDQMPMTSMSKRF

211 211 211 189 189 189 75 75 74 78 240 240

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

LVKQGTMNNQS----------GEFDHNRWLFETKGTYGYGNAIWPD-------DNVDDDGRNGVPGHPKELMSKPWRPLT LVKQGTMNNQS----------GEFDHNRWLFETKGTYGYGNAIWPD-------DNVDDDGRNGVPGHPKELMSKPWRPLT LVKQGTMNNQS----------GEFDHNRWLFETKGTYGYGNAIWPD-------DNVDDDGRNGVPGHPKELMSKPWRPLT AGKSLLARNQN----------GEFDHNRWLFESSGTYGYGNAFMPKG--GMYEDDLDEDGAAGDD-GMQDMNQKPFKPLT SGKSLLARNQN----------GEFDHNRWLFESSGTYGYGNAFMPKG--GMYEDDLDEDGVGGDG-GMQDMNQKPFKPLT AGKSLLARNQN----------GEFDHNRWLFESSGTYGYGNAFMPKG--GMYEDDLDEDGAGGDGGMPADLSQKPFKPLT PSKSLLVRSQT----------GEFDHNRWLFETQGTYGIGNAYWPQ------DENDDGAGMGGGSVKMEDLVDKPWKPLS PSKSLLVRSQT----------GEFDHNRWLFETQGTYGIGNAYWPQ------DDNDDGAGMGGGSVKMEDLVDKPWKPLS PSKSLLVRSQT----------GEFDHNRWLFETQGTYGIGNAYWPQ------EDNDDGAGMGGGSVKMEDLVDKPWKPLS SMVHSIKMPMSSSND----KPADFDHARWLFETKGTYSYGNALWPENEHGGGGNNAGATFGFVGIEEPPNF--------SMVHSIKMPMPSSNG----KPADFDHARWLFETKGTYGYGNALWPKNEHGGGGNNAGATSGFVGIEEPPNFGARCRRPLT SMVHSIKMPMPSSNGNGGGKPADFDHARWLFETKGTYGYGNALWPKNEHGGGGNTAGATSGFVGIEEPPNFGARCRRPLT

274 274 274 256 256 257 139 139 138 145 316 320

TRIAE_CS42_1BL_TGACv RKLQIPAAVISPYRLLVLIRLVALAFFLMWRIKHQNDDAIWLWGMSIVCELWFAFSWVLDQLPKLCPINRATDLSVLKEK 354 TRIAE_CS42_1DL_TGACv RKLQIPAAVISPYRLLVLIRLVALAFFLMWRIKHQNDDAIWLWGMSIVCELWFALSWVLDQLPKLCPINRATDLSVLKEK 354

TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

RKLQIPAAVISPYRLLVLIRLVALAFFLMWRIKHQNDDAIWLWGMSIVCELWFALSWVLDQLPKLCPINRATDLSVLKEK RKIPMPASIISPYRIFIVIRFFVLIFYLTWRIRNPNMEALWLWGMSIVCELWFAFSWLLDMLPKVNPINRSTDLAVLKEK RKIPMPTSIISPYRIFIVIRFFVLIFYLTWRIRNPNMEALWLWGMSIVCELWFAFSWLLDMLPKVNPINRSTDLAVLKEK RKIPMPTSIISPYRIFIVIRFFVLIFYLTWRIRNPNMEALWLWGMSIVCELWFAFSWLLDMLPKVNPINRSTDLAVLKEK RKVAIPPGILSPYRLLVLVRFVALFLFLIWRATNPNPDAMWLWGISIVCEYWFALSWLLDQMPKLNPINRAADLAALREK RKVAIPPGILSPYRLLVLVRFVALFLFLVWRATNPNPDAMWLWGISIVCEYWFALSWLLDQMPKLNPINRAADLAALREK RKVAIPPGILSPYRLLVLVRFVALFLFLVWRATNPNPDAMWLWGISIVCEYWFALSWLLDQMPKLNPINRAADLAALREK -------------------------------------------------------------------------------RKTSVSQAILSPYRMLIAIRLVALGFFLAWRIRHPNPDAMWLWALSVTCEVWFAFSWLLDSLPKLCPVNRSCDLDVLADR RKTSVSQAILSPYRMLIAIRLVALGFFLAWRIRHPNPDAMWLWALSVTCEVWFAFSWLLDSLPKLCPVNRSCDLDVLADR

354 336 336 337 219 219 218 145 396 400

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

FETPTPSNPTGKSDLPGIDIFVSTADPEKEPVLVTANTILSILAVDYPVDKLACYVSDDGGALLTFEAMAEAASFANFWV FETPTPSNPTGKSDLPGIDIFVSTADPEKEPVLVTANTILSILAVDYPVDKLACYVSDDGGALLTFEAMAEAASFANFWV FETPTPSNPTGKSDLPGIDIFVSTADPEKEPVLVTANTILSILAVDYPVDKLACYVSDDGGALLTFEAMAEAASFANFWV FETPSPSNPHGRSDLPGLDVFVSTADPEKEPVLTTANTILSILAVDYPVEKLACYVSDDGGALLTFEAMAEAASFANIWV FETPSPSNPHGRSDLPGLDIFVSTADPEKEPVLTTANTILSILAVDYPVEKLACYVSDDGGALLTFEAMAEAASFANIWV FETHSPSNPHGRSDLPGLDVFVSTADPEKEPVLTTANTILSILAVDYPVEKLACYVSDDGGALLTFEAMAEAASFANIWV FESKTPSNPTGRSDLPGLDVFISTADPYKEPPLVTANTLLSILATDYPVEKLFVYISDDGGALLTFEAMAEACAYAKVWV FESKTPSNPTGRSDLPGLDVFISTADPYKEPPLVTANTLLSILATDYPVEKLFVYISDDGGALLTFEAMAEACAYAKVWV FESKTPSNPTGRSDLPGLDVFISTADPYKEPPLVTANTLLSILATDYPVEKLFVYISDDGGALLTFEAMAEACAYAKVWV -------------------------------------------------------------------------------FELPTARNPKGRSDLPGIDVFVSTADPEKEPPLVTANTILSILAADYPVEKLACYLSDDGGALLTFEALAETASFARTWV FELPTARNPKGRSDLPGIDVFVSTADPEKEPPLVTANTILSILAADYPVEKLACYLSDDGGALLTFEALAETASFARTWV

434 434 434 416 416 417 299 299 298 145 476 480

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

PFCRKHDIEPRNPDSYFNLKRDPFKNKVKADFVKDRRRIKREYDEFKVRVNGLPDSIRRRSDAYHAREEIQAMNLQREKI PFCRKHDIEPRNPDSYFNLKRDPFKNKVKADFVKDRRRIKREYDEFKVRVNGLPDSIRRRSDAYHAREEIQAMNLQREKI PFCRKHDIEPRNPDSYFNLKRDPFKNKVKADFVKDRRRIKREYDEFKVRVNGLPDSIRRRSDAYHAREEIQAMNLQREKI PFCKKHDIEPRNPDSYFALKGDPTKGKRRSDFVKDRRKVKREYDEFKVRINGLPDSIRRRSDAFNAREDMKML----KHL PFCKKHDIEPRNPDSYFALKGDPTKGKRRSDFVKDRRKVKREYDEFKVRINGLPDSIRRRSDAFNAREDMKML----KHL PFCKKHDIEPRNPDSYFALKGDPTKGKRRSDFVKDRRKVKREYDEFKVRINGLPDSIRRRSDAFNAREDMKML----KHL PFCRKHSIEPRNPEAYFTQKGDPTKGKKRPDFVKDRRWIKREYDEYKVRINDLPEAIKRRAKAMNAHERKIAR----ETA PFCRKHSIEPRNPEAYFTQKGDPTKGKKRPDFVKDRRWIKREYDEYKVRINDLPEAIKRRAKAMNAHERKIAR----ETA PFCRKHSIEPRNPEAYFTQKGDPTKGKKRPDFVKDRRWIKREYDEYKVRINDLPEAIKRRAKAMNAHERKIAR----ETA -------------------------------------------------------------------------------PFCRKHGVEPRCPESYFGQKRDFLKNKVRLDFVRERRKVKREYDEFKVRVNSLTEAIRRRSDAYNAGEELRARRRLQEEA PFCRKHGVEPRCPESYFGQKRDFLKNKVRLDFVRERRKVKREYDEFKVRVNSLTEAIRRRSDAYNAGEELRARRRLQEEA

514 514 514 492 492 493 375 375 374 145 556 560

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

KAGGDEQFEPV----KIPKATWMADSTHWPGTWIHSSQDHARGDHAGIIQVMLKPPSDMPMYG--NIEK-SPLDFSEVDT KAGGDEQFEPV----KIPKATWMADSTHWPGTWIHSSQDHARGDHAGIIQVMLKPPSDMPMYG--NIEK-SPLDFSEVDT KAGGDEQFEPV----KIPKATWMADSTHWPGTWIHPSQDHARGDHAGIIQVMLKPPSDMPMYG--NIEK-SPLDFSGVDT RETGADPSEQP----KVKKATWMADGTHWPGTWAVSSPDHAKGNHAGILQVMLRPPSPDPLYG--MHDEDQLIDYSDVDT RETGADPSEQP----KVKKATWMADGTHWPGTWAVSSPDHAKGNHAGILQVMLRPPSPDPLYG--MHDEDQLIDYSDVDT RETGADPSEQP----KVKKATWMADGTHWPGTWAVSSPDHAKGNHAGILQVMLRPPSPDPLYG--MHDEDQLVDYSDVDT AAS---SDAAP----PPVKATWMADGTHWPGTWLDSAPDHGKGDHASIVQVMIKNPHHDVVYG--EADDHAYLDFTNVDV AAS---SDAAP----PPVKATWMADGTHWPGTWLDSAPDHGKGDHASIIQVMIKNPHHDVVYG--DADDHAYLDFTNVDV AAS---SDAAP----PPVKATWMADGTHWPGTWLDSAPDHGKGDHASIVQVMIKNPHHDVVYG--DADDHAYLDFTNVDV -------------------ATWMSDGSQWASTWLAGATDHARGNHAGIIQ-----------------------------VAAGGALGTAPLAETGAVKATWMSDGSQWPGTWLTGATDHARGNHAGIIQAMLAPPTSEPVLGGEPAESGALIDTTGVDI VAAGGALGAAPLAETGAVKGTWMSDGSQWPGTWLTGATDHARGDHAGIIQAMLAPPTSEPVLGGVPAESGALIDTTGVDI

587 587 587 566 566 567 446 446 445 176 636 640

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

RLPMLVYMSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYVYNSKAFREGMCFMMDRGGDRLCYVQFPQRF RLPMLVYMSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYVYNSKAFREGMCFMMDRGGDRLCYVQFPQRF RLPMLVYMSREKRPGYDHNKKAGAMNALVRASAIMSNGPFILNLDCDHYVYNSKAFREGMCFMMDRGGDRLCYVQFPQRF RLPMLVYMSREKRPGYDHNKKAGAMNALVRCSAVMSNAPFILNFDCDHYINNNQAVREAMCFMMDRGGERICYIQFPQRF RLPMLVYMSREKRPGYDHNKKAGAMNALVRCSAVMSNAPFILNFDCDHYINNNQAVREAMCFMMDRGGERICFIQFPQRF RLPMLVYMSREKRPGYDHNKKAGAMNALVRCSAVMSNAPFILNFDCDHYINNTQAIREAMCFMMDRGGERICYIQFPQRF RIPMFVYLSREKRPGYDHNKKAGAMNAMVRASAVLSNGPFMLNFDCDHYVYNCQAIREAMCYMLDRGGDRICYIQFPQRF RIPMFVYLSREKRPGYDHNKKAGAMNAMVRASAVLSNGPFMLNFDCDHYVYNCQAIREAMCYMLDRGGDRICYIQFPQRF RIPMFVYLSREKRPGYDHNKKAGAMNAMVRASAVLSNGPFMLNFDCDHYVYNCQAIREAMCYMLDRGGDRICYIQFPQRF ------------RPGYNHNKKAGAMNALVRTSAIMSNGPFILNLDCDHYVHNSAALREGMCYMLDCRGDRVCYVQFPQRF RLPMLVYVSREKRPGYDHNKKAGAMNALVRTSAIMSNGPFILNLDCDHYVHNSAALREGMCYMLDRGGDRVCYVQFPQRF RLPMLVYVSREKRPGYDHNKKAGAMNALVRTSAIMSNGPFILNLDCDHYVHNSAALREGMCYMLDRGGDRVCYVQFPQRF

667 667 667 646 646 647 526 526 525 244 716 720

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

EGIDPSDRYANHNTVFFDINMRALDGLQGPVYVGTGCLFRRIALYGFDPPRSKDHSPGFCGCCLPRRRKASASNANPEET EGIDPSDRYANHNTVFFDINMRALDGLQGPVYVGTGCLFRRIALYGFDPPRSKDHSPGFCGCCLPRRRKASASNANPEET EGIDPSDRYANHNTVFFDINMRALDGLQGPVYVGTGCLFRRIALYGFDPPRSKDHSPGFCGCCLPRRRKASASNANPEET EGIDPSDRYANHNTVFFDGNMRALDGLQGPMYVGTGCMFRRFALYGFDPPRTAEYTG------WLFKKKKVTNFKDPESD EGIDPSDRYANHNTVFFDGNMRALDGLQGPMYVGTGCMFRRFALYGFDPPRTAEYTG------WLFKKKKVTNFKDPDSD EGIDPSDRYANHNTVFFDGNMRALDGLQGPMYVGTGCMFRRFALYGFDPPRTAEYTG------WLFKKKKVTNFKDPESD EGIDPSDRYANHNTVFFDGNMRALDGLQGPMYVGTGCLFRRYAIYGFNPPRAVEYHG------LVG-QTRVPIDPHARSG EGIDPSDRYANHNTVFFDGNMRALDGLQGPMYVGTGCLFRRYAIYGFNPPRAVEYHG------LVG-QTRVPIDPHARSG EGIDPSDRYANHNTVFFDGNMRALDGLQGPMYVGTGCLFRRYAIYGFNPPRAVEYHG------LVG-QTRVPIDPNARSG EGIDPNDRYANHNLVFFDVAMRAMDGLQGPMYVGG-CIFRRIALYGFSPPRATKHHGWLG-RKIKLFLRKPTMGKKTDRE EGIDPNDRYANHNLVFFDVAMRAMDGLQGPMYVGTGCIFRRTALYGFSPPRATEHHGWLGRKKIKLFLRKPTMGKKTDRE EGIDPNDRYANHNLVFFDVAMRAMDGLQGPMYVGTGCIFRRTALYGFSPPRATEHHGWLGRKKIKLFLRKPTTGKKTDRE

747 747 747 720 720 721 599 599 598 322 796 800

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv

MALRMGDFDGDS---------MNLATFPKKFGNSSFLIDSIPVAEFQGRPLADHPSVKNGRPPGALTIPREILDASIVAE MALRMGDFDGDS---------MNLATFPKKFGNSSFLIDSIPVAEFQGRPLADHPSVKNGRPPGALTIPREILDASIVAE MALRMGDFDGDS---------MNLATFPKKFGNSSFLIDSIPVAEFQGRPLADHPSVKNGRPPGALTIPREILDASIVAE TQKLKAEDFDAE---------LTAQLVPRRFGNSSAMLASIPIAEFQARPIADHPAVLHGRPPGTLTVPRPPLDPPTVAE TQQLKAEDFDAE---------LTAQLVPRRFGNSSAMLASIPIAEFQARPIADHPAVLHGRPPGTLTVPRPPLDPPTVAE

818 818 818 791 791

TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

TQQLKAEDFDAE---------LTAQLVPRRFGNSSAMLASIPIAEFQARPIADHPAVLHGRPPGTLTVPRPPLDPPTVAE DGVADELRPLSD---------HPDHEAPQRFGKSKMFIESIAVAEYQGRPLADHPSVRNGRPAGALLMPRPPLDAATVAE DGIADELRPLSD---------HPDHEAPQRFGKSKMFIESIAVAEYQGRPLADHPSVRNGRPAGALLMPRPPLDAATVAE DGVADELRPLSD---------HPDHEAPQRFGKSKMFIESIAVAEYQGRPLADHPSVRNGRPPGALLMPRPPLDAATVAE LVMAILQK-----------------------------------------------------------------------SEHESMLPPIEDDDHNQLGDGVRGDLLLPQQRTVRHPADEAPAARGLLQRGHVPVHLHVPHRLLRAPGRLPLHRQVHRPA SEHESMLPPIEDDDHNQLGDIESSALMPKRFGSSATFVSSIPVAEYQGRLLQDMPGVHQGRPAGALAVPREPLDAATVGE

792 670 670 669 330 876 880

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

AISVVSCWYEEKTEWGTRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTQRDAFRGTAPINLTDRLHQVLRWATGSVEIFF AISVVSCWYEEKTEWGTRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTQRDAFRGTAPINLTDRLHQVLRWATGSVEIFF AISVVSCWYEEKTEWGTRVGWIYGSVTEDVVTGYRMHNRGWKSVYCVTQRDAFRGTAPINLTDRLHQVLRWATGSVEIFF AVSVISCWYEDKTEWGDRVGWIYGSVTEDVVTGYRMHNRGWRSVYWISKRDAFLGTAPINMTDRLHQVLRWATGSVEIFF AVSVISCWYEDKTEWGDRVGWIYGSVTEDVVTGYRMHNRGWRSVYWISKRDAFLGTAPINMTDRLHQVLRWATGSVEIFF AVSVISCWYEDKTEWGDRVGWIYGSVTEDVVTGYRMHNRGWRSVYWISKRDAFLGTAPINMTDRLHQVLRWATGSVEIFF AVSVISCWYEDNTEWGLRVGWIYGSVTEDVVTGYRMHNRGWRSVYCITKRDAFRGTAPINLTDRLHQVLRWATGSVEIFF AVSVISCWYEDNTEWGLRVGWIYGSVTEDVVTGYRMQNRGWRSVYCITKRDAFRGTAPINLTDRLHQVLRWATGSVEIFF AVSVISCWYEDNTEWGLRVGWIYGSVTEDVVTGYRMHNRGWRSVYCITKRDAFRGTAPINLTDRLHQVLRWATGSVEIFF -------------------------------------------------------------------------------PERHVPRLPAHHHHHAVPAGAAGDQVVRDHAARVVAQRAVLGDRRHQRAPGCGAAGPPQGDRRRGHLLHAHVQAGRRRRR AISVISCFYEEKTEWGRRIGWIYGSVTEDVVTGYRMHNRGWRSVYCVTRRDAFRGTAPINLTDRLHQVLRWATGSVEIFF

898 898 898 871 871 872 750 750 749 330 956 960

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

SRNNALFASSKMKVLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTLNVTFLTYLLIITITLCLLAMLEIKWS SRNNALFASSKMKVLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTLNVTFLTYLLIITVTLCLLAMLEIKWS SRNNALFASSKMKVLQRIAYLNVGIYPFTSIFLIVYCFLPALSLFSGQFIVQTLNVTFLTYLLIITVTLCLLAMLEIKWS SRNNAFLASRKLMFLQRVAYLNVGIYPFTSIFLLTYCFIPALSLFSGFFIVQTLNVAFLFYLLTITVTLIALGILEVKWS SRNNAFLASRKLMFLQRVAYLNVGIYPFTSIFLLTYCFIPALSLFSGFFIVQTLNVAFLFYLLTITITLIALGILEVKWS SRNNAFLASRKLMFLQRVAYLNVGIYPFTSIFLLTYCFIPALSLFSGFFIVQTLNVAFLFYLLTITVTLIALGILEVKWS SKNNAMLASRRLMFLQRMSYINVGIYPFTSLFLIMYCLLPALSLFSGQFIVATLDPTFLCYLLLITVTLVLLCLLEVKWS SKNNAMLASRRLMFLQRMSYINVGIYPFTSLFLIMYCLLPALSLFSGQFIVATLDPTFLCYLLLITVTLVLLCLLEVKWS SKNNALLASRRLMFLQRMSYINVGIYPFTSLFLIMYCLLPALSLFSGQFIVATLDPTFLCYLLLITITLVLLCLLEVKWS -------------------------------------------------------------------------------GGGHVRGAVRGAVELPDGAPRDHHDAERGGAGGGDGEDAVQRVPAVEQAAGRRLLQLLGAVPPLPL-------------SRNNALFATRRMKLLQRVAYFNVGMASSR---------------------------------------------------

978 978 978 951 951 952 830 830 829 330 1022 989

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

GIALEEWWRNEQFWLIGGTSAHLAAVMQGLLKVVAGIEISFTLTSKQVGDDIDDEFAELYEVKWTSLMIPPLTIIMVNLV GIALEEWWRNEQFWLIGGTSAHLAAVMQGLLKNSTK-------------------------------------------GIALEEWWRNEQFWLIGGTSAHLAAVMQGLLKVVAGIEISFTLTSKQVGDDIDDEFAELYEVKWTSLMIP---------GIELEDWWRNEQFWLISGISAHLYAVVQGLLKVMAGIEISFTLTAKAAAEDNEDIYADLYVVKWSSLLIP---------GIELEDWWRNEQFWLISGISAHLYAVVQGLLKVMAGIEISFTLTAKAAADDNEDIYADLYVVKWSSLLIP---------GIELEDWWRNEQFWLISGISAHLYAVVQGLLKVMAGIEISFTLTAKAAAEDNEDIYADLYVVKWSSLLIP---------GIGLEEWWRNEQFWVIGGTSAHLAAVLQGLLKVAAGIEISFTLTAKAAAEDDDDPFAELYLIKWTSLFIP---------GIGLEEWWRNEQFWVIGGTSAHLAAVLQGLLKVAAGIEISFTLTAKAAAEDDDDPFAELYLIKWTSLFIP---------GIGLEEWWRNEQFWVIGGTSAHLAAVLQGLLKVAAGIEISFTLTAKAAAEDDDDPFAELYLIKWTSLFIP-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

1058 1014 1048 1021 1021 1022 900 900 899 330 1022 989

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

AIAVGFSRTIYSTDIDDEFAELYEVKWTSLMIPPLTIIMVNLVAIAVGFSRTIYSTIPQWSKLLGGVFFSFWVLAHLYPF ----------------------------------------------------------------------------------------------------------------PLTIIMVNLVAIAVGFSRTIYSTIPQWSKLLGGVFFSFWVLAHLYPF ---------------------------------PITIGMLNIIAIAFAFARTIYSDNPRWGKFIGGGFFSFWVLAHLNPF ---------------------------------PITIGMLNIIAIAFAFARTIYSENPRWGKFIGGGFFSFWVLAHLNPF ---------------------------------PITIGMLNIIAIAFAFARTIYSDNPRWGKFIGGGFFSFWVLAHLNPF ---------------------------------PLAIIGINIIAMVVGVSRCVYAEIPQYSKLLGGGFFSFWVLAHYYPF ---------------------------------PLAIIGINIIAMVVGVSRCVYAEIPQYSKLLGGGFFSFWVLAHYYPF ---------------------------------PLAIIGINIIAMVVGVSRCVYAEIPQYSKLLGGGFFSFWVLAHYYPF ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

1138 1014 1095 1068 1068 1069 947 947 946 330 1022 989

TRIAE_CS42_1BL_TGACv TRIAE_CS42_1DL_TGACv TRIAE_CS42_1AL_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_1BS_TGACv TRIAE_CS42_5BS_TGACv TRIAE_CS42_5DS_TGACv

AKGLMGRRGRTPTIVYVWAGLVSITISLLWIAINPPSSAANQQLGGSFSFP---------------------------------------------------AKGLMGRRGRTPTIVYVWAGLVSITISLLWIAINPPSTAANQQLGGSFSFPAKGLMGRRGKTPTIIFVWSGLISITISLLWVALSPPEANSTGGARGGGFQFP AKGLMGRRGKTPTIIFVWSGLISITISLLWVALSPPEANSTGGARSGGFQFP AKGLMGRRGKTPTIIFVWSGLISITISLLWVALSPPEANSTGGARGGGFQFP AKGLMGRRGRTPTIVYVWAGLISITVSLLWITISPPDDRVSQSGIEV----AKGLMGRRGRTPTIVYVWAGLISITVSLLWITISPPDDRVSQSGIEV----AKGLMGRRGRTPTIVYVWAGLISITVSLLWITISPPDDRVSQSGIEV--------------------------------------------------------------------------------------------------------------------------------------------------------------

1189 1014 1146 1120 1120 1121 994 994 993 330 1022 989

Fig. S_2D: CslE subfamily. S.No 1 2 3 4 5 6 7 8 9 10

Gene name with number of splice variants (CslE) TRIAE_CS42_6DL_TGACv1_526558_AA1687090.1 TRIAE_CS42_6AL_TGACv1_471004_AA1500600.1_3_SPLICE TRIAE_CS42_6BL_TGACv1_499967_AA1596110.2 TRIAE_CS42_U_TGACv1_683314_AA2158770.1 TRIAE_CS42_5DL_TGACv1_433536_AA1415840.1 TRIAE_CS42_5BL_TGACv1_406235_AA1342610.1 TRIAE_CS42_5AL_TGACv1_376126_AA1232370.2 TRIAE_CS42_5DL_TGACv1_433536_AA1415830.1_2_SPLICE TRIAE_CS42_5BL_TGACv1_406235_AA1342600.1_2_SPLICE TRIAE_CS42_6DS_TGACv1_543277_AA1737920.1

No. of amino acids (aa) 738 aa 737 aa 736 aa 446 aa 756 aa 728 aa 728 aa 728 aa 734 aa 725 aa

Color Align Conservation results TRIAE_CS42_5DL_TGACv MVAIGRRTGQQHGHWRLAAESPPYLGPRDGEEHEAVRDGDSRGPGGVQAPRRHGGRRILLLLYYRATRVPAAGEGRAAWL TRIAE_CS42_5BL_TGACv ----------------------------MERSRRLFETETHGGRAAYRLHAVTVAAGILLVLYYRATHVPAAGEGRATWL TRIAE_CS42_U_TGACv1_ -------------------------------------------------------------------------------TRIAE_CS42_5AL_TGACv -----------------------------MERTRLFETETHGGRAAYRLHAVTVAAGILLLLYYRATRVPAAGEGRAAWL TRIAE_CS42_6DS_TGACv ------------------------------MERRLFETVRHGGRALYRLHAVTVAASTLLVLYYRATRVPGSGGRRAAWL TRIAE_CS42_5DL_TGACv ----------------------------MERTRRLFETETHGGRAAYRLHAVTVAAGILLLLYYRATHVPAAGEGRAAWL TRIAE_CS42_5BL_TGACv ----------------------------MERSRRLFETETHGGRAVYRLHAVTVAAGILLLLYYRATRVPAAGEGRAAWL TRIAE_CS42_6AL_TGACv --------------------MAGSSVSGGGGRPPLFATEKPKRVLAYRVYAGTIFAGILLIWFYRATHIPARGSSSLGWR TRIAE_CS42_6BL_TGACv ---------------------MAGSSVSGGGRPPLFATEKPKRVLAYRLYAGTIFAGILLIWFYRATHIPERGDSSLGWR TRIAE_CS42_6DL_TGACv -------------------MAGSSVSGGGGGRPPLFATEKPKRVLAYRLYAGTIFAGILLIWFYRATHIPARGSSSLGWR

80 52 0 51 50 52 52 60 59 61

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_5AL_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_6AL_TGACv TRIAE_CS42_6BL_TGACv TRIAE_CS42_6DL_TGACv

G---MLAAELWYAAYWVVTQSVRWSPVRRRPFIDRLAARHG-ETLPCVDIFVCTADPYSEPPSLVVSTILSLMAYNYPPE G---MLAAELWYAAYWVVTQSVRWSPVRRRPFIDRLAARHG-ERLPCVDIFVCTADPYSEPPSLVVSTILSLMAYNYPPE -------------------------------------------------------------------------------G---MLAAELWYAAYWVVTQSVRWSPVRRRPFRDRLAARHG-ERLPSVDIFVCTADPYSEPPSLVVSTILSLMAYNYPPE G---MLAAELWYAAYWVVTQSVRWSPVRRCTFRDRLTARYG-DRLPGVDIFVCTADPLSEPPSLVISTILSVMAYNYLAE G---MLAAELWYAAYWAVTQSVRWSPVRRLPFIDRLAARYG-ERLPCVDIFVCTADPHSEPPSLVISTVLSLMAYNYPAE G---MLAAELCYAAYWVVTQSVRWSPLHRRPCRDRLAARYG-ERLPCVDIFVCTADPHSEPPSLVISTVLSLMAYNYPAE AGLGLLVAEILFGLYWVLTLSVRWNPVRRTTFKDRLSERYDDDQLPGVDIFVCTADPALEPPMLVISTVLSVMAYDYPPE AGLGLLVAELLFGLYWVLTLSVRWNPVRRTTFKDRLSERYDDDQLPGVDIFVCTADPALEPPMLVISTVLSVMAYDYPPE AGLGLLVAELWFGLYWVLTLSVRWNPVRRATFKDRLSERYDDDQLPGVDIFVCTADPALEPPMLVISTVLSVMAYDYPPE

156 128 0 127 126 128 128 140 139 141

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_5AL_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_6AL_TGACv TRIAE_CS42_6BL_TGACv TRIAE_CS42_6DL_TGACv

KLSVYLSDDGGSILTFYGMWEASLFAKHWLPFCKRYNIEPRSPAAYFSESDGHQELCNPKEWSLIKDMFDKMTERIDTVV KLSVYLSDDGGSILTLYGMWEASLFAKHWLPFCKRYNIEPRSPAAYFSESDGHQELCTPKEWSLIKDMFDKMTERIDTAV -------------------------------------------------------------------------------KLSVYLSDDGGSILTYYGMWEASLFAKHWLPFCKRYNIEPRSPAAYFSQSDGHQELCTPKEWSLIKDMFDEMTERIDTAV KLSVYLSDDGGSVLTFYAMWEASLFAKHWLPFCKRYNIEPRSPAAYFSES--YQDLCTPKEWSFIKDMYDEMTERIDTAV KISVYLSDDGGSVLTFYALWEASLFAKHWIPFCKRYNIEPRSPAAYFSESDGHQDLCSPKEWSLIREMYEDMTERIDTAV KISVYLSDDGGSILTFYALWEASLFAKHWIPFCKRYNIEPRSPATYFSESDGHQDMCTPKEWSLIREMYEDMTERIDTAA KLNIYLSDDAGSAVTFYALHEASEFAKHWIPFCKNYKVEPRSPAAYFAEGATPHDACSPQELLRMKELYKDLTDRVNSVV KLNIYLSDDAGSAVTFYALHEASEFAKHWIPFCKNYKVEPMSPAAYFAEGATPHDACSPQELLRMKELYKDLTDRVNSVV KLNIYLSDDAGSAVTFYALHEASEFAKHWIPFCKNYKVEPRSPAAYFAKGATPHDACSPQEFLRMKELYKDLTDRMNSVV

236 208 0 207 204 208 208 220 219 221

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_5AL_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_6AL_TGACv TRIAE_CS42_6BL_TGACv TRIAE_CS42_6DL_TGACv

MSGKVPEEIKASHKGFYEWNQEITSKNHQPIVQILIDSKDQNAVDNEGKVLPTLVYMAREKRPQHHHNFKAGAMNALIRV MSGKVPEEIKARHKGFYEWNQEISSKNHQPIVQILIDGKDQNAVDNEGKVLPTLVYMAREKRPQHHHNFKAGAMNALIRV ---------------------------------------------------------------------------MQIRV MSGKVPEEIKARQKGFHEWNQEITSKNHQPIVQILIDGKDQNAVDNEGNVLPTLVYMAREKRPQHHHNFKAGAMNALIRV ISRKIPEEIRSNHKGFYEWNPEITSKNHQPIVQVLIDGKDQKGVDSEGNVLPTLVYMAREKRPQHHHNFKAGAMNALIRV LSGKISEEVKANHKGFHEWDQENTSKNHQPIVQILIEGKDKNANDDEGNVLPTLVYMAREKRPQHHHNFKAGAMNALIRV LSGKISEEVKENHKGFHEWDQENTSKNHQPIVQILIEGKDKNANDDEGNVLPTLVYMAREKRPQHHHNFKAGAMNALIRV HSGKIPEVPECNHRGSSEWNEMITSGDHPSIVQILIDRNKRKAVDVDGNALPKLVYMSREKRPQEQHHFKAGSLNALIRV HSGKIPEVPECNHRGFSVWNETITSGDHPSIVQILIDRNKRKAVDVDGNALPKLVYMAREKRPQEQHHFKAGSLNALIRV HSGKIPEVPECNHRGFSEWNETITSGDHPSVVQILIDRNKRKAVDVDGNALPKLVYMAREKRPQEQHHFKAGSLNALIRV

316 288 5 287 284 288 288 300 299 301

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_5AL_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_6AL_TGACv TRIAE_CS42_6BL_TGACv

SSVISNSPIIMNVDCDMYSNNNDAVRDALCFFLDEEMGHKIGFVQYPQNYYNLSKNNIYGNSLHVINEVEMGGMDSLGGP SSVISNSPIIMNVDCDMYSNNNDAVRDALCFFLDEEMGHKIGFVQYPQNYNNLSKNNIYGNSLHVINEVEMGGMDSLGGP SSVISNSPIIMNVDCDMYSNNNDAVRDALCFFLDEEMGHKIGFVQYPQNYNNLSKNDIYGNSLHVINEVEMGGMDSLGGP SSVISNSPIIMNVDCDMYSNNNDAVRDALCFFLDEEMGHKIGFVQYPQNYNNLSKNDIYGNSLQVINEVEMAGMDSLGGP SSVISNSPVIMNVDCDMYSNNNDTIRDALCFFLDEEMGHKIGFVQYPQNYNNMTKNNLYGNSLHVINEVEMGGMDSLGGP SSVISNSPIIMNVDCDMYSNNCDTIRDALCFFLDEEMGHKIGFVQFPQNYNNLTKNNIYGNSHQVTNQVLMGGMDSVGGP SSVISNSPIIMNVDCDMYSNNYDTIRDALCFFLDEEMGHKIGFVQFPQNYNNLTKNNIYGNSHQVTNQVLMGGMDSVGGP SSVISNSSVILNVDCDMYSNNSESIRDALCFFLDEEQGQDIGFVQYPQNFDNVVHNDIYGNPINVVNELDNPCLDGWGGM SSVISSSPVILNVDCDMYSNNSESIRDALCFFLDEEQGQDIGFVQYPQNFDNVVHNDIYGNPINVVNELDNTCLDGWGGM

396 368 85 367 364 368 368 380 379

TRIAE_CS42_6DL_TGACv SSVISNSPVILNVDCDMYSNNSESIRDALCFFLDEEQGQDIGFVQYPQNFDNVVHNDSYGNPINVVNELDNPCLDGWGGM 381 TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_5AL_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_6AL_TGACv TRIAE_CS42_6BL_TGACv TRIAE_CS42_6DL_TGACv

MYIGTGCFHRREILCGRKFTEDYQEDWNAGIKDKLQES-IDETEEKAKSLAACTYEHGTQWADEIGVKYGCVVEDVNTGL MYIGTGCFHRREILCGRKFTEDYQEDWNAGIKDKLQES-IDETEEKAKSLAACTYEHGTQWGDEIGVKYGCVVEDVNTGF LYIGTGCFHRREILCGRKFTKDYQEDWNAGIKDKLQES-IDETEEKAKSLAACTYEHGTQWGDEIGVKYGCAVEDVITGL LYIGTGCFHRREILCGRKFTKDYQEDWNAGIKDKLQES-IDETEEKAKSLAACTYEHGTQWGDEIGVKYGCAVEDVITGL LYVGTGCFHRREILRGKRFTKDYQEDWNGGIKGRIQDTSIGEIEEKAKSLATCTYEHDTQWGDEIGLKYGCPVEDVITGL MYVGTGCFHRREILCGRRFTEDYKEDWNGGIKDKTQES-IVEIEEKAKSLAASTYEHDTQWGDEIGIKYGYPAEDIVTGL MYVGTGCFHRREILCGRRFTKDYKEDWDGGIKDKTQES-IDEIEEKAKSLAASTYEHDTQWGDEIGIKYGYPAEDIVTGL CYYGTGCFHRRETLSGQIYSKDYKEDWDRGVGIAENAD---ELEETSKSLVTCTYEHNTPWGIEKGVRYGCPLEDVITGL CYYGTGCFHRRETLSGQIYSKDYKEDWARGVGIAENAD---ELEETSKSLVTCTYEHNTPWGIEKGVRYGCPLEDVITGL CYYGTGCFHRRETLSGQIYSKDYKEDWARGVGIAENAD---ELEETSKSLVTCTYEHNTPWGIEKGVRYGCPLEDVITGL

475 447 164 446 444 447 447 457 456 458

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_5AL_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_6AL_TGACv TRIAE_CS42_6BL_TGACv TRIAE_CS42_6DL_TGACv

AIHCRGWDSVYNNPKRPAFMGVGPTTLAQTILQHKRWSEGIFSIFLSKYNVFLFAHGKTKLRHQMGYHIYGLWAPNSLAT AIHCRGWESVYNNPKKPAFMGVGPTTLAQTILQHKRWSEGIFSIFLSKYNVFLFAHGKTKLQHQMGYHIYGLWAPNSLAT AIHCRGWESVYNNPKKPAFMGVGPTTLAQTILQHKRWSEGNLSIFLSKYNVFLFGHGKTKLRHQMGYHIYGLWAPNSLAT AIHCRGWESVYNNPKKPAFMGVGPTTLAQTILQHKRWSEGNLSIFLSKYNVFLFGHGKTKLRHQMGYHIYGLWAPNSLAT AIHCRGWESVYSNPTRPAFMGLGPTTLAQTLLQHKRWSEGSFSIFLSKYCPFLFGHGKTNLRHQMGYCIYGLWAPNSLPT GIHCRGWKSVHSNPPRPAFLGVAPTTLAQTLLQHKRWSEGSFSIFLSKYCPFMFGHGKIKLRHQMGYSIYGLWAPNSIPT EIHCRGWKSVHSNPPRPAFLGVAPTTLAQTLLQHKRWSEGSFSIFLSKYCPFMFGHGKIKLRHQMGYSIYGLWAPNSIPT QIQCRGWRSVYYNPARKGFLGMAPTSLGQILVQHKRWSEGFLQISLSNYSPFLLGHGKIKLGLQMGYSVCGFWALNSFPT QIQCRGWRSVYYNPARKGFLGMAPTSLGQILVQHKRWSEGFLQISLSNYSPFLLGHGKIKLGLQMGYSVCGFWALNSFPT QIQCHGWRSVYYNPARKGFLGKAPTSLGQILVQHKRWSEGFLQMSLSNYSPFLLGHGKIKLGLQMGYSVCGFWALNSFPT

555 527 244 526 524 527 527 537 536 538

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_5AL_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_6AL_TGACv TRIAE_CS42_6BL_TGACv TRIAE_CS42_6DL_TGACv

LYYVIIPSLALLKGTSLFPEITSPWIAPFVYVFCVKNMYSLYEALLSGDTLKGWWNGQRMWLVKRITSYLFGVLDNLRKL LYYVIIPSLALLKGTSLFPEITSPWIAPFVYVFCVKNMYSLYEALSSGDTLKGWWNGQRMWLVKRITSYLFGVLDNLRKL LYYVIIPSLALLKGTPLFPEITSPWIAPFVYVFCVKNMYSLYEALSSGDTLKGWWNGQRMWLVKRITSYLFGVLDNLRKL LYYVIIPSLALLKGTPLFPEITSPWIAPFVYVFCVKNMYSLYEALSSGDTLKGWWNGQRMWLVKRITSYLFGVLDNLWKL LYYVVIPSLALLKGIPLFPKITSPWIAPFIYAFCVKNMYSLYEALSCGDTLKGWWNGQRMWLVKRITSFLFGAIDTVRKL LYYVIIPSLALLKGISLFPEITSPWMSPFIYVLCVKNMYSLYEALSCGDTLKGWWNEQRMWMVRRITSYLYGLTDTVRKL LYYVIIPSLALLKGISLFPEITSPWISPFIYVVCVKNMYSLYEALSCGDTLKGWWNEQRMWMVRRITSYLYGLTDTVRKL FYYVIIPSLCFLSGVSVFPEITSPWCIPFIYVVVASYSWSLMESLQCGDTAVEWWNAQRMWLMRRTTSYLLAAIDTIGGM FYYVIIPSLCFLSGVSVFPEITSPWCIPFIYVVAASYSWSLMESLQCGDTAVEWWNAQRMWLMRRTTSYLLAAIDTIGGM FYYVIIPSLCFLSGVSVFPEITSPWCIPFIYVVVASYSWSLMESLQCGDTAVEWWNAQRMWLMRRTTSYLLAAIDTIGGM

635 607 324 606 604 607 607 617 616 618

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_5AL_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_6AL_TGACv TRIAE_CS42_6BL_TGACv TRIAE_CS42_6DL_TGACv

LGLSKINFVVTPKVSDEDESKRYEQEIMEFGSSDPEYVIIAAIALLNLVCLMGGLSKVMKGGWN-VHLDALFPQLILCGM LGLSKMNFVVTPKVSDEDESKRYEQEIMEFGSSDPEYVIIGTITLLNLVCLLGGLSKVMKGGWN-EHLDALFPQLILCGM LGLSKMNFVVSPKVSDEDESKRYEQEIMEFGSSDPEYVIIGTITLLNLVCLLGGLSKVMKVGWNNIHLDALFPQLILCGM LGLSKMNFVVSPKVSDEDESKRYEQEIMEFGSSDPEYVIIGTITLLNLVCLLGGLSKVMKVGWNNIHLDALFPQLILCGM LGLSKMTFVVTPKVSIEDESKRYEEEIMEFGSSTPEYVIIATIALLNLVCLLGGLSQIMTGAWN-IHLDAFSPQLILCGM LGLSKMTFAVTSKVSEESESKRYEQEIMEFGSSAPEYVIIATVALLNLNCLVVGLCQIMTGGWN-ILLNVFSPQLILCGM LGLSKMTFAVTSKVSEENESKRYEQEIMEFGSSAPEYVIIATVALLNLICLVGGLSQILTGGWN-ILLNVFSPQLILCGM LGVSESGFELTVKVDESQALERYKKGKMEFGPISGMFVIITTIALFNLVCLVVGLGRALLREGT-AGLGPLFLQAVLCVA LGVSESGFELTVKVDESQALERYKKGKMEFGPISGMFVIITTIALFNLVCLVLGLGRVVLREGA-AGLGPLFLQAVLCVA LGVSESGFELTVKVDESQALERYKKGKMEFGPISGMFVIITTIALFNLVCLVLGLGRVLLRGGA-EGLGPLFLQAVLCVA

714 686 404 686 683 686 686 696 695 697

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_5AL_TGACv TRIAE_CS42_6DS_TGACv TRIAE_CS42_5DL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_6AL_TGACv TRIAE_CS42_6BL_TGACv TRIAE_CS42_6DL_TGACv

LVITSIPFYEAMFLRKDKGRIPFPVTLASIGFVMLALLAKIV-----VVITSIPFYEAMFLRKDKGRIPFPVTLASIGFVMLALLATIV-----VVITSIPFYEAMFLRKDKGRIPFPVTLASIGFVMLALLPAIV-----VVITSIPFYEAMFLRKDKGRIPFPVTLASIGFVMLALLPAIV-----LVITNMPFYEAMFLRNDKGKIPFTVTLASFGFVMLDFLVPIV-----LVITNIPFYEAMFVRKDKGRIPFSVTLASIGFAILALLVPIV-----LVITNIPFYEAMFVRKDKGRMPFSVTLASISFAMLALLVPIVYNSRYI IVVINAPVYEALFIRRDSGSLPYFVTLVSLCFVSSLCLQAI------IVVINAPVYEALFIRRDSGSLPYFVTLVSLCFVSSLCLQAI------IVVINAPVYEALFIRRDSGGLPYFVTLVSLCFVSSLCLQAI-------

756 728 446 728 725 728 734 737 736 738

Fig. S_2E: CslF subfamily. S.No 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29

Gene name with number of splice variants (CslF) No. of amino acids (aa) TRIAE_CS42_2DL_TGACv1_159781_AA0542640.1 845 aa TRIAE_CS42_7BL_TGACv1_580651_AA1914920.1 614 aa TRIAE_CS42_7AL_TGACv1_557532_AA1782680.1 837 aa TRIAE_CS42_7DL_TGACv1_602590_AA1961740.1 835 aa TRIAE_CS42_2AL_TGACv1_094713_AA0301960.1 865 aa TRIAE_CS42_2DL_TGACv1_160109_AA0546890.1 862 aa TRIAE_CS42_2BL_TGACv1_130934_AA0420130.1 663 aa TRIAE_CS42_2DS_TGACv1_178985_AA0603230.1 870 aa TRIAE_CS42_2AS_TGACv1_112790_AA0345230.1 878 aa TRIAE_CS42_2BS_TGACv1_148027_AA0489970.1 877 aa TRIAE_CS42_2AS_TGACv1_113659_AA0359050.1 847 aa TRIAE_CS42_2DS_TGACv1_177641_AA0581710.2_2_SPLICE 847 aa TRIAE_CS42_2BS_TGACv1_148608_AA0494060.1 851 aa TRIAE_CS42_U_TGACv1_641498_AA2096480.1 857 aa TRIAE_CS42_2BS_TGACv1_146146_AA0456710.1 754 aa TRIAE_CS42_2DS_TGACv1_179076_AA0604160.1_2_SPLICE 783 aa TRIAE_CS42_2AS_TGACv1_112322_AA0335290.1 878 aa TRIAE_CS42_2BS_TGACv1_147667_AA0486240.1_2_SPLICE 877 aa TRIAE_CS42_2DS_TGACv1_177329_AA0573830.1 875 aa TRIAE_CS42_2BS_TGACv1_148916_AA0495580.1 701 aa TRIAE_CS42_2DS_TGACv1_178471_AA0596060.1 701 aa TRIAE_CS42_2AS_TGACv1_112322_AA0335280.1 897 aa TRIAE_CS42_5BL_TGACv1_409916_AA1366600.2_2_SPLICE 815 aa TRIAE_CS42_5DL_TGACv1_433902_AA1424880.1 808 aa TRIAE_CS42_5AL_TGACv1_374191_AA1193100.1_2_SPLICE 807 aa TRIAE_CS42_7BL_TGACv1_577473_AA1876170.1 941 aa TRIAE_CS42_7AL_TGACv1_555973_AA1751470.1 899 aa TRIAE_CS42_7DL_TGACv1_607937_AA2011180.1 498 aa TRIAE_CS42_1BS_TGACv1_049866_AA0163180.1 856 aa

Color Align Conservation results TRIAE_CS42_5DL_TGACv -----------------------------------------------------------MSMTYITKKHDYAATLDEKEP TRIAE_CS42_5AL_TGACv -----------------------------------------------------------MSMTYITKKHDYVASLDGKES TRIAE_CS42_5BL_TGACv -----------------------------------------------------------MSMTYISKKHDYAATLDEKEQ TRIAE_CS42_2AS_TGACv -----------MTTSPATHDGAATGLSEPLLPNRNGVHAGALVVTPVVANGHGGGDKLKGDLKAKDKYWKDVDQPDDVAA TRIAE_CS42_2BS_TGACv -----------MTTSPATAAGAATGLSEPLLSNGNGVHAGALVVTPVVANGHGG-DKLKGDLKAKDKYWKDVDQPDDVAA TRIAE_CS42_2DS_TGACv -----------MTTSPATDAGAATGLSEPLLSNRNGVHAGALVVTPVAANGHGG--------KAKDKYWKDVDQPGDMAV TRIAE_CS42_2AS_TGACv ------------MASAAGAGGANAGLADPLLAS------------------------AKKPVGAKGKHWVAADK-DQRRA TRIAE_CS42_2DS_TGACv ------------MASAAGAGGANAGLADPLLAS------------------------AKKPVGAKGKHWVAADK-DQRRA TRIAE_CS42_2BS_TGACv ------------MASAVGAGGANAGLADPLLASRD--------------------GGAKKPVGAKGKHWVAADK-DQRRA TRIAE_CS42_2DL_TGACv -------------------------------MAAAVTRRSNALRVDVPGGEAVAVSVAADSPVAKRGLGAKDDVWVAADE TRIAE_CS42_2BL_TGACv -------------------------------------------------------------------------------TRIAE_CS42_2AL_TGACv ------------------------------MMAAAVTRRSNALRVDVPDGDAVAVSVVADSPVAKRGLGAKEDVWVAVDE TRIAE_CS42_2DL_TGACv -------------------------------MAAAVTRRVNALRVEVPDG-----NADTANAPAAKRILDAKDDVWVSAD TRIAE_CS42_2BS_TGACv --------------------------MAAAVTRRANALRVEAPDGNTESGRASLAADSPVAKRAVDAKDDVWVAADEGEA TRIAE_CS42_2DS_TGACv --------------------------MAAAVTRRANALRAEAPDGNAESGRASLAADSPAAKRAVDAKDDVWVAADEGDT TRIAE_CS42_7AL_TGACv --------------------------------------------MPLRVEALVATDTASAAAAEGRRAKDDVWVAEEGDM TRIAE_CS42_7DL_TGACv --------------------------------------------MALRVEALVATDTAAAEGR--RAKDDVWVAAEEGDM TRIAE_CS42_7BL_TGACv ----------------------------------------------------MATDTVADAAEGRRARDDVWVAAEEGDM TRIAE_CS42_U_TGACv1_ ------------------------------MPSPAAVGGGRLADPLLAAD-------VVVGAKDKYWVPADEREILASQK TRIAE_CS42_1BS_TGACv ------------------------------MASPAAGGGGRLADPLLATD-------VVVGPKDKYWVPADEREILASHR TRIAE_CS42_2AS_TGACv --------------------------MVSPATGGGRGGNAGLAEPLLATNDDSDGAKHVFGAKAKHWVPADEKEMAASRE TRIAE_CS42_2BS_TGACv --------------------------MVSPATSGGRGGNAGLADPLLATNDDSDGARHVFGAKAKYWAPADEKEMTASRE TRIAE_CS42_2DS_TGACv ----------------------------MVSPAASGGGNAGLADPLLATNDNSEGARHVFGAKAKYWVPADEKEIAASRE TRIAE_CS42_2BS_TGACv -------------------------------------------------------------------------------TRIAE_CS42_2DS_TGACv -------------------------------------------------------------------------------TRIAE_CS42_2AS_TGACv ----MGSLAAANGAGHASNGAGVADQALALENGTGNGHKAGVANRATPPLQANGGSKVAKKISPKDKYWVAADEGEMAAA TRIAE_CS42_7AL_TGACv MAPAVAGGGRVRSNEPAAAAAPAAASGKPCVCGFQVCACTGSAAVASAASSLDMDIVAMGQIGAVNDESWVGVELGEDGE TRIAE_CS42_7DL_TGACv -------------------------------------------------------------------------------TRIAE_CS42_7BL_TGACv MAPAVAGGGRVRSNEP----AAAAASDKPCVCGFQVCACTGSAAVASAASSLDMDIVAMGQIGAVNDESWVGVELGEDGE

21 21 21 69 68 61 43 43 47 49 0 50 44 54 54 36 34 28 43 43 54 54 52 0 0 76 80 0 76

TRIAE_CS42_5DL_TGACv SEDQKSASVKNLLVRTTKLTTVTIKLYRLMVFVRLTIFVLFFKWRVSTALTVISDGTTTARAMWTMSIAGELWFALMWVL 101 TRIAE_CS42_5AL_TGACv PEHEKSASVERLLVRTTKLTTVTIKLYRLVVFVRMIIFVLFFKWRSSTALAMISDGTTTVRAMWTMSIAGELWFALMWVL 101

TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv

PKDQKSASVESLLVRTTKLTTVTIKLYRIMVFVRMAIFVLFFKWRISTALAMISDGATTVRAMWTMPIAGELWFALMWVL APDLENGGGRPLLFSNRRVKNIILYPYRVLILIRVIAVILFVGWRIK-------HNNSDVMWFWVMSVVADVWFSLSWLS APDLENGGGRPLLFSNRRVKNIILYPYRVLILIRVIAVILFVGWRIK-------HNNSDVMWFWVMSVVADVWFSLSWLS APDLENGGGRPLLFSNRRVKNIILYPYRVLILIRVIAVILFVGWRIK-------NNNSDVMWFWVISVVADVWFSLSWLS AKESGGEDGRPLLFRTYKVKGTLLHPYRALIFIRLIAVLLFFVWRIK-------HNKSDVMWFWTMSVVGDVWFGFSWLL AKESGGEDGRPLLFRTYKVKGTLLHPYRALIFIRLIAVLLFFVWRIK-------HNKSDIMWFWTLSVVGDVWFGFSWLL AKESGGEEGRPLLFRTYKVKGTLLHPYRALIFIRLIAVLLFFVWRIK-------HNKSDIMWFWTMSVVGDVWFGFSWLL GGIMSGDGNRPLLFRTMKVKGSILHPYRFLMLVRLVAVVAFFKWHVE-------HKNQDSVWLWTASMTADPWFGFSWLL -------------------------------------------------------------------------------GG-MSGDGNRPLLFRTMKVKGSILHPYRFLMLMRLVAVVIFFKWRME-------HKNHDGVWLWTVSMTADVWFGFSWLL DGTSAGNGNQPLLFRTMKVKGSILHPYRFLILVRLVAVAAFFAWRLE-------HRNHDGTWLWATSMVADAWFGFSWLL SGSIAGDGNRTPLFRTFKVKGSILHPYRFMILVRLVAIVAFFAWRVK-------HKNHDGVWLWATSMVADVWFGFSWLL SGAIAGDGNRPPLFRTFKVKGSILHPYRFMILVRLVAIVAFFAWRVK-------HKNHDGVWLWATSMVADVWFGFSWLL SGASAG---RPLLFRTMKVKGSILHPYRFLILVRLVAIVAFFAWRVE-------HRNHDGTWLWATSMVADAWFGFSWLL SGASAG---RPLLFRTMKVKGSILHPYRFLILVRLVAIIAFFAWRVE-------HRNHDGMWLWATSMVADAWFGFSWLL PEASAG---RPLLFRTMKVKGSILHPYRFLILVRLVAIVAFFAWRVE-------HRNHDGVWLWATSMVADAWFGFSWLL SGAG-EDGRAPLLYRTFRVKGPLINLYRLLTLVRVIVVILFFTWRMR-------HRDSDAMWLWWISVVGDLWFGVTWLL SGAGGDDGRAPLLYRTFRVKGPLINLYRLLTLVRVIVVILFFTWRMR-------HRDSDAMWLWWISVVGDLWFGVTWLL CGGE---DGRPLLYRTFKVRGFLVNTYRFLNLARLTAVIVFFAWRVQ-------HPDSDAMWLWWISVVGDFWFGLSWWL CSGE---DGRPLLYRTFKVKGFLVNTYRFLNLARLTAVIVFFAWRVQ-------HPDSDAMWLWWISVVGDFWFGLSWWL CGGE---DGRPLLYRTFKVKGMLVNTYRFLNLARLTAVIVFFAWRVQ-------HPDSDAMWLWWISVVGDFWFGLSWWL --------------------------------------------------------------------------------------------------------------------------------------------------------------IADGGEDGRRPLLYRTFKVKGILLHPYRLLSLIRLVAIVLFFVWRVR-------HPYADGMWLWWISMVGDLWFGVTWLL TDESGVAVDDRPVFRTEKIKGVLLHPYRVLIFVRLIAFTLFVIWRIS-------HKNPDAMWLWVTSICGEFWFGFSWLL -------------------------------------------------------------------------------TDESGAAVDDRPVFRTEKIKGVLLHPYRVLIFVRLIAFTLFVIWRIS-------HKNPDAMWLWVTSICGEFWFGFSWLL

101 142 141 134 116 116 120 122 0 122 117 127 127 106 104 98 115 116 124 124 122 0 0 149 153 0 149

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv

DQLPKMQPVRRTVYVTALE--------EPRLPTMDVFVTTTDPEKEPPLVTVNTILSILAADYPPDKLTCYVSDDGGALL DQLPKMQPVRRTVYATALE--------ESLLPAMDVFVTTADPEKEPPLVTVNTILSILAADYPPDKLTCYVSDDGGALL DQLPKMQPVRRTVFATALE--------EPLLPTMDVFVTTADPEKEPPLVTVNTILSILAADYPPDKLTCYVSDDGGALL YQLPKYNPIKMIPDLATLRKQFDTPGRSSQLPGIDVIVTTASATDEPILYTMNGVLSILAADYHIGRCNCYLSDDSGSLV YQLPKYNPIKMIPDLATLRKQFDTPGSSSQLPGIDVIVTTASATDEPILYTMNCVLSILAADYHIGRCNCYLSDDSGSLV YQLPKYNPIKMIPDLATLRKQFDTPGRSSQLPGIDVIVTTASATDEPILYTMNCVLSILAADYHIGRCNCYLSDDSGSLV NQLPKFNPVKTIPDMVALRRQYDLPDGTSTLPGIDVFVTTADPIDEPILYTMNCVLSILASDYPVDRCACYLSDDSGALI NQLPKFNPVKTIPDMVALKRQYDLPDGTSTLPGIDVFVTTADPIDEPILYTMNCVLSILASDYPVDRCACYLSDDSGALI NQLPKFNPVKTIPDMVALRRQYDLPDGTSTLPGIDVFVTTADPIDEPILYTMNCVLSILASDYPVDRCACYLSDDSGALI NQLPKLNPIKRVP---DLADRHD----DATLPRIDVFVTTVDPVDEPVLYTVNTILSILAADYPIDNYACYISDDGGTLV -------------------------------------------------------------------------------NQLPKLNPIKRVPDLAALADRHD----DATLPGIDVFVTTVDPVDEPVLYTVNTILSILAADYPVDNYACYLSDDGGTLV NQLTKLNPIKRVPDLATLADQHG----EAILPGIDVFVTTADPVDEPVLYTVNTVLSILAADYPIDKYACYLSDDGGTLV NQLPKLNPVKRVPDLAALADHSG----DANLPGIDIFVTTVDPVDEPLLYTVNTILSILATDYPVDKYACYLSDDGGTLV NQLPKLNPVKRVPDLAALADHSG----DANLPGIDIFVTTVDPVDEPLLYTVNTILSILATDYPVDKYACYLSDDGGTLV NQLPKLNPIKRVPDLVALADRHG----EAILPGIDVFVTTVDPVDEPVLYTVNTILSILAADYPVDKYACYLSDDGGTLV NQLPKLNPIKRVPDLAALADLHG----EAVLPGIDVFVTTVDPVDEPVMYTVNTILSILAADYPVDKYACYLSDDGGSLV NQLPKLNPIKRVPDLAALADRHG----EAVLPGIDVFVTTVDPVDEPVMYTVNTILSILAADYPVDKYACYLSDDGGTLV NQITKLRPRKCVPSISVLRDHLDQPDGGSDLPLLDVFINTVDPVDEPMLYTMNSILSILATDYPVEKYATYFSDDGGSLV NQITKLRPRKCVPSISVLREQLDQPDGGSDLPLLDVFINTVDPVDEPMLYTMNSILSILATDYPVDKYATYFSDDGGSLV NQVPKLNPTICIPTIPLLRQQFDLPDGGSNLPVLDVFISTVDPVEEPMLHTMNSILSILATDYPVDKYATYLSDDGGSLL NQVPKLNPTICIPTIPLLRQQFDLPDGGSNLPVLDVFISTVDPVEEPMLHTMNSILSILATDYPVDKYATYLSDDGGSLL NQVPKLNPTICIPTIPLLRQQFDLPDGGSNLPVLDVFISTVDPVEEPMLHTMNSILSILATDYPVDKYATYLSDDGGSLL -----------------------------------------------MIYTMNSIISILAADYPVDKHACYLSDDGGSII -----------------------------------------------MIYTMNSIISILAADYPVDKHACYLSDDGGSII NQVAKLNPVKRVPNLTLLEQQFDLPDGNSNLPCLDVFINTVDPINEPMIYTMNSIISILAADYPVDKHACYLSDDGGSII DQLPKLNPINRVPDLAVLRQRFDRPDGTSTLPGLDIFVTTADPIKEPILSTANSVLSILAADYPVDRNTCYVSDDSGMLL -------------------------------------------------------------------------------DQLPKLNPINRVPDLAVLRQRFDRPDGTSTLPGLDIFVTTADPIKEPILSTANSVLSILAADYPVDRNTCYVSDDSGMLL

173 173 173 222 221 214 196 196 200 195 0 198 193 203 203 182 180 174 195 196 204 204 202 33 33 229 233 0 229

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv

TREAVAHAACFARLWVPFCRKHGVEPRNPEAYFCPGVKARVVSRADYMGRSWPELARDRRRVRREYEELRLRIDALHAGD TREAVAQAAWFARLWVPFCRKHGVEPRNPEAYFCPGVKARVVSRADYRAKSWPELARDRRRVRREYEELRLRIDALHAGD TREAVAHAARFARLWVPFCRKHGVEPRNPEAYFCPGVKARVVSRAAYMGRSWPELARDRRRVRREYEELRLRIDALHAGD LYEALVETAKFAALWVPFCRKHQIEPRAPESYFELEGT-------LCGGASHKEFIQDYKHVRTQYDEFKKHLDMLPNTI LYEALVETAKFAALWVPFCRKHQTEPRAPARYFELEGP-------LCGGASHKEFIQDYKHVRMQYEEFKKHLDMLPNTI LYEALVETAKFAALWVPFCRKHQIDPRAPESYFELEGP-------LCGGASHKEFIQDYKHVCTQYEEFKKHLDMLPNTI QYEALVETAKFATLWVPFCRKHCIEPRAPESFFEQEAP-------LYTGSAPEEFKNDHNSVYIEYDEFKECLDSLSSAI QYEALVETAKFATLWVPFCRKHCIEPRAPESYFELEAP-------LYTGSAPEDFKNDHSSVHREYDEFKEHLDSISSAI QYEALIETAKFATLWVPFCRKHCIEPRAPESYFELEAP-------LYTGSASEEFKNDHSSVHREYDEFKEHLDSLSSAI HYEAMVQVASFAALWVPFCRKHCVEPRSPESYFGIKTR-------SYIGGMAGEFMRDHRRVRREYEEFKVRIDSLSTTI ----MVQVASFAALWVLFCRKHCVEPRSPESYFGMKTR-------SYAGGMAGEFMRDHRRVRREYEEFKVRIDSLSTTI HYEAMVQVASFAALWVPFCRKHCVEPRSPESYFGIKTH-------SYAGGMAGEFMRDRRRVRREYEEFKVRIDSLSTTI HYEAMTQVASFAALWAPFCRKHCVEPRSPENYFGMKAQ-------PYAGSMPGDFTRDRRRVRREYDEFMVRIDSLSTTI HYEAMIEVANFAVLWVPFCRKYCVEPRSPENYFGMKTQ-------PYAGSMAGEFMRDHRRVRREYDELKVRVDSLSTTI HYEAMIEVANFAVLWVPFCRKYCVEPRSPENYFGMKTQ-------PYAGSMAGEFMRDHRRVRREYDEFKVRVDSLSTTI HYEAMLQVASFAALWVPFCRKHCVEPRSPENYFGMKTR-------PYVGGMAGEFMSDHRRVRREYGEFKVRIDSLSTTI HYEAMIQIVHFAALWVPFCRKHCIEPRSPENYFGMKTR-------PYVGGMAGEFMSDHRRVRREYGEFKVIIDSLSTTI HYEAMLQVASFAALWVPFCRKHCVEPRSPENYFGMKTR-------PYVGGMAGEFMSDHRRVRREYGEFKVRIDSLSSTI HYEGLQLAAEFAASWVPFCRKHCVEPRAPESYFWAKMRG------EYAGSAPKEFLDDHRRMRAAYEEFKARLDGLSAAI HYEGLQLAAEFAASWVPFCRKHCVEPRAPESYFWAKMRG------EYAGTAPKEFLDDHRRMRAAYEEFKVRLDGLSAAI HYDGLVETAKFAALWVPFCRKHHVEPRAPESYFGMKVR-------PYKGNLPEEFLDDHRRLRREYEEFKTRLDALFTVI HYDGLVETAKFAALWVPFCRKHHVEPRAPESYFGMKIR-------PYTGNLPEEFLDDHRRLRREYEEFKTRLDALFTVI HYDGLVETAKFAALWVPFCRKHHVEPRAPESYFGVKIR-------PYMGNLPEEFLDDHGRLRREYEEFKTRLDALFTLI

253 253 253 295 294 287 269 269 273 268 69 271 266 276 276 255 253 247 269 270 277 277 275

TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv

HYDGLLETAKFAALWVPFCRKHSIEPRAPESYFSLNTR-------PYTGNAPQDFVNDRRHMCREYDEFKERLDALFTLI HYDGLLETAKFAALWVPFCRKHSIEPRAPESYFSLNTR-------PYTGNAPQDFVNDRRHMCREYDEFKERLDALFTLI HYDGLLETAKFAALWVPFCRKHSIEPRAPESYFSLNTR-------PYTGNAPQDFVNDRRHMCREYDEFKERLDALFTLI TYEALAESSKFATLWVPFCRKHGIEPRGPESYFELKSHP-------YMGRAQDEFVNDRRRVRKEYDEFKARINSLEHDI -------------------------------------------------------------------------------TYEALAESSKFATLWVPFCRKHGIEPRGPESYFELKSHP-------YMGRAQDEFVNDRRRVRKEYDEFKARINSLEHDI

106 106 302 306 0 302

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv

VRPQ---------------------------------QWSRGTAENHAGVVEVLVGPPGSTPELG-----VSDLLDLSSV VRRQ---------------------------------QWSRGTAEDHAGVVEVLVGPPGSTPELG-----VSDLLDLGSV VRRQ---------------------------------PWSRGTPEYHAGVVEVLVGPPGSTPELG-----VSDLLDLTSV RQRSDIYSRTGTK--DEDATVTWMADG-TQWPGTWLDPTEKHRPGHHAGIVKIVQSHPEHVVPLG-VQESNDNPLNFDDV RQRSDIYSKTGTK--DEDAKVTWMADG-TQWPGTWVDPAEKHRAGHHAGIVKIVQSHPEHVVPLG-VQESNDNPLNFDDV RQRADIYSKTGTK--DEDAKVTWMADG-TQWPGTWLDPAEKHRAGHHAGIVKIVQSHPEHVVPLG-VHESNDSSLNFDGV SKRSDAYNSMKTE--EGDANATWMANG-TQWPGSWIDTTEIHRKGHHAGIVKVVLDHSIRGHNLG-SQASTHN-LNFAST SKRSDAYNSMKTE--EGDAKATWMANG-TQWPGSWIDTTEIHRKGHHAGIVKVVLGHSIRGHNLG-SQASTNN-LNFAST SKRSDAYNSMKTG--EGDAKATWMANG-TQWPGSWIDTTEIHRKGHHAGIVKVVLDHSVRGHNLG-SQASTHN-LNFANT RQRS---DAYNS-SNKGGVSATWMADG-THWPGTWVEQAENHRRGQHAGIVQVLLDHPSCKPQLGSPASTD-NPFDFSNI RQRS---DAYNS-NNKGGVSATWMADG-TQWPGTWVEQAENHRRGQHAGIVQVLLDHPSFKPQLGSPASTD-NPFDFSNV RQRS---DAYNS-KNKGGVSATWMADG-TQWPGTWVEQAENHRRGQHAGIVQVLLDHPSCEPQLGSPASTD-NPFDFSNV RQRS---DAYN---NGDGVHATRMADG-APWPGTWIEQAENHRRAQHAGIVQVILEHPGCKPQLGSSASTD-NPFDFNNV RQRS---DAYNSSTKGDGVRATWMADG-TQWPGTWIEQVENHRRGQHAGIVQVILGHPSCKPQLGSPASSD-NPLDFSNV RQRS---DAYNSSKKGDGVRATWMADG-TQWPGTWIEQVENHRRGQHAGIVQVILGHPSCKPQLGSPASAD-NPLDFSNV RRRS---DAYN--KGDDGVHATWMADG-TQWAGTWIEQADNHRRGHHAGIVQVMLDHPSCKPQLGSSVSTN-SPIDLSNV RRRS---DAYN--KRDDGVHATWMADG-TQWAGTWIEQADNHRRGQHAGIVQVMLDHPSCKPQLGSSARTN-NPIDLSNV RRRS---DAYN--KGDDDVHATWMADG-TQWPGTWIEQADNHRRGQHAGIVKVMLDHPSCKPQLGSSASTN-KPVDLSNV EQRSEACNRANGKDKEECANATWMADGSTQWQGTWIKPAKGHRKGHHPAILQVMLDQPSKDPELGMAASSD-HPLDFSAV EQRSEACNRANG--KEEGADATWMADGSTQWQGTWIKPAKGHRKGHHPAILQVMLDQPSKDPELGMAASSG-HPLDLSAV PQRSEAHGREDAK-GGGGAKATWMADG-TQWPGTWTEPAEGHRKGDHAGIIQVMLSQPSGEPQLGAPASSDDNPLDFSAV PQRSEAHGREDAK-GGG-GKATWMADG-TQWPGTWTEPAEGHRKGDHAGIIQVMLSQPSSEPQLGEPASSDDGPLDFSAV PQRSEAHGREDAK-GGG-GKATWMADG-TQWPGTWTEPAEGHRKGDHAGIIQVMLSQPSSEPQLGEPASSDHSPLDFSAV PKRSDVYNHAAAK---EGAKATWMADG-TQWPGTWIDPAENHKKGQHVGIVKVMLKHPSYEPELGLGASTN-SPLDFSAV PKRSDVYNHAAAK---EGAKATWMADG-TQWPGTWIDPAENHKKGQHVGIVKVMLKHPSYEPELGLGASTN-SPLDFSAV PKRSDVYNHAAAK---EGAKATWMADG-TQWPGTWIDPAENHKKGQHVGIVKVMLKHPSYEPELGLGASTN-SPLDFSAI KQRNDGYNAANAH-REGEPRPTWMADG-TQWQGTWVDASENHRRGDHAGIVLVLLNHPSHRRQTGPPASAD-NPLDFSGV -------------------------------------------------------------------------------KQRNDGYNAANAH-REGEPRPTWMADG-TQWEGTWVDASENHRRGDHAGIVRVLLNHPSHRRQTGPPASAD-NPLDFSGV

295 295 295 371 370 363 344 344 348 342 143 345 338 351 351 328 326 320 348 347 355 354 352 181 181 377 383 0 379

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv

DVRVPAVVYMCREKRHGRVHHRKAGAMNALLRTSAVLSNAPFILNLDCDHYVNNSQALRAGVCLMLD-RGGSNVAFVQFP DVRVPAVVYMCREKRHGRVHHRKAGAMNALLRTSAVLSNAPFILNLDCDHYVNNSQALRAGVCLMLD-RGGSNVAFVQFP DVRVPAVVYMCREKRHGRVHHRKAGAMNALLRTSAVLSNAPFILNLDCDHYVSNSQALRAGVCLMLD-RGGSNVAFVQFP DMRLPMLVYVAREKSPGVEHNKKAGALNAELRISALLSNAPFFINFDCDHYINNSEALRAAICFMLDPREGDNTGFVQFP DMRLPMLVYVAREKSPGVEHNKKAGALNAELRISALLSNAPFFINFDCDHYINNSEALRAAVCFMLDPREGDNTGFVQFP DMRLPMLVYVAREKCPGVEHNKKAGALNAELRISALLSNAPFFINFDCDHYINNSEALHAAVCFMLDPREGDNTGFVQFP DVRLPMLVYISRGKNPSYDHNKKAGALNAQLRASALLSNAQFIINFDCDHYINNSQALRAAMCFMLDQRQGDSTAFVQFP DVRLPMLVYISRGKNPSYDHNKKAGALNAQLRASALLSNAQFIINFDCDHYINNSQALRAAMCFMLDQRQGDSTAFVQFP DVRLPMLVYISRGKNPSYDHNKKAGALNAQLRASALLSNAQFIINFDCDHYINNSQALRAAMCFMLDQRQGDSTAFVQFP DTRLPMLVYISREKRPGYDNQKKAGAMNVMLRVSVLLSNAPFVINFDCDHYINNSQALRAPMCFMLDPHDGQNTAFVQFP DTRLPMLVYISREKRPGYDNQKKAGAMNVMLRVSALLSNAPFVINFDCDHYINNSQALRAPMCFMLDPHDGQNTAFVQFP DTRLPMLVYISREKRPGYDNQKKAGAMNVMLRVSALLSNAPFVINFDCDHYINNSQALRAPMCFMLDPRDGQNTAFVQFP DMRLPMLVYISREKRLGYDNQKKAGAMNAMLRISALLSNAPFIINFDCDHYINNSKALRAPMCFMLDPRDGQNTAFVQFP DTRLPMLVYMSREKRPGYNHQKKAGAMNVMLRVSALLSNAPFVVNFDGDHYINNSQALCAPMCFMLDPRDGQNTAFVQFP DTRLPMLVYMSREKRPGYNHQKKAGAMNVMLRVSAMLSNAPFVVNFDGDHYINNSQALRAPMCFMLDPRDGQNTAFVQFP DTRLPMLVYISREKHPGYDNQKKAGAMNVMLRVSALLSNAPFVINFDCDHYINNSRALRAPMCFMLDPRDGQNTAFVQFP DTRLPMLVYISREKHPGYDNQKKAGAMNVMLRVSALLSNAPFVINFDCDHYINNSQALRAPMCFMLDPRDGQNTAFVQFP DTRLPMLVYISREKRPGYDNQKKAGAMNVMLRVSALLSNAPFVINFDCDHYINNSQALRAPMCFMLDPRDGQNTAFVQFP DARLPMLVYIAREKRPGYDHQKKAGAMNVQLRVSALLSNAPFIINFDGDHYVNNSQAFRAAICFMLDPRDGADTAFVQFP DARLPMLVYIAREKRPGYDHQKKAGAMNVQLRVSALLSNAPFIINFDGDHYVNNSQAFRAAMCFMLDPRDGADTAFVQFP DVRLPMLVYVSREKRPGYDHQKKAGALNVQLRVSALLSNAPFIINFDCDHYINNSQAFRAAMCFMMDRRDGDNVAFVQFP DVRLPMLVYVSREKRPGYDHQKKAGALNVQLRVSALLSNAPFIINFDCDHYINNSQAFRAAMCFMMDRRDGDNVAFVQFP DVRLPMLVYVSREKRPGYDHQKKAGALNVQLRVSALLSNAPFIINFDCDHYINNSQAFRAAMCFMMDRRDGDNVAFVQFP DVRLPMLVYISREKSPSCDHQKKAGAMNVQLRVSALLTNAPFIINFDGDHYVNNSKAFRAGICFMLDRREGDNTAFVQFP DVRLPMLVYISREKSPSCDHQKKAGAMNVQLRVSALLTNAPFIINFDGDHYVNNSKAFRAGICFMLDRREGDNTAFVQFP DVRLPMLVYISREKSPSCDHQKKAGAMNVQLRVSALLTNAPFIINFDGDHYVNNSKAFRAGICFMLDRREGDNTAFVQFP DVRLPMLVYVXXXXX----------------------------------------------XXXXXX-XXXXXXXXXXXX ----------------------------------------------------------------MVG-RDSDTVAFVQFP DARLPMLVYVSREKRPGHDHQKKAGAMNALTRASALLSNSPFILNLDCDHYINNSQALRAGICFMVG-RDSDTVAFVQFP

374 374 374 451 450 443 424 424 428 422 223 425 418 431 431 408 406 400 428 427 435 434 432 261 261 457 416 15 458

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv

QRFDGVDPADRYANHNRVFFDCTELGLDGLQGPIYVGTGCMFRRAALYNADPPLWRPHGGDRDAGK-------------QRFDGVDPADRYANHNRVFFDCTELGLDGLQGPIYVGTGCMFRRAALYNADPPLWRPHG-DRDAGK-------------QRFDGVDPADRYANHNRVFFDCTELGLDGLQGPIYVGTGCMFRRAALYNADPPLWRPHGGDRDAGK-------------QRFDNVDPTDRYGNHNRVFFDGAMYGLNGQQGPTYLGTGCMFRRLALYGIDPPCWRDEDIIVDSN--------------QRFDNVDPTDRYGNHNRVFFDGAMYGLNGQQGPTYLGTGCMFRRLALYGIDPPCWRAEDMIVDSN--------------QRFDNVDPTDRYGNHNRVFFDGAMYGLNGQQGPTYLGTGCMFRRLALYGIDPPCWRAEDIIVDSN--------------QRFDNVDPSDRYGNHNRVFFDGTMLALNGLQGPSYLGTGCMFRRIALYGIDPPEWRHANIVVDDK--------------QRFDNVDPSDRYGNHNRVFFDGTMLALNGLQGPSYLGTGCMFRRIALYGIDPPEWRHDNIVVDDK--------------QRFDNVDPSDRYGNHNRVFFDGTMLALNGLQGPSYLGTGCMFRRIALYGIDPPEWRHDNIVVDDK--------------QRFDDVDPTDRYANHNRVFFDGTMLALNGLQGPTYLGTGTMFRRVSLYGIEPPRYRAENTKLVRK--------------QRFDDVDPTDRYANHNRVFFDGTMLALNGLQGPTYLGTGTMFRRVALYGIEPPHYRAENTKLVCK--------------QRFDDVDPTDRYANHNRVFFDGTMLALNGLQGPTYLGTGTMFRRVSLYGIEPPRYRAENTKLVRK--------------QRFDDVDPTDRYANHNRVFFDGTMLALNGLQGPSYLGTGTMFRRVTLYGMEPPRYRVENIKLVDN--------------QRFDDVDPTDRYANHNRVFFDGTMLSLNGLQGPSYLGTGTMFRRVALYGMEPPRYRAENIKLAGK---------------

440 439 440 516 515 508 489 489 493 487 288 490 483 496

TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv

QRFDDVDPTDRYANHNRVFFDGTMLSLNGLQGPSYLGTGTMFRRVALYGMEPPRYRAENIKLAGK--------------QRFDNVDPTDRYSNHNRVFFDGTMLSLNGLQGPTYLGTGTMFRRVALYGMEPPRYKAENIKLVGK--------------QRFDDVDPTDRYSNHNRVFFDGTMLSLNGLQGPTYLGTGTMFRRVALYGMEPPCYRAENIKLVGK--------------QRFDDVDPTDRYSNHNRVFFDGTMLSLNGLQGPTYLGTGTMFHRVALYGMEPQRYRAENIKLVGK--------------QRFDDVDPTDRYCNHNRMFFDATLLGLNGIQGPSFVGTGCMFRRVALYSADPPRWRPDDAKEAKAS-------------QRFDDVDPTDRYCNHNRMFFDATLLGLNGIQGPSFVGTGCMFRRVALYSADPPRWRPDDAKEAKAS-------------QRFDDVDPTDRYANHNRMFFDATMLGMNGIQGPSYVGTGSMFRRVALYGADPPRWRPDDVKVLEN--------------QRFDDVDPTDRYANHNRMFFDATMLGMNGIQGPSYVGTGSMFRRVALYGADPPRWRPDDVKVLEN--------------QRFDDVDPTDRYANHNRMFFDATMLGMNGIQGPSYVGTGSMFRRVALYGADPPRWRPDDVKVLEN--------------QRFDDVDPTDRYCNHNRVFFDATLLGLNGIQGPSYVGTGCMFRRVALYGVDPPRWRPDDVKIVDS--------------QRFDDVDPTDRYCNHNRVFFDATLLGLNGIQGPSYVGTGCMFRRVALYGVDPPRWRPDDVKIVDS--------------QRFDDVDPTDRYCNHNRVFFDATLLGLNGIQGPSYVGTGCMFRRVALYGVDPPRWRPDNVKIVDS--------------XXXXXXXXXXXYANHNRIFFDGTLRALDGMQGPIYVGTGCLFRRITVYGFDPPRINVGGPCFPRLAGLFAKTKYEKPGLE QRFEGVDPTDLYANHNRIFFDGTLRALDGMQGPIYVGTGCLFRRITVYGFDPPRINVGGPCFPRLAGLFAKTKYEKPGLE QRFEGVDPTDLYANHNRIFFDGTLRALDGMQGPIYVGTGCLFRRITVYGFDPPRINVGGPCFPRLAGLFAKTKYEKPSLE

496 473 471 465 494 493 500 499 497 326 326 522 496 95 538

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv

------------------DVATEADKFGISTPFLGSVRAALGLNRSEQWNTTTKPPRSFDGAAVGEATALVSCGYEDRTA ------------------DVAAEADKFGISTPFLGSVRAALNLNQSEQWNTTS-PPRSFDGAAVGEATALVSCGYEDRTA ------------------DVATEADKFGISTPFLVSVRAALNLNRSEQWNTTS-PPRSFDGAAVGEATALVSCGYEDRTA -------------------------RFGNSLLFLNSVLAAIKQEEG------VTLQPPLDDSFLEEVTKVVSSSYDDSTD -------------------------RFGNSLPFLNSVLAAIKQEEG------VTLPPPLDDSFLEEMTKVVSSSYDDSTD -------------------------RFGNSLPFLNSVLAAIKQEEG------VTLPPTLDDSFLEEMTKVVSSSYDDSTD -------------------------RFGSSIPFLESVSKAINQER-------STIPPPISETLVAEMERVVSASHDKATG -------------------------RFGSSIPFLESVSKAINQER-------STIPPPISETLVAEMERVVSASHDKATG -------------------------RFGSSIPFLDSVSKAINQER-------STIPPPISETLVAEMERVVSASHDKATG -----------------------TGEFGYSTSFVNSVPDAAIQDR-------SITPVLVDEHLRKDLATLMTCAYEDGSS -----------------------TGEFGYSTSFINSVPDAAIQDR-------SITPVLVDERLSKDLATLMTCAYEDGSS -----------------------AGEFGYSTSFVNSVPDAAIQDR-------SITPVLVDEGLRKDLTTLMTCAYEDGSS -----------------------AHEFGNSTSFTNSMPDGAIQER-------SITPVLVDEGLINDLATLITCAYEDGSS -----------------------VNEFGSSTSFINSMPDGAIQER-------SITPVLVDEALSNDLATLMTCAYEDGSS -----------------------VNEFGSSTSFINSMPDGAIQER-------SITPVLVDEALSNDLATLMTCAYEDGSS -----------------------AAELGNSTPFLKSIPDGAIQER-------SITPVLVDEALTSDLATLMTCAYEDRTS -----------------------AAELGNSTPFLNSIPDGAIQER-------SITPVLVDEGLSNDIATLMTCTYEDGSS -----------------------GAELGKSTPFLNSIPDGAIQDR-------SITPVSVDEGLMSDLATLMTCAYEDRTS --------------------RYRPNMFGKSTSFINSMPAAANQERS------VPSPATVGE---AELADAMTCAYEDGTE --------------------RYRPNMFGKSTSFINSVPAAANQERS------VPSPATVGE---AELADAMTCAYEDGTE -----------------------PNKFGKSMTFINSIPVAANQERS------VMSPVSLDEPATTELADVMTCAYEDGTE -----------------------PNKFGKSMTFINSIPVAANQERS------VMSPVSLDEPATTELADVMTCAYEDGTE -----------------------PNKFGKSMTFINSIPVAANQERS------VMSPVSLDEPATTELADVMTCAYEDGTE -----------------------STKFGSSASFISSILPAADQERS------IMSPPALEEPVMADLAHVMTCAYEDGTE -----------------------STKFGSSASFISSILPAADQERS------IMSPPALEESVMADLAHVMTCAYEDGTE -----------------------STKFGSSASFISSILPAADQERS------IMSPPALEEPVMADLAHVMTCAYEDGTE MTMAKAKAAPVPAKGKHGFLPLPKKTYGKSDAFVDSIPRASHPSPY----AAAAEGIVADEATIVEAVNVTAAAFEKKTG MTMAKAKAAPVPAKGKHGFLPLPKKTYGKSDAFVDSIPRASHPSPY----AAAAEGIVADEATIVEAVNVTAAAFEKKTG MTMAKAKAAPVPAKGKHGFLPLPKKTYGKSDAFVDSIPRASHPSPY----AAAAEGIVADEATIVEAVNVTAAAFEKKTG

502 500 501 565 564 557 537 537 541 537 338 540 533 546 546 523 521 515 545 544 551 550 548 377 377 573 572 171 614

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv

WGRDIGWIYGTVTEDVATGFCMHRRGWRSAYCATAPDAFRGTAPINLTDRLHQVLRWAAGSLEIFFSRNNALLAGARLHP WGRDIGWIYGTVTEDVATGFCMHRRGWSSAYCATAPDAFRGTAPINLTDRLHQVLRWAAGSLEIFFSRNNALLAGPRLHP WGRDIGWIYGTVTEDVATGFCMHRRGWRSAYCATAPDAFRGTAPINLTDRLHQVLRWAAGSLEIFFSRNNALLAGARLHP WGRGIGYIYNMATEDIVTGFRIHGQGWRSMYATMEREAFRGTAPINLTERLRQIVRWSGGSLEMFFSHISPLFAGRRLSL WGRGIGYLYNMATEDIVTGFRIHGQGWRSMYVTMEREAFRGTAPINLTERLRQIVRWSGGSLEMFFSHISPLFAGRRLSL WGRGIGYIYNMATEDIVTGFRIHGQGWRSMYATMEREAFRGTAPINLTERLRQIVRWSGGSLEMFFSHISPLFAGRRLSL WGKGVGYIYDIATEDIVTGFRIHGQGWRSMYCTMERDAFCGIAPINLTERLHQIVRWSGGSLEMFFSLNNPLIGGRRIQS WGKGVGYIYDIATEDIVTGFRIHGQGWRSMYCTMERDAFCGIAPINLTERLHQIVRWSGGSLEMFFSLNNPLIGGRRIQA WGKGVGYIYDIATEDIVTGFRIHGQGWRSMYCTMERDAFCGIAPINLTERLHQIVRWSGGSLEMFFSLNNPLIGGRRIQS WGRDAGWVYNIATEDVVTGFRIHRQGWRSMYCSMEPAAFRGTAPINLTERLYQVLRWSGGSLEVFFSHSNALIASRRLHP WGRDAGWVYNIATEDVVTGFRIHRQGWHSMYCSMEPAAFRGTAPINLTERLYQVLRWSGGSLEVFFSHNNALIASRRLHP WGRDAGWVYNIATEDVVTGFRIHRQGWRSMYCSMEPAAFRGTAPINLTERLYQVLRWSGGSLEVFFSHSNALIASRRLHL WGRDIGWVYNIATEDVVTGFRIHRQGWRSMYCSMEPAAFRGTAPINLTERLYQVLRWSGGSLEVFFSHNNALIAGRRLHP WGRDVGWVYNIATEDVVTGFRMHRQGWRSMYCSMEPAAFRGTAPINLTERLYQVLRWSGGSLEMFFSHSNALMAGRRLHP WGRDVGWVYNIATEDVVTGFRMHRQGWRSMYCSMEPAAFRGTAPINLTERLYQVLRWSGGSLEMFFSHSNALMAGRRLHP WGRDVGWVYNIATEDVVTGFRIHRQGWRSMYCSMEPAAFRGTAPINLTERLYQVLRWSGGSLEAFFSHSNALIASRRLHP WGRDVGWVYNIATEDVVTGFRIHRQGWRSMYCSMEPAAFRGTAPINLTERLYQVLRWSGGSLEVFFSHSNALIASRRLNP WGRDVGWVYNIATEDVVTGFRIHRQGWRSMYCSMEPAAFRGTAPINLTERLYQVLGG----------------------WGNDVGWVYNIATEDVVTGFRLHRTGWRSTYCAMEPDAFRGTAPINLTERLYQILRWSGGSLEMFFSRFCPLLAGRRLHP WGNDVGWVYNIATEDVVTGFRLHRTGWRSTYCAMEPDAFRGTAPINLTERLYQILRWSGGSLEMFFSRFCPLLAGRRLHP WGDGVGWVYDMATEDAVTGFRLHRTGWRSMYCDMEPPAFCGTAPINMTERMYQILRWSGGSLEVFFSRFCPLLAGRRLHP WGDGVGWVYDMATEDAVTGFRLHRTGWRSMYCDMEPPAFCGTAPINMTERMYQILRWSGGSLEVFFSRFCPLLAGRRLHP WGDGVGWVYDMATEDAVTGFRLHRTGWRSMYCDMEPPAFCGTAPINMTERMYQILRWSGGSLEVFFSRFCPLLAGRRLHP WGREVGWVYNIATEDVVTGFRLHRNGWRSMYCRMEPDAFAGTAPINLTERLYQILRWSGGSLEMFFSRNCPLLAGRRLHP WGSDVGWVYNIATEDVVTGFRLHRNGWRSMYCRMEPDAFAGTAPINLTERLYQILRWSGGSLEMFFSRNCPLLAGRRLHP WGREVGWVYNIATEDVVTGFRLHRNGWRSMYCRMEPDAFAGTAPINLTERLYQILRWSGGSLEMFFSRNCPLLAGRRLHP WGKEIGWVYDTVTEDVVTGYRMHIKGWRSRYCSIYPHAFIGTAPINLTERLFQVLRWSTGSLEIFFSKNNPLFGSTYLHP WGKEIGWVYDTVTEDVVTGYRMHIKGWRSRYCSIYPHAFIGTAPINLTERLFQVLRWSTGSLEIFFSKNNPLFGSTYLHP WGKEIGWVYDTVTEDVVTGYRMHIKGWRSRYCSIYPHAFIGTAPINLTERLFQVLRWSTGSLEIFFSKNNPLFGSTYLHP

582 580 581 645 644 637 617 617 621 617 418 620 613 626 626 603 601 572 625 624 631 630 628 457 457 653 652 251 694

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv

LQRLAYLNTTVYPFTSIFLLLYCLLPAIPLVTRSASASAFSVTMPPSGTYMGFVAALMLTLAMVAVLEVRWSGITLGEWW LQRLAYLNTTVYPFTSIFLLLYCLLPAIPLVTRNASTSAFSVNTPPSATYIAFVAALMLTLAMVAVLEVRWSGITLGDWW LQRLAYLNTTVYPFTSIFLLLYCLLPAIPLVTRSASTSAFSVNTPPSATYIGFVAALMLTLAMVAALEVRWSGITLGEWW VQRLSYINFTIYPLTSLFILMYAFCPVMWLLP------TEILVQRPYTRYIVYLIIVIAMIHVIGMFEIMWAGITWLDWW VQRLSYINFTIYPLTSLFILMYAFCPVMWLLP------TEILIQRPYTRYIVYLLIVIAMIHVIGMFEIMWAGITWLDWW

662 660 661 719 718

TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv

VQRLSYINFTIYPLTSLFILMYAFCPVMWLLP------TEILVQRPYTRYIVYLIIVIVMIHVIGMFEIMWAGITWLDWW LQRVSYLNMTVYPVTSLFILLYALSPVMWLIP------DEVYIQRPFTKYVVFLLVIILMIHVIGWLEIKWAGVTWLDYW LQRVSYLNMTVYPVTSLFILLYALSPVMWLIP------DEVYIQRPFTKYVVFLLVIILMIHVIGWLEIKWAGVTWLDYW LQRVSYLNMTVYPVTSLFILLYALSPVMWLIP------DEVYIQRPFTKYVVFLLVIILMIHVIGWLEIKWAGVTWLDYW LQRIAYLNMSTHPIVTVFILSYNFFPVMWLFS------EQLYIQRPFGMYMGYLVAIIAMVHLIGMFEVRWSGITLLDWF LQRITYLNMSTYPIVTVFILSYNFFPVMWLFS------EQLYIQRPFGTYMAYLVGIIAMVHLIGMFEVRWSGITLLDWF LQRIAYLNMSTYPIVTVFILSYNFFPVMWLFS------EQLYIQRPFGTYMAYLVAIIAMVHLIGMFEVRWSGITLLDWF LQRIAYFNMSTYPIVTVFILAYNFFPVMWLFS------EQLYIQRPFGTYIAYLVAVIAMMHVIGMFEVKWAGITLLDWC LQRIAYLNMSTYPIVTVFILAYNLFPVLWLFS------EQFYIQRPFAWGFFTDQARHVLLGML-------FNVWILVLL LQRVAYLNMSTYPIVTVFILAYNLFPVLWLFS------EQFYIQRPFGTYIMYLVAVIAMIHVIGMFEVKWAGITLLDWC LQRIAYLNMSIYPIATMFILAYSFFPVMWLFSE-----ESYYIQRPFGTFIMYLVAVIAMMHVIGMFEVKWAGITLQDWW LQRIAYLNMSIYPIATMFILAYSFFPVMWLFSE-----QSYYIQRPFGTFIMYLVVVIAMMHVIGMFEVKWAGITLQDWW -------------------------------------------------------------------------------MQRVAYINMTTYPVSTFFICMYYLYPVMWLFQ------GEFYIQRPFQTFALFVVVIIATVELIGMVEIRWAGLTLLDWV MQRIAYINMTTYPVSTFFICMYYFYPVMWLFQ------GEFYIQRPFQTFALFVVIVIATVELIGMVEIRWAGLTPLDWF MQRVAYTNMTFYPLSALFVVCYHLLPLMWVFN------GRFYIQKPYPTYVMYVLVIIVSNEVIGMVEIVWAGLTLLDWF MQRVAYTNMTFYPLSALFVVCYHLLPLMWVFN------GRFYIQKPYPTYVMYVLIIIISNEVIGMVEIVWAGLTLLDWF MQRVAYTNMTFYPLSALFVVCYHLLPLMWVFN------GQFYIQKPYPTYVMYVLIIIVSNEVIGMVEIVWAGLTLLDWF MQRIAYANMTAYPVSSVFLVFYLLFPVIWIFR------GQFYIQKPFPTYVLYLVIVIALTELIGMVEIKWAGLTLLDWI MQRIAYANMTAYPVSSVFLVFYLLFPVIWIFR------GQFYIQKPFPTYVLYLVIVIALTELIGMVEIKWAGLTLLDWI MQRIAYANMTAYPVSSVFLVFYLLFPVIWIFR------GQFYIQKPFPTYVLYLVIVIALTELIGMVEIKWAGLTLLDWI LQRVAYINITTYPFTAIFLIFYTTVPALSFVTG------HFIVQRPTTMFYVYLGIVLSTLLVIAVLEVKWAGVTVFEWF LQRVAYINITTYPFTAIFLIFYTTVPALSFVTG------HFIVQRPTTMFYVYLGIVLSTLLVIAVLEVKWAGVTVFEWF LQRVAYINITTYPFTAIFLIFYTTVPALSFVTG------HFIVQRPTTMFYVYLGIVLSTLLVIAVLEVKWAGVTVFEWF

711 691 691 695 691 492 694 687 693 700 678 676 572 699 698 705 704 702 531 531 727 726 325 768

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv

RNEQFWMVSATSAYAAAVVQVALKVSAGKEIAFKLTSKQRAS-SPGGGVKERFAELYAVRWTVLMVPTAVVLAVNVMSMA RNEQFWMVSATSAYAAAVVQVALKVAAGKEIAFKLTSKHRASNSGGGVVKDRFAELYAVRWTVLMVPTAVVLAVNVTSMA RNEQFWMVSATSAYAAAVVQVALKVAAGKEIAFKLTSKQRAPSAGGGVVKRQVRGAVRREMDGADGSDGGGADGERGVHG RNEQFFMIGSVTAYPTAVLHMVVNLLTKKGIHFRVTTKQPVADTDDK-----YAEMYEVHWVPMMVPAVVILFSNILAIG RNEQFFMIGSVTAYPTAVLHMVVNLLTKKGIHFRVTTKQPVADTDDK-----YAEMYEVHWVPMMVPAVVVLFSNILAIG RNEQFFMIGSVTAYPTAVLHMVVNILTKKGIHFRVTTKQPVADTDDK-----YAEMYEVHWVPMMIPAVVVLFSNILAIG RNEQFFMIGSTSAYPAAVLHMVVNLLTKKGIHFRVTSKQTAADTNDK-----FADLYDMRWVPMLIPTTVVLIANVGAIG RNEQFFMIGSTSAYPAAVLHMVVNLLTKKGIHFRVTSKQTAADTNDK-----FADLYDMRWVPMLIPTTVVLIANVGAIG RNEQFFMIGSTSAYPAAVLHMVVNLLTKKGIHFRVTSKQTAADTNDK-----FADLYDMRWVPMLIPTTVVLIANVGAIG RNEQFYMIGATGVYPTAVLYMLLKLATGKGIYFRLTSKQTEACSNDK-----FADLYTVRWVPLLIPTTAVIIVNVAAVG RNEQFYMIGATGVYPTAVLYMLLKLVTGKGIYFRLTSKQTEGCSNDK-----FADLYTVRWVPLLIPTAAVIIVNVAAIG RNEQFYMIGATGVYPTAVLYMLLKLVTGKGIYFRLTSKQTEACSNDK-----FADLYTVRWVPLLIPTTAVIIVNVAAVG RNEQFYLIAATGVYPTAVLYMALKLVTGKGMHFRLTSKQTEACSRDK-----FANLYTVRWVPLLIPTTAVLVVNVAAVG YPFALGIMGKWGKRPVILFVMLVMAVGAVGLLYVAFHAPYPADFSEV-----AASLGEASLTGPSG-------------RNEQFYMIGATGVYPTAVLYMALKLVTGKGIYFRLTSKQTDACSNDK-----XXXXXXXXXXXXXXXXXXXXXXXGCRCC RNEQFYMITATGVYPTAVLYMALKLIRGKGIYFRLTSKQTEACSGEK-----FADLYTVRWVPLLIPTVAVLVVNIAAIG RNEQFYMIAATGVYPTAVLYMALKLIRGKGIYFRLTSKQTEACSDEK-----FADLYTVRWVPLLIPTVAVLIVNVTAVG ----------------------------QAAPSRCSSPTATLSSPAV-----GSTLYSASLTSTCRSTRSPRCSS----RNEQFYIIGTTGVYPMAMLHILLRSLGIKGVSFKLTAKKLTGGARER-----LAELYDVQWVPLLVPTVVVMAVNVAAIG RNEQFYIIGTTGVYPMAMLHIILRSLGIKGVSFKLTAKKLTSGTRER-----LAELYDVQWVPLLVPTVVVMAVNVAAIG RNEQFYMICATGVYPTAVLHVVLRSLGLKGMSFKMTAKQLATGARER-----FAELYNVQWAPLLIPTLVVIAVNVVAIG RNEQFYMICATGVYPTAVLHVVLRSLGLKGISFKMTAKQLATGARER-----FAELYDVQWAPLLIPTLVVIAVNVVAIG RNEQFYMICATGVYPTAVLHVVLRSLGLKGMSFKMTAKQLATGARER-----FAELYDVQWAPLLIPTLVVIAVNVVAIG RNEQFYIIGATAVYPTAVFHIVLKLFGLKGVSFKLTAKQVASSTSDK-----FAELYAVQWAPMLIPTMVVIAVNVCAIG RNEQFYIIGATAVYPTAVFHIVLKLFGLKGVSFKLTAKQVASSTSDK-----FAELYAVQWAPMLIPTMVVIAVNVCAIG RNEQFYIIGATAVYPTAVFHIVLKLFGLKGVSFKLTAKQVASSTSDK-----FAELYAVQWAPMLIPTMVVIAVNVCAIG RNGQFWMTASCSAYLAAVCQVLTKVIFRRDISFKLTSKLPSGDEKKD----PYADLYVVRWTPLMITPIIIIFVNIIGSA RNGQFWMTASCSAYLAAVCQVLTKVIFRRDISFKLTSKLPSGDEKKD----PYADLYVVRWTPLMITPIIIIFVNIIGSA RNGQFWMTASCSAYLAAVCQVLTKVIFRRDISFKLTSKLPSGDEKKD----PYADLYVVRWTPLMITPIIIIFVNIIGSA

741 740 741 794 793 786 766 766 770 766 567 769 762 754 775 753 751 614 774 773 780 779 777 606 606 802 802 401 844

TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv

AAVQEGRWRK-------GPAAVLAMAFNAWVVVHLHPFALGLMGRWSKTLSPLLLLVVGFTVLSLCFVLHLHML-----AAVQEGRWRK-------GPAAVLAMAFNAWVVVHLYPFALGLMGRWSKTLSPLLLLVVVFTVLSLCFVLHLHML-----SSGTRGTVEE-------RPRGGARDGVQRVGGGASPPVRPWSHGPLEQDVEPPALARRSVHSSITMFCPPFAYALIWLLF VAIGKSVLYMGTWSAAQRRHGALGLLFNLWIMVLLYPFALAIIGRWAKRTGILFILLPIAFLSTALMYIGIHTFLLHFFP VAIGKSVLYMGTWSAAQKRHGALGLLFNMWIMVLLYPFALAIIGRWAKRTGILFILLPIAFLSTALMYIGIHTFLLHFFP VAIGKSILYMGTWSAAQKRHGALGLLFNLWIMVLLYPFALAIIGRWAKRTGILFILLPIAFLSTSLMYIGVHTFLLHFFP VAMGKTIVYMGAWTIAQKTHAALGLLFNVWIMVLLYPFALAIMGRWAKRPVILVVLLPVAFTIVCLVYVAVHILLLSYLT VAMGKTIVYMGAWTIAQKTHAALGLLFNVWIMVLLYPFALAIMGRWAKRPVILLVLLPVAFTIVCLVYVAVHILLLSYLT VAMGKTIVYMGAWTIAQKTHAALGLLFNVWIMVLLYPFALAIMGRWAKRPVILLVLLPVAFTIVCLVYVAVHILLLSYLT AAIGKAATWG--FFTDEARHALLGMVFNMGILVLLYPFALGIMGKWAKRPIILFIVLVMAISVVGLLYVSLHAPYTGEWS AAIGKAATWG--FFTDEARHALLGMVFNMGILVLLYPFALGIMGKWGKRPIILFIVLVMAISVVGLLYVTLHAPYTGEWS AAIGKAATWG--FFTDEARHALLGMVFNMGILVLLYPFALGIMGKWGKRPIILFIVLVMAISAVGLLYVMLHAPYTGEWS AAIGKAAAWG--FSTDQARHVLLGMVFNVGTLMLLYPFALGIMGKRGKTPVILFVLLLMAIAAVGLLYVTLYAPYPQESL -------------------------------------------------------------------------------SRPSWCSS-----------------------------------------------------------------------AAIGKAATWG--FFTDQAWHAVLGMVFNVGTLVLLYPFALGIMGQWGKRPGILLVMLVMAIGTVGLLYVTLQQDGHRMSF AAIGKAATWG--FFTDQAWHAVLGMVFNVGTLVLLYPFALGIMGQWGKRPGILLVMLVMAIGTVGLLYVTLQQDGHRMSF -------------------------------------------------------------------------------AAAGKAIVGR--WSAAQVAGAASGLVFNVWMLLLLYPFALGIMGRWSKRPYILFIVLVTAVAATASMYVALAGSLPYLHS AAAGKAIAGR--WSAAQVAGAASGLVFNVWMLLLLYPFALGIMGRWSKRPYILFIVLVTAVAATASVYVALAGSLPYLHS AAVGKAITWG--WSAGQVVEAASGLMFNVWILLMFYPFALGVIGRWGKRPYVLFAMFVAAFAAIAAVYVAVQAALAGNLL AAVGKAITWG--WSAGQVVEAASGLMFNVWILLMFYPFALGVIGRWGKKPYVLFAMFVAAFAAIAAVYVAVQAALAGNLP VAVGKAITWG--WSAGQVVEAASGLMFNVWILLMFYPFALGVIGRWGKRPFVLFAMFVAAFAAIAAVYVAVQAALAGNLP ASIGKAIVGG--WSLMQMADAGLGLVFNAWILVLIYPFALGMIGRWSKRPYILFILFVIAFILIALVDIAIQAMRSGFVR ASIGKAIVGG--WSLMQMADAGLGLVFNAWILVLIYPFALGMIGRWSKRPYILFILFVIAFILIALVDIAIQAMRSGFVR ASIGKAIVGG--WSLMQMADAGLGLVFNAWILVLIYPFALGMIGRWSKRPYILFILFVIAFILIALVDIAIQAMRSGFVR

808 807 814 874 873 866 846 846 850 844 645 847 840 754 783 831 829 614 852 851 858 857 855 684 684 880

TRIAE_CS42_7AL_TGACv VAFAKVLDGEW----THWLKVAGGVFFNFWVLFHLYPFAKGILGKHGKTPVVVLVWWAFTFVITAVLYINIPHMHSSGGK 878 TRIAE_CS42_7DL_TGACv VAFAKVLDGEW----THWLKVAGGVFFNFWVLFHLYPFAKGILGKHGKTPVVVLVWWAFTFVITAVLYINIPHMHSSGGK 477 TRIAE_CS42_7BL_TGACv VAFAKVLDGEW----THWLKVAGGVFFNFWVLFHLYPFAKGILGKHGKTPVVVLVWWAFTFVITAVLYINIPHMHSSGGK 920 TRIAE_CS42_5DL_TGACv TRIAE_CS42_5AL_TGACv TRIAE_CS42_5BL_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2AL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv TRIAE_CS42_U_TGACv1_ TRIAE_CS42_1BS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2BS_TGACv TRIAE_CS42_2DS_TGACv TRIAE_CS42_2AS_TGACv TRIAE_CS42_7AL_TGACv TRIAE_CS42_7DL_TGACv TRIAE_CS42_7BL_TGACv

----------------------------------------G-------------------SMLI----------------SMLI----------------SMLI----------------F-------------------F-------------------F-------------------QVAVSLGKASLTGPSGSG--QVAVSLGKASLTGPSGSG--QVAVSLGKASLTGPSGSG--TFLSW-------------------------------------------------------LTRPSG--------------LTRPSG----------------------------------GIKLV---------------GIKLV---------------YFQLGHWSIGGAVSLPSRRVYFQLGHWSIGGAVSLPSRRVYFQLGHRSIGGAVSLASRRVFHFKSSGGATFPTSWGL---FHFKSSGGATFPTSWGL---FHFKSSGGATFPTSWGL---HTTVHGHHGKKFVDAGYYNWP HTTVHGHHGKKFVDAGYYNWP HTTVHGHHGKKFVDAGYYNWP

808 807 815 878 877 870 847 847 851 862 663 865 845 754 783 837 835 614 857 856 878 877 875 701 701 897 899 498 941

Fig. S_2F: CslH & CslJ subfamilies. S.No 1 2 3 4 5 6 7 8

Gene name with number of splice variants (CslH) No. of amino acids (aa) TRIAE_CS42_3DS_TGACv1_271739_AA0907200.1 714 aa TRIAE_CS42_3AS_TGACv1_212952_AA0704280.1 331 aa TRIAE_CS42_3B_TGACv1_222234_AA0760340.1 751 aa TRIAE_CS42_3B_TGACv1_221049_AA0728260.1 458 aa TRIAE_CS42_3DS_TGACv1_273502_AA0931770.1 579 aa TRIAE_CS42_2AL_TGACv1_094351_AA0296300.3_3_SPLICE 752 aa TRIAE_CS42_2DL_TGACv1_158387_AA0517170.1 752 aa TRIAE_CS42_2BL_TGACv1_129372_AA0380770.1 799 aa

S.No 1 2 3 4

Gene name with number of splice variants (CslJ) No. of TRIAE_CS42_3DS_TGACv1_272297_AA0918580.1 738 TRIAE_CS42_3AS_TGACv1_210908_AA0681280.2_2_SPLICE 766 TRIAE_CS42_3B_TGACv1_221705_AA0747940.1 734 TRIAE_CS42_3DS_TGACv1_272756_AA0924850.1_2_SPLICE 734

amino acids (aa) aa aa aa aa

TRIAE_CS42_2AL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv

-----------------------------------------------MAGGKKLHERVALGRTAWMLADFVILLLLLALV MHRGEDSLSGLYKCTLAFVACGCGWSCGVVLLASLLLLVASYLSATAMAGGKKLQERVALGRSAWMLADFVILFLVLALV -----------------------------------------------MAGGKKLQERVALGRTAWMLADFVILLLLLALV ---------------------------------------------------------------------------------------------------------------------------------MSSAMKLQERVSVPRTAWKLADIFILCLLF -----------------------------------------------MSSAMKLQERVIVPRTAWKLADIFILCLLFALL -----------------------------------------------MSSAMKLQERVTVPRTAWKLADIFILCLLLVLL -----------------------------------------------MGSAMKLQERVILPRTAWKLADIFILCLLFALL

33 80 33 0 30 33 33 33

TRIAE_CS42_2AL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv

ARRAASLGE--RGGTWLAALVCEAWFAFVWILNMNGKWSPVRFDTYPENLSHRLEELPAVDMFVTTADPALEPPLITVNT ARRAASLGE--RGGTWLAALVCEAWFAFVWILNMNGKWSPVRFDTYPENLSHRMEELPAVDMFVTTADPALEPPLITVNT ARRAASLGE--RGGTWLAALVCEAWFAFVWILNMNGKWSPVRFDTYPDNLSHRMEELPAVDMFVTTADPALEPPLITVNT -------------------------------------------------------------------------------ALLSCRVASLREGGASVAALVCEAWFTFVWIINMNIKWNPVRFNTYPENLSQRTDELPAVDMLVTTADPELEPPLMTVNT SCRVLSLGEGGAGAASVAALVCEAWFTFVWILNMNIRWNPVRFHTYPENLSQRMDGLPAVDMLVTTADPELEPPLMTVNT SCRVASLGEGGAG---AAALVCEAWFTFVWILNMNIKWNPVRFHTYPENLSQRMDELPAVDMLVTTADPELEPPLMTVNT SCRVASLGDGGAGAASVAALVCEAWFTFVWILNMNIKWNPVRFHTYPENLSQRMDELPAVDMLVTTADPELEPPLMTVNT

111 158 111 0 110 113 110 113

TRIAE_CS42_2AL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv

VLSLLALDYPDVGKLACYVSDDGCSPVTCYALREAAKFASLWIPFCKRYDVGVRAPFMYFSSAPEVGTGTADHEFLESWA VLSLLALDYPHVGKLACYVSDDGCSPLTCYSLREAAKFASLWVPFCKRHDVGVRAPFMYFSSAPEVDTGTVDHEFLESWA VLSLLALDYPDVGRLACYVSDDGCSPVTCYALREAAKFAGLWVPFCKRHDVGVRAPFMYFSSAPEVGNGTVDHEFLESWA -------------------------------------------------------------------------------VLSLLAVDYPDVDKLACYVSDDGCSPVTCYALREAAGFARLWVPFCKRHGVGVRAPFMYFAS--SRPEPELAG--DWTFI VLSLLAMDYPDVDKLACYVSDDGCSPVTCYALHEAARFAGLWVPFCKRHGVGVRAPFMYFAS--RPEPELAGDNFSDEWT VLSLLAVDYPDVDKLACYVSDDGCSPVTCYALREAAGFARLWVPFCKRHGVGVRAPFIYFASS-RPEPDLAGDKFSDDWI VLSLLAVDYPDVDKLACYVSDDGCSPATCYALREAAWFARLWVPFCKRHDVRVRAPIIYFAS--RLEPELAGDTFSDEWT

191 238 191 0 186 191 189 191

TRIAE_CS42_2AL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv

LMKTEYEKLASRIENADEVSILR-DGGEEFAEFIDAERGNHPTIVKVLWDNSKSK-AGEGFPHLVYLSREKSPRHRHNFK LMKSEYEKLASRIENADEVSILR-DGGDEFAEFIDAERGNHPTIVKVLWDNSKNK-TGEGFPHLVYLSREKSPRHRHNFK LMKSQYEKLARRIENADEGTIMR-DGGDEFAEFIDAERGNHPTIVKVLWDNSKSK-AGEEFPHLVYLSREKSPRHRHNFK -------------------------------------------------------------------------------KSEYDKLVSRIESADEGSLLRHDDAADFTEFKEAKRGDHPAIVKVLWDNSKSSRTGSGDGFPNLVYVSREKTRKHDHHYK FIKSEYDKLVSRIESADEGSLLRDDDAGEFTEFMEAKRGDHPGIVKVLWDNSKSSRTGEGFPNLVYVSREKSRKHDHHYK FIKSEYDKLVSLIESADEASLLRHDHAGEFTEFKGAECGDHPAIVKVLWDNSKSSGTGEGFPNLVYVSREKSRKHDHHYK FIKSEYDKLVSRIESADEGSLLRHDDAGEFTEFMEAERTDHPAIVKVLWDNSKSSRTGEAFPHLVYVSSEKSRKHHHHYK

269 316 269 0 266 271 269 271

TRIAE_CS42_2AL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv

AGAMNVLTRVSAVMTNAPIMLNVDCDMFANNPQVALHAMCLLLGFDDEIHSGFVQAPQKFYGGLKDDPFGNQMQVITKKI AGAMNVLTRVSAVMTNAPIMLNVDCDMFANNPQVALHAMCLLLGFDDEIHSGFVQAPQKFYGGLKDDPFGNQMQVITKKI AGAMNVLTRVSAVMTNAPIMLNVDCDMFANNPQVALHAMCLLLGFDDEIHSGFVQAPQKFYGGLKDDPFGNQMQVITKKI -------------------------------------------------------------------------------AGAMNVLARVSAVMTNAPIILNMDCDMFVNNPQVVLHAMCLLLGFNDETCSGFVQVPQRFYAKLKDDPFGNQIEVLREKL AGAMNVLARVSAVMTNAPIILNVDCDMFVNNSQVVLHAMCLLLGFDDETCSGFVQVPQRFYGKLKDDPFGNQMEVLREKL AGAMNVLARVSAVMTNAPIILNVDCDMFVNNPQVVLHATCLLLGFDDETCSGFVQVPQRFYGKLKDDPFGNQMEVLRS-AGAMNVLARVSAVMTNAPIILNVDCDMFVNNSQVVLHAMCLLLGFDDETCSGFVQVPQRFYGKLKDDPFGNQMEVLREKL

349 396 349 0 346 351 347 351

TRIAE_CS42_2AL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1

GGGLAGIQGTFYGGTGCFHRRKVIYGMPPPD--TVKHETRGSPSYKELQAKFGSSKELIESSRNIISGDLLARPTVDISS GGGLAGIQGTFYGGTGCFHRRKVIYGMPPPD--TVKHETRGSPSYKELQAKFGSSKELIESSRNIISGDLLARPTVDISS GGGLAGIQGMFYGGTGCFHRRKVIYGVPPPD--TVKHEMKGSPSYKELQAKFGSSKELIESSRNIISGDLLARPTVDLSS --------------------------------------------------------------------------MIDISS LGGLSGLQGIYYLGTGCFHRRKIIYGVAPPSFAAVKHERQGSLTYEDLRTKFGASVELAESARNIYSREIPLKPMIDISS

427 474 427 6 426

TRIAE_CS42_3DS_TGACv FGGLAGLQGIYYLGMGCFHRRKIIYGVAPSSSAAIKHEREGSRSYEDLRTKFGASVELVESARNIYSGEIPPSPMIDISS 431 TRIAE_CS42_3B_TGACv1 ------------------------------------------LSYEDLLTKFGASMELVESSRNIYSVEIPPKPMIDITS 385 TRIAE_CS42_3DS_TGACv LGGLSGLQGIFYLGTGCFHRRKIIYGVAPSSFAAVKHEREGSLSYEDLRTKFGASVELVESTRNIYSREIPPKPMVNISS 431 TRIAE_CS42_2AL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv

RVEMAKQVGDCNYEAGTCWGQEIGWVYGSMTEDILTGQRIQAAGWESALLDTDPPAFLGCAPTGGPASLTQFKRWATGLL RVEMAKQVGDCNYEAGTCWGQEIGWVYGSMTEDILTGQRIQAAGWESALLDTDPPAFLGCAPTGGPASLTQFKRWATGLL RVEMAKQVGDCNYEAGTCWGQEIGWVYGSMTEDILTGLRIHAAGWESALLDTEPPAFLGCAPTGGPASLTQFKRWATGLL RIQVAKQVSSCNYETDTHWGQEIGWSYGSMAEDILTGQRIHSSGWKSTLLDTNPPAFLGCAPTGGPASLTQYKRWATGLL RIQVAKQVSSCNYETGTHWGQEIGWSYGSMAEDILTGQRIHSAGWKSTSPDTNPPAFLGCAPTGGPASLTQYKRWATGLL RIQVAKQVSSCNYETDTHWGQEIGWSYGSMAEDILTGQRIHSSGWKSTLLDTNPPAFLGCAPTGGPASLTQYKRWATGLL RIQVAKQVSTCNYETGTHWGEEASNHG----------------------------------------------------CIQVAKQVSSCNYETGTHWGQEIGWSYGSMAEDILTGQRIHSAGWKSTLLDTNPPAFLGCAPTGGPASLTQYKRWATGVL

507 554 507 86 506 511 412 511

TRIAE_CS42_2AL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv

EILISRNSPILGTIFKGLQLRQCLGYLIVDAWPVRAPFELCYALLGPFCLLTNQSFLPTASDEGFHIPAALFLTYNIYHL EILISRNSPILGTIFRRLQLRQCLAYLIVNAWPMRAPFEMCYALLGPFCLLTNQSFLPTTSNEGFRIPAALFLSYHVYHL EILISQNSPILGTIFRRLQLRQCLAYLIVEAWPVRAPFELCYALLGPFCLLTNQSFLPTASDEGFRIPAALFLTCHIYHL EILLGQNSPIIATIFKRLQFRQFLAYLVFYVWSMRAPFELCYALLGPFCLFRNQSFLLKASNHGFSIQLALFLSYNIYNF EILLGPNTPIIATIFKRLQFRQYLGYLVFYVWSMRAPFELCYALLGPFCLFRNHSFLLKASNHGFSIQLALFLSYNIYNF EILLGQNSPIMATVFKRLQFRQSLAYLVFYVWSMRAPFELCYALLGPFCLFRNQSFLLKASNHGFSIQLALFLSYNIYNF -FSIQLALFLSYNIYNFVEYKECGLSARTWWNNMRMRINLLLAPCFP--------------------------------EILLGQNCPIIATIFKRLQFRQCLAYLVFYVWSMRAPFELCYALLGPFCLFRNHSFLLKHQTMVSASN------------

587 634 587 166 586 591 458 579

TRIAE_CS42_2AL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv

MEYKECGLSVRAWWNNHRMQRITSASAWLLAFLTVILKTLGLSETVFEVTRKESSTSSDGGAGTDDADPGLFTFDSAPVF MEYKECGLSVRAWWNNHRMQRITSASAWLLAFLTVILKTLGLSETVFEVTRKESSTSSDGGTGTDEADTGLFTFDSAPVF MEYKECGLSVRAWWNNHRMQRITSASAWLLAFLTVILKTLGLSETVFEVTRKESSTSSDGGAGTDEADPGLFTFDSAPVF VEYMDCGLSARTWWNNMRMQRIVSISSWLLAFLSVVLKTIGLSKTVFEVTREDKST-SDGDPSTHETDLGWFTFDSSLVF VEYMECGLSARTWWNNMRMQRIVSLSSWLLAFLSVVLKTIGLSKTVFEVTRKDKST-SDGDPSTHETDLGWFTFDSSPVF VEYMECGLSARTWWNNMRMQRIVSISSWLLDFLSVVLKTIGLSKTVFEVTRKDKST-SDGDPSTHETDLGWFTFDSSPVF ---------------------------------------------------------------------------------------------------------------------------------------------------------------

667 714 667 245 665 670 458 579

TRIAE_CS42_2AL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv

IPVTALSVLNIVALTVAAWRAVVGTVAG-VHGGPGVGEFVCCGWMVLCFWPFVRGLVSSGKYGIPWSVRVKAGLIVAAFV IPVTALSMLNIVALAVAAWRAVVGTAAG-VHGGPGVGEFVCCGWMVLCFWPFMRGLVSSGKYGIPWSVRVKAGLIVAAFV IPVTVLSMLNIVALAVAAWRAVVGAAAG-VHGGPGIGEFVCCGWIVLCFWPFVRGLVSRGKYGIPWSVRVKAGLIVAAFV IPVTTVAILNIATIAIGVWRHAIFWMITGNHDWQNIGEFICCGWAILYFWPFIKGLVGRGRYGIPWNVKLKAWVIVVAFL IPMTAVAILNIVTIAIGVWRHAIFWMTTGNHDCQNIGEFLCCGLMILYFWPFIKGLVGRGRYGIPWNVKLKAWVIVVAFL ILVTTVAILNIATIAIGVWRHAIFWMITGNHDCQNIGELCVLDG--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

746 793 746 325 745 714 458 579

TRIAE_CS42_2AL_TGACv TRIAE_CS42_2BL_TGACv TRIAE_CS42_2DL_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv

HLCTRN HLCTRN HICTRN YFCRGD YFCRGD ----------------

752 799 752 331 751 714 458 579

Color Align Conservation results TRIAE_CS42_3B_TGACv1 MAAKPSQDAPLQLHTVEVDQPIATVNRLLAVLHVALAAAAIAHRGAHVMLAADLVLLFLWALSQAPMWRPVSRAAFPSRL TRIAE_CS42_3DS_TGACv MATKPSQDAPLPLHTVQTDQPLATVNRLLAAVHLALGAAAIAHRGAHVMLAADLVLLFLWALSQAPMWRPVSRTAFPSRL TRIAE_CS42_3AS_TGACv MAARPSQDAPLQLHTVQTDQPLATVNRLLAALHVALAAAAIAHRGAHVMLAPDLVLLFLWALSQAPMWRPVSRAAFPSRL TRIAE_CS42_3DS_TGACv MAAEPSQDAPLQLHTVQTDQPLATVNRHLAALHVALAAAAIAHRGAHVMLAADLVLLFLWALSQAPMWRPVSRAAFPSRL

80 80 80 80

TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3DS_TGACv

SRAALPAVDVMVVTADPDKEPAAKVMNTVVSAMALNYPGGRLSVYLSDDAGSPRTLLAARKAYAFARAWVPFCRKYGVRC SRAALPAVDVMVVTADPEKEPAAKVMNTVVSAMALDYPGGRLSVYLSDDAGSPRTLLAARKAYAFARAWVPFCRKYGVRC SRPALPAVDVMVVTADPDKEPAAKVMNTVVSAMALDYPGGRLSVYLSDDAGSPRTLLAARKAYALARAWVPFCRKYGVRC SRAALPAVDVMVVTADPDKEPAAKVMNTVVSAMALDYPGGRLSVYLSDDAGSPRTLLAARKAYAFARAWVPFCRKYGVRC

160 160 160 160

TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3DS_TGACv

PCPDRFFAGDDQLDLDGHHRQELDDDRLRIKKMYETFKEGVEEVMSDAALSQSWTKADHDAHVEIITGDE-QDSSNSNSG PCPDRFFAGDDKLDLGSHHHHELADDRLRIKNMYETFNEGVREVMSDADLSQSCTKADHDAHVEIITGDE-QDSSNSNSG PCPDRFFAGDDQLDLGDHHRQELDDDRLRIKNVYETFKEGVEEVMNDATLSQSWTKADHHAHVEIITDEQGQDSSHSNSG PCPDRFFAGDDQLDLGGHHRQELDDDRLRIKNMYETFKEGVEKVMNDAALSQSWTKADHDAHVE-------QDSSNSNSG

239 239 240 233

TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3DS_TGACv

DGEEDEDATPLLVYVSRGKRRSSTHHFKAGALNVLLRVSSLMSNSPYVMVLDCDMYCNSRSSILEAMCFHLDGRRRADLA DGEEDEDAMPLLVYVSREKRRSSTHHFKAGALNVLLRVSSLLSNSPYVMVLDCDMYCNSRSSILEAMCFHLDGRRRADLA DGDGDEDAMPLLVYVSREKRRSSTHHFKAGALNVLLRVSSLMSNSPYVMVLDCDMYCNSRSSLLEAMCFHLDGRRRADLA DGEEDEDAMPLLVYVSREKRRSSAHHFKAGALNVLLRVSSLMSNSPYVMVLDCDMYCNSWSSVLEAMCFHLDGRRRADLA

319 319 320 313

TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3DS_TGACv

FVQFPQMFHNLSTSDIYANELRSIFWT----------------------------RWKGLDGLRGPILSGTGFCARRDAI FVQFPQMFHNLSSSDIYANELRSIFWT----------------------------RWKGLDGLRGPILSGTGFCARRDAI FVQFPQMFHNLSSSDIYANELRPIFWVRKKTNRPCIASVIFSEFSSNLGACMVQTRWKGLDGLRGPILSGTGFCVRRDAV FVQFPQMFHNLSSSDIYANELRSIFWAGPTG---------------------LRDAVERRGRPPGPILSGTGFCVRRDAV

371 371 400 372

TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3DS_TGACv

YGALPASSQDQ-FSGVEVGELKRRFGVSNGHIASLRRPGTGSTIVARDALP---QDAELVACCDYETGTEWGEEVGFLYQ YGARPASSQDQ-FSGVEVGELKRRFGVSNGHIASLRRSGTGSTIVARDALPQ--EDAELVASCAYETGTEWGEQVGFLYQ YGAGPGSSQEQ-FSGVEVGELKRRFGVSNGHIASLRRSGTGSTIVAAGDVLP--QDAELVASCDYETGTEWGEDVGFLYQ YGAGPGSSQDHQSSGVEVGELKRRFGVSNGHIASLRRSGTGSTIVARDGLPQPQEDAELVASCDYETGTEWGEEVGFLYQ

447 448 477 452

TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3DS_TGACv

SVVEDYFTGYRQLYCRGWTSVYCFPATGSRPPFLGSVPTNLNDALVQNKRWMSGMLAVGLSRHCPLASAAAISVPESMGF SVVEDYFTGYRQLYCRGWTSVYCFPAAASRPPFLGSVPTNLNDALVQNKRWMSGMLAVGLSRHCPLAS-AAICVPQSMGF SVVEDYFTGYRQLYCRGWTSVYCFPATGSRPPFLGSVPTNLNDALVQNKRWMSGLLAVGLSRHCPLASAAAISVPQSMGF SVVEDYFTGYRQLYCPGWTSVYCFPATGTRPPFLGSVPTNLNDALVQNKRWMSGMLAVGLSRHCPLASAAAVSVPQSMGF

527 527 557 532

TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3DS_TGACv

AYYAFMALYAFPVLCYAIVPQLCFFRGGTSFP-EASTLWFAAVFVSSSLQHLVEVSVAKRGLAARTCWNEQRFWALNAVT AYYAFMALYAFPVLCYATVPQLCFLRGGTSFPGAASTLWFAAVFASSSLQHLVEVSVAKRGLALRTWWNEQRFWALNAVT AYYAFTPLYAFPLLCYATVPQLCFLRGATSFPEAASTLWFAAVFASSSLQHLVEVSVAKRGLAARTWWNEQRFWALNAVT AYYAFMALYAFPVLCYATVPQLCFLRGGTSFP-GESALWFAAVLASSSLQHLVEVSFAKRGLAARAWWNEQRFWALNAVT

606 607 637 611

TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3DS_TGACv

GQLFACLSVALNLVDGAGGRAVDFDLTSKASDDRLYRDGVFDFAGCSTLLLPATTLCLLNAAALVGGVWKMVGRGGNMPGQLFACLGVALNLVG-AGGRAVDFDLTSKASDDRLYRDGVFDFAGCTTLLLPATTLCLLNAAALVGGVWKMVGRGGSVSGQIFACLGVALSLVG-AGGRAVDFDLTSKASGDRLYRDGVFDFAGCSALLLPATTLCLLNAAALVGGVWKMVGRGGNVSG GQLFACVSVALSLVG-AGGRAVDFDLTSKASDDRLYRDSVFDFAGCSALLLPATTLCLLNTAALVGGVWKMVGRGGSVS-

685 685 716 689

TRIAE_CS42_3B_TGACv1 TRIAE_CS42_3DS_TGACv TRIAE_CS42_3AS_TGACv TRIAE_CS42_3DS_TGACv

-GELFLLCYIAALSYPLLQGMFLRRDLARVPARITAMSVAMVATLLSLFG -GELFLLCYVAALSYPLLQGMFLRRDPARVPAPITAMSVAMVAALLSLFG TGELFLLCYVAALSYPLLQGMFLRRDPARVPARITAVSVAIVATLLSLFG -GELFLLCYVAALSYPLLEGMFLRRDPARVPAWITAMSVAMVATLLSLFG

734 734 766 738