Network ProteinSequence @nalysis
(NPS@ is the IBCP contribution to PBIL in Lyon, France)

Abstract Thompson, J.D., Higgins, D.G. and Gibson, T.J. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice. (1994) Nucleic Acids Research, 22,4673-4680


Alignment width: Show :
 PREDATOR (Argos et al., 1996)
GOR IV (Garnier et al, 1996)
Self Optimized Prediction Method (Geourjon & Deleage,1994)
Double Prediction Method (Deleage and Roux,1987)
HOMOLOGUE (Levin et al.,1986)
GORII
GOR I (Garnier et al.,1978)

[Clustal file] View data in: [MPSA (Mac, UNIX) , About...] [AnTheProt (PC) , Download...] [HELP]

CLUSTAL W (1.74) multiple sequence alignment

                           10        20        30        40        50        60
                           |         |         |         |         |         |
AquifexFDR        ---------------------------------------------------------MEI
Sulfol-acidocauld ---------------------------------------------------------MEK
ProteusvFDR       ---------------------------------------------------------MQT
EcoliFDR          ---------------------------------------------------------MQT
Hinfluenz         ---------------------------------------------------------MQT
Archeoglobus      ------------------------------------------------------------
M.tubercul        --------------------------------------------------------MICQ
Mycobac           --------------------------------------------------------MIQQ
Ecoli             -------------------------------------------------------MKLPV
Ecoli2            -------------------------------------------------------MKLPV
Shewanella        -------------------------------------------------------MSIPV
Coxiella          ------------------------------------------------------MSSIRV
Rf-FDR            ---------------------------------------------------MTATSKLTT
Bos               MSGVAAVSRLWRARRLALTCTKWSAAWQTGTRSFHFTVDGNKRSSAKVSDAISAQYPVVD
bos2              MSGVAAVSRLCARPALALTCTKWSAAWQTGTRSFHFTVDGNKRSSAKVSDAISAQYPVVD
Homo              MSGVRGLSRLLSARRLALA-KAWPTVLQTGTRGFHFTVDGNKRASAKVSDSISAQYPVVD
Homo2             MSGVRGLSRLLSARRLALA-KAWPTVLQTGTRGFHFTVDGNKRASAKVSDSISAQYPVVD
Mus               ------------------------------------------------------------
Gallus            ------------------------------------------------------------
Celegans          --------------------MLRAASNGLRNTVAARSVSLSAANHSDAKRSDIAQYKVVD
Celegans2         --------------------MLRAASNGLRNTVAARSVSLSAANHSDAKRSDIAQYKVVD
Celegans3         ---------------------------MLNVVKSINRAKTPVRTYMKKQVSATTNFDVVD
ascaris           ---------------------MLRAVRALICRIGARRTLSVSSSRLDVSTSNIAQYKVID
nemato4           ---------------------MLSVAHRLVTKACQRRTLSLSTIRKETRTSSIAEYRIVD
Drosophila        ----------------------MQRAAAVGVQRSYHITHGRQQASAANPDKISKQYPVVD
Plasmodiumf       ----------------------------------------MQSSFCRFSNIKTKAYDIID
sacharro          ----------MLSLKKSALSKLTLLRNTRTFTSSALVRQTQGSVNGSASRSADGKYHIID
sacharro2         ----------MLSLKKGITKSYILQRTFTSSSVVRQIGEVK-SESKPPA-----KYHIID
Spombe            ------------------------------------------------------------
Arabidopsis       -------------MWRCVSRGFRAPASKTSSLFDGVSGSRFSRFFSTGSTDTRSSYTIVD
Rickettsia        ---------------------------------------------------MTKAYNIIH
Bradyrhizo        ------------------------------------MANETNGKGNGAPATNGKAYPIED
Rrubrum           ----------------------------------------------------MASYEIVD
Paradeni          ----------------------------------------------------MAAYKYET
Synecho           -----------------------------------------------------------M
Natronobac        ---------------------------------------------------------MTI
Mjanaischii       ------------------------------------------------------------
Methanobac        -------------------------------------------------MLKILSMESQV
                                                                              

                          70        80        90        100       110       120
                           |         |         |         |         |         |
AquifexFDR        KKYDVVVIGGGGAGLRTAIEVAKD-PNISVALVSKVYPTRSHTGAAQGGMNAAL-GNVIP
Sulfol-acidocauld LSYDAVIIGAGLAGLMAAHEISK--AGYSAAVISKVFPTRSHSAAAEGGIAAYVNGNSDP
ProteusvFDR       FNADIAIIGAGGAGLRAAIAAAEANPQLKIALISKVYPMRSHTVAAEGGSAAVT----QA
EcoliFDR          FQADLAIVGAGGAGLRAAIAAAQANPNAKIALISKVYPMRSHTVAAEGGSAAVA----QD
Hinfluenz         VNVDIAIVGAGGGGLRAAIAAAEANPNLKIALVSKVYPMRTHTVAAEGGAAAVI----KE
Archeoglobus      MEHDIVIVGAGIAGMRAAIAAAEKSRKLSIALIAKTYPIRCHSVCAEGGTAAVL----RE
M.tubercul        HRYDVVIVGAGGAGMRAAVEAGPRVR---TAVLTKLYPTRSHTGAAQGGMCAAL-ANVE-
Mycobac           HRYDVVIVGAGGAGMRAAVEAGPRVR---TAVLTKLYPTRSHTGAAQGGMCAAL-ANVE-
Ecoli             REFDAVVIGAGGAGMRAALQISQSGQ--TCALLSKVFPTRSHTVSAQGGITVAL-GNTH-
Ecoli2            REFDAVVIGAGGAGIARALQISQSGQ--TCALLSKVFPTRSHTVSAQGGITVAL-GNTH-
Shewanella        REFDVIVIGAGGAGMRAALQISKEGK--SCALLSKVFPTRSHTVSAQGGITVAL-GNAH-
Coxiella          KQYDALIVGAGGAGLRAALEMAQSRQY-KVAVVSKVFPTRSHTVSAQGGIAAAL-GNVV-
Rf-FDR            RKFDVVIVGAGGSGMSASLRLSNAGL--NVAVLTKVFPTRSHTVAAQGGIGASL-GNMA-
Bos               HEFDAVVVGAGGAGLRAAFGLSEAGF--NTACVTKLFPTRSHTVAAQGGINAAL-GNME-
bos2              HEFDAVVVGAGGAGLRAAFGLSEAGF--NTACVTKLFPTRSHTVAAQGGINAAL-GNME-
Homo              HEFDAVVVGAGGAGLRAAFGLSEAGF--NTACVTKLFPTRSHTVAAQGGINAAL-GNME-
Homo2             HEFDAVVVGAGGAGLRAAFGLSEAGF--NTACVTKLFPTRSHTVAAQGGINAAL-GNME-
Mus               --------------LRAAFGLSEAGF--NTACLTKLFPTRSHTVAAQGGINAAL-GNME-
Gallus            ------------------------------------------------------------
Celegans          HAYDAVVVGAGGAGLRAAMGLAEGGL--KTAVITKLFPTRSHTVAAQGGINAAL-GNMN-
Celegans2         HAYDAVVVGAGGAGLRAAMGLAEEGL--KTAVITKLFPTRSHTVAAQGGINAAL-GNMN-
Celegans3         HTFDAVVVGAGGAGLRAAMGLSEGGM--KTAVITKLFPTRSHTVAAQGGVNAAL-GNMN-
ascaris           HAYDVVIIGAGGAGLRAAMGLGEAGF--KTAVVTKMFPTRSHTTAAQGGINAAL-GSMN-
nemato4           HAFDAVVVGAGGAGLRAAMGLSENGQ--NVAVITKLFPTRSHTVAAQGGVNAAL-GNMN-
Drosophila        HAYDAIVVGAGGAGLRAAFGLVAEGF--RTAVITKLFPTRSHTIAAQGGINAAL-GNME-
Plasmodiumf       HHYDAVIVGAGGAGLRSALELSKNKY--KVACISKLFPTRSHTVAAQGGINAAL-GNMT-
sacharro          HEYDCVVIGAGGAGLRAAFGLAEAGY--KTACISKLFPTRSHTVAAQGGINAAL-GNMH-
sacharro2         HEYDCVVVGAGGAGLRAAFGLAEAGY--KTACLSKLFPTRSHTVAAQGGINAAL-GNMH-
Spombe            ------------------------------------------------------------
Arabidopsis       HTYDAVVVGAGGAGLRAAIGLSEHGF--NTACITKLFPTRSHTVAAQGGINAAL-GNMS-
Rickettsia        HKFDVVVVGAGGAGLRSAFGMAKEGL--NTACITKIFPTRSHTVAAQGGISAAL-GNMG-
Bradyrhizo        HTYDVVVVGAGGAGLRAVVGCSEAGL--RTACITKVFPTRSHTVAAQGGISASL-GNMH-
Rrubrum           HEYDALVVGAGGAGLRATFGLVEQGL--KTACITKVFPTRSHTVAAQGGIGAAL-GNMA-
Paradeni          HEYDVVVVGAGGAGLRATLGMAEQGL--RTACVTKVFPTRSHTVAAQGGIAASL-SNMG-
Synecho           LEQDVVIVGGGLAGCRAALEIKRLAPDTKVAIVAKTHPIRSHSVAAQGGIAASL-KNVDA
Natronobac        HEHDVIVVGAGGAGLRAAIAAHEEGA--DVAMVTKLHPVRSHTGAAEGGINAAI----RD
Mjanaischii       MKTDILIIGGGGAAARAAIECRDKNV---IIAVKGLFGKSGCTVMAEGGYNAVF----NP
Methanobac        YECDVLIIGSGGAGCRAAIEVSEHNLT-PLIVSKGLSFKSGCTGMAEGGYNAAFAC-VDP
                                                                              

                         130       140       150       160       170       180
                           |         |         |         |         |         |
AquifexFDR        DDSPEVHAYDTIKGSDFLADQDAVFFMTEKAPEIIYELDRWGVPFSRLPDGRIAQRPFGG
Sulfol-acidocauld NDSPDYMAYDTIKGGDYLVDQDAAELLAYKSGEIVELLEKWGALFNRQPDGRIALRYFGG
ProteusvFDR       HDSYDFHFNDTVSGGDWLCEQDVVDYFVEHCPTEMTQLELWGCPWSRKEDGSVNVRRFGG
EcoliFDR          HDSFEYHFHDTVAGGDWLCEQDVVDYFVHHCPTEMTQLELWGCPWSRRPDGSVNVRRFGG
Hinfluenz         EDSYDKHFQDTVAGGDWLCEQDVVEYFVQHSPVEMTQLERWGCPWSRKADGDVNVRRFGG
Archeoglobus      GDSFDLHAWDTVKGADFLADQDAVEFFVRECPKEIIRLENWGCPWSRNEDGTIAQRPFGG
M.tubercul        DDNWEWHTFDTVKGGDYLADQDAVEIMCKEAIDAVLDLEKMGMPFNRTPEGRIDQRRFGG
Mycobac           DDNWEWHAFDTVKGGDYLADQDAVEIMCKEAIDAVLDLEKMGMPFNRTPEGRIDQRRFGG
Ecoli             EDNWEWHMYDTVKGSDYIGDQDAIEYMCKTGPEAILELEHMGLPFSRLDDGRIYQRPFGG
Ecoli2            EDNWEWHMYDTVKGSDYIGDQDAIEYMCKTGPEAILELEHMGLPFSRLDDGRIYQRPFGG
Shewanella        EDHWEQHMYDTVKGSDFIGDQEAIEFMCQTGPEAIIELEQMGLPFSRFEDGTIYQRPFGG
Coxiella          PDKPIWHMFDTVKGSDYLGDQDAIQYMCEQAPPSVYELEHYGLPFSRLDDGRIYQRAFGG
Rf-FDR            EDNWHYHFYDTIKGSDWLGDQDAIEFMCRERTQVVYELEHFGMPFDRNADGTIYQRPFGG
Bos               EDNWRWHFYDTVKGSDWLGDQDAIHYMTEQAPASVVELENYGMPFSRTEDGKIYQRAFGG
bos2              EDNWRWHFYDTVKGSDWLGDQDAIHYMTEQAPASVVELENYGMPFSRTEDGKIYQRAFGG
Homo              EDNWRWHFYDTVKGSDWLGDQDAIHYMTEQAPAAVVELENYGMPFSRTEDGKIYQRAFGG
Homo2             EDNWRWHFYDTVKGSDWLGDQDAIHYMTEQAPAAVVELENYGMPFSRTEDGKIYQRAFGG
Mus               EDNWRWHFYDTVKGSDWLGDQDAIHYMTEQAPASVVELENYGMPFSRTEDGKIYQRAFGG
Gallus            --------------------------------------------------GKIYQRAFGG
Celegans          PDNWRWHFYDTVKGSDWLGDQDAIHYMTREAERAVIELENYGMPFSRTTDGKIYQRAFGG
Celegans2         PDNWRWHFYDTVKGSDWLGDQDAIHYMTREAERAVIELENYGMPFSRTTDGKIYQRAFGG
Celegans3         PDNWRWHFYDTVKGSDWLGDQDAIHYMTREAERAIIELENYGMPFSRTTDGKIYQRAFGG
ascaris           PDDWKWHFYDTAKGSDWLGDQNAMHYLTRNAVEAVTELENFGMPFSRTPEGKIYQRSFGG
nemato4           PDDWRWHFYDTVKGSDWLGDQNAIHYMTREAVRAVIEMENYGMPFSRTEEGKIYQRSFGG
Drosophila        EDDWKWHMYDTVKGSDWLGDQDAIHYMTREAPKAVIELENYGMPFSRTQDGKIYQRAFGG
Plasmodiumf       EDDWRWHAYDTIKGSDWLGDQNAIHYMCREAPDSVLELEEFGLPFSRTKDGKIYQRAFGG
sacharro          KDNWKWHMYDTVKGSDWLGDQDSIHYMTREAPKSIIELEHYGVPFSRTENGKIYQRAFGG
sacharro2         PDDWKSHMYDTVKGSDWLGDQDAIHYMTREAPKSVIELEHYGMPFSRTEDGRIYQRAFGG
Spombe            ----------------------------------------------RTKEGKIYQRAFGG
Arabidopsis       EDDWRWHMYDTVKGSDWLGDQDAIQYMCREAPKAVIELENYGLPFSRTEEGKIYQRAFGG
Rickettsia        EDDWRWHMYDTVKGSDWLGDQDAIEYMCKNAPDAILELEHYGVPFSRTVDGKIYQRPFGG
Bradyrhizo        KDDWRWHMYDTVKGSDWLGDQDAIEYMVRNAPDAVYELEHWGVPFSRTEDGKIYQRPFGG
Rrubrum           EDNWKWHMYDTVKGADWLGDQDAIEYMCREAIPAVYELEHYGVPFSRTEDGRIYQRPFGG
Paradeni          PDNWQWHMYDTVKGSDWLGDTDAMEYLAREAPKAVYELEHYGVPFSRTEEGKIYQRPFGG
Synecho           EDSWEAHAFDTVKGSDYLADQDAVEILTKEAPEVIIELEHLGVLFSRLPDGKIAQRAFGG
Natronobac        GDDWELHAYDTMKGSDYLGDAPAIETLAQDAPEEVIQLEHWGMPFSREDDGRVSQRPFGG
Mjanaischii       KDSFKKHFYDTVKGGGFINNPKLVEILVKNAPKELLNLERFGALFDRTEDGFIAQRPFGG
Methanobac        EDSPDVHFEDTMRGGGFLNDPQLVRILVDEAPDRLRDLETYGALFDRQESGLLDQRPFGG
                                                                    * :  * ***

                         190       200       210       220       230       240
                           |         |         |         |         |         |
AquifexFDR        AS--------FPRTVFAADKTGHVLLHTLFEQALARDNITFFNEYFLLDLIH------DG
Sulfol-acidocauld QT--------YPRTRFVGDKTGMALLHTLYERTSGSGKVDFYFEWFAWELIR------DE
ProteusvFDR       MK--------IERTWFAADKTGFHMLHTLFQTSLKYPQIQRFDEHFVLDILV------DE
EcoliFDR          MK--------IERTWFAADKTGFHMLHTLFQTSLQFPQIQRFDEHFVLDILV------DD
Hinfluenz         MK--------IERTWFAADKTGFHLLHTLFQTSIQYPQIQRFDEHFVLDILV------DD
Archeoglobus      HS--------FNRATYAKDRTGFHEVHTLYERMLMYDNVEIFPEYFITNLAI------EN
M.tubercul        HTRDHGKAP-VRRACYAADRTGHMILQTLYQNCVKH-DVEFFNEFYALDLALTQ--TPSG
Mycobac           HTRDHGKAP-VRRACYAADRTGHMILQTLYQNCVKH-DVEFFNEFYALDLVLTQ--TPSG
Ecoli             QSKNFGGEQ-AARTAAAADRTGHALLHTLYQQNLKN-HTTIFSEWYALDLVKN-----QD
Ecoli2            QSKNFGGEQ-AARTAAAADRTGHALLHTLYQQNLKN-HTTIFSEWYALDLVKN-----QD
Shewanella        QSRNFGGEQ-AARTAAAADRTGHALLHCLYQQNVKH-KTDVYSEWYALDLVKN-----ED
Coxiella          HTRDFGKEM-ARRTCACADRTGHAMLHTLYQKNVEA-GTHFYYEWYGIDLVRG-----AQ
Rf-FDR            HTANYGEKP-VQRACAAADRTGHAMLHTLYQQNVKA-RTNFFVEWMALDLIRD-----AE
Bos               QSLKFGKGGQAHRCCCVADRTGHSLLHTLYGRSLRY-DTSYFVEYFALDLLM------ES
bos2              QSLKFGKGGQAHRCCCVADRTGHSLLHTLYGRSLRY-DTSYFVEYFALDLLM------ES
Homo              QSLKFGKGGQAHRCCCVADRTGHSLLHTLYGRSLRY-DTSYFVEYFALDLLM------EN
Homo2             QSLKFGKGGQAHRCCCVADRTGHSLLHTLYGRSLRY-DTSYFVEYFALDLLM------EN
Mus               QSLKFGKGGQAHRCCCVADRTGHSLLHTLYGRSLRY-DTSYFVEYFALDLLM------EN
Gallus            QSLQFGKGGQAHRCCCVADRTGHSLLHTLYGRSLRY-DTSYFVEYFALDLLM------EN
Celegans          QSNDFGRGGQAHRTCCVADRTGHSLLHTLYGASLQY-NCNYFVEYFALDLIM------EN
Celegans2         QSNDFGRGGQAHRTCCVADRTGHSLLHTLYGASLQY-NCNYFVEYFALDLIM------EN
Celegans3         QSNDFGRGGQAHRTCCVADRTGHSLLHTLYGASLQY-DCNYFVEYFALDLIM------DK
ascaris           QSNNYGKGGVAKRTCCVADRTGHSMLHTLYGNSLRC-HCTFFIEYFALDLLM------DK
nemato4           QSNNFGKGGMARRTCCVADRTGHSMLHTLYGSSLQY-NCRYFIEYFALDLLM------DK
Drosophila        QSLKFGKGGQAHRCCAVADRTGHSLLHTLYGQSLNY-DCNYFVEYFALDLIF------ED
Plasmodiumf       QSLKYGKGGQAYRCAAAADRTGHAMLHTLYGQSLSY-NCIFFVEYFVLDLLML-----NS
sacharro          QTKEYGKGAQAYRTCAVADRTGHALLHTLYGQALRH-DTHFFIEYFALDLLT------HN
sacharro2         QSKDFGKGGQAYRTCAVADRTGHAMLHTLYGQALKN-NTHFFIEYFAMDLLT------HN
Spombe            QSLEYGKGGQAYRCAAVADRTGHSILHTLYGQSLKH-NTNFFIEYFGMDLLM------EG
Arabidopsis       QSLDFGKGGQAYRCACAADRTGHALLHTLYGQAMKH-NTQFFVEYFALDLLMA-----SD
Rickettsia        MTTEYGKGKAAQRTCAAADRTGHAILHTLYQQSLKH-KVQFFIEYFAIDLLME-----D-
Bradyrhizo        MTLDFGKGQA-QRTCAAADRTGHAMLHTMYGQSLRH-AAEFFIEFFAIDLIMD-----DQ
Rrubrum           HMRNFGEAP-VQRACAAADRTGHAILQTLYQQSLRF-NAEFFVEYFALDLIIE-----D-
Paradeni          HTTEFGEGPPVQRTCAAADRTGHAILHTLYGQSLKE-KAEFFIEYFALDLIIT-----D-
Synecho           H----S----HNRTCYAADKTGHAILHELVNNLRRN-KVEIYDEWYVMKLIY------EE
Natronobac        LS--------FPRTTYAGAETGHHLLHTMYEQVVKR-GIKVYDEFYVSELAVTDHDDPED
Mjanaischii       QS--------FNRTCYCGDRTGHEIMRGLMEYISKFERIKILEEVMAIKLIV------KD
Methanobac        QT--------YRRTCYQGDRTGHEMITALKEEVIRRD-IETVEEIMITSLLV------EE
                              *      .**   :  :              *    .:          

                         250       260       270       280       290       300
                           |         |         |         |         |         |
AquifexFDR        ERVKGVTIYDIRNGEVLFLQAKAVVLATGGFARIYWFRSTNAIGNTGDGQAVALRAGVPL
Sulfol-acidocauld SRVRGVVAFDMRNMVPFFFKAKAVVIAAGGMGMLYR-HTTNSYIGTGDGYAMALRARVAL
ProteusvFDR       GHARGVVAINMMEGTKVQIRANAVIMATGGAGRVYR-FNTNGGIVTGDGMGIALRHGVPL
EcoliFDR          GHVRGLVAMNMMEGTLVQIRANAVVMATGGAGRVYR-YNTNGGIVTGDGMGMALSHGVPL
Hinfluenz         GHARGMVAMNMMEGSLVQINANAVVIATGGGCRAFK-FNTNGGIVTGDGLSMAYRHGVPL
Archeoglobus      GAVQGVSAIQLKTGEMEFFEAKAVIFATGGAGRLYG-FTTYSHQVTGDGLAIAYRNGIPL
M.tubercul        PVATGVIAYELATGDIHVFHAKAVVIATGGSGRMYK-TTSNAHTLTGDGIGIVFRKGLPL
Mycobac           PVTTGVVAYELATGDIHVFHTKAVVIATGGSGRMYK-TTSNAHTLTGDGIGIVFRKGLPL
Ecoli             GAVVGCTALCIETGEVVYFKARATVLATGGAGRIYQ-STTNAHINTGDGVGMAIRAGVPV
Ecoli2            GAVVGCTALCIETGEVVYFKARATVLATGGAGRIYQ-STTNAHINTGDGVGMAIRAGVPV
Shewanella        DVVVGCTAIEIETGEIVYFKAKATILATGGAGRIYA-STTNAHINTGDGVGMAARAGVQL
Coxiella          GGIAGMIAMNMETSELVFFKSRATIFATGGAGRIYE-TTSNAYTNTGDGIGMVLRAGLPV
Rf-FDR            GDVVGVTALEMETGEIHVLRAKSVLLATGGAGRIFA-ASTNAFINTGDGLGIAARAGIPL
Bos               GECRGVIALCIEERVHPPHQGQEHCHRHRSYGRTYF-SCTSAHTSTGDGTAMVTRAGLPC
bos2              GECRGVIALCIEDGSIHRIRARNTVIATGGYGRTYF-SCTSAHTSTGDGTAMVTRAGLPC
Homo              GECRGVIALCIEDGSIHRIRAKNTVVATGGYGRTYF-SCTSAHTSTGDGTAMITRAGLPC
Homo2             GECRGVIALCIEDGSIHRIRAKNTVVATGGYGRTYF-SCTSAHTSTGDGTAMITRAGLPC
Mus               GECRGVIALCIEDGSIHRIRAKNTVIATGGYGRTYF-SCTSAHTSTGDGTAMVTRAGLPC
Gallus            GECRGVIALCIEDGTIHRFRAKNTVIATGGYGRTYF-SCTSAHTSTGDGTAMVTRAGLPC
Celegans          GVCVGVIAMDLEDGTIHRFRSKNTVLATGGYGRAFF-SCTSAHTCTGDGTALTARAGINN
Celegans2         GVCVGVIAMDLEDGTIHRFRSKNTVLATGGYGRAFF-SCTSAHTCTGDGTALTARAGISN
Celegans3         GKCIGVVALDIETGQIHRFRAKNTVLATGGYGRAYF-SCTSAHTCTGDGTALTARAGIRN
ascaris           GRCVGVIALCLEDGTIHRFRSKRTIVATGGYGRAYF-SCTTAHMNTGDGTALATRAGIAL
nemato4           GRCIGIIAMDLEDGSIHRFRAKNTVIATGGYGRAFF-SCTSAHTCTGDGTAMITRAGLQN
Drosophila        GECRGVLALNLEDGTLHRFRAKNTVLATGGYGRAFF-SCTSAHTCTGDGTAMVARQGLPS
Plasmodiumf       NECIGVICINIADGKIHRFFTPHTVIATGGYGRAYL-SCTSAHACTGDGNAIVARSKLPL
sacharro          GEVVGVIAYNQEDGTIHRFRAHKTIIATGGYGRAYF-SCTSAHTCTGDGNAMVSRAGFPL
sacharro2         GEVVGVIAYNQEDGTIHRFRAHKTVIATGGYGRAYF-SCTSAHTCTGDGNAMVSRAGFPL
Spombe            GECRGVIAMNLEDGSIHRFRAHKTILATGGYGRAYF-SCTSAHTCTGDGNAMVSRAGLPL
Arabidopsis       GSCQGVIALNMEDGTLHRFRSSQTILATGGYGRAYF-SATSAHTCTGDGNAMVARAGLPL
Rickettsia        GECRGVVAWNLDDGSLHCFRAHNVVLATGGYGRAYF-SATSAHTCTGDGGGMVIRAGLPL
Bradyrhizo        GTCRGVIALKLDDGTLHRFRAQTVILATGGYGRAYA-SCTSAHTCTGDGGGMVLRAGLPM
Rrubrum           GVCRGVIAWCMEDGTIHRFKSHTTVLATGGYGRAYF-SCTSAHTCTGDGNGMVARAGLPL
Paradeni          GACTGVVCWKLDDGTIHVFNAKMVVLATGGYGRAYF-SATSAHTCTGDGGGMVARAGLPL
Synecho           GEAKGLVMYEIATGRIEIVRAKAVMVATGGYGRVYN-TTSNDYASTGDGLAMAAIAGIPL
Natronobac        RECHGCVAYDIKSGDIVGFRATGGVILATGGDGQVFDHTTNAVANTGDGPAMAYRAGVPV
Mjanaischii       NRCYGAIFLDLKTGNIFPIFAKATILATGGAGQLYP-ITSNPIQKTGDGFAIAYNEGAEL
Methanobac        DRVLGAMGVSIRDSSTVAFRASSTILAAGGAGHIYP-VTSNTIQKGGDGFSVAWKAGADL
                      *                        .         :      *** .:        

                         310       320       330       340       350       360
                           |         |         |         |         |         |
AquifexFDR        KDMEFIQFHPTGLA----KTGILLSEACRGEGGYLLNKEGERFMKRYAPDK---------
Sulfol-acidocauld KDPEFVQFHPTALY----PSDILISEAARGEGAVLKNAKGERFMTRYAPKK---------
ProteusvFDR       RDMEFVQYHPTGLP----GSGILMTEGCRGEGGILVNKDGYRYLQDYGLGPETPLGKPEN
EcoliFDR          RDMEFVQYHPTGLP----GSGILMTEGCRGEGGILVNKNGYRYLQDYGMGPETPLGEPKN
Hinfluenz         RDMEFVQYHPTGLP----NTGILMTEGCRGEGGILVNKDGYRYLQDYGLGPETPIGKPQN
Archeoglobus      KDMEFFQFHPTGLV----PSGILMTEGCRGEGGYLLNKNGERFMKRYAPEK---------
M.tubercul        EDMEFHQFHPTGLA----GLGILISEAVRGEGGRLLNGEGERFMERYAPTI---------
Mycobac           EDMEFHQFHPTGLA----GLGILISEAVRGEGGRLLNGENERFMEHYAPTI---------
Ecoli             QDMEMWQFHPTGIA----GAGVLVTEGCRGEGGYLLNKHGERFMERYAPNA---------
Ecoli2            QDMEMWQFHPTGIA----GAGVLVTEGCRGEGGYLLNKHGERFMERYAPNA---------
Shewanella        QDMEMWQFHPTGIA----GAGVLVTEGCRGEGGYLLNKDGERFMERYAPNA---------
Coxiella          QDMEFWQFHPTGIY----GVGCLITEGARGEGGYLINKDGERFMERYSPHL---------
Rf-FDR            QDMEFWQFHPTGVA----GAGVLLTEGCRGEGAILRNSAGERFMERYAPTL---------
Bos               QDLEFVQFHPTGIY----GAGCLITEGCRGEGGILINSQGERFMERYAPVA---------
bos2              QDLEFVQFHPTGIY----GAGCLITEGCRGEGGILINSQGERFMERYAPVA---------
Homo              QDLEFVQFHPTGIY----GAGCLITEGCRGEGGILINSQGERFMERYAPVA---------
Homo2             QDLEFVQFHPTGIY----GAGCLITEGCRGEGGILINSQGERFMERYAPVA---------
Mus               QDLEFVQFHPTGIY----GAGCLITEGCRGEGGILINSQGERFMERYAPVA---------
Gallus            QDLEFVQFHPTGIY----GAGCLITEGCRGEGGILINSQGERFMERYAPVA---------
Celegans          SDMEFVQFHPTGIY----GAGCLITEGSRGEGGYLVNSAGERFMERYAPNA---------
Celegans2         SDMEFVQFHPTGIY----GAGCLITEGSRGEGGYLVNSSGERFMERYAPNS---------
Celegans3         SDMEFVQFHPTGIY----GVGCLITEGSRGEGGYLVNSQGERFMERYAPNA---------
ascaris           EDLEFIQFHPTGIY----GVGCLITEGSRGEGGFLVNSEGERFMERYAPKA---------
nemato4           SDMEFVQFHPTGIY----GAGCLITEGSRGEGGFLVNSEGERFMKKYAPNA---------
Drosophila        QDLEFVQFHPTGIY----GAGCLITEGCRGEGGYLINGNGERFMERYAPVA---------
Plasmodiumf       QDLEFVQFHPTGIY----PAGCLITEGCRGEGGILRNKEGEAFMMRYAPKA---------
sacharro          QDLEFVQFHPSGIY----GSGCLITEGARGEGGFLVNSEGERFMERYAPTA---------
sacharro2         EDLEFVQFHPSGIY----GSGCLITEGARGEGGFLLNSEGERFMERYAPTA---------
Spombe            QDLEFVQFHPTGIY----GAGCLITEGCRGEGGYLLNSKGERFMERYAPTA---------
Arabidopsis       QDLEFVQFHPTGIY----GAGCLITEGSRGEGGILRNSEGERFMERYAPTA---------
Rickettsia        QDMEFVQFHPTGIY----SAGCLITEGARGEGGYLVNANGERFMERYAPAA---------
Bradyrhizo        QDMEFVQFHPTGIY----GSGCLVTEGARGEGGYLVNSEGERFMERYAPSA---------
Rrubrum           QDMEFVQFHPTGIY----GAGCLITEGARGEGGYLTNSEGERFMERYAPTA---------
Paradeni          QDMEFVQFHPTGIY----GSGCLITEGARGEGGYLTNSEGERFMERYAPTY---------
Synecho           EDMEFVQFHPTGLY----PVGVLISEAVRGEGAYLINSEGRRFMEDYAPSR---------
Natronobac        EDMEFVQFHPTTLP----STGVLISEGVRGEGGILYNSEGERFMFEYGYANN----D---
Mjanaischii       IDMEMVQFHPTGMV----GTGILVTEAVRGEGGILYNKYKERFMVRYDKER---------
Methanobac        IDMEQVQFHPTGMVYPESRRGVLVTEAVRGEGGILLNSEGERFMKRYDP-R---------
                   * *  *:**: :       . *::*. ****. * *     ::  *             

                         370       380       390       400       410       420
                           |         |         |         |         |         |
AquifexFDR        --MELAPRDIVSRAIETEIREGRGVGE--GAR-AYVYLDLRHLGEEKIKERLPQVRQLAI
Sulfol-acidocauld --LDLAPRDIVSRAIITEIKEGRGFP---GG---YVGLDISHLGEEYIKERLALAYEAAK
ProteusvFDR       KYMELGPRDKVSQAFWHEWRAGRTIKT--HRG-DVVHLDLRHLGAKKLHERLPFICELAK
EcoliFDR          KYMELGPRDKVSQAFWHEWRKGNTIST--PRG-DVVYLDLRHLGEKKLHERLPFICELAK
Hinfluenz         KYMELGPRDKVSQAFWQEWKKGNTLKT--AKGVDVVHLDLRHLGEKYLHERLPFICELAS
Archeoglobus      --MEIAPRDVVSRAMWTEIIEGRGFEG--EYG-PYIALDLRHLGEEKIEERLPLIRDAAI
M.tubercul        --VDLAPRDIVARSMVLEVLEGRGA----GPLKDYVYIDVRHLGEEVLEAKLPDITEFAR
Mycobac           --VDLAPRDIVARSMVLEVLEGRGA----GPHKDYVYIDVRHLGEEVLESKLPDITEFSR
Ecoli             --KDLAGRDVVARSIMIEIREGRGCD---GPWGPHAKLKLDHLGKEVLESRLPGILELSR
Ecoli2            --KDLAGRDVVARSIMIEIREGRGCD---GPWGPHAKLKLDHLGKEVLESRLPGILELSR
Shewanella        --KDLASRDVVARSMMTEIREGRGLD---GPLGPHCLLKLDHLGKETLEARLPGVCELSR
Coxiella          --KDLDCRDVVARSILQEVMAGGGV----GPKKDHVLLKLDHLGEKVLRERLPGIIELSE
Rf-FDR            --KDLAPRDFVSRCMDQEIKEGRGC----GPNKNYINLDMTHLGAETIAKRLPSVFEIGH
Bos               --KDLASRDVVSRSMTLEIREGRGC----GPEKDHVYLQLHHLPPAQLAMRLPGISETAM
bos2              --KDLASRDVVSRSMTLEIREGRGC----GPEKDHVYLQLHHLPPAQLAMRLPGISETAM
Homo              --KDLASRDVVSRSMTLEIREGRGC----GPEKDHVYLQLHHLPPEQLATRLPGISETAM
Homo2             --KDLASRDVVSRSMTLEIREGRGC----GPEKDHVYLQLHHLPPEQLATRLPGISETAM
Mus               --KDLASRDVVSRSMTLEIREGRGC----GPEKDHVYLQLHHLPPEQLATRLPGISETAM
Gallus            --KDLASRDVVSRSMTIEIREGRGC----GPEKDHVYLQLHHLPPQQLATRLPGISETAM
Celegans          --KDLASRDVVSRSMTVEIMEGRGV----GPDKDHIYLQLHHLPAEQLQQRLPGISETAM
Celegans2         --KDLASRDVVSRSMTVEIMEGRGV----GPDKDHIYLQLHHLPAEQLQQRLPGISETAM
Celegans3         --KDLASRDVVSRAMTMEINEGRGV----GPNKDHIYLQLHHLPAEQLQQRLPGISETAQ
ascaris           --KDLASRDVVSRAETIEIMEGRGV----GPEKDHIYLQLHHLPAEQLHQRLPGISETAK
nemato4           --LDLASRDVVSRAMTIEIMEGRGV----GKDKDHIYLQLHHLPAKDLHAKLPGIMETAM
Drosophila        --KDLASRDVVSRSMTIEIMEGRGA----GPEKDHVYLQLHHLPPKQLAERLPGISETAM
Plasmodiumf       --KDLASRDVVSRAMTIEINEQRGC----GPNADHIYLDLTHLPYETLKERLPGIMETAK
sacharro          --KDLACRDVVSRAITMEIREGRGV----GKKKDHMYLQLSHLPPEVLKERLPGISETAA
sacharro2         --KDLASRDVVSRAITMEIRAGRGV----GKNKDHILLQLSHLPPEVLKERLPGISETAA
Spombe            --KDLASRDVVSRAMTVEIREGRGV----GPEKDHCYLQLSHLPAEILKERLPGISETAA
Arabidopsis       --KDLASRDVVSRSMTMEIREGRGV----GPHKDHIYLHLNHLPPEVLKERLPGISETAA
Rickettsia        --KDLASRDVVSRAMTIEIREGRGV----GEHKDHVFLHLNHLSPEILHRRLPGISETAK
Bradyrhizo        --KDLASRDVVSRAMTIEIREGRGV----GKKKDHIFLHLDHLDPAVLAERLPGISESAK
Rrubrum           --KDLASRDVVSRAMTVEIREGRGV----GPKKDHINLHLEHLGPEVLHSRLPGITETAK
Paradeni          --KDLASRDVVSRCITIEIREGRGV----GPHKDHMHLNLMHLPPESLAERLPGISESAK
Synecho           --MELAPRDITSRAITLEIRAGRGVNADGSAGGPYVYLDLRHMGREKIMSRIPFCWEEAH
Natronobac        --GELASRDVVSRAELTEVNEGRGI------NDEYVFLDMRHLGDERINDRLENIIHLAE
Mjanaischii       --MELSTRDVVARAIYKEIQEGRGV-------NGGVYLDVSHLPNEVIEKKLETMLKQFL
Methanobac        --GELATRDVVARAIYTEIMEGRGTG------NGGVYLDVSHLPDEVIEEKLETMLLQFQ
                     ::  ** .::.   *                   :.: *:    :  ::        

                         430       440       450       460       470       480
                           |         |         |         |         |         |
AquifexFDR        DFEGVDPAKELVPIRPSAHYCMGGIHVENY----KTSETP-------------LKGLYAV
Sulfol-acidocauld TFSGVDATKELIPIRPAHHYYMGGIDVDI-----TGKNPD-------------VIGLFAA
ProteusvFDR       AYVGVDPVNEPIPVRPTAHYTMGGIETNQ------RTETR-------------IKGLFAV
EcoliFDR          AYVGVDPVKEPIPVRPTAHYTMGGIETDQ------NCETR-------------IKGLFAV
Hinfluenz         AYEGVNPVNEPIPVRPVVHYTMGGIEVDF------NSETR-------------IKGLFAV
Archeoglobus      KFAGVDPVEEPIPVRPVAHYTMGGIDTNV------RCETA-------------VKGFFAA
M.tubercul        TYLGVDPVTELVPVYPTCHYLMGGIPTTV-TGQVLRDNT------------SVVPGLYAA
Mycobac           TYLGVDPVHELVPVYPTCHYVMGGIPTTV-TGQVLRDNT------------STVPGLYAA
Ecoli             TFAHVDPVKEPIPVIPTCHYMMGGIPTKV-TGQALTVNE--------KGEDVVVPGLFAV
Ecoli2            TFAHVDPVKEPIPVIPTCHYMMGGIPTKV-TGQALTVNE--------KGEDVVVPGLFAV
Shewanella        TFAHIDPADGPIPVLPTCHYMMGGLPTKV-SGQVIRMND--------DGTEQDVLGLFAV
Coxiella          KFANVDITKEPIPILPTCHYMMGGIPTNI-HGQALTVDE--------NGKDQIIEGLFAA
Rf-FDR            NFANVDITKESIPVVPTIHYQMGGIPTNI-YGQVVTPDG--------SGSQKVVKGLYAV
Bos               IFAGVDVTKEPIPVLPTVHYNMGGIPTNY-KGQVLRHV---------NGQDQGVPGLYAC
bos2              IFAGVDVTKEPIPVLPTVHYNMGGIPTNY-KGQVLRHV---------NGQDQVVPGLYAC
Homo              IFAGVDVTKEPIPVLPTVHYNMGGIPTNY-KGQVLRHV---------NGQDQIVPGLYAC
Homo2             IFAGVDVTKEPIPVLPTVHYNMGGIPTNY-KGQVLRHV---------NGQDQIVPGLYAC
Mus               IFAGVDVTKEPIPVLPTVHYNMGGIPTNY-KGQVLKHV---------NGQDQIVPGLYAC
Gallus            IFAGVDVTKEPIPVLPTVHYNMGGIPTNY-KGQVITHV---------NGEDKVVPGLYAC
Celegans          IFAGVDVTKEPIPVIPTVHYNMGGVPTNY-KGQVLNYTP--------KKGDEVVPGLYAA
Celegans2         IFAGVDVTKEPIPVIPTVHYNMGGVPTNY-KGQVLNYTP--------KKGDEVVPGLYAA
Celegans3         IFAGVDVTKEPIPVIPTVHYNMGGVPTNY-KGQVLDFTP--------EGGDKVIPGLYAA
ascaris           IFAGVDVTKEPIPVIPTVHYNMGGIPTNY-KAQVIKYTK--------EGGDKIVPGLYAC
nemato4           IFAGVDAAKEPIPVLPTVHYNMGGIPTNY-MGQVLTHKR--------DKGDQLVPGLYAC
Drosophila        IFAGVDVTREPIPVLPTVHYNMGGVPTNY-RGQVITIDK--------DGKDVIVPGLYAA
Plasmodiumf       IFAGVDVTKQYIPVLPTVHYNMGGIPTNY-KTQVLTQNVNFNKQTNKSNEDIIVKGLYAA
sacharro          IFAGVDVTKEPIPIIPTVHYNMGGIPTKW-NGEALTIDEET-------GEDKVIPGLMAC
sacharro2         VFAGVDVTQEPIPVLPTVHYNMGGIPTKW-TGEALTIDEET-------GEDKVIPGLMAC
Spombe            IFAGVDVTKEPIPVLPTVHYNMGGIPTRF-TGEVLTIDEN--------GKDKIVPGLYAA
Arabidopsis       IFAGVDVTKEPIPVLPTVHYNMGGIPTNY-HGEVVTIKG--------DDPDAVIPGLMAA
Rickettsia        IFAGVDVTKDPIPVLPTVHYNMGGIPTNY-QGQVIIKDG--------KNHNSIVNGIMAI
Bradyrhizo        IFANVDVTREPIPIVPTVHYNMGGIPTNY-HGEVLTKKD--------GDDNAVIPGLMAI
Rrubrum           VFSGVDATRDPIPVLPTVHYNMGGIPTNY-HGEVLNPTK--------ADPEAIFPGLMAV
Paradeni          IFAGVDVTREPIPILPTVHYNMGGIPTNY-WGEVLNPTQ--------DNPDQVFPGLMAV
Synecho           RLVGIDAVEQPMPVRPTVHYCMGGIPVNT-DGRVRKNAN------------ELTEGFFAA
Natronobac        DFEGVNPVEEPMPVKPGQHYHMGGVETNE-------------------HGETCIDGLYAA
Mjanaischii       R-VGIDIRKEPMIVSPTAHHFMGGLKINE-----RCETN--------------IIGLFAC
Methanobac        D-VGIDIRSEPMEVAPTAHHFMGGVRIDE-----WGRTN--------------LRNLFAA
                      ::     : : *  *: ***:                              .: * 

                         490       500       510       520       530       540
                           |         |         |         |         |         |
AquifexFDR        GECACVSVHGANRLGGNSLTELVVFGKYCGM-AVREFVK---------------------
Sulfol-acidocauld GEAACVSVHGANRLGSNSLLETLVFGRETGN-EVVKFLQ---------------------
ProteusvFDR       GECSSVGLHGANRLGSNSLAELVVFGRLAGEEAVRRAQE---------------------
EcoliFDR          GECSSVGLHGANRLGSNSLAELVVFGRLAGEQATERAAT---------------------
Hinfluenz         GECASSGLHGANRLGSNSLAELVVLGRVAGEYAAQRAVE---------------------
Archeoglobus      GECACVSIHGANRLGSNSTAECLVFGRVAGEVAAEYALK---------------------
M.tubercul        GECACVSVHGANRLGTNSLLDINVFGRRAGIAAASYAQ----------------------
Mycobac           ANAHVCPCTAPTGWAPTRC-----------------------------------------
Ecoli             GEIACVSVHGANRLGGNSLLDLVVFGRAAGLHLQESIA----------------------
Ecoli2            GEIACVSVHGANRLGGNSLLDLVVFGRAAGLHLQESIA----------------------
Shewanella        GEIACVSVHGANRLGGNSLLDLVVFGRAAGQHLGKALD----------------------
Coxiella          GECACVSVHGANRLGTNSLLDLVVFGRAIGLHLEEALK----------------------
Rf-FDR            GECACVSVHGANRLGTNSLLDIVVFGRAAGKHIVQFNN----------------------
Bos               GEAACASVHGANRLGANSLLDLVVFGRACALSIAESCR----------------------
bos2              GEAACASVHGANRLGANSLLDLVVFGRACALSIAESCR----------------------
Homo              GEAACASVHGANRLGANSLLDLVVFGRACALSIEESCR----------------------
Homo2             GEAACASVHGANRLGANSLLDLVVFGRACALSIEESCR----------------------
Mus               GEAACASVHGANRLGANSLLDLVVFGRACALSIAESCR----------------------
Gallus            GEAASASVHGANRLGANSLLDLVVFGRACALTIAETCK----------------------
Celegans          GECGAHSVHGANRLGANSLLDLVIFGRACAIDILKNTS----------------------
Celegans2         GECGAHSVHGANRLGANSLLDLVIFGRACAIDILKNTS----------------------
Celegans3         GECAAHSVHGANRLGANSLLDLVIFGRSCALTILNENK----------------------
ascaris           GECACHSVHGANRLGANSLLDAVVFGRACSINIKEELK----------------------
nemato4           GEAAAHSVHGANRLGANSLLDLVVFGRACAIDILEKAKK---------------------
Drosophila        GEAASSSVHGANRLGANSLLDLVVFGRACAKTIAELNK----------------------
Plasmodiumf       GEAASASVHGANRLGANSLLDIVVFGKRAALTIMEIDK----------------------
sacharro          GEAACVSVHGANRLGANSLLDLVVFGRAVAHTVADTLQ----------------------
sacharro2         GEAACVSVHGANRLGANSLLDLVVFGRAVANTIADTLQ----------------------
Spombe            GEAACVSVHGGNRLGANSLLDIVVFGRACALHIKDTLE----------------------
Arabidopsis       GEAACASVHGANRLGANSLLDIVVFGRACANRVAEISK----------------------
Rickettsia        GEAACVSVHGANRLGSNSLLDLVVFGRSSALKAAELIK----------------------
Bradyrhizo        GEAACVSVHGANRLGSNSLIDLVVFGRAAALRLAEKLT----------------------
Rrubrum           GECACVSVHGANRLGTNSLLDIVVFGRAAALRAAEVVK----------------------
Paradeni          GEAGCASVHGANRLGSNSLIDLVVFGRAAAIRAGQVID----------------------
Synecho           GECACVSVHGGNRLGSNSLLECVVYGRRTGRSIAEYVQ----------------------
Natronobac        GECACASVHGSNRLGGNALPELIVFGARAGHHAAGRDLGTAEVPTGPSAETEREEGLETP
Mjanaischii       GEVT-GGVHGANRLGGNALADTQVFGAIAGKSAKEFVE----------------------
Methanobac        GEVT-GGVHGANRLGGNALADTQVFGRRAGIAAARNAI----------------------
                  .:       . .  . .                                           

                         550       560       570       580       590       600
                           |         |         |         |         |         |
AquifexFDR        ----------------E-TDFAPVSESEPKKSEEFIEELMKREGNESLAQVRAQMGEITW
Sulfol-acidocauld ----------------SFTEPSSDIDKEAEKAEQSAYDIMKKESGVHFGDILEKLRDYMW
ProteusvFDR       ----------------ATPANASALDAQTRDIEDNLKKLMNQKGSENWAQIRDEMGEAME
EcoliFDR          ----------------AGNGNEAAIEAQAAGVEQRLKDLVNQDGGENWAKIRDEMGLAME
Hinfluenz         ----------------AQSVNQSAVDAQAKDVVARLEALHKQEGNESWSEIRDEMGTVME
Archeoglobus      ----------------AKQGKISA-EFREKEEKRIFDELLGKSGDESPYQIKKELNETME
M.tubercul        -----------------GHDFVDMPPNPEAMVVGWVSDILSEHGNERVADIRGALQQSMD
Mycobac           ------------------------------------------------------------
Ecoli             ----------------EQGALRDASESDVEASLDRLNRWNNNRNGEDPVAIRKALQECMQ
Ecoli2            ----------------EQGALRDASESDVEASLDRLNRWNNNRNGEDPVAIRKALQECMQ
Shewanella        ----------------ATPDPKAASDVEISASLARLNRWESNKDGEEPAVIRKDLQLCMQ
Coxiella          ----------------TELKHRSENPDDIDAAIARLKRWEKPNNVENPALLRQEMRKAMS
Rf-FDR            ----------------ESDTHKPVPENGADISLDRLNRLDNSTSGEYAQVVRDDIRNTMQ
Bos               ----------------PGDKVPSIKPNAGEESVMNLDKLRFANGSIRTSELRLNMQKSMQ
bos2              ----------------PGDKVPSIKPNAGEESVMNLDKLRFANGSIRTSELRLNMQKSMQ
Homo              ----------------PGDKVPPIKPNAGEESVMNLDKLRFADGSIRTSELRLSMQKSMQ
Homo2             ----------------PGDKVPPIKPNAGEESVMNLDKLRFADGSIRTSELRLSMQKSMQ
Mus               ----------------PGDKVPSIKANAGEESVMNLDKLRFADGSIRTSELRLNMQKSMQ
Gallus            ----------------PGEPVPSIKPNAGEESVANLDKLRFADGTIRTSEARLNMQKTMQ
Celegans          ----------------AGVGVPELPKNAGEASVANIDKLRHNKGDISTAELRLTMQKSMQ
Celegans2         ----------------AGVGGPELPKNAGEASVANIDKLRTTREDISTAELRLTMQKSMQ
Celegans3         ----------------PGDSIPELPVNCEEKSCDNLNGLLHSKGDISSIELRQKMQMTMQ
ascaris           ----------------PDEKIPELPEGAGEESIANLDAVRYANGDVPTAELRLTMQKTMQ
nemato4           ----------------SPEKIPELPENAGESTIANVDKLRFAKGDIPTAALRLKMQKTMQ
Drosophila        ----------------PGAPAPTLKENAGEASVANLDKLRHANGLITTADLRLKMQKTMQ
Plasmodiumf       ----------------PNIPKINANTNIGEESIQRLDHIRFNKGSIQTSQLRKKMQICMQ
sacharro          ----------------PGLPHKPLPSDLGKESIANLDKLRNANGSRSTAEIRMNMKQTMQ
sacharro2         ----------------PGLPHKPLASNIGHESIANLDKVRNARGSLKTSQIRLNMQRTMQ
Spombe            ----------------PNTPHKPLAADAGLDSLKFLDQIRTSQGPKHTSEIRLDMQKTMQ
Arabidopsis       ----------------PGEKQKPLEKDAGEKTIAWLDRLRNSNGSLPTSTIRLNMQRIMQ
Rickettsia        ----------------PASPHKPLQKETLEKIINRFDKVRYANGNILVADLRLKMQRTMQ
Bradyrhizo        ----------------PNAKQPELPANSAELALGRLDHYRYASGGTPTAKLREGMQHVMQ
Rrubrum           ----------------PGIAHKPLKADAADLALSRLDRLRSAKGAKLTAEIRDDLQQAMQ
Paradeni          ----------------REAQIPTTNKEQVDKALDRFDRIRNADGSVSTADLRLEMQRTMQ
Synecho           ----------------GRSLPEIDEAVYKTEAQTRIDQLLNQQGTVRINTLRQAFQDCMT
Natronobac        VEPGALDTGDSDVAADGALVEPDELVEQAVETERQRVEELLESDGINHAEIREDLQKAMT
Mjanaischii       -------------------NHDFNNIDAEEDVAKILEEINSLKGDLNVYNLIEDLRKVMW
Methanobac        -------------------KSAPASIKSTVEEEEYRIKSMVAEGSHSPSEIRDRLHEAMW
                                                                              

                         610       620       630       640       650       660
                           |         |         |         |         |         |
AquifexFDR        AKMGIFRDEKSLKEAYDELSELLERWNNIPV-VDKAKVFNTNLIEVLELRNMLELARVVA
Sulfol-acidocauld EFVGIYRDENGLKNAVSEILKLREQMKNMYV-LDKSKVYNTEFYNALELKNMIDLGLVIA
ProteusvFDR       EGCGIYRTPELMQKTIDKLTELKERFKHVEI-KDTSSVFNTDLLYKIELGFGLDVAECMA
EcoliFDR          EGCGIYRTPELMQKTIDKLAELQERFKRVRI-TDTSSVFNTDLLYTIELGHGLNVAECMA
Hinfluenz         EGCGIYRDQASMQKAVDKIAELKERYKRIRV-SDNSSVFNTDVLYTVELGYILDVAQSIA
Archeoglobus      ANMWIFREEKGLKEAVKKIKELKERYRNVEI-NDKSRGFNTDLTSAIEIGYMLDLAEVVA
M.tubercul        NNAAVFRTEETLKQALTDIHALKERYSRITV-HDKGKRFNTDLLEAIELGFLLELAEVTV
Mycobac           ------------------------------------------------------------
Ecoli             HNFSVFREGDAMAKGLEQLKVIRERLKNARL-DDTSSEFNTQRVECLELDNLMETAYATA
Ecoli2            HNFSVFREGDAMAKGLEQLKVIRERLKNARL-DDTSSEFNTQRVECLELDNLMETAYATA
Shewanella        LNFSVFRSGDAMAEGLEQLQAIQKRLENAKL-SDNSTEFNTQRIECLELDNLMATALATA
Coxiella          EDFGVFREEQKMKQGLERLQKLNERLQRAKL-TDTSRTFNNARIEALELDNLMEVSYATA
Rf-FDR            THASVFRTQASMDEGVIKIAAMRERVKNITL-ADKSKIFNTARIEALEVDNMIEAAQATM
Bos               SHAAVFRVGSVLQEGCEKISSLYGDLRHLKT-FDRGMVWNTDLVETLELQNLMLCALQTI
bos2              SHAAVFRVGSVLQEGCEKISSLYGDLRHLKT-FDRGMVWNTDLVETLELQNLMLCALQTI
Homo              NHAAVFRVGSVLQEGCGKISKLYGDLKHLKT-FDRGMVWNTDLVETLELQNLMLCALQTI
Homo2             NHAAVFRVGSVLQEGCGKISKLYGDLKHLKT-FDRGMVWNTDLVETLELQNLMLCALQTI
Mus               NHAAVFRVGSVLQEGCEKISQLYGDLKHLKT-FDRGMVWNTDLVETLELQNLMLCALQTI
Gallus            SHAAVFRTGSILQEGCEKLSQIYCDLAHLKT-FDRGIVWNTDLVETLELQNLMLCALQTI
Celegans          NHAAVFRRGDILKEGVKVLSKLYKDQAHLNV-ADKGLVWNSDLIETLELQNLLINATQTI
Celegans2         NHAAVFRRGDILKEGVKVLSKLYKDQAHLNV-ADKGLVWNSDLIETLELQNLLINATQTI
Celegans3         KHAAVFRRGDLLKEGVDKMSSIYKEQQNLKACADSGKVWNSELVETLELQNLLINANQTI
ascaris           KHAGVFRRGDILAEGVKKMMDLSKELKRLKT-TDRSLIWNSDLTESLELQNLMLNATQTI
nemato4           QHAAVFRRGDILKEGITKMEALFKEQKLLKT-TDRGLIWNSDLAETLELQNLMLNATQTI
Drosophila        HLAAVFRDG---------------------------------------------------
Plasmodiumf       KHAAVFRIGPLLQEGYKQILEICSIFKDIEI-TDKTLTWNTDLLETLELENLLTLASQTI
sacharro          KDVSVFRTQSSLDEGVRNITAVEKTFDDVKT-TDRSMIWNSDLVETLELQNLLTCASQTA
sacharro2         KDVSVFRTQDTLDEGVRNITEVDKTFEDVHV-SDKSMIWNSDLVETLELQNLLTCATQTA
Spombe            RDVSVFRMEETLQEGVKNIARVDGTYKDIGI-RDRGLIWNTDLVEALELRNLLTCAVQTA
Arabidopsis       NNAAVFRTQETLEEGCQLIDKAWESFGDVQV-KDRSMIWNSDLIETLELENLLINASITM
Rickettsia        SHVSVFRTQKLLDEGVGMISEIRNRYKDIKI-NDKSLIWNSDLVEALELDNLLDQALVTV
Bradyrhizo        SNCAVFRTGEVLSEGQNLIEKVHSGITDIAV-SDRSLVWNSDLVETLEFDNLIAQAVVTM
Rrubrum           RDAAVFRTTKSLAEGVARVDQVAASLADIKL-VDTSMIFNTDLAEALELENLMACAQTTI
Paradeni          ADAAVFRTDKTLAEGVDKMRVIAGKLSDLKV-TDRSLIWNSDLMETLELTNLMPNALATI
Synecho           SHCGVFRSESFMAEGLEQVQNLKAQYGQIFL-DDKQPQWNTEVIEALELQSIMAVGELIL
Natronobac        ENVNVFREEEGLKEALEVIRECRERYQNVAV-SDPSRTFNTDLIHTIETRNLIDIAETIT
Mjanaischii       DYVSIIRNEDGLKKALEKIDEIERNIDNVKV--NGIIDLQNYF----ELKNMVVVAKLVT
Methanobac        NGVAIVRSRESLESARAVIQDLTTMMGDLNV--PETSGFNTYLIEALELENMLVTSSMVV
                                                                              

                         670       680       690       700       710       720
                           |         |         |         |         |         |
AquifexFDR        YSALHRRESRGGHSRED-----------------YPQRDDKNFLKHSLVYYDKNGN-LKL
Sulfol-acidocauld STALNRKESRGAHYRTD-----------------YPKRDDQNWLKHTIAYLSGN-T-VEI
ProteusvFDR       HSAFNRKESRGAHQRLDE----G-----------CTERDDVNFLKHTLAFYNPEGA-PRL
EcoliFDR          HSAMARKESRGAHQRLDE----G-----------CTERDDVNFLKHTLAFRDADGT-TRL
Hinfluenz         NSAIERKESRGAHQRLD-----------------YTERDDVNYLKHTLAFYNENGA-PRI
Archeoglobus      IGALKRQESRGAHYRLD-----------------YPKRDDENWLKHTLAYYTPEG--PKF
M.tubercul        VGALNRKESRGGHAREDYPNRDDV-----------------NYMRHTMAYKEIGADKEGP
Mycobac           ------------------------------------------------------------
Ecoli             VSANFRTESRGAHSRFDF-----------------PDRDDENWLCHSLYLPESES----M
Ecoli2            VSANFRTESRGAHSRFDF-----------------PDRDDENWLCHSLYLPESES----M
Shewanella        YAANFRTESRGAHSREDY-----------------LDRDDDNWLCHSLYNPVTQG----M
Coxiella          VSAQQRTESRGAHSRYDY-----------------KERDDANWLKHTVYFRDGH-----I
Rf-FDR            VSAAARRECRGAHTVLDYDRPADDATCP-------LGRDDVNWMKHTLWDRDTNS----L
Bos               YGAEARKESRGGPRREDFKERVDEYDYSKPIQGQQKKPFEQHWRKHTLSYVDIKTGKVTL
bos2              YGAEARKESRGGPRREDFKERVDEYDYSKPIQGQQKKPFEQHWRKHTLSYVDIKTGKVTL
Homo              YGAEARKESRGAHAREDYKVRIDEYDYSKPIQGQQKKPFEEHWRKHTLSYVDVGTGKVTL
Homo2             YGAEARKESRGAHAREDYKVRIDEYDYSKPIQGQQKKPFEEHWRKHTLSFVDVGTGKVTL
Mus               YGAEARKESRGAHAREDYKVRVDEYD----------------------------------
Gallus            YGAEARKESRGAHAREDYKFRIDDFDYSKPLQGQQKRPFEEHWRKHTLSYVDVKSGKVTL
Celegans          VAAENREESRGAHARDDFPDRLDELDYSKPLEGQTKKELKDHWRKHSIIRSNIETGEVSL
Celegans2         VAAENREESRGAHARDDFPDRLDELDYSKPLEGQTKKELKDHWRKHSIIRSNIETGEVSL
Celegans3         VAAENRTESRGAHARDDFQERIDEYDYSNPLEGQQKKPFDQHWRKHSIIGIDTKTGAVDL
ascaris           VAAENRKESRGAHARDDFPKREDEYDYSKPIEGQTKRPFEKHWRKHTLTKQDPRTGHITL
nemato4           TAAEARKESRGAHARDDFPTRIDEFDYSRSLDNQTKKPFDQHWRKHTMIEQNHETGKITL
Drosophila        ------------------------------------------------------------
Plasmodiumf       LAAVERKESRGAHARDDFPERDDKNYLKHSLTWMTDRNIENTKYFTTYRDVITKPLDNEM
sacharro          VSAANRKESRGAHAREDYPNRDDE-----------------HWMKHTLSWQKDVAAPVTL
sacharro2         VSASKRKESRGAHAREDYAKRDDV-----------------NWRKHTLSWQKGTSTPVKI
Spombe            NAALNRKESRGAHAREDYPERDDK-----------------NWIKHTLTWQHKTGDPVTL
Arabidopsis       HSAEARKESRGAHAREDFTKREDG-------------------EWMKHTLGYWEDEKVRL
Rickettsia        CSAAARKESRGAHAREDYPDRNDR-----------------DWIKHTLSSIDDSGK-VVL
Bradyrhizo        NSAANRTESRGAHAREDFSERDDK-----------------NWMKHTLAWLDDAGK-VKI
Rrubrum           HGAAARQESRGAHAHEDFPDRDDK-----------------TWMKHTLAWLDPKGK-VTL
Paradeni          VAAEARKESRGAHAHEDYPERDDA-----------------NWRKHSLAWIEGND--VKL
Synecho           TSAIQRQESRGSHAREDFPSRDDE-----------------QFLRHTLASFDGEQ--IKV
Natronobac        LGALAREEFRGAHWRQQYQERRDD-----------------EWLKHTMISWNDGS--PKL
Mjanaischii       KSALYRKESRGAHYREDFPETKE------------------EWRGNIIIKGKKMW---FE
Methanobac        ESALIREESRGSHYRKDFPETRP------------------EWLKSIVLNRNRRP---GF
                                                                              

                         730       740       750       760       770
                           |         |         |         |         |
AquifexFDR        EYIPVRITKYK-------------PEERKY---------------------
Sulfol-acidocauld TYKPVKMTKWK-------------PEERVY---------------------
ProteusvFDR       EYSDVKITKSA-------------PAKRVYGGEATAQDKQNKEKANG----
EcoliFDR          EYSDVKITTLP-------------PAKRVYGGEADAADKAEAANKKEKANG
Hinfluenz         EYSPVKITKSQ-------------PAKRVYGAEAEAQEAAAKAKEQANG--
Archeoglobus      DYKPVTITKWQ-------------PVERKY---------------------
M.tubercul        ELRSDVRLDFKP-----VVQTRYEPKERKY---------------------
Mycobac           ---------------------------------------------------
Ecoli             TRRSVNMEPKLR--------PAFPPKIRTY---------------------
Ecoli2            TRRSVNMEPKLR--------PAFPPKIRTY---------------------
Shewanella        SKRDVNMTPKLR--------EAFPPVKRTY---------------------
Coxiella          AYRPVNMKPKGM--------DPFPPKSRD----------------------
Rf-FDR            SYKPVNLKPLTV--------ASVPPKVRTF---------------------
Bos               EYRPVIDRTLNE-----TDCATVPPAIGSY---------------------
bos2              EYRPVIDRTLNE-----TDCATVPPAIRSY---------------------
Homo              EYRPVIDKTLNE-----ADCATVPPAIRSY---------------------
Homo2             EYRPVIDKTLNE-----ADCATIPPAIRSY---------------------
Mus               ---------------------------------------------------
Gallus            KYRPVIDRTLNE-----EDCSSVPPAIRSY---------------------
Celegans          DYRPVIDTTLDK-----SETDWVPPKVRSY---------------------
Celegans2         DYRPVIDTTLDK-----SETDWVAPKVRSY---------------------
Celegans3         TYRPVIDKTLDK-----SETDWVPPKVRSY---------------------
ascaris           DYRPVIDKTLDP-----AEVDWIPPIIRSY---------------------
nemato4           LYRPVIDQTLDK-----SETDWIQPMIRSY---------------------
Drosophila        ---------------------------------------------------
Plasmodiumf       EYVPPVKRVY-----------------------------------------
sacharro          KYRRVIDHTLDE-----KECPSVPPTVRAY---------------------
sacharro2         KYRNVIAHTLDE-----NECAPVPPAVRSY---------------------
Spombe            KYRAVTRTTMDE-----NEVKPVPPFKRVY---------------------
Arabidopsis       DYRPVHMDTLDD-----EIDTFPPKARVY----------------------
Rickettsia        DYKPVTLTTLT------DAISAIPPVKRVY---------------------
Bradyrhizo        EYRPVHDYTMT------NDVQYIPPKARVY---------------------
Rrubrum           DYRPVHTFTLT------DEIDYIEPKARVY---------------------
Paradeni          AYRPVHLEPLTRQDEGGIDLKKIAPKARVY---------------------
Synecho           EYMPVVINRFE-------------PKERKY---------------------
Natronobac        YYKPVILEGENK-----E----YEPKVRSY---------------------
Mjanaischii       KLDYSVFQNFLE---------------------------------------
Methanobac        IERGLKSA-------------------------------------------
                                                                     


Color code for secondary states :
h,g: Helix e: Sheet t: Turn c,s,b: Coil ?: Ambigous 1 AquifexFDR 571 aa 2 Sulfol-acidocauld 566 aa 3 ProteusvFDR 598 aa 4 EcoliFDR 602 aa 5 Hinfluenz 599 aa 6 Archeoglobus 563 aa 7 M.tubercul 590 aa 8 Mycobac 401 aa 9 Ecoli 588 aa 10 Ecoli2 588 aa 11 Shewanella 588 aa 12 Coxiella 587 aa 13 Rf-FDR 601 aa 14 Bos 665 aa 15 bos2 665 aa 16 Homo 664 aa 17 Homo2 664 aa 18 Mus 532 aa 19 Gallus 499 aa 20 Celegans 646 aa 21 Celegans2 646 aa 22 Celegans3 640 aa 23 ascaris 645 aa 24 nemato4 646 aa 25 Drosophila 509 aa 26 Plasmodiumf 620 aa 27 sacharro 640 aa 28 sacharro2 634 aa 29 Spombe 487 aa 30 Arabidopsis 634 aa 31 Rickettsia 596 aa 32 Bradyrhizo 611 aa 33 Rrubrum 594 aa 34 Paradeni 600 aa 35 Synecho 575 aa 36 Natronobac 611 aa 37 Mjanaischii 539 aa 38 Methanobac 558 aa Consensus length 771 Number of perfect matches = * 37 => 4.80 % Number of high similarity = : 31 => 4.02 % Number of low similarity = . 16 => 2.08 %
Fast Pairwise Alignment Parameters Ktupl size = 1 Window size = 5 Scoring method = Percentage Number of top diagonals = 5 Gap penalty = 3 Multiple Alignment Parameters Weight Matrix = pam Gap opening penalty = 10.0 Gap extension penalty = 0.05 Hydophilic gaps = On Hydrophilic residues = GPSNDQERK Residue-specific gap penalties = On