Probable amino acid sequence: MAALVLRCVARRCLLARLSPGPSVHHVVPMATTAKEEMARFWEKNTKSSRPLSPHI SIYKWSLPMAMSITHRGTGVALSLGVSLFSLAALLLPEQFPHYVAVVKSLSLSPALIYSA KFALVFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGVVVLILTLLSSAAIAAM0 1 10 20 30 40 50 60 | | | | | |patpk0076f8f -----CGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgl1npk003i14 ----------------------------GTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgf1npk011j7 ---------------------------TGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgm2npk010m23 --GAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgm1npk001h3 CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpatpk0079d4f ----------------GGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgl1npk002m9 ------------------CGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgl1npk010j14 -----------------------CTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGG ********************************Prim.cons. CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGG (E R F K) M A A L V L R C V A R R C L L 50 60 70 80 90 100 110 120 | | | | | |patpk0076f8f CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpgl1npk003i14 CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpgf1npk011j7 CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCNNCCACAGCCAAGGpgm2npk010m23 CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpgm1npk001h3 CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpatpk0079d4f CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpgl1npk002m9 CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpgl1npk010j14 CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGG ********************************************** ************Prim.cons. CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGG A R L S P G P S V H H V V P M A T T A K 110 120 130 140 150 160 170 180 | | | | | |patpk0076f8f AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgl1npk003i14 AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgf1npk011j7 AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgm2npk010m23 AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgm1npk001h3 AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApatpk0079d4f AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgl1npk002m9 AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgl1npk010j14 AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACA ************************************************************Prim.cons. AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACA E E M A R F W E K N T K S S R P L S P H 170 180 190 200 210 220 230 240 | | | | | |patpk0076f8f TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgl1npk003i14 TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgf1npk011j7 TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgm2npk010m23 TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgm1npk001h3 TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpatpk0079d4f TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgl1npk002m9 TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgl1npk010j14 TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCG ************************************************************Prim.cons. TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCG I S I Y K W S L P M A M S I T H R G T G 230 240 250 260 270 280 290 300 | | | | | |patpk0076f8f TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgl1npk003i14 TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgf1npk011j7 TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgm2npk010m23 TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgm1npk001h3 TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpatpk0079d4f TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgl1npk002m9 TTGCTCTCAGCTTAAGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgl1npk010j14 TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGT ************** *********************************************Prim.cons. TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGT V A L S L G V S L F S L A A L L L P E Q 290 300 310 320 330 340 350 360 | | | | | |patpk0076f8f TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgl1npk003i14 TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgf1npk011j7 TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgm2npk010m23 TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgm1npk001h3 TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpatpk0079d4f TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgl1npk002m9 TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgl1npk010j14 TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTG ************************************************************Prim.cons. TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTG F P H Y V A V V K S L S L S P A L I Y S 350 360 370 380 390 400 410 420 | | | | | |patpk0076f8f CTAAATTCCGCCCTGGGTCTTCCCCCTCTCCTACCACACCTGGGAACGGAATCCGACACCpgl1npk003i14 CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCpgf1npk011j7 CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCpgm2npk010m23 CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCpgm1npk001h3 CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AATGGAATCCGACACCpatpk0079d4f CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCpgl1npk002m9 CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCpgl1npk010j14 CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACC ******** ******* ************************** ** *************Prim.cons. CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACC A K F A L V F P L S Y H T W N G I R H 410 420 430 440- - 450 460 470 480 | | | | | | patpk0076f8f TCCGTGTGGGATATGGGGAAAGGGCTTCAAACTCAACCAAGGTGGGAGCAATCNGGGGGTpgl1npk003i14 TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGNNNNpgf1npk011j7 TC-GTGTGGGNT-TGNNNNAAGGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGNNNNNpgm2npk010m23 TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGG-Tpgm1npk001h3 TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGG-Tpatpk0079d4f TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGGGTpgl1npk002m9 TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGG-Tpgl1npk010j14 TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGNNNN ** ******* * ** * ************** *** ***** **** ** * Prim.cons. TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGG?T L V W D M G K G F K L S Q V E Q S G -V 460 470 480 490 500 507 | | | | | |patpk0076f8f TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGGCATCGCA-GCGAT-TA-T-------pgl1npk003i14 TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-G---------------pgf1npk011j7 TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCAT---------------------pgm2npk010m23 TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGC------------------pgm1npk001h3 TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-GNNNNNNNNNNNNAGNCpatpk0079d4f TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGNCATCGCAAGCGAT-TA-CCCAAGGCCpgl1npk002m9 TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGNNNT----------------------pgl1npk010j14 NGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCAT--------------------- ********************************* Prim.cons. TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-GCGATGTAGCCCAAGGCC V V L I L T L L S S A A I A A M # Prim.cons. TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAAGCGATTACCCAAGGCC V V L I L T L L S S A A I A S D Y (2h88- aaiasE<stop>) wrong frame -> C G A D S D A A L L R G H R K R L P K A <-wrongOr Prim.cons. TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-GCGATTACCCAAGGCC V V L I L T L L S S A A I A - A I Tto account for final E140, consist with mammalian term codon TGA:suggested TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAAGCGAaTgACCCAAGGCC (change T->a/c, insert g) V V L I L T L L S S A A I A S E # (2h88- aaiasE<stop>)1. extra G at 479 in two seq, not believed in consensus. Density for 121QSGVVVLIL is very good, confirming the consensus sequence here.2. Top sequence 516G codes for G136 which I had been using- but clearly in minority and density shows a136. 3. Longest sequence 79d4f has extra A inserted at 524. This is believed in first consensus, giving IASDY. Delete that A is supported by 3 apparently weaker sequences, gives IAAIS. A139 is pretty clear, but 140 is not. but I140 uses both T's, leaving no term seq. D140 gives TAC or TAT as next triplet, change last base gives termination. Or insert g -> TGA stop Density fits E140 well in 2803. 4. No avian sequence for SDHC extends to end (2013) but 2nd underlined triplet TAC (TAT) aligns w bovine stop codon TGA (same in rhino, platypus; TGG in anole).Turkey ends in SAGIAAM. Guinea fowl same but I->L. Density for A instead of G is clear, and this is the consensus result above.FinaL M and termination appear to require one "G" insert each:patpk0076f8f TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGGCATCGCA-GCGAT-TA-T--------pgm1npk001h3 TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-GNNNN-NN-NNNNNNAGNCpatpk0079d4f TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGNCATCGCAAGCGAT-TA-CCCAAGGCCPROPOSED: TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-GCGATGTAGCCCAAGGCC V V L I L T L L S S A A I A - A M #https://www.ncbi.nlm.nih.gov/nucest/CO504141.1?report=genbank confirms C-term up to:GCTCTCCTCCGCGGCCATCGCA-GCGA 550 560 | |patpk0076f8f ----------------------------pgl1npk003i14 ----------------------------pgf1npk011j7 ----------------------------pgm2npk010m23 ----------------------------pgm1npk001h3 CCCCCCCCCTTTCCCTCCCCCNNNCCCApatpk0079d4f CCCCC----TTTCCCCCC----------pgl1npk002m9 ----------------------------pgl1npk010j14 ---------------------------- Prim.cons. CCCCCCCCCTTTCCC2CCCCCNNNCCCA P P P F P S P X P ~ Prim.cons. CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCTC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGG?TTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAAGCGATTACCCAAGGCCCCCCCCCCCTTTCCCTCCCCCNNNCCCA Prim.cons. (blanks removed) (561 bases including "?") CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGG?TTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAAGCGATTACCCAAGGCCCCCCCCCCCTTTCCCTCCCCCNNNCCCA Prim.cons. (Revised C-term) (561 bases including "?") CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGG?TTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAGCGATGTAG Which codes for:(ERFK)MAALVLRCVARRCLLARLSPGPSVHHVVPMATTAKEEMARFWEKNTKSSRPLSPHI SIYKWSLPMAMSITHRGTGVALSLGVSLFSLAALLLPEQFPHYVAVVKSLSLSPALIYSA KFALVFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGVVVLILTLLSSAAIAAM0Cterminus is well conserved but usually (always?) G not I at minus 5:chick vs turkey:Query 121 VFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGVVVLILTLLSSAAIAAM 169 VFPLSYHTWNGIRHLVWDMGKGFKL++V+QSGV+VLILTLLSSA IAAM Sbjct 121 VFPLSYHTWNGIRHLVWDMGKGFKLTEVQQSGVLVLILTLLSSAGIAAM 169Density for Ala CB is clear, and several seq including CO504141.1 confirm.=====================old stuff===========================================TblastX consensus against cow (with "?" which got removed?)Query: 15 MAALVLRCVARRCLLARLSPGPSVHHVVPMATTAKEEMARFWEKNTKSSRPLSPHISIYK 194 MAAL+LR V R CL A LSP + + VP+ TTAKEEM RFW KNT +RPLSPHISIY Sbjct: 1 MAALLLRHVGRHCLRAHLSPQLCIRNAVPLGTTAKEEMERFWSKNTTLNRPLSPHISIYG 60Query: 195 WSLPMAMSITHRGTGVALSLGVSLFSLAALLLPEQFPHYVAVVKSLSLSPALIYSAKFAL 374 WSLPMAMSI HRGTG+ALS GVSLF L+ALL+P F ++ VKSL L PALI++AKFALSbjct: 61 WSLPMAMSICHRGTGIALSAGVSLFGLSALLVPGSFESHLEFVKSLCLGPALIHTAKFAL 120Query: 375 VFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGVVVLILTLLSSAAIASD 518 VFPL YHTWNGIRHL+WD+GKG +SQ+ QSGV VL+LT+LSS +A+Sbjct: 121 VFPLMYHTWNGIRHLMWDLGKGLTISQLHQSGVAVLVLTVLSSVGLAAM 169 (end)Try again with N for ?Query: 375 VFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSG 470 VFPL YHTWNGIRHL+WD+GKG +SQ+ QSGSbjct: 121 VFPLMYHTWNGIRHLMWDLGKGLTISQLHQSG 152But where is stop codon?Frame 2 (ERFK)MAALVLRCVARRCLLARLSPGPSVHHVVPMATTAKEEMARFWEKNTKSSRPLSPHISIYKWSLPMAMSITHRGTGVALSLGVSLFSLAALLLPEQFPHYVAVVKSLSLSPALIYSAKFALVFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGVVVLILTLLSSAAIASDYPRPPPPFPPXXP~ put N for "?":Frame 2 (ERFK)MAALVLRCVARRCLLARLSPGPSVHHVVPMATTAKEEMARFWEKNTKSSRPLSPHISIYKWSLPMAMSITHRGTGVALSLGVSLFSLAALLLPEQFPHYVAVVKSLSLSPALIYSAKFALVFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGXCGADSDAALLRGHRKRLPKAPPPFPSPXP~ Need to check reverse frame for corroboration.Brings the same hits- apparently blastn checks both strands.Only coincidence that all hits are in same direction (or resultof primer strategy used.)FASTA nucleoitide sequences:>pgm2n.pk010.m23GAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGGTTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGC>pgm1n.pk001.h3CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAATGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGGTTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAGNNNNNNNNNNNNAGNCCCCCCCCCCTTTCCCTCCCCCNNNCCCA>pat.pk0079.d4.fGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGGGTTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGNCATCGCAAGCGATTACCCAAGGCCCCCCCTTTCCCCCC>pgl1n.pk002.m9CGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAAGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGGTTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGNNNT>pgl1n.pk003.i14GTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGNNNNTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAG>pgl1n.pk010.j14CTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGNNNNNGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCAT>pat.pk0076.f8.fCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCCGCCCTGGGTCTTCCCCCTCTCCTACCACACCTGGGAACGGAATCCGACACCTCCGTGTGGGATATGGGGAAAGGGCTTCAAACTCAACCAAGGTGGGAGCAATCNGGGGGTTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGGCATCGCAGCGATTAT>pgf1n.pk011.j7TGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCNNCCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGNTTGNNNNAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGNNNNNTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCAT
MAALVLRCVARRCLLARLSPGPSVHHVVPMATTAKEEMARFWEKNTKSSRPLSPHI SIYKWSLPMAMSITHRGTGVALSLGVSLFSLAALLLPEQFPHYVAVVKSLSLSPALIYSA KFALVFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGVVVLILTLLSSAAIAAM0 1 10 20 30 40 50 60 | | | | | |patpk0076f8f -----CGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgl1npk003i14 ----------------------------GTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgf1npk011j7 ---------------------------TGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgm2npk010m23 --GAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgm1npk001h3 CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpatpk0079d4f ----------------GGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgl1npk002m9 ------------------CGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGpgl1npk010j14 -----------------------CTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGG ********************************Prim.cons. CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGG
MAALVLRCVARRCLLARLSPGPSVHHVVPMATTAKEEMARFWEKNTKSSRPLSPHI SIYKWSLPMAMSITHRGTGVALSLGVSLFSLAALLLPEQFPHYVAVVKSLSLSPALIYSA KFALVFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGVVVLILTLLSSAAIAAM0
(E R F K) M A A L V L R C V A R R C L L
50 60 70 80 90 100 110 120 | | | | | |patpk0076f8f CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpgl1npk003i14 CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpgf1npk011j7 CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCNNCCACAGCCAAGGpgm2npk010m23 CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpgm1npk001h3
CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpatpk0079d4f CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpgl1npk002m9 CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGpgl1npk010j14 CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGG ********************************************** ************Prim.cons. CCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGG
A R L S P G P S V H H V V P M A T T A K
110 120 130 140 150 160 170 180 | | | | | |patpk0076f8f AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgl1npk003i14 AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgf1npk011j7 AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgm2npk010m23 AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgm1npk001h3
AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApatpk0079d4f AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgl1npk002m9 AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACApgl1npk010j14 AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACA ************************************************************Prim.cons. AGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACA
E E M A R F W E K N T K S S R P L S P H
170 180 190 200 210 220 230 240 | | | | | |patpk0076f8f TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgl1npk003i14 TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgf1npk011j7 TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgm2npk010m23 TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgm1npk001h3
TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpatpk0079d4f TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgl1npk002m9 TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGpgl1npk010j14 TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCG ************************************************************Prim.cons. TCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCG
I S I Y K W S L P M A M S I T H R G T G
230 240 250 260 270 280 290 300 | | | | | |patpk0076f8f TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgl1npk003i14 TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgf1npk011j7 TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgm2npk010m23 TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgm1npk001h3
TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpatpk0079d4f TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgl1npk002m9 TTGCTCTCAGCTTAAGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTpgl1npk010j14 TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGT ************** *********************************************Prim.cons. TTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGT
V A L S L G V S L F S L A A L L L P E Q
290 300 310 320 330 340 350 360 | | | | | |patpk0076f8f TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgl1npk003i14 TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgf1npk011j7 TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgm2npk010m23 TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgm1npk001h3
TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpatpk0079d4f TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgl1npk002m9 TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGpgl1npk010j14 TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTG ************************************************************Prim.cons. TCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTG
F P H Y V A V V K S L S L S P A L I Y S
350 360 370 380 390 400 410 420 | | | | | |patpk0076f8f CTAAATTCCGCCCTGGGTCTTCCCCCTCTCCTACCACACCTGGGAACGGAATCCGACACCpgl1npk003i14 CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCpgf1npk011j7 CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCpgm2npk010m23 CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCpgm1npk001h3
CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AATGGAATCCGACACCpatpk0079d4f CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCpgl1npk002m9 CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCpgl1npk010j14 CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACC ******** ******* ************************** ** *************Prim.cons. CTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACC
A K F A L V F P L S Y H T W N G I R H
410 420 430 440- - 450 460 470 480 | | | | | | patpk0076f8f TCCGTGTGGGATATGGGGAAAGGGCTTCAAACTCAACCAAGGTGGGAGCAATCNGGGGGTpgl1npk003i14 TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGNNNNpgf1npk011j7 TC-GTGTGGGNT-TGNNNNAAGGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGNNNNNpgm2npk010m23 TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGG-Tpgm1npk001h3
TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGG-Tpatpk0079d4f TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGGGTpgl1npk002m9 TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGG-Tpgl1npk010j14 TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGNNNN ** ******* * ** * ************** *** ***** **** ** * Prim.cons. TC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGG?T
L V W D M G K G F K L S Q V E Q S G -V
460 470 480 490 500 507 | | | | | |patpk0076f8f TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGGCATCGCA-GCGAT-TA-T-------pgl1npk003i14 TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-G---------------pgf1npk011j7 TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCAT---------------------pgm2npk010m23 TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGC------------------pgm1npk001h3
TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-GNNNNNNNNNNNNAGNCpatpk0079d4f TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGNCATCGCA
A
GCGAT-TA-CCCAAGGCCpgl1npk002m9 TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGNNNT----------------------pgl1npk010j14 NGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCAT--------------------- *********************************
Prim.cons. TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-
GCGATGTAGCCCAAGGCC
V V L I L T L L S S A A I A A M #
Prim.cons. TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAAGCGATTACCCAAGGCC V V L I L T L L S S A A I A S D Y (2h88- aaiasE<stop>) wrong frame -> C G A D S D A A L L R G H R K R L P K A <-wrong
Prim.cons. TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA
AGCGATTACCCAAGGCC
V V L I L T L L S S A A I A S D Y (2h88- aaiasE<stop>)
Or Prim.cons. TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA
-GCGATTACCCAAGGCC
V V L I L T L L S S A A I A - A I T
suggested TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA
AGCGAaTgACCCAAGGCC
(change T->a/c, insert g)
V V L I L T L L S S A A I A S E # (2h88- aaiasE<stop>)
1. extra G at 479 in two seq, not believed in consensus. Density for 121QSGVVVLIL is very good, confirming the consensus sequence here.
2. Top sequence
516G
codes for G136 which I had been using- but clearly in minority and density shows a136. 3. Longest sequence 79d4f has extra A inserted at 524. This is believed in first consensus, giving IASDY. Delete that A is supported by 3 apparently weaker sequences, gives IAAIS. A139 is pretty clear, but 140 is not. but I140 uses both T's, leaving no term seq. D140 gives TAC or TAT as next triplet, change last base gives termination. Or insert g -> TGA stop Density fits E140 well in 2803.
4. No avian sequence for SDHC extends to end (2013) but 2nd underlined triplet TAC (TAT) aligns w bovine stop codon TGA
(same in rhino, platypus; TGG in anole).Turkey ends in SAGIAAM.
Guinea fowl same but I->L.
Density for A instead of G is clear, and this is the consensus result above.
patpk0076f8f TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGGCATCGCA-GCGAT-TA-T--------pgm1npk001h3 TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-GNNNN-NN-NNNNNNAGNCpatpk0079d4f TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGNCATCGCAAGCGAT-TA-CCCAAGGCC
patpk0076f8f TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGGCATCGCA-GCGAT-TA-T--------pgm1npk001h3
patpk0076f8f TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGGCATCGCA-GCGAT-TA-T--------
TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-GNNNN-NN-NNNNNNAGNCpatpk0079d4f TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGNCATCGCA
GCGAT-TA-CCCAAGGCC
PROPOSED: TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA-GCGATGTAGCCCAAGGCC V V L I L T L L S S A A I A - A M #https://www.ncbi.nlm.nih.gov/nucest/CO504141.1?report=genbank
PROPOSED: TGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA
-GCGATGTAGCCCAAGGCC
V V L I L T L L S S A A I A - A M #
confirms C-term up to:GCTCTCCTCCGCGGCCATCGCA-GCGA 550 560 | |patpk0076f8f ----------------------------pgl1npk003i14 ----------------------------pgf1npk011j7 ----------------------------pgm2npk010m23 ----------------------------pgm1npk001h3
CCCCCCCCCTTTCCCTCCCCCNNNCCCApatpk0079d4f CCCCC----TTTCCCCCC----------pgl1npk002m9 ----------------------------pgl1npk010j14 ---------------------------- Prim.cons. CCCCCCCCCTTTCCC2CCCCCNNNCCCA
P P P F P S P X P ~
Prim.cons. CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTC-GCCCTGG-TCTTCCCCCTCTCCTACCACACCTGG-AACGGAATCCGACACCTC-GTGTGGGATATGGGGAA-GGGCTTCAAACTCAGCCA-GGTGG-AGCAGTCGGGGG?TTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAAGCGATTACCCAAGGCCCCCCCCCCCTTTCCCTCCCCCNNNCCCA Prim.cons. (blanks removed) (561 bases including "?") CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGG?TTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAAGCGATTACCCAAGGCCCCCCCCCCCTTTCCCTCCCCCNNNCCCA
Prim.cons. (Revised C-term) (561 bases including "?") CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGG?TTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCA
GCGATGTAG Which codes for:(ERFK)MAALVLRCVARRCLLARLSPGPSVHHVVPMATTAKEEMARFWEKNTKSSRPLSPHI SIYKWSLPMAMSITHRGTGVALSLGVSLFSLAALLLPEQFPHYVAVVKSLSLSPALIYSA KFALVFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGVVVLILTLLSSAAIAAM0
GCGATGTAG Which codes for:
Query 121 VFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGVVVLILTLLSSAAIAAM 169 VFPLSYHTWNGIRHLVWDMGKGFKL++V+QSGV+VLILTLLSSA IAAM Sbjct 121 VFPLSYHTWNGIRHLVWDMGKGFKLTEVQQSGVLVLILTLLSSAGIAAM 169Density for Ala CB is clear, and several seq including
CO504141.1 confirm.=====================old stuff===========================================TblastX consensus against cow (with "?" which got removed?)Query: 15 MAALVLRCVARRCLLARLSPGPSVHHVVPMATTAKEEMARFWEKNTKSSRPLSPHISIYK 194 MAAL+LR V R CL A LSP + + VP+ TTAKEEM RFW KNT +RPLSPHISIY Sbjct: 1 MAALLLRHVGRHCLRAHLSPQLCIRNAVPLGTTAKEEMERFWSKNTTLNRPLSPHISIYG 60Query: 195 WSLPMAMSITHRGTGVALSLGVSLFSLAALLLPEQFPHYVAVVKSLSLSPALIYSAKFAL 374 WSLPMAMSI HRGTG+ALS GVSLF L+ALL+P F ++ VKSL L PALI++AKFALSbjct: 61 WSLPMAMSICHRGTGIALSAGVSLFGLSALLVPGSFESHLEFVKSLCLGPALIHTAKFAL 120Query: 375 VFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGVVVLILTLLSSAAIASD 518 VFPL YHTWNGIRHL+WD+GKG +SQ+ QSGV VL+LT+LSS +A+Sbjct: 121 VFPLMYHTWNGIRHLMWDLGKGLTISQLHQSGVAVLVLTVLSSVGLAAM 169 (end)Try again with N for ?Query: 375 VFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSG 470 VFPL YHTWNGIRHL+WD+GKG +SQ+ QSGSbjct: 121 VFPLMYHTWNGIRHLMWDLGKGLTISQLHQSG 152But where is stop codon?Frame 2 (ERFK)MAALVLRCVARRCLLARLSPGPSVHHVVPMATTAKEEMARFWEKNTKSSRPLSPHISIYKWSLPMAMSITHRGTGVALSLGVSLFSLAALLLPEQFPHYVAVVKSLSLSPALIYSAKFALVFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGVVVLILTLLSSAAIASDYPRPPPPFPPXXP~ put N for "?":Frame 2 (ERFK)MAALVLRCVARRCLLARLSPGPSVHHVVPMATTAKEEMARFWEKNTKSSRPLSPHISIYKWSLPMAMSITHRGTGVALSLGVSLFSLAALLLPEQFPHYVAVVKSLSLSPALIYSAKFALVFPLSYHTWNGIRHLVWDMGKGFKLSQVEQSGXCGADSDAALLRGHRKRLPKAPPPFPSPXP~ Need to check reverse frame for corroboration.Brings the same hits- apparently blastn checks both strands.Only coincidence that all hits are in same direction (or resultof primer strategy used.)FASTA nucleoitide sequences:>pgm2n.pk010.m23GAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGGTTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGC>pgm1n.pk001.h3CGGAGCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAATGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGGTTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAGNNNNNNNNNNNNAGNCCCCCCCCCCTTTCCCTCCCCCNNNCCCA>pat.pk0079.d4.fGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGGGTTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGNCATCGCAAGCGATTACCCAAGGCCCCCCCTTTCCCCCC>pgl1n.pk002.m9CGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAAGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGGGTTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGNNNT>pgl1n.pk003.i14GTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGNNNNTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCATCGCAG>pgl1n.pk010.j14CTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGATATGGGGAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGGNNNNNGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCAT>pat.pk0076.f8.fCGCTTCAAGATGGCGGCGCTGGTGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCCACCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCCGCCCTGGGTCTTCCCCCTCTCCTACCACACCTGGGAACGGAATCCGACACCTCCGTGTGGGATATGGGGAAAGGGCTTCAAACTCAACCAAGGTGGGAGCAATCNGGGGGTTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGGCATCGCAGCGATTAT>pgf1n.pk011.j7TGTTGAGATGCGTCGCTCGGCGGTGCCTCCTGGCCCGGCTCAGCCCCGGCCCCTCCGTGCACCACGTTGTCCCTATGGCNNCCACAGCCAAGGAGGAGATGGCTCGTTTCTGGGAGAAGAACACCAAATCCAGCCGTCCTTTGTCCCCTCACATCTCCATTTACAAGTGGTCGCTGCCCATGGCCATGTCCATCACGCACCGCGGCACCGGCGTTGCTCTCAGCTTAGGCGTTTCTCTCTTCAGCCTGGCTGCTTTGCTGCTCCCGGAGCAGTTCCCTCACTACGTGGCCGTGGTGAAATCCCTCAGCCTGAGCCCCGCTCTCATCTACTCTGCTAAATTCGCCCTGGTCTTCCCCCTCTCCTACCACACCTGGAACGGAATCCGACACCTCGTGTGGGNTTGNNNNAAGGGCTTCAAACTCAGCCAGGTGGAGCAGTCGGNNNNNTGTGGTGCTGATTCTGACGCTGCTCTCCTCCGCGGCCAT
CO504141.1 confirm.