PDB Codes and their Amino Acid Sequences for a set of 77 proteins

All-alpha proteins

1LMB    
PLTQEQLEDARRLKAIYEKKKNELGLSQESVADKMGMGQSGVGALFNGINALNAYNAALL
AKILKVSVEEFSPSIAREIYEMYEAVS

2ABD SQAEFDKAAEEVKHLKTKPADEEMLFIYSHYKQATVGDINTERPGMLDFKGKAKWDAWNE LKGTSKEDAMKAYIDKVEELKKKYGI

1IMQ MELKHSISDYTEAEFLQLVTTICNADTSSEEELVKLVTHFEEMTEHPSGSDLIYYPKEGD DDSPSGIVNTVKQWRAANGKSGFKQG

2PDD VIAMPSVRKYAREKGVDIRLVQGTGKNGRVLKEDIDAFLAGGA

1HRC GDVEKGKKIFVQKCAQCHTVEKGGKHKTGPNLHGLFGRKTGQAPGFTYTDANKNKGITWK EETLMEYLENPKKYIPGTKMIFAGIKKKTEREDLIAYLKKATNE

1YCC GSAKKGATLFKTRCLQCHTVEKGGPHKVGPNLHGIFGRHSGQAEGYSYTDANIKKNVLWD ENNMSEYLTNPKKYIPGTKMAFGGLKKEKDRNDLITYLKKACE

256b ADLEDNMETLNDNLKVIEKADNAAQVKDALTKMRAAALDAQKATPPKLEDKSPDSPEMKD FRHGFDILVGQIDDALKLANEGKVKEAQAAAEQLKTTRNAYHQKYR

1a6n VLSEGEWQLVLHVWAKVEADVAGHGQDILIRLFKSHPETLEKFDRFKHLKTEAEMKASED LKKHGVTVLTALGAILKKKGHHEAELKPLAQSHATKHKIPIKYLEFISEAIIHVLHSRHP GDFGADAQGAMNKALELFRKDIAAKYKELGY

1cei LKNSISDYTEAEFVQLLKEIEKENVAATDDVLDVLLEHFVKITEHPDGTDLIYYPSDNRD DSPEGIVKEIKEWRAANGKPGFKQG

2cro MQTLSERLKKRRIALKMTQTELATKAGVKQQSIQLIEAGVTKRPRFLFEIAMALNCDPVW LQYGT

2a5e MEPAAGSSMEPSADWLATAAARGRVEEVRALLEAGALPNAPNSYGRRPIQVMMMGSARVA ELLLLHGAEPNCADPATLTRPVHDAAREGFLDTLVVLHRAGARLDVRDAWGRLPVDLAEE LGHRDVARYLRAAAGGTRGSNHARIDAAEGPSDIPD

1VII MLSDEDFKAVFGMTRSAFANLPLWKQQNLKKEKGLF

1BDD TADNKFNKEQQNAFYEILHLPNLNEEQRNGFIQSLKDDPSQSANLLAEAKKLNDAQAPKA

1l8w GGLVAEAFGFKSDPKKSDVKTYFTTVAAKLEKTKTDLNSLPTAVEGAIKEVSELLDKLVK AVKTAEGASSGTAAIGEVVADADAAKVADKASVKGIAKGIKEIVEAAGGSEKLKAVAAAK GENNKGAGKLFGKAGAAAHGDSEAASKAAGAVSAVSGEQILSAIVTAADAAEQDGKKPEE AKNPIAAAIGDKDGGAEFGQDEKKDDQIAAAIALRGAKDGKFAVKDGEKEKAEGAIKGAA ESAVRKVLGAITGLIGDAVSSGLRKVGDS

1ENH RPRTAFSSEQLARLKREFNENRYLTERRRQQLSSELGLNEAQIKIWFQNKRAKI 1EBD IAMPSVRKYAREKGVDIRLVQGTGKNGRVLKEDIDAFLAGG

All-beta proteins

1NYF    
VTLFVALYDYEARTEDDLSFHKGEKFQILNSSEGDWWEARSLTTGETGYIPSNYVAPV

1PKS EGYQYRALYDYKKEREEDIDLHLGDILTVNKGSLVALGFSDGQEARPEEIGWLNGYNETT GERGDFPGTYVEYIGR

1SHG KELVLALYDYQEKSPREVTMKKGDILTLLNSTNKDWWKVEVNDRQGFVPAAYVKKLD

1SRL TFVALYDYESRTETDLSFKKGERLQIVNNTEGDWWLAHSLTTGQTGYIPSNYVAPS

1FNF-9 LDSPTGIDFSDITANSFTVHWIAPRATITGYRIRHHPEHFSGRPREDRVPHSRNSITLTN LTPGTEYVVSIVALNGREESPLLIGQQST

1FNF-10 VSDVPRDLEVVAATPTSLLISWDAPAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATIS GLKPGVDYTITVYAVTGRGDSPASSKPISINYRT

1HNG DSGTVWGALGHGINLNIPNFQMTDDIDEVRWERGSTLVAEFKRKMKPFLKSGAFEILANG DLKIKNLTRDDSGTYNVTVYSTNGTRILNKALDLRILE

1TEN LDAPSQIEVKDVTDTTALITWFKPLAEIDGIELTYGIKDVPGDRTTIDLTEDENQYSIGN LKPDTEYEVSLISRRGDMSSNPAKETFTT

1TIT LIEVEKPLYGVEVFVGETAHFEIELSEPDVHGQWKLKGQPLTASPDCEIIEDGKKHILIL HNCQLGMTGEVSFQAANAKSAANLKVKEL

1WIT LKPKILTASRKIKIKAGFTHNLEVDFIGAPDPTATWTVGDSGAALAPELLVDAKSSTTSI FFPSAKRADSGNYKLKVKNELGEDEAIFEVIVQ

1CSP MLEGKVKWFNSEKGFGFIEVEGQDDVFVHFSAIQGEGFKTLEEGQAVSFEIVEGNRGPQA ANVTKEA

1MJC SGKMTGIVKWFNADKGFGFITPDDGSKDVFVHFSAIQNDGYKSLDEGQKVSFTIESGAKG PAAGNVTSL

2AIT DTTVSEPAPSCVTLYQSWRYSQADNGCAETVTVKVVYEDDTEGLCYAVAPGQITTVGDGY IGSHGHARYLARCL

1pnj GSMSAEGYQYRALYDYKKEREEDIDLHLGDILTVNKGSLVALGFSDGQEAKPEEIGWLNG YNETTGERGDFPGTYVEYIGRKKISP

1shf VTLFVALYDYEARTEDDLSFHKGEKFQILNSSEGDWWEARSLTTGETGYIPSNYVAPVD

1c9o MQRGKVKWFNNEKGYGFIEVEGGSDVFVHFTAIQGEGFKTLEEGQEVSFEIVQGNRGPQA ANVVKL

1g6p MRGKVKWFDSKKGYGFITKDEGGDVFVHWSAIEMEGFKTLKEGQVVEFEIQEGKKGPQAA HVKVVE

1lop MVTFHTNHGDIVIKTFDDKAPETVKNFLDYCREGFYNNTIFHRVINGFMIQGGGFEPGMK QKATKEPIKNEANNGLKNTRGTLAMARTQAPHSATAQFFINVVDNDFLNFSGESLQGWGY CVFAEVVDGMDEVDKIKGVATGRSGMHQDVPKEDVIIESVTVSE

1ifc AFDGTWKVDRNENYEKFMEKMGINVVKRKLGAHDNLKLTITQEGNKFTVKESSNFRNIDV VFELGVDFAYSLADGTELTGTWTMEGNKLVGKFKRVDNGKELIAVREISGNELIQTYTYE GVEAKRIFKKE

1eal AFTGKYEIESEKNYDEFMKRLALPSDAIDKARNLKIISEVKQDGQNFTWSQQYPGGHSIT NTFTIGKECDIETIGGKKFKATVQMEGGKVVVNSPNYHHTAEIVDGKLVEVSTVGGVSYE RVSKKLA

1opa TKDQNGTWEMESNENFEGYMKALDIDFATRKIAVRLTQTKIIVQDGDNFKTKTNSTFRNY DLDFTVGVEFDEHTKGLDGRNVKTLVTWEGNTLVCVQKGEKENRGWKQWVEGDKLYLELT CGDQVCRQVFKKK

1cbi PNFAGTWKMRSSENFDELLKALGVNAMLRKVAVAAASKPHVEIRQDGDQFYIKTSTTVRT TEINFKVGEGFEEETVDGRKCRSLPTWENENKIHCTQTLLEGDGPKTYWTRELANDELIL TFGADDVVCTRIYVRE

1hx5 IKPLEDKILVQATTASGLVIPPQEGTVVAVGPGRWDEDGEKRIPLDVAEGDTVIYSKYGG TEIKYNGEEYLILSARDVLAVV

1pin KLPPGWEKRMSRSSGRVYYFNHITNASQWERPSGGKNGQGEPARVRCSHLLVKHSQSRRP SSWRQEKITRTKEEALELINGYIQKIKSGEEDFESLASQFSDCSSAKARGDLGAFSRGQM QKPFEDASFALRTGEMSGPVFTDSGIHIILRTE

1c8c MATVKFKYKGEEKQVDISKIKKVWRVGKMISFTYDEGGGKTGRGAVSEKDAPKELLQMLA KQKK

1psf AIERGSKVKILRKESYWYGDVGTVASIDKSGIIYPVIVRFNKVNYNGFSGSAGGLNTNNF AEHELEVVG

Mixed class proteins

1APS    
STARPLKSVDYEVFGRVQGVCFRMYAEDEARKIGVVGWVKNTSKGTVTGQVQGPEEKVNS
MKSWLSKVGSPSSRIDRTNFSNEKTISKLEYSNFSVRY

1HDN MFQQEVTITAPNGLHTRPAAQFVKEAKGFTSEITVTSNGKSASAKSLFKLQTLGLTQGTV VTISAEGEDEQKAVEHLVKLMAELE

1URN AVPETRPNHTIYINNLNEKIKKDELKKSLHAIFSRFGQILDILVSRSLKMRGQAFVIFKE VSSATNALRSMQGFPFYDKPMRIQYAKTDSDIIAKM

2HQI ATQTVTLAVPGMTCAACPITVKKALSKVEGVSKVDVGFEKREAVVTFDDTKASVQKLTKA TADAGYPSSVKQ

1PBA HHSGEHFEGEKVFRVNVEDENDISELHELASTRQIDFWKPDSVTQIKPHSTVDFRVKAED ILAVEDFLEQNELQYEVLINN

1UBQ MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYN IQKESTLHLVLRLRGG

2PTL ENKEETPETPETDSEEEVTIKANLIFANGSTQTAEFKGTFEKATSEAYAYADTLKKDNGE YTVDVADKGYTLNIKFAG

1FKB GVQVETISPGDGRTFPKRGQTCVVHYTGMLEDGKKFDSSRDRNKPFKFMLGKQEVIRGWE EGVAQMSVGQRAKLTISPDYAYGATGHPGIIPPHATLVFDVELLKLE

1COA MKTEWPELVGKSVEEAKKVILQDKPEAQIIVLPVGTIVTMEYRIDRVRLFVDKLDNVAEV PRVG

1DIV MKVIFLKDVKGKGKKGEIKNVADGYANNFLFKQGLAIEATPANLKALEAQKQKE

2VIK VELSKKVTGKLDKTTPGIQIWRIENMEMVPVPTKSYGNFYEGDCYVLLSTRKTGSGFSYN IHYWLGKNSSQDEQGAAAIYTTQMDEYLGSVAVQHREVQGHESETFRAYFKQGLIYKQGG VASGMK

1CIS MKTEWPELVGKSVEEAKKVILQDKPEAQIIVLEKQAVDNAYAEYRIDRVRLAVDKLDNIA QVPRVG

1PCA KEDFVGHQVLRISVDDEAQVQKVKELEDLEHLQLDFWRGPARPGFPIDVRVPFPSIQAVK VFLEAHGIRYTIMIEDVQLLLDEEQEQMFASQGR

1hz6 HHAMEEVTIKANLIFANGSTQTAEFKGTFEKATSEAYAYADTLKKDNGEWTVDVADKGYT LNIKFAG

1pgb MTYKLILNGKTLKGETTTEAVDAATAEKVFKQYANDNGVDGEWTYDDATKTFTVTE

2ci2 NLKTEWPELVGKSVEEAKKVILQDKPEAQIIVLPVGTIVTMEYRIDRVRLFVDKLDNIAE VPRVG

1ay2 NFNFGAYHTLEEISQEMDNLVAEHPGLVSKVNIGSSFENRPMNVLKFSTGGDKPAIWLDA GIHAREWVTQATALW

1ris MRRYEVNIVLNPNLDQSQLALEKEIIQRALENYGARVEKVEELGLRRLAYPIAKDPQGYF LWYQVEMPEDRVNDLARELRIRDNVRRVMVVKSQEPF

1poh MFQQEVTITAPNGLHTRPAAQFVKEAKGFTSEITVTSNGKSASAKSLFKLQTLGLTQGTV VTISAEGEDEQKAVEHLVKLMAELE

2a5e MEPAAGSSMEPSADWLATAAARGRVEEVRALLEAGALPNAPNSYGRRPIQVMMMGSARVA ELLLLHGAEPNCADPATLTRPVHDAAREGFLDTLVVLHRAGARLDVRDAWGRLPVDLAEE LGHRDVARYLRAAAGGTRGSNHARIDAAEGPSDIPD

1aon EGMQFDRGYLSPYFINKPETGAVELESPFILLADKKISNIREMLPVLEAVAKAGKPLLII AEDVEGEALATLVVNTMRGIVKVAAVKAPGFGDRRKAMLQDIATLTGGTVISEEIGMELE KATLEDLGQAKRVVINKDTTTIIDGVGEEAAIQGR

1brs KKAVINGEQIRSISDLHQTLKKELALPEYYGENLDALWDALTGWVEYPLVLEWRQFEQSK QLTENGAESVLQVFREAKAEGADITIILS

3chy ADKELKFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGYGFVISDWNMP NMDGLELLKTIRADGAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPFTAATLEEKLN KIFEKLGM

2rn2 MLKQVEIFTDGSCLGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELMAAIVALEALK EHCEVILSTDSQYVRQGITQWIHNWKKRGWKTADKKPVKNVDLWQRLDAALGQHQIKWEW VKGHAGHPENERCDELARAAAMNPTLEDTGYQVEV

1ra9 MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLDKPVIMGRHTWESIGRPLPGRKNI ILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVE GDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR

1bni VINTFDGVADYLQTYHKLPDNYITKSEAQALGWVASKGNLADVAPGKSIGGDIFSNREGK LPGKSGRTWREADINYTSGFRNSDRILYSSDWLIYKTTDHYQTFTKIR

2lzm MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNCNGVITK DEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRCALINMVFQMGETGVAGFTNSLRM LQQKRWDEAAVNLAKSRWYNQTPNRAKRVITTFRTGTWDAYKNL

1ubq MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYN IQKESTLHLVLRLRGG

1sce VPRLLTASERERLEPFIDQIHYSPRYADDEYEYRHVMLPKAMLKAIPTDYFNPETGTLRI LQEEEWRGLGITQSLGWEMYEVHVPEPHILLFKREKD

1gxt NTSCCGVQLRIRGKVQGVGFRPFVWQLAQQLNLHGDVCNDGDGVEVRLREDPEVFLVQLY QHCPPLARIDSVEREPFIWSQLPTEFTIR

2acy AEGDTLISVDYEIFGKVQGVFFRKYTQAEGKKLGLVGWVQNTDQGTVQGQLQGPASKVRH MQEWLETKGSPKSHIDRASFHNEKVIVKLDYTDFQIVK

1php-N terminal MNKKTIRDVDVRGKRVFCRVDFNVPMEQGAITDDTRIRAALPTIRYLIEHGAKVILASHL GRPKGKVVEELRLDAVAKRLGELLERPVAKTNEAVGDEVKAAVDRLNEGDVLLLENVRFY PGEEKNDPELAKAFAELADLYVNDAFGAAHRAHASTEGIAHYLPAVAGFL

1php-C terminal VLGKALSNPDRPFTAIIGGAKVKDKIGVIDNLLEKVDNLIIGGGLAYTFVKALGHDVGKS LLEEDKIELAKSFMEKAKEKGVRFYMPVDVVVADRFANDANTKVVPIDAIPADWSALDIG PKTRELYRDVIRESKLVVWNGPMGVFEMDAFAHGTKAIAEALAEALDTYSVIGGGDSAAA VEKFGLADKMDHISTGGGASLEFME

1qop-alpha subunit MERYENLFAQLNDRREGAFVPFVTLGDPGIEQSLKIIDTLIDAGADALELGVPFSDPLAD GPTIQNANLRAFAAGVTPAQCFEMLAIIREKHPTIPIGLLMYANLVFNNGIDAFYARCEQ VGVDSVLVADVPVEESAPFRQAALRHNIAPIFICPPNADDDLLRQVASYGRGYTYLLSRS GVTGAENRGPLHHLIEKLKEYHAAPALQGFGISSPEQVSAAVRAGAAGAISGSAIVKIIE KNLASPKQMLAELRSFVSAMKAASR

1qop-beta subunit TTLLNPYFGEFGGMYVPQILMPALNQLEEAFVSAQKDPEFQAQFADLLKNYAGRPTALTK CQNITAGTRTTLYLKREDLLHGGAHKTNQVLGQALLAKRMGKSEIIAETGAGQHGVASAL ASALLGLKCRIYMGAKDVERQSPNVFRMRLMGAEVIPVHSGSATLKDACNEALRDWSGSY ETAHYMLGTAAGPHPYPTIVREFQRMIGEETKAQILDKEGRLPDAVIACVGGGSNAIGMF ADFINDTSVGLIGVEPGGHGIETGEHGAPLKHGRVGIYFGMKAPMMQTADGQIEESYSIS AGLDFPSVGPQHAYLNSIGRADYVSITDDEALEAFKTLCRHEGIIPALESSHALAHALKM MREQPEKEQLLVVNLSGRGDKDIFTVHDIL