**** file header **** TRAINING dataset Format of data: >CyBase/PDB_id Protein name Species annotation(cyclic/non-cyclic) sequence **** file header end **** >00084 Ent AS-48 Enterococcus faecalis cyclic MAKEFGIPAAVAGTVLNVVEAGGWVTTIVSILTAVGSGGLSLLAAAGRESIKAYLKKEIKKKGKRAVIAW >00119 circularin A Clostridium beijerinckii cyclic VAGALGVQTAAATTIVNVILNAGTLVTVLGIIASIASGGAGTLMTIGWATFKATVQKLAKQSMARAIAY >00005 palicourein Palicourea condensata cyclic GDPTFCGETCRVIPVCTYSAALGCTCDDRSDGLCKRN >00346 CD-1 Chassalia discolor cyclic GADGFCGESCYVIPCISYLVGCSCDTIEKVCKRN >00007 tricyclon A Viola tricolor cyclic GGTIFDCGESCFLGTCYTKGCSCGEWKLCYGTN >00348 cycloviolacin Y2 Viola yedoensis cyclic GGTIFDCGESCFLGTCYTAGCSCGNWGLCYGTN >00542 Glopa A Gloeospermum pauciflorum Hekking cyclic GGSIPCIETCVWTGCFLVPGCSCKSDKKCYLN >00032 cyclopsychotride A Psychotria longipes cyclic SIPCGESCVFIPCTVTALLGCSCKSKVCYKN >00045 circulin B Chassalia parvifolia cyclic GVIPCGESCVFIPCISTLLGCSCKNKVCYRN >00141 Hyfl A Hybanthus floribundus E cyclic SISCGESCVYIPCTVTALVGCTCKDKVCYLN >00168 kalata B8 Oldenlandia affinis cyclic GSVLNCGETCLLGTCYTTGCTCNKYRVCTKD >00183 cycloviolacin O23 Viola odorata cyclic GLPTCGETCFGGTCNTPGCTCDSSWPICTHN >00244 kalata B9 Oldenlandia affinis cyclic GSVFNCGETCVLGTCYTPGCTCNTYRVCTKD >00357 vibi F Viola biflora cyclic GTIPCGESCVFIPCLTSALGCSCKSKVCYKN >00361 vibi J Viola biflora cyclic GTFPCGESCVWIPCISKVIGCACKSKVCYKN >00392 mram 4 Melicytus ramiflorus cyclic GSIPCGESCVFIPCISSVVGCSCKNKVCYKN >00436 Viba 7 Viola baoshanensis cyclic GVIPCGESCVFIPCISSVIGCSCKSKVCYRN >00502 Vpl-1 Viola pinetorum cyclic GSQSCGESCVLIPCISGVIGCSCSSMICYFN >00540 Glopa D Gloeospermum pauciflorum Hekking cyclic GVPCGESCVWVPCTVTALMGCSCVREVCRKD >00002 cycloviolacin O1 Viola odorata cyclic GIPCAESCVYIPCTVTALLGCSCSNRVCYN >00021 cycloviolacin O2 Viola odorata cyclic GIPCGESCVWIPCISSAIGCSCKSKVCYRN >00043 Hypa A Hybanthus parviflorus cyclic GIPCAESCVYIPCTITALLGCSCKNKVCYN >00054 cycloviolacin O5 Viola odorata cyclic GTPCGESCVWIPCISSAVGCSCKNKVCYKN >00057 cycloviolacin O10 Viola odorata cyclic GIPCGESCVYIPCLTSAVGCSCKSKVCYRN >00061 varv peptide B Viola arvensis cyclic GLPVCGETCFGGTCNTPGCSCDPWPMCSRN >00070 cycloviolin D Leonia cymosa cyclic GFPCGESCVFIPCISAAIGCSCKNKVCYRN >00114 htf-1 Hedyotis terminalis cyclic GIPCGDSCHYIPCVTSTIGCSCTNGSCMRN >00139 cycloviolacin H3 Viola hederacea cyclic GLPVCGETCFGGTCNTPGCICDPWPVCTRN >00151 Hyfl K Hybanthus floribundus W cyclic GTPCGESCVYIPCFTAVVGCTCKDKVCYLN >00178 cycloviolacin O18 Viola odorata cyclic GIPCGESCVYIPCTVTALAGCKCKSKVCYN >00254 kalata B16 Oldenlandia affinis cyclic GIPCAESCVYIPCTITALLGCKCQDKVCYD >00350 cycloviolacin Y4 Viola yedoensis cyclic GVPCGESCVFIPCITGVIGCSCSSNVCYLN >00356 vibi E Viola biflora cyclic GIPCAESCVWIPCTVTALIGCGCSNKVCYN >00376 Viba 5 Viola baoshanensis cyclic GIPCGESCVWIPCLTATIGCSCKSKVCYRN >00411 mram 12 Melicytus ramiflorus cyclic GSAILCGESCTLGECYTPGCTCSWPICTKN >00439 Viba 8 Viola baoshanensis cyclic GAGCIETCYTFPCISEMINCSCKNSRCQKN >00503 Vpf-1 Viola pinetorum cyclic GIPCGESCVFIPCLTAAIGCSCRSKVCYRN >00510 Vo3006 Viola odorata cyclic GLPTCGETCFGGTCNTPGCTCDPFPVCTHN >00534 Globa A Gloeospermum blakeanum (Standl.) Hekking cyclic GIPCGESCVFIPCITAAIGCSCKTKVCYRN >00538 Globa E Gloeospermum blakeanum (Standl.) Hekking cyclic GSAFGCGETCVKGKCNTPGCVCSWPVCKKN >00004 kalata B2 Oldenlandia affinis cyclic GLPVCGETCFGGTCNTPGCSCTWPICTRD >00031 vodo M Viola odorata cyclic GAPICGESCFTGKCYTVQCSCSWPVCTRN >00062 varv peptide C Viola arvensis cyclic GVPICGETCVGGTCNTPGCSCSWPVCTRN >00072 violapeptide 1 Viola arvensis cyclic GLPVCGETCVGGTCNTPGCSCSRPVCTXN >00145 Hyfl E Hybanthus floribundus E cyclic GEIPCGESCVYLPCFLPNCYCRNHVCYLN >00175 cycloviolacin O15 Viola odorata cyclic GLVPCGETCFTGKCYTPGCSCSYPICKKN >00181 cycloviolacin O21 Viola odorata cyclic GLPVCGETCVTGSCYTPGCTCSWPVCTRN >00352 vibi A Viola biflora cyclic GLPVCGETCFGGTCNTPGCSCSYPICTRN >00377 Viba 10 Viola baoshanensis cyclic GIPCAESCVYLPCVTIVIGCSCKDKVCYN >00413 mram 13 Melicytus ramiflorus cyclic GHPICGETCVGNKCYTPGCTCTWPVCYRN >00512 cO34 Viola odorata cyclic GLPVCGETCVGGTCNTEYCTCSWPVCTRD >00068 cycloviolin B Leonia cymosa cyclic GTACGESCYVLPCFTVGCTCTSSQCFKN >00250 kalata B12 Oldenlandia affinis cyclic GSLCGDTCFVLGCNDSSCSCNYPICVKD >00089 RTD-2 Macaca mulatta (rhesus monkey) cyclic GVCRCLCRRGVCRCLCRR >00081 SFTI-1 Helianthus annus cyclic GRCTKSIPPICFPD >1epw_A Neurotoxin B. Clostridium botulinum non-cyclic PVTINNFNYNDPIDNNNIIMMEPPFARGTGRYYKAFKITDRIWIIPERYTFGYKPEDFNKSSGIFNRDVCEYYDPDYLNTNDKKNIFLQTMIKLFNRIKSKPLGEKLLEMIINGIPYLGDRRVPLEEFNTNIASVTVNKLISNPGEVERKKGIFANLIIFGPGPVLNENETIDIGIQNHFASREGFGGIMQMKFCPEYVSVFNNVQENKGASIFNRRGYFSDPALILMHELIHVLHGLYGIKVDDLPIVPNEKKFFMQSTDAIQAEELYTFGGQDPSIITPSTDKSIYDKVLQNFRGIVDRLNKVLVCISDPNININIYKNKFKDKYKFVEDSEGKYSIDVESFDKLYKSLMFGFTETNIAENYKIKTRASYFSDSLPPVKIKNLLDNEIYTIEEGFNISDKDMEKEYRGQNKAINKQAYEEISKEHLAVYKIQMCKSVKAPGICIDVDNEDLFFIADKNSFSDDLSKNERIEYNTQSNYIENDFPINELILDTDLISKIELPSENTESLTDFNVDVPVYEKQPAIKKIFTDENTIFQYLYSQTFPLDIRDISLTSSFDDALLFSNKVYSFFSMDYIKTANKVVEAGLFAGWVKQIVNDFVIEANKSNTMDKIADISLIVPYIGLALNVGNETAKGNFENAFEIAGASILLEFIPELLIPVVGAFLLESYIDNKNKIIKTIDNALTKRNEKWSDMYGLIVAQWLSTVNTQFYTIKEGMYKALNYQAQALEEIIKYRYNIYSEKEKSNINIDFNDINSKLNEGINQAIDNINNFINGCSVSYLMKKMIPLAVEKLLDFDNTLKKNLLNYIDENKLYLIGSAEYEKSKVNKYLKTIMPFDLSIYTNDTILIEMFNKYNSEILNNIILNLRYKDNNLIDLSGYGAKVEVYDGVELNDKNQFKLTSSANSKIRVTQNQNIIFNSVFLDFSVSFWIRIPKYKNDGIQNYIHNEYTIINCMKNNSGWKISIRGNRIIWTLIDINGKTKSVFFEYNIREDISEYINRWFFVTITNNLNNAKIYINGKLESN >2gag_A heterotetrameric sarcosine oxidase alpha-subunit Stenotrophomonas maltophilia non-cyclic MSKPQRLSAEQSSRARINREEALSLTVDGAKLSAFRGDTVASALLANGVRRAGNSLYLDRPRGIFAAGVEEPNALVTVSARHEQDIDESMLPATTVPVTEDLNATLLSGLGVLDPTKDPAYYDHVHVHTDVLVVGAGPAGLAAAREASRSGARVMLLDERAEAGGTLLDTAGEQIDGMDSSAWIEQVTSELAEAEETTHLQRTTVFGSYDANYLIAAQRRTVHLDGPSGPGVSRERIWHIRAKQVVLATGAHERPIVFENNDRPGIMLAGAVRSYLNRYGVRAGARIAVATTNDSAYELVRELAATGGVVAVIDARSSISAAAAQAVADGVQVISGSVVVDTEADENGELSAIVVAELDEARELGGTQRFEADVLAVAGGFNPVVHLHSQRQGKLDWDTTIHAFVPADAVANQHLAGAMTGRLDTASALSTGAATGAAAATAAGFATVARTPQALETALGETRPVWLVPSVSGDDAVNYKFHFVDLQRDQTVADVLRATGAGMKSVEHIKRYTSISTANDQGKTSGVAAIGVIAAVLGIENPAAIGTTTFRAPYTPVAFAALAGRNRGDQLDPARITAMHSWHLSHGAEFEDVGQWKRPWYYPQAGETMDQAVYRESKAVRDSVGMLDATTLGKIEIRGKDAAEFLNRIYTNGYTKLKVGMGRYGVMCKADGMIFDDGVTLRLAEDRFLLHTTTGGAADVLDWLEEWLQTEWPDLDVTCTSVTEQLATVAVVGPRSRDVIAKLASTVDVSNEGFKFMAFKDVVLDSGIEARISRISFSGELAFEIAVPAWHGLRVWEDVYAAGEEFNITPYGTETMHVLRAEKGFIIVGQDTDGTVTPQDAGMEWVVSKLKDFIGNRSYSRADNAREDRKQLVSVLPVDKSLRLPEGAALVASDALASEGITPMEGWVTSSYDSPNLGRTFGLALIKNGRNRIGEVLKTPVGDQLVDVVVSETVLYDPEGSRRDG >1ti2_A Pyrogallol hydroxytransferase large subunit Pelobacter acidigallici non-cyclic MGEVVRLTNSSTGGPVFVYVKDGKIIRMTPMDFDDAVDAPSWKIEARGKTFTPPRKTSIAPYTAGFKSMIYSDLRIPYPMKRKSFDPNGERNPQLRGAGLSKQDPWSDYERISWDEATDIVVAEINRIKHAYGPSAILSTPSSHHMWGNVGYRHSTYFRFMNMMGFTYADHNPDSWEGWHWGGMHMWGFSWRLGNPEQYDLLEDGLKHAEMIVFWSSDPETNSGIYAGFESNIRRQWLKDLGVDFVFIDPHMNHTARLVADKWFSPKIGTDHALSFAIAYTWLKEDSYDKEYVAANAHGFEEWADYVLGKTDGTPKTCEWAEEESGVPACEIRALARQWAKKNTYLAAGGLGGWGGACRASHGIEWARGMIALATMQGMGKPGSNMWSTTQGVPLDYEFYFPGYAEGGISGDCENSAAGFKFAWRMFDGKTTFPSPSNLNTSAGQHIPRLKIPECIMGGKFQWSGKGFAGGDISHQLHQYEYPAPGYSKIKMFWKYGGPHLGTMTATNRYAKMYTHDSLEFVVSQSIWFEGEVPFADIILPACTNFERWDISEFANCSGYIPDNYQLCNHRVISLQAKCIEPVGESMSDYEIYRLFAKKLNIEEMFSEGKDELAWCEQYFNATDMPKYMTWDEFFKKGYFVVPDNPNRKKTVALRWFAEGREKDTPDWGPRLNNQVCRKGLQTTTGKVEFIATSLKNFEEQGYIDEHRPSMHTYVPAWESQKHSPLAVKYPLGMLSPHPRFSMHTMGDGKNSYMNYIKDHRVEVDGYKYWIMRVNSIDAEARGIKNGDLIRAYNDRGSVILAAQVTECLQPGTVHSYESCAVYDPLGTAGKSADRGGCINILTPDRYISKYACGMANNTALVEIEKWDGDKYEIY >2f2h_A Putative family 31 glucosidase yicI Escherichia coli non-cyclic MKISDGNWLIQPGLNLIHPLQVFEVEQQDNEMVVYAAPRDVRERTWQLDTPLFTLRFFSPQEGIVGVRIEHFQGALNNGPHYPLNILQDVKVTIENTERYAEFKSGNLSARVSKGEFWSLDFLRNGERITGSQVKNNGYVQDTNNQRNYMFERLDLGVGETVYGLGERFTALVRNGQTVETWNRDGGTSTEQAYKNIPFYMTNRGYGVLVNHPQCVSFEVGSEKVSKVQFSVESEYLEYFVIDGPTPKAVLDRYTRFTGRPALPPAWSFGLWLTTSFTTNYDEATVNSFIDGMAERNLPLHVFHFDCFWMKAFQWCDFEWDPLTFPDPEGMIRRLKAKGLKICVWINPYIGQKSPVFKELQEKGYLLKRPDGSLWQWDKWQPGLAIYDFTNPDACKWYADKLKGLVAMGVDCFKTDFGERIPTDVQWFDGSDPQKMHNHYAYIYNELVWNVLKDTVGEEEAVLFARSASVGAQKFPVHWGGDCYANYESMAESLRGGLSIGLSGFGFWSHDIGGFENTAPAHVYKRWCAFGLLSSHSRLHGSKSYRVPWAYDDESCDVVRFFTQLKCRMMPYLYREAARANARGTPMMRAMMMEFPDDPACDYLDRQYMLGDNVMVAPVFTEAGDVQFYLPEGRWTHLWHNDELDGSRWHKQQHGFLSLPVYVRDNTLLALGNNDQRPDYVWHEGTAFHLFNLQDGHEAVCEVPAADGSVIFTLKAARTGNTITVTGAGEAKNWTLCLRNVVKVNGLQDGSQAESEQGLVVKPQGNALTITLH >1bf2_A ISOAMYLASE Pseudomonas amyloderamosa non-cyclic AINSMSLGASYDAQQANITFRVYSSQATRIVLYLYSAGYGVQESATYTLSPAGSGVWAVTVPVSSIKAAGITGAVYYGYRAWGPNWPYASNWGKGSQAGFVSDVDANGDRFNPNKLLLDPYAQEVSQDPLNPSNQNGNVFASGASYRTTDSGIYAPKGVVLVPSTQSTGTKPTRAQKDDVIYEVHVRGFTEQDTSIPAQYRGTYYGAGLKASYLASLGVTAVEFLPVQETQNDANDVVPNSDANQNYWGYMTENYFSPDRRYAYNKAAGGPTAEFQAMVQAFHNAGIKVYMDVVYNHTAEGGTWTSSDPTTATIYSWRGLDNATYYELTSGNQYFYDNTGIGANFNTYNTVAQNLIVDSLAYWANTMGVDGFRFDLASVLGNSCLNGAYTASAPNCPNGGYNFDAADSNVAINRILREFTVRPAAGGSGLDLFAEPWAIGGNSYQLGGFPQGWSEWNGLFRDSLRQAQNELGSMTIYVTQDANDFSGSSNLFQSSGRSPWNSINFIDVHDGMTLKDVYSCNGANNSQAWPYGPSDGGTSTNYSWDQGMSAGTGAAVDQRRAARTGMAFEMLSAGTPLMQGGDEYLRTLQCNNNAYNLDSSANWLTYSWTTDQSNFYTFAQRLIAFRKAHPALRPSSWYSGSQLTWYQPSGAVADSNYWNNTSNYAIAYAINGPSLGDSNSIYVAYNGWSSSVTFTLPAPPSGTQWYRVTDTCDWNDGASTFVAPGSETLIGGAGTTYGQCGQSLLLLISK >1oao_C CARBON MONOXIDE DEHYDROGENASE/ACETYL-COA SYNT Moorella thermoacetica non-cyclic MTDFDKIFEGAIPEGKEPVALFREVYHGAITATSYAEILLNQAIRTYGPDHPVGYPDTAYYLPVIRCFSGEEVKKLGDLPPILNRKRAQVSPVLNFENARLAGEATWYAAEIIEALRYLKYKPDEPLLPPPWTGFIGDPVVRRFGIKMVDWTIPGEAIILGRAKDSKALAKIVKELMGMGFMLFICDEAVEQLLEENVKLGIDYIAYPLGNFTQIVHAANYALRAGMMFGGVTPGAREEQRDYQRRRIRAFVLYLGEHDMVKTAAAFGAIFTGFPVITDQPLPEDKQIPDWFFSVEDYDKIVQIAMETRGIKLTKIKLDLPINFGPAFEGESIRKGDMYVEMGGNRTPAFELVRTVSESEITDGKIEVIGPDIDQIPEGSKLPLGILVDIYGRKMQADFEGVLERRIHDFINYGEGLWHTGQRNINWLRVSKDAVAKGFRFKNYGEILVAKMKEEFPAIVDRVQVTIFTDEAKVKEYMEVAREKYKERDDRMRGLTDETVDTFYSCVLCQSFAPNHVCIVTPERVGLCGAVSWLDAKASYEINHAGPNQPIPKEGEIDPIKGIWKSVNDYLYTASNRNLEQVCLYTLMENPMTSCGCFEAIMAILPECNGIMITTRDHAGMTPSGMTFSTLAGMIGGGTQTPGFMGIGRTYIVSKKFISADGGIARIVWMPKSLKDFLHDEFVRRSVEEGLGEDFIDKIADETIGTTVDEILPYLEEKGHPALTMDPIM >1y79_1 Peptidyl-Dipeptidase Dcp Escherichia coli non-cyclic TTMNPFLVQSTLPYLAPHFDQIANHHYRPAFDEGMQQKRAEIAAIALNPQMPDFNNTILALEQSGELLTRVTSVFFAMTAAHTNDELQRLDEQFSAELAELANDIYLNGELFARVDAVWQRRESLGLDSESIRLVEVIHQRFVLAGAKLAQADKAKLKVLNTEAATLTSQFNQRLLAANKSGGLVVNDIAQLAGMSEQEIALAAEAAREKGLDNKWLIPLLNTTQQPALAEMRDRATREKLFIAGWTRAEKNDANDTRAIIQRLVEIRAQQATLLGFPHYAAWKIADQMAKTPEAALNFMREIVPAARQRASDELASIQAVIDKQQGGFSAQPWDWAFYAEQVRREKFDLDEAQLKPYFELNTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRVWEIFDHNGVGLALFYGDFFARDSKSGGAWMGNFVEQSTLNKTHPVIYNVCNYQKPAAGEPALLLWDDVITLFHEFGHTLHGLFARQRYATLSGTNTPRDFVEFPSQINEHWATHPQVFARYARHYQSGAAMPDELQQKMRNASLFNKGYEMSELLSAALLDMRWHCLEENEAMQDVDDFELRALVAENMDLPAIPPRYRSSYFAHIFGGGYAAGYYAYLWTQMLADDGYQWFVEQGGLTRENGLRFREAILSRGNSEDLERLYRQWRGKAPKIMPMLQHRGLNI >1gof_A GALACTOSE OXIDASE Hypomyces rosellus non-cyclic ASAPIGSAISRNNWAVTCDSAQSGNECNKAIDGNKDTFWHTFYGANGDPKPPHTYTIDMKTTQNVNGLSMLPRQDGNQNGWIGRHEVYLSSDGTNWGSPVASGSWFADSTTKYSNFETRPARYVRLVAITEANGQPWTSIAEINVFQASSYTAPQPGLGRWGPTIDLPIVPAAAAIEPTSGRVLMWSSYRNDAFGGSPGGITLTSSWDPSTGIVSDRTVTVTKHDMFCPGISMDGNGQIVVTGGNDAKKTSLYDSSSDSWIPGPDMQVARGYQSSATMSDGRVFTIGGSWSGGVFEKNGEVYSPSSKTWTSLPNAKVNPMLTADKQGLYRSDNHAWLFGWKKGSVFQAGPSTAMNWYYTSGSGDVKSAGKRQSNRGVAPDAMCGNAVMYDAVKGKILTFGGSPDYQDSDATTNAHIITLGEPGTSPNTVFASNGLYFARTFHTSVVLPDGSTFITGGQRRGIPFEDSTPVFTPEIYVPEQDTFYKQNPNSIVRVYHSISLLLPDGRVFNGGGGLCGDCTTNHFDAQIFTPNYLYNSNGNLATRPKITRTSTQSVKVGGRITISTDSSISKASLIRYGTATHTVNTDQRRIPLTLTNNGGNSYSFQVPSDSGVALPGYWMLFVMNSAGVPSVASTIRVTQ >1b25_A PROTEIN (FORMALDEHYDE FERREDOXIN OXIDOREDUCTA Pyrococcus furiosus non-cyclic MYGWWGRILRVNLTTGEVKVQEYPEEVAKKFIGGRGLAAWILWNEARGVEPLSPENKLIFAAGPFNGLPTPSGGKLVVAAKSPLTGGYGDGNLGTMASVHLRRAGYDALVVEGKAKKPVYIYIEDDNVSILSAEGLWGKTTFETERELKEIHGKNVGVLTIGPAGENLVKYAVVISQEGRAAGRPGMGAVMGSKKLKAVVIRGTKEIPVADKEELKKLSQEAYNEILNSPGYPFWKRQGTMAAVEWCNTNYALPTRNFSDGYFEFARSIDGYTMEGMKVQQRGCPYCNMPCGNVVLDAEGQESELDYENVALLGSNLGIGKLNEVSVLNRIADEMGMDTISLGVSIAHVMEAVERGILKEGPTFGDFKGAKQLALDIAYRKGELGNLAAEGVKAMAEKLGTHDFAMHVKGLEVSGYNCYIYPAMALAYGTSAIGAHHKEAWVIAWEIGTAPIEGEKAEKVEYKISYDPIKAQKVVELQRLRGGLFEMLTACRLPWVEVGLSLDYYPKLLKAITGVTYTWDDLYKAADRVYSLIRAYWVREFNGKWDRKMDYPPKRWFTEGLKSGPHKGEHLDEKKYDELLSEYYRIRGWDERGIPKKETLKELDLDFVIPELEKVTNLE >1lq2_A Beta-D-glucan glucohydrolase isoenzyme Exo1 Hordeum vulgare non-cyclic DYVLYKDATKPVEDRVADLLGRMTLAEKIGQMTQIERLVATPDVLRDNFIGSLLSGGGSVPRKGATAKEWQDMVDGFQKACMSTRLGIPMIYGIDAVHGQNNVYGATIFPHNVGLGATRDPYLVKRIGEATALEVRATGIQYAFAPCIAVCRDPRWGRCYESYSEDRRIVQSMTELIPGLQGDVPKDFTSGMPFVAGKNKVAACAKHFVGDGGTVDGINENNTIINREGLMNIHMPAYKNAMDKGVSTVMISYSSWNGVKMHANQDLVTGYLKDTLKFKGFVISDWEGIDRITTPAGSDYSYSVKASILAGLDMIMVPNKYQQFISILTGHVNGGVIPMSRIDDAVTRILRVKFTMGLFENPYADPAMAEQLGKQEHRDLAREAARKSLVLLKNGKTSTDAPLLPLPKKAPKILVAGSHADNLGYQCGGWTIEWQGDTGRTTVGTTILEAVKAAVDPSTVVVFAENPDAEFVKSGGFSYAIVAVGEHPYTETKGDNLNLTIPEPGLSTVQAVCGGVRCATVLISGRPVVVQPLLAASDALVAAWLPGSEGQGVTDALFGDFGFTGRLPRTWFKSVDQLPMNVGDAHYDPLFRLGYGLTTNAT >2ips_A Lactoperoxidase Bos taurus non-cyclic SWEVGCGAPVPLVKCDENSPYRTITGDCNNRRSPALGAANRALARWLPAEYEDGLALPFGWTQRKTRNGFRVPLAREVSNKIVGYLDEEGVLDQNRSLLFMQWGQIVDHDLDFAPETELGSNEHSKTQCEEYCIQGDNCFPIMFPKNDPKLKTQGKCMPFFRAGFVCPTPPYQSLAREQINAVTSFLDASLVYGSEPSLASRLRNLSSPLGLMAVNQEAWDHGLAYLPFNNKKPSPCEFINTTARVPCFLAGDFRASEQILLATAHTLLLREHNRLARELKKLNPHWNGEKLYQEARKILGAFIQIITFRDYLPIVLGSEMQKWIPPYQGYNNSVDPRISNVFTFAFRFGHMEVPSTVSRLDENYQPWGPEAELPLHTLFFNTWRIIKDGGIDPLVRGLLAKKSKLMNQDKMVTSELRNKLFQPTHKIHGFDLAAINLQRCRDHGMPGYNSWRGFCGLSQPKTLKGLQTVLKNKILAKKLMDLYKTPDNIDIWIGGNAEPMVERGRVGPLLACLLGRQFQQIRDGDRFWWENPGVFTEKQRDSLQKVSFSRLICDNTHITKVPLHAFQANNYPHDFVDCSTVDKLDLSPWASREN >1c4a_A PROTEIN (FE-ONLY HYDROGENASE) Clostridium pasteurianum non-cyclic MKTIIINGVQFNTDEDTTILKFARDNNIDISALCFLNNCNNDINKCEICTVEVEGTGLVTACDTLIEDGMIINTNSDAVNEKIKSRISQLLDIHEFKCGPCNRRENCEFLKLVIKYKARASKPFLPKDKTEYVDERSKSLTVDRTKCLLCGRCVNACGKNTETYAMKFLNKNGKTIIGAEDEKCFDDTNCLLCGQCIIACPVAALSEKSHMDRVKNALNAPEKHVIVAMAPSVRASIGELFNMGFGVDVTGKIYTALRQLGFDKIFDINFGADMTIMEEATELVQRIENNGPFPMFTSCCPGWVRQAENYYPELLNNLSSAKSPQQIFGTASKTYYPSISGLDPKNVFTVTVMPCTSKKFEADRPQMEKDGLRDIDAVITTRELAKMIKDAKIPFAKLEDSEADPAMGEYSGAGAIFGATGGVMEAALRSAKDFAENAELEDIEYKQVRGLNGIKEAEVEINNNKYNVAVINGASNLFKFMKSGMINEKQYHFIEVMACHGGCVNGGGQPHVNPKDLEKVDIKKVRASVLYNQDEHLSKRKSHENTALVKMYQNYFGKPGEGRAHEILHFKYKK >1ie7_C UREASE ALPHA SUBUNIT Sporosarcina pasteurii non-cyclic MKINRQQYAESYGPTVGDEVRLADTDLWIEVEKDYTTYGDEVNFGGGKVLREGMGENGTYTRTENVLDLLLTNALILDYTGIYKADIGVKDGYIVGIGKGGNPDIMDGVTPNMIVGTATEVIAAEGKIVTAGGIDTHVHFINPDQVDVALANGITTLFGGGTGPAEGSKATTVTPGPWNIEKMLKSTEGLPINVGILGKGHGSSIAPIMEQIDAGAAGLKIHEDWGATPASIDRSLTVADEADVQVAIHSDTLNEAGFLEDTLRAINGRVIHSFHVEGAGGGHAPDIMAMAGHPNVLPSSTNPTRPFTVNTIDEHLDMLMVCHHLKNNIPEDVAFADSRIRPETIAAEDILHDLGIISMMSTDALAMGRAGEMVLRTWQTADKMKKQRGPLAEEKNGSDNFRLKRYVSKYTINPAIAQGIAHEVGSIEEGKFADLVLWEPKFFGVKADRVIKGGIIAYAQIGDPSASIPTPQPVMGRRMYGTVGDLIHDTNITFMSKSSIQQGVPAKLGLKRRIGTVKNCRNIGKKDMKWNDVTTDIDINPETYEVKVDGEVLTCEPVKELPMAQRYFLF >2fvc_A polyprotein Hepatitis c virus subtype 1b non-cyclic SMSYTWTGALITPCAAEESKLPINALSNSLLRHHNMVYATTSRSAGLRQKKVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSAKSKFGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQYSPGQRVEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCDLAPEARQAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKASAACRAAKLQDCTMLVNGDDLVVICESAGVQEDAASLRVFTEAMTRYSAPPGDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETARHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYGACYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRVWRHRARSVRARLLSQGGRAATCGKYLFNWAVKTKLKLTPIPAASQLDLSGWFVAGYSGGDIYHS >2cvp_A Glucose-6-phosphate isomerase Mus musculus non-cyclic MAALTRNPQFQKLLEWHRANSANLKLRELFEADPERFNNFSLNLNTNHGHILVDYSKNLVSKEVMQMLVELAKSRGVEAARDNMFSGSKINYTEDRAVLHVALRNRSNTPIKVDGKDVMPEVNRVLDKMKSFCQRVRSGDWKGYTGKSITDIINIGIGGSDLGPLMVTEALKPYSKGGPRVWFVSNIDGTHIAKTLASLSPETSLFIIASKTFTTQETITNAETAKEWFLEAAKDPSAVAKHFVALSTNTAKVKEFGIDPQNMFEFWDWVGGRYSLWSAIGLSIALHVGFDHFEQLLSGAHWMDQHFLKTPLEKNAPVLLALLGIWYINCYGCETHALLPYDQYMHRFAAYFQQGDMESNGKYITKSGARVDHQTGPIVWGEPGTNGQHAFYQLIHQGTKMIPCDFLIPVQTQHPIRKGLHHKILLANFLAQTEALMKGKLPEEARKELQAAGKSPEDLEKLLPHKVFEGNRPTNSIVFTKLTPFILGALIAMYEHKIFVQGIMWDINSFDQWGVELGKQLAKKIEPELEGSSAVTSHDSSTNGLISFIKQQRDTKL >3crv_A XPD/Rad3 related DNA helicase Sulfolobus acidocaldarius non-cyclic MVKLRDWQEKLKDKVIEGLRNNFLVALNAPTGSGKTLFSLLVSLEVKPKVLFVVRTHNEFYPIYRDLTKIREKRNITFSFLVGKPSSCLYAEKGAESEDIPCKYCELKGSIVEVKTDDSPLSLVKKLKKDGLQDKFCPYYSLLNSLYKADVIALTYPYFFIDRYREFIDIDLREYMIVIDEAHNLDKVNELEERSLSEITIQMAIKQSKSEESRRILSKLLNQLREVVLPDEKYIKVENVPKLSKEELEILADDYEDIRKDSLKQGKVNKIHIGSILRFFSLLSIGSFIPFSYSKRLVIKNPEISYYLNLLNDNELSIILMSGTLPPREYMEKVWGIKRNMLYLDVEREIQKRVSGSYECYIGVDVTSKYDMRSDNMWKRYADYLLKIYFQAKANVLVVFPSYEIMDRVMSRISLPKYVESEDSSVEDLYSAISANNKVLIGSVGKGKLAEGIELRNNDRSLISDVVIVGIPYPPPDDYLKILAQRVSLKMNRENEEFLFKIPALVTIKQAIGRAIRDVNDKCNVWLLDKRFESLYWKKNLKCLNANKMKL >2dka_A Phosphoacetylglucosamine mutase Candida albicans non-cyclic MSIEQTLSQYLPSHPKPQGVTFTYGTAGFRMKADKLDYVTFTVGIIASLRSKYLQGKTVGVMITASHNPPEDNGVKVVDPLGSMLESSWEKYATDLANASPSPSNDSEGEKNSLVEVIKNLVSDLKIDLSIPANVVIARDSRESSPALSMATIDGFQSVPNTKYQDFGLFTTPELHYVTRTLNDPDFGKPTEDGYYSKLAKSFQEIYTICESNNEKIDITIDAANGVGAPKIQELLEKYLHKEISFTVVNGDYKQPNLLNFDCGADYVKTNQKLPKNVKPVNNKLYASFDGDADRLICYYQNNDNKFKLLDGDKLSTLFALFLQQLFKQIDPTKISLNIGVVQTAYANGSSTKYVEDVLKIPVRCTPTGVKHLHHEAENFDIGVYFEANGHGTVIFNPEAEKKIFDYKPNNDNEAKAIKVLQNFSQLINQTVGDAISDLLAVLIVVHYLKLSPSDWDNEYTDLPNKLVKVIVPDRSIFKTTNAERTLVEPKGMQDEIDKLVAQYPNGRSFVRASGTEDAVRVYAEADTQNNVEELSKAVSELVK >1hyu_A ALKYL HYDROPEROXIDE REDUCTASE SUBUNIT F Salmonella typhimurium non-cyclic MLDTNMKTQLRAYLEKLTKPVELIATLDDSAKSAEIKELLAEIAELSDKVTFKEDNTLPVRKPSFLITNPGSQQGPRFAGSPLGHEFTSLVLALLWTGGHPSKEAQSLLEQIRDIDGDFEFETYYSLSCHNCPDVVQALNLMAVLNPRIKHTAIDGGTFQNEITERNVMGVPAVFVNGKEFGQGRMTLTEIVAKVDTGAEKRAAEALNKRDAYDVLIVGSGPAGAAAAVYSARKGIRTGLMGERFGGQVLDTVDIENYISVPKTEGQKLAGALKAHVSDYDVDVIDSQSASKLVPAATEGGLHQIETASGAVLKARSIIIATGAKWRNMNVPGEDQYRTKGVTYCPHCDGPLFKGKRVAVIGGGNSGVEAAIDLAGIVEHVTLLEFAPEMKADQVLQDKVRSLKNVDIILNAQTTEVKGDGSKVVGLEYRDRVSGDIHSVALAGIFVQIGLLPNTHWLEGALERNRMGEIIIDAKCETSVKGVFAAGDCTTVPYKQIIIATGEGAKASLSAFDYLIRTKIA >1nkg_A Rhamnogalacturonase B Aspergillus aculeatus non-cyclic AFGITTSSSAYVIDTNAPNQLKFTVSRSSCDITSIIHYGTELQYSSQGSHIGSGLGSATVTATQSGDYIKVTCVTDTLTQYMVVHNGDPIIHMATYITAEPSIGELRFIARLNSDLLPNEEPFGDVSTTADGTAIEGSDVFLVGSETRSKFYSSERFIDDQRHCIAGDAHRVCMILNQYESSSGGPFHRDINSNNGGSYNALYWYMNSGHVQTESYRMGLHGPYSMYFSRSGTPSTSIDTSFFADLDIKGYVAASGRGKVAGTASGADSSMDWVVHWYNDAAQYWTYTSSSGSFTSPAMKPGTYTMVYYQGEYAVATSSVTVSAGSTTTKNISGSVKTGTTIFKIGEWDGQPTGFRNAANQLRMHPSDSRMSSWGPLTYTVGSSALTDFPMAVFKSVNNPVTIKFTATSAQTGAATLRIGTTLSFAGGRPQATINSYTGSAPAAPTNLDSRGVTRGAYRGLGEVYDVSIPSGTIVAGTNTITINVISGSSGDTYLSPNFIFDCVELFQ >1ayx_A GLUCOAMYLASE Saccharomycopsis fibuligera non-cyclic AYPSFEAYSNYKVDRTDLETFLDKQKEVSLYYLLQNIAYPEGQFNNGVPGTVIASPSTSNPDYYYQWTRDSAITFLTVLSELEDNNFNTTLAKAVEYYINTSYNLQRTSNPSGSFDDENHKGLGEPKFNTDGSAYTGAWGRPQNDGPALRAYAISRYLNDVNSLNEGKLVLTDSGDINFSSTEDIYKNIIKPDLEYVIGYWDSTGFDLWEENQGRHFFTSLVQQKALAYAVDIAKSFDDGDFANTLSSTASTLESYLSGSDGGFVNTDVNHIVENPDLLQQNSRQGLDSATYIGPLLTHDIGESSSTPFDVDNEYVLQSYYLLLEDNKDRYSVNSAYSAGAAIGRYPEDVYNGDGSSEGNPWFLATAYAAQVPYKLAYDAKSASNDITINKINYDFFNKYIVDLSTINSAYQSSDSVTIKSGSDEFNTVADNLVTFGDSFLQVILDHINDDGSLNEQLNRYTGYSTGAYSLTWSSGALLEAIRLRNKVKALA >1u8v_A Gamma-aminobutyrate metabolism dehydratase/is Clostridium aminobutyricum non-cyclic MLMTAEQYIESLRKLNTRVYMFGEKIENWVDHPMIRPSINCVRMTYELAQDPQYADLMTTKSNLIGKTINRFANLHQSTDDLRKKVKMQRLLGQKTASCFQRCVGMDAFNAVFSTTYEIDQKYGTNYHKNFTEYLKYIQENDLIVDGAMTDPKGDRGLAPSAQKDPDLFLRIVEKREDGIVVRGAKAHQTGSINSHEHIIMPTIAMTEADKDYAVSFACPSDADGLFMIYGRQSCDTRKMEEGADIDLGNKQFGGQEALVVFDNVFIPNDRIFLCQEYDFAGMMVERFAGYHRQSYGGCKVGVGDVVIGAAALAADYNGAQKASHVKDKLIEMTHLNETLYCCGIACSAEGYPTAAGNYQIDLLLANVCKQNITRFPYEIVRLAEDIAGGLMVTMPSEADFKSETVVGRDGETIGDFCNKFFAAAPTCTTEERMRVLRFLENICLGASAVGYRTESMHGAGSPQAQRIMIARQGNINAKKELAKAIAGIK >3c7e_A Endo-1,4-beta-xylanase Bacillus subtilis non-cyclic ATSTTIAKHIGNSNPLIDHHLGADPVALTYNGRVYIYMSSDDYEYNSNGTIKDNSFANLNRVFVISSADMVNWTDHGAIPVAGANGANGGRGIAKWAGASWAPSIAVKKINGKDKFFLYFANSGGGIGVLTADSPIGPWTDPIGKPLVTPSTPGMSGVVWLFDPAVFVDDDGTGYLYAGGGVPGVSNPTQGQWANPKTARVIKLGPDMTSVVGSASTIDAPFMFEDSGLHKYNGTYYYSYCINFGGTHPADKPPGEIGYMTSSSPMGPFTYRGHFLKNPGAFFGGGGNNHHAVFNFKNEWYVVYHAQTVSSALFGAGKGYRSPHINKLVHNADGSIQEVAANYAGVTQISNLNPYNRVEAETFAWNGRILTEKSTAPGGPVNNQHVTSIQNGDWIAVGNADFGAGGARSFKANVASTLGGKIEVRLDSADGKLVGTLNVPSTGGAQTWREIETAVSGATGVHKVFFVFTGTGTGNLFNFDYWQFTQR >2bra_A NEDD9 INTERACTING PROTEIN WITH CALPONIN HOMOL Mus musculus non-cyclic MASPASTNPAHDHFETFVQAQLCQDVLSSFQGLCRALGVESGGGLSQYHKIKAQLNYWSAKSLWAKLDKRASQPVYQQGQACTNTKCLVVGAGPCGLRAAVELALLGARVVLVEKRIKFSRHNVLHLWPFTIHDLRALGAAAFYGRFCTGTLDHISIRQLQLLLLKVALLLGVEIHWGVKFTGLQPPPRKGSGWRAQLQPNPPAQLASYEFDVLISAAGGKFVPEGFTIREMRGKLAIGITANFVNGRTVEETQVPEISGVARIYNQKFFQSLLKATGIDLENIVYYKDETHYFVMTAKKQCLLRLGVLRQDLSETDQLLGKANVVPEALQRFARAAADFATHGKLGKLEFAQDARGRPDVAAFDFTSMMRAESSARVQEKHGARLLLGLVGDCLVEPFWPLGTGVARGFLAAFDAAWMVKRWAEGAGPLEVLAERESLYQLLSQTSPENMHRNVAQYGLDPATRYPNLNLRAVTPNQVQDLYD >1k7h_A ALKALINE PHOSPHATASE Pandalus borealis non-cyclic EEDKAYWNKDAQDALDKQLGIKLREKQAKNVIFFLGDGMSLSTVTAARIYKGGLTGKFEREKISWEEFDFAALSKTYNTDKQVTDSAASATAYLTGVKTNQGVIGLDANTVRTNCSYQLDESLFTYSIAHWFQEAGRSTGVVTSTRVTHATPAGTYAHVADRDWENDSDVVHDREDPEICDDIAEQLVFREPGKNFKVIMGGGRRGFFPEEALDIEDGIPGEREDGKHLITDWLDDKASQGATASYVWNRDDLLAVDIANTDYLMGLFSYTHLDTVLTRDAEMDPTLPEMTKVAIEMLTKDENGFFLLVEGGRIDHMHHANQIRQSLAETLDMEEAVSMALSMTDPEETIILVTADHGHTLTITGYADRNTDILDFAGISDLDDRRYTILDYGSGPGYHITEDGKRYEPTEEDLKDINFRYASAAPKHSATHDGTDVGIWVNGPFAHLFTGVYEENYIPHALAYAACVGTGRTFCD >1gqq_A UDP-N-ACETYLMURAMATE-L-ALANINE LIGASE Haemophilus influenzae non-cyclic MKHSHEEIRKIIPEMRRVQQIHFIGIGGAGMSGIAEILLNEGYQISGSDIADGVVTQRLAQAGAKIYIGHAEEHIEGASVVVVSSAIKDDNPELVTSKQKRIPVIQRAQMLAEIMRFRHGIAVAGTHGKTTTTAMISMIYTQAKLDPTFVNGGLVKSAGKNAHLGASRYLIAEADESDASFLHLQPMVSVVTNMEPDHMDTYEGDFEKMKATYVKFLHNLPFYGLAVMCADDPVLMELVPKVGRQVITYGFSEQADYRIEDYEQTGFQGHYTVICPNNERINVLLNVPGKHNALNATAALAVAKEEGIANEAILEALADFQGAGRRFDQLGEFIRPNGKVRLVDDYGHHPTEVGVTIKAAREGWGDKRIVMIFQPHRYSRTRDLFDDFVQVLSQVDALIMLDVYAAGEAPIVGADSKSLCRSIRNLGKVDPILVSDTSQLGDVLDQIIQDGDLILAQGAGSVSKISRGLAESWKN >3dam_A Cytochrome P450 74A2 Parthenium argentatum non-cyclic MDPSSKPLREIPGSYGIPFFQPIKDRLEYFYGTGGRDEYFRSRMQKYQSTVFRANMPPGPFVSSNPKVIVLLDAKSFPILFDVSKVEKKDLFTGTYMPSTKLTGGYRVLSYLDPSEPRHAQLKNLLFFMLKNSSNRVIPQFETTYTELFEGLEAELAKNGKAAFNDVGEQAAFRFLGRAYFNSNPEETKLGTSAPTLISSWVLFNLAPTLDLGLPWFLQEPLLHTFRLPAFLIKSTYNKLYDYFQSVATPVMEQAEKLGVPKDEAVHNILFAVCFNTFGGVKILFPNTLKWIGLAGENLHTQLAEEIRGAIKSYGDGNVTLEAIEQMPLTKSVVYESLRIEPPVPPQYGKAKSNFTIESHDATFEVKKGEMLFGYQPFATKDPKVFDRPEEYVPDRFVGDGEALLKYVWWSNGPETESPTVENKQCAGKDFVVLITRLFVIELFRRYDSFEIELGESPLGAAVTLTFLKRASI >2eq7_A 2-oxoglutarate dehydrogenase E3 component Thermus thermophilus non-cyclic MYDLLVIGAGPGGYVAAIRAAQLGMKVGVVEKEKALGGTCLRVGCIPSKALLETTERIYEAKKGLLGAKVKGVELDLPALMAHKDKVVQANTQGVEFLFKKNGIARHQGTARFLSERKVLVEETGEELEARYILIATGSAPLIPPWAQVDYERVVTSTEALSFPEVPKRLIVVGGGVIGLELGVVWHRLGAEVIVLEYMDRILPTMDLEVSRAAERVFKKQGLTIRTGVRVTAVVPEAKGARVELEGGEVLEADRVLVAVGRRPYTEGLSLENAGLSTDERGRIPVDEHLRTRVPHIYAIGDVVRGPMLAHKASEEGIAAVEHMVRGFGHVDYQAIPSVVYTHPEIAAVGYTEEELKAQGIPYKVGKFPYSASGRARAMGETEGFIKVLAHAKTDRILGVHGIGARVGDVLAEAALALFFKASAEDLGRAPHAHPSLSEILKEAALAAWERPIHL >1ra6_A Genome polyprotein Human poliovirus 1 non-cyclic GEIQWMRPSKEVGYPIINAPSKTKLEPSAFHYVFEGVKEPAVLTKNDPRLKTDFEEAIFSKYVGNKITEVDEYMKEAVDHYAGQLMSLDINTEQMCLEDAMYGTDGLEALDLSTSAGYPYVAMGKKKRDILNKQTRDTKEMQKLLDTYGINLPLVTYVKDELRSKTKVEQGKSRLIEASSLNDSVAMRMAFGNLYAAFHKNPGVITGSAVGCDPDLFWSKIPVLMEEKLFAFDYTGYDASLSPAWFEALKMVLEKIGFGDRVDYIDYLNHSHHLYKNKTYCVKGGMPSGCSGTSIFNSMINNLIIRTLLLKTYKGIDLDHLKMIAYGDDVIASYPHEVDASLLAQSGKDYGLTMTPADKSATFETVTWENVTFLKRFFRADEKYPFLIHPVMPMKEIHESIRWTKDPRNTQDHVRSLCLLAWHNGEEEYNKFLAKIRSVPIGRALDLPEYSTLYDRWLDSF >2ez1_A Tyrosine phenol-lyase Citrobacter freundii non-cyclic MNYPAEPFRIKSVETVSMIPRDERLKKMQEAGYNTFLLNSKDIYIDLLTDSGTNAMSDKQWAGMMMGDEAYAGSENFYHLERTVQELFGFKHIVPTHQGRGAENLLSQLAIKPGQYVAGNMYFTTTRYHQEKNGAVFVDIVRDEAHDAGLNIAFKGDIDLKKLQKLIDEKGAENIAYICLAVTVNLAGGQPVSMANMRAVRELTEAHGIKVFYDATRCVENAYFIKEQEQGFENKSIAEIVHEMFSYADGCTMSGKKDCLVNIGGFLCMNDDEMFSSAKELVVVYEGMPSYGGLAGRDMEAMAIGLREAMQYEYIEHRVKQVRYLGDKLKAAGVPIVEPVGGHAVFLDARRFCEHLTQDEFPAQSLAASIYVETGVRSMERGIISAGRNNVTGEHHRPKLETVRLTIPRRVYTYAHMDVVADGIIKLYQHKEDIRGLKFIYEPKQLRFFTARFDYI >2rkv_A Trichothecene 3-O-acetyltransferase Gibberella zeae non-cyclic MAFKIQLDTLGQLPGLLSIYTQISLLYPVSDSSQYPTIVSTFEQGLKRFSEAVPWVAGQVKAEGISEGNTGTSFIVPFEDVPRVVVKDLRDDPSAPTIEGMRKAGYPMAMFDENIIAPRKTLPIGPGTGPDDPKPVILLQLNFIKGGLILTVNGQHGAMDMVGQDAVIRLLSKACRNDPFTEEEMTAMNLDRKTIVPYLENYTIGPEVDHQIVKADVAGGDAVLTPVSASWAFFTFSPKAMSELKDAATKTLDASTKFVSTDDALSAFIWKSASRVRLERIDGSAPTEFCRAVDARPAMGVSNNYPGLLQNMTYHNSTIGEIANESLGATASRLRSELDPASMRQRTRGLATYLHNNPDKSNVSLTADADPSTSVMLSSWAKVGLWDYDFGLGLGKPETVRRPIFEPVESLMYFMPKKPDGEFCAALSLRDEDMDRLKADKEWTKYAQYVG >2hp3_A IDS-epimerase Agrobacterium tumefaciens non-cyclic MFTTKLAEKVVSAWKAKISQPALKAAQDGVIDTVAAALGGVTEHSVQVALKYVAATGGSGDSKLWGVNQRSNMFDAAFVNGMAAHAIDFDDSFPVMRGHPSSSLVPAIFAVGEHVGANGHNCLKSYVLGIEVVATLGRAVGKGHYLAGWHPTSTLGVFGATTAAALLLGADEEQLRNAWGIAASNSCGIIKNFGTMTKPMHTGSAARNGVLSAWLSMQSFTGCQTVFDDAEGILAMYGAQPGPELFNAMQKFGTPWAIIAPGLYKKSWPSCYANHKPLAGLFAIMKEHGLTGQDISHVDVGFLPGVEKPLLYMDPRTTEEAKFSIEANIGAALLDGEVSLASFEIEHLDRPAMRAAMKKVTRFDMPSETTFSGTTGYTDIVVHTADGKIERRIEATPGSLEDPMDDAHLERKFKDCTAWMPFGESGLLFDRLRSLTADQGIKTVQP >3hxu_A Alanyl-tRNA synthetase Escherichia coli non-cyclic SKSTAEIRQAFLDFFHSKGHQVVASSSLVPHNDPTLLFTNAGMNQFKDVFLGLDKRNYSRATTSQRCVRAGGKHNDLENVGYTARHHTFFEMLGNFSFGDYFKLDAILFAWLLLTSEKWFALPKERLWVTVYESDDEAYEIWEKEVGIPRERIIRIGDNKGAPYASDNFWQMGDTGPCGPCTEIFYDHGDHIWGGPPGSPEEDGDRYIEIWNIVFMQFNRQADGTMEPLPKPSVDTGMGLERIAAVLQHVNSNYDIDLFRTLIQAVAKVTGATDLSNKSLRVIADHIRSCAFLIADGVMPSNENRGYVLRRIIRRAVRHGNMLGAKETFFYKLVGPLIDVMGSAGEDLKRQQAQVEQVLKTEEEQFARTLERGLALLDEELAKLSGDTLDGETAFRLYDTYGFPVDLTADVCRERNIKVDEAGFEAAMEEQRRRAREASGF >2cts_A CITRATE SYNTHASE Sus scrofa non-cyclic ASSTNLKDILADLIPKEQARIKTFRQQHGNTAVGQITVDMMYGGMRGMKGLVYETSVLDPDEGIRFRGYSIPECQKMLPKAKGGEEPLPEGLFWLLVTGQIPTEEQVSWLSKEWAKRAALPSHVVTMLDNFPTNLHPMSQLSAAITALNSESNFARAYAEGIHRTKYWELIYEDCMDLIAKLPCVAAKIYRNLYREGSSIGAIDSKLDWSHNFTNMLGYTDAQFTELMRLYLTIHSDHEGGNVSAHTSHLVGSALSDPYLSFAAAMNGLAGPLHGLANQEVLVWLTQLQKEVGKDVSDEKLRDYIWNTLNSGRVVPGYGHAVLRKTDPRYTCQREFALKHLPHDPMFKLVAQLYKIVPNVLLEQGKAKNPWPNVDAHSGVLLQYYGMTEMNYYTVLFGVSRALGVLAQLIWSRALGFPLERPKSMSTDGLIKLVDSK >2hne_A L-fuconate dehydratase Xanthomonas campestris pv. campestris non-cyclic MRTIIALETHDVRFPTSRELDGSDAMNPDPDYSAAYVVLRTDGAEDLAGYGLVFTIGRGNDVQTAAVAALAEHVVGLSVDKVIADLGAFARRLTNDSQLRWLGPEKGVMHMAIGAVINAAWDLAARAANKPLWRFIAELTPEQLVDTIDFRYLSDALTRDEALAILRDAQPQRAARTATLIEQGYPAYTTSPGWLGYSDEKLVRLAKEAVADGFRTIKLKVGANVQDDIRRCRLARAAIGPDIAMAVDANQRWDVGPAIDWMRQLAEFDIAWIEEPTSPDDVLGHAAIRQGITPVPVSTGEHTQNRVVFKQLLQAGAVDLIQIDAARVGGVNENLAILLLAAKFGVRVFPHAGGVGLCELVQHLAMADFVAITGKMEDRAIEFVDHLHQHFLDPVRIQHGRYLAPEVPGFSAEMHPASIAEFSYPDGRFWVEDLAA >2oit_A Nucleoporin 214kDa Homo sapiens non-cyclic MGDEMDAMIPEREMKDFQFRALKKVRIFDSPEELPKERSSLLAVSNKYGLVFAGGASGLQIFPTKNLLIQNKPGDDPNKIVDKVQGLLVPMKFPIHHLALSCDNLTLSACMMSSEYGSIIAFFDVRTFSNEAKQQKRPFAYHKLLKDAGGMVIDMKWNPTVPSMVAVCLADGSIAVLQVTETVKVCATLPSTVAVTSVCWSPKGKQLAVGKQNGTVVQYLPTLQEKKVIPCPPFYESDHPVRVLDVLWIGTYVFAIVYAAADGTLETSPDVVMALLPKKEEKHPEIFVNFMEPCYGSCTERQHHYYLSYIEEWDLVLAASAASTEVSILARQSDQINWESWLLEDSSRAELPVTDKSDDSLPMGVVVDYTNQVEITISDEKTLPPAPVLMLLSTDGVLCPFYMINQNPGVKSLIKTPERLSLEGERQPKSPGST >2qlr_A Kynurenine/alpha-aminoadipate aminotransferas Homo sapiens non-cyclic MNYARFITAASAARNPSPIRTMTDILSRGPKSMISLAGGLPNPNMFPFKTAVITVENGKTIQFGEEMMKRALQYSPSAGIPELLSWLKQLQIKLHNPPTIHYPPSQGQMDLCVTSGSQQGLCKVFEMIINPGDNVLLDEPAYSGTLQSLHPLGCNIINVASDESGIVPDSLRDILSRWKPEDAKNPQKNTPKFLYTVPNGNNPTGNSLTSERKKEIYELARKYDFLIIEDDPYYFLQFNKFRVPTFLSMDVDGRVIRADSFSKIISSGLRIGFLTGPKPLIERVILHIQVSTLHPSTFNQLMISQLLHEWGEEGFMAHVDRVIDFYSNQKDAILAAADKWLTGLAEWHVPAAGMFLWIKVKGINDVKELIEEKAVKMGVLMLPGNAFYVDSSAPSPYLRASFSSASPEQMDVAFQVLAQLIKESL >1szn_A alpha-galactosidase Hypocrea jecorina non-cyclic IVMPDGVTGKVPSLGWNSWNAYHCDIDESKFLSAAELIVSSGLLDAGYNYVNIDDCWSMKDGRVDGHIAPNATRFPDGIDGLAKKVHALGLKLGIYSTAGTATCAGYPASLGYEDVDAADFADWGVDYLKYDNCNVPSDWQDEYVACNPDFVKTGPNGTCTTALDPTLAPPGYDWSTSKSAERFGAMRNALAKQSHEIVLSMCIWGQADVFSWGNSTGISWRMSDDISPNWGSVTRILNLNSFKLNSVDFWGHNDADMLEVGNGNLTAAETRTHFALWAAMKSPLLIGTDLAQLSQNNINLLKNKHLLAFNQDSVYGQPATPYKWGINPDWTFNVTYPAEFWAGPSSKGHLVLMVNTLDITATKEAKWNEIPGLSAGHYEVRDVWSDKDLGCLSSYKAAVAAHDTAVILVGKKCQRW >1zzg_A glucose-6-phosphate isomerase Thermus thermophilus non-cyclic MLRLDTRFLPGFPEALSRHGPLLEEARRRLLAKRGEPGSMLGWMDLPEDTETLREVRRYREANPWVEDFVLIGIGGSALGPKALEAAFNESGVRFHYLDHVEPEPILRLLRTLDPRKTLVNAVSKSGSTAETLAGLAVFLKWLKAHLGEDWRRHLVVTTDPKEGPLRAFAEREGLKAFAIPKEVGGRFSALSPVGLLPLAFAGADLDALLMGARKANETALAPLEESLPLKTALLLHLHRHLPVHVFMVYSERLSHLPSWFVQLHDESLGKVDRQGQRVGTTAVPALGPKDQHAQVQLFREGPLDKLLALVIPEAPLEDVEIPEVEGLEAASYLFGKTLFQLLKAEAEATYEALAEAGQRVYALFLPEVSPYAVGWLMQHLMWQTAFLGELWEVNAFDQPGVELGKVLTRKRLAG >2cb1_A O-ACETYL HOMOSERINE SULFHYDRYLASE Thermus thermophilus non-cyclic MEYTTLAVLAGLPEDPHGAVGLPIYAVAAYGFKTLEEGQERFATGEGYVYARQKDPTAKALEERLKALEGALEAVVLASGQAATFAALLALLRPGDEVVAAKGLFGQTIGLFGQVLSLMGVTVRYVDPEPEAVREALSAKTRAVFVETVANPALLVPDLEALATLAEEAGVALVVDNTFGAAGALCRPLAWGAHVVVESLTKWASGHGSVLGGAVLSRETELWRNYPQFLQPDLKGQIPWEALRARCFPERVRTLGLSLCGMALSPFNAYLLFQGLETVALRVARMSETARFLAERLQGHPKVKALRYPGLPEDPAHRNARKYLASGGPILTLDLGDLERASRFLGAIRLLKAANLGDARTLLVHPWTTTHSRLKEEARLQAGVTPGLVRVSVGLEDPLDLLALFEEALEAV >2qjj_A Mandelate racemase/muconate lactonizing enzym Novosphingobium aromaticivorans non-cyclic MKITAARVIITCPGRNFVTLKIETDQGVYGIGDATLNGRELSVVAYLQEHVAPCLIGMDPRRIEDIWQYVYRGAYWRRGPVTMRAIAAVDMALWDIKAKMAGMPLYQLLGGRSRDGIMVYGHANGSDIAETVEAVGHYIDMGYKAIRAQTGVPGIKDAYGVGRGKLYYEPADASLPSVTGWDTRKALNYVPKLFEELRKTYGFDHHLLHDGHHRYTPQEAANLGKMLEPYQLFWLEDCTPAENQEAFRLVRQHTVTPLAVGEIFNTIWDAKDLIQNQLIDYIRATVVGAGGLTHLRRIADLASLYQVRTGCHGATDLSPVTMGCALHFDTWVPNFGIQEYMRHTEETDAVFPHDYWFEKGELFVGETPGHGVDIDEELAAKYPYKPAYLPVARLEDGTMWNW >1gc0_A METHIONINE GAMMA-LYASE Pseudomonas putida non-cyclic MHGSNKLPGFATRAIHHGYDPQDHGGALVPPVYQTATFTFPTVEYGAACFAGEQAGHFYSRISNPTLNLLEARMASLEGGEAGLALASGMGAITSTLWTLLRPGDEVLLGNTLYGCTFAFLHHGIGEFGVKLRHVDMADLQALEAAMTPATRVIYFESPANPNMHMADIAGVAKIARKHGATVVVDNTYCTPYLQRPLELGADLVVHSATKYLSGHGDITAGIVVGSQALVDRIRLQGLKDMTGAVLSPHDAALLMRGIKTLNLRMDRHCANAQVLAEFLARQPQVELIHYPGLASFPQYTLARQQMSQPGGMIAFELKGGIGAGRRFMNALQLFSRAVSLGDAESLAQHPASMTHSSYTPEERAHYGISEGLVRLSVGLEDIDDLLADVQQALKASA >1jhd_A SULFATE ADENYLYLTRANSFERASE Sulfur-oxidizing endosymbiont of non-cyclic MIKPVGSDELKPLFVYDPEEHHKLSHEAESLPSVVISSQAAGNAVMMGAGYFSPLQGFMNVADAMGAAEKMTLSDGSFFPVPVLCLLENTDAIGDAKRIALRDPNVEGNPVLAVMDIEAIEEVSDEQMAVMTDKVYRTTDMDHIGVKTFNSQGRVAVSGPIQVLNFSYFQADFPDTFRTAVEIRNEIKEHGWSKVVAFQTRNPMHRAHEELCRMAMESLDADGVVVHMLLGKLKKGDIPAPVRDAAIRTMAEVYFPPNTVMVTGYGFDMLYAGPREAVLHAYFRQNMGATHFIIGRDHAGVGDYYGAFDAQTIFDDEVPEGAMEIEIFRADHTAYSKKLNKIVMMRDVPDHTKEDFVLLSGTKVREMLGQGIAPPPEFSRPEVAKILMDYYQSINS >1o06_A Vacuolar protein sorting-associated protein V null non-cyclic EEDPDLKAAIQESLREAEEA >3pe7_A Oligogalacturonate lyase Yersinia enterocolitica subsp. enterocolitica non-cyclic MAKGKQIPLTFDTYQDASTGAQVTRLTPPDVTCHRNYFYQKCFTRDGSKLLFGGAFDGPWNYYLLDLNTQVATQLTEGRGDNTFGGFLSPDDDALFYVKDGRNLMRVDLATLEENVVYQVPAEWVGYGTWVANSDCTKLVGIEIRREDWVPLTDWKKFHEFYFTKPCCRLMRVDLKTGESTVILQENQWLGHPIYRPYDDSTVAFCHEGPHDLVDARMWLINEDGTNMRKVKTHAEGESCTHEFWVPDGSALVYVSYLKGSPDRFIYSADPETLENRQLTSMPACSHLMSNYDGSLMVGDGSDAPVDVQDDSGYKIENDPFLYVFNMKNGTQHRVARHDTSWKVFEGDRQVTHPHPSFTPDDKQILFTSDVHGKPALYLATLPESVWK >2eh6_A Acetylornithine aminotransferase Aquifex aeolicus non-cyclic TYLMNNYARLPVKFVRGKGVYLYDEEGKEYLDFVSGIGVNSLGHAYPKLTEALKEQVEKLLHVSNLYENPWQEELAHKLVKHFWTEGKVFFANSGTESVEAAIKLARKYWRDKGKNKWKFISFENSFHGRTYGSLSATGQPKFHKGFEPLVPGFSYAKLNDIDSVYKLLDEETAGIIIEVIQGEGGVNEASEDFLSKLQEICKEKDVLLIIDEVQTGIGRTGEFYAYQHFNLKPDVIALAKGLGGGVPIGAILAREEVAQSFTPGSHGSTFGGNPLACRAGTVVVDEVEKLLPHVREVGNYFKEKLKELGKGKVKGRGLMLGLELERECKDYVLKALEKGLLINCTAGKVLRFLPPLIIQKEHIDRAISVLREIL >1bjw_A ASPARTATE AMINOTRANSFERASE Thermus thermophilus non-cyclic MRGLSRRVQAMKPSATVAVNAKALELRRQGVDLVALTAGEPDFDTPEHVKEAARRALAQGKTKYAPPAGIPELREALAEKFRRENGLSVTPEETIVTVGGKQALFNLFQAILDPGDEVIVLSPYWVSYPEMVRFAGGVVVEVETLPEEGFVPDPERVRRAITPRTKALVVNSPNNPTGAVYPKEVLEALARLAVEHDFYLVSDEIYEHLLYEGEHFSPGRVAPEHTLTVNGAAKAFAMTGWRIGYACGPKEVIKAMASVSSQSTTSPDTIAQWATLEALTNQEASRAFVEMAREAYRRRRDLLLEGLTALGLKAVRPSGAFYVLMDTSPIAPDEVRAAERLLEAGVAVVPGTDFAAFGHVRLSYATSEENLRKALERFARVL >1u6r_A Creatine kinase, M chain Oryctolagus cuniculus non-cyclic PFGNTHNKYKLNYKSEEEYPDLSKHNNHMAKVLTPDLYKKLRDKETPSGFTLDDVIQTGVDNPGHPFIMTVGCVAGDEESYTVFKDLFDPIIQDRHGGFKPTDKHKTDLNHENLKGGDDLDPHYVLSSRVRTGKSIKGYTLPPHCSRGERRAVEKLSVEALNSLTGEFKGKYYPLKSMTEQEQQQLIDDHFLFDKPVSPLLLASGMARDWPDARGIWHNDNKSFLVWVNEEDHLRVISMEKGGNMKEVFRRFCVGLQKIEEIFKKAGHPFMWNEHLGYVLTCPSNLGTGLRGGVHVKLAHLSKHPKFEEILTRLRLQKRGTGGVDTAAVGSVFDISNADRLGSSEVEQVQLVVDGVKLMVEMEKKLEKGQSIDDMIPAQK >2ahf_A unsaturated glucuronyl hydrolase Bacillus sp. non-cyclic MWQQAIGDALGITARNLKKFGDRFPHVSDGSNKYVLNDNTDWTDGFWSGILWLCYEYTGDEQYREGAVRTVASFRERLDRFENLDHHNIGFLYSLSAKAQWIVEKDESARKLALDAADVLMRRWRADAGIIQAWGPKGDPENGGRIIIDCLLNLPLLLWAGEQTGDPEYRRVAEAHALKSRRFLVRGDDSSYHTFYFDPENGNAIRGGTHQGNTDGSTWTRGQAWGIYGFALNSRYLGNADLLETAKRMARHFLARVPEDGVVYWDFEVPQEPSSYRDSSASAITACGLLEIASQLDESDPERQRFIDAAKTTVTALRDGYAERDDGEAEGFIRRGSYHVRGGISPDDYTIWGDYYYLEALLRLERGVTGYWYERGR >1yt3_A Ribonuclease D Escherichia coli non-cyclic MNYQMITTDDALASLCEAVRAFPAIALDTEFVRTRTYYPQLGLIQLFDGEHLALIDPLGITDWSPLKAILRDPSITKFLHAGSEDLEVFLNVFGELPQPLIDTQILAAFCGRPMSWGFASMVEEYSGVTLDKSESRTDWLARPLTERQCEYAAADVWYLLPITAKLMVETEASGWLPAALDECRLMQMRRQEVVAPEDAWRDITNAWQLRTRQLACLQLLADWRLRKARERDLAVNFVVREEHLWSVARYMPGSLGELDSLGLSGSEIRFHGKTLLALVEKAQTLPEDALPQPMLNLMDMPGYRKAFKAIKSLITDVSETHKISAELLASRRQINQLLNWHWKLKPQNNLPELISGWRGELMAEALHNLLQEYPQ >1anf_A MALTODEXTRIN-BINDING PROTEIN Escherichia coli non-cyclic KIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK >3lnl_A UPF0135 protein SA1388 Staphylococcus aureus subsp. aureus non-cyclic AMDPMKIADLMTLLDHHVPFSTAESWDNVGLLIGDEDVEVTGVLTALDCTLEVVNEAIEKGYNTIISHHPLIFKGVTSLKANGYGLIIRKLIQHDINLIAMHTNLDVNPHGVNMMLAKVMGLKNISIINNQQDVYYKVQTYIPKDNVGPFKDKLSENGLAQEGNYEYCFFESEGRGQFKPVGEANPTIGQIDKIEDVDEVKIEFMIDAYQKSRAEQLIKQYHPYETPVFDFIEIKQTSLYGLGVMAEVDNQMTLEDFAADIKSKLNIPSVRFVGESNQKIKRIAIIGGSGIGYEYQAVQQGADVFVTGDIKHHDALDAKIHGVNLIDINHYSEYVMKEGLKTLLMNWFNIEKINIDVEASTINTDPFQYI >1ue8_A 367aa long hypothetical cytochrome P450 Sulfolobus tokodaii non-cyclic MYDWFKQMRKESPVYYDGKVWNLFKYEDCKMVLNDHKRFSSNLTGYNDKLEMLRSGKVFFDIPTRYTMLTSDPPLHDELRNLTADAFNPSNLPVDFVREVTVKLLSELDEEFDVIESFAIPLPILVISKMLGINPDVKKVKDWSDLVALRLGRADEIFSIGRKYLELISFSKKELDSRKGKEIVDLTGKIANSNLSELEKEGYFILLMIAGNETTTNLIGNAIEDFTLYNSWDYVREKGALKAVEEALRFSPPVMRTIRVTKEKVKIRDQVIDEGELVRVWIASANRDEEVFKDPDSFIPDRTPNPHLSFGSGIHLCLGAPLARLEARIALEEFAKKFRVKEIVKKEKIDNEVLNGYRKLVVRVERT >1ado_A ALDOLASE Oryctolagus cuniculus non-cyclic PHSHPALTPEQKKELSDIAHRIVAPGKGILAADESTGSIAKRLQSIGTENTEENRRFYRQLLLTADDRVNPCIGGVILFHETLYQKADDGRPFPQVIKSKGGVVGIKVDKGVVPLAGTNGETTTQGLDGLSERCAQYKKDGADFAKWRCVLKIGEHTPSALAIMENANVLARYASICQQNGIVPIVEPEILPDGDHDLKRCQYVTEKVLAAVYKALSDHHIYLEGTLLKPNMVTPGHACTQKYSHEEIAMATVTALRRTVPPAVTGVTFLSGGQSEEEASINLNAINKCPLLKPWALTFSYGRALQASALKAWGGKKENLKAAQEEYVKRALANSLACQGKYTSSGQAGAAASESLFISNHAY >1g1a_A DTDP-D-GLUCOSE 4,6-DEHYDRATASE Salmonella enterica subsp. enterica non-cyclic MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLTYAGNLESLSDISESNRYNFEHADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYALLEVARKYWSALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDHLVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYVEDHARALHMVVTEGKAGETYNIGGHNEKKNLDVVFTICDLLDEIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLANTQWVNNVKSGAYQSWIEQNYEGRQ >2c1l_A RESTRICTION ENDONUCLEASE Bacillus firmus non-cyclic MNFFSLHPNVYATGRPKGLIGMLENVWVSNHTPGEGTLYLISGFSNYNGGVRFYETFTEHINQGGRVIAILGGSTSQRLSSRQVVEELLNRGVEVHIINRKRILHAKLYGTSNNLGESLVVSSGNFTGPGMSQNIEASLLLDNNTTQSMGFSWNDMISEMLNQNWHIHNMTNATDASPGWNLLYDERTTNLTLDETERVTLIVTLGHADTARIQAAPGTTAGQGTQYFWLSKDSYDFFPPLTIRNRRGTKATYSSLINMNYIDINYTDTQCRVTFEAENNFDFRLGTGKLRYTGVAKSNDIAAITRVGDSDYELRIIKQGTPEHSQLDPYAVSFIGNRGKRFGYISNEEFGRIIGVTF >2vka_A VERSATILE PEROXIDASE VPL2 Pleurotus eryngii non-cyclic ATCDDGRTTANAACCILFPILDDIQENLFDGAQCGEEVHESLRLTFHDAIGFSPTLGGGGADGSIIAFDTIETNFPANAGIDEIVSAQKPFVAKHNISAGDFIQFAGAVGVSNCPGGVRIPFFLGRPDAVAASPDHLVPEPFDSVDSILARMGDAGFSPVEVVWLLASHSIAAADKVDPSIPGTPFDSTPGVFDSQFFIETQLKGRLFPGTADNKGEAQSPLQGEIRLQSDHLLARDPQTACEWQSFVNNQPKIQNRFAATMSKMALLGQDKTKLIDCSDVIPTPPALVGAAHLPAGFSLSDVEQACAATPFPALTA >1b6r_A PROTEIN (N5-CARBOXYAMINOIMIDAZOLE RIBONUCLEOT Escherichia coli non-cyclic MKQVCVLGNGQLGRMLRQAGEPLGIAVWPVGLDAEPAAVPFQQSVITAEIERWPETALTRQLARHPAFVNRDVFPIIADRLTQKQLFDKLHLPTAPWQLLAERSEWPAVFDRLGELAIVKRRTGGYDGRGQWRLRANETEQLPAECYGECIVEQGINFSGEVSLVGARGFDGSTVFYPLTHNLHQDGILRTSVAFPQANAQQQARAEEMLSAIMQELGYVGVMAMECFVTPQGLLINELAPRVHNSGHWTQNGASISQFELHLRAITDLPLPQPVVNNPSVMINLIGSDVNYDWLKLPLVHLHWYDKEVRPGRKVGHLNLTDSDTSRLTATLEALIPLLPPEYASGVIWAQSKFG >1uim_A Threonine Synthase Thermus thermophilus non-cyclic MRPPLIERYRNLLPVSEKTPVISLLEGSTPLIPLKGPEEARKKGIRLYAKYEGLNPTGSFKDRGMTLAVSKAVEGGAQAVACASTGNTAASAAAYAARAGILAIVVLPAGYVALGKVAQSLVHGARIVQVEGNFDDALRLTQKLTEAFPVALVNSVNPHRLEGQKTLAFEVVDELGDAPHYHALPVGNAGNITAHWMGYKAYHALGKAKRLPRMLGFQAAGAAPLVLGRPVERPETLATAIRIGNPASWQGAVRAKEESGGVIEAVTDEEILFAYRYLAREEGIFCEPASAAAMAGVFKLLREGRLEPESTVVLTLTGHGLKDPATAERVAELPPPVPARLEAVAAAAGLL >1ujn_A dehydroquinate synthase Thermus thermophilus non-cyclic MQRLEVREPVPYPILVGEGVLKEVPPLAGPAALLFDRRVEGFAQEVAKALGVRHLLGLPGGEAAKSLEVYGKVLSWLAEKGLPRNATLLVVGGGTLTDLGGFVAATYLRGVAYLAFPTTTLAIVDASVGGKTGINLPEGKNLVGAFHFPQGVYAELRALKTLPLPTFKEGLVEAFKHGLIAGDEALLKVEDLTPQSPRLEAFLARAVAVKVRVTEEDPLEKGKRRLLNLGHTLGHALEAQTRHALPHGMAVAYGLLYAALLGRALGGEDLLPPVRRLLLWLSPPPLPPLAFEDLLPYLLRDKKKVSESLHWVVPLAPGRLVVRPLPEGLLREAFAAWREELKGLGLLR >1rqx_A 1-aminocyclopropane-1-carboxylate deaminase Pseudomonas sp. non-cyclic MNLQRFPRYPLTFGPTPIQPLARLSKHLGGKVHLYAKREDCNSGLAFGGNKTRKLEYLIPEALAQGCDTLVSIGGIQSNQTRQVAAVAAHLGMKCVLVQENWVNYSDAVYDRVGNIQMSRILGADVRLVPDGFDIGFRRSWEDALESVRAAGGKPYAIPAGCSDHPLGGLGFVGFAEEVRAQEAELGFKFDYVVVCSVTGSTQAGMVVGFAADGRADRVIGVDASAKPAQTREQITRIARQTAEKVGLERDIMRADVVLDERFAGPEYGLPNEGTLEAIRLCARTEGMLTDPVYEGKSMHGMIEMVRNGEFPEGSRVLYAHLGGVPALNGYSFIFRDG >1wkr_A Polyporopepsin Irpex lacteus non-cyclic AAGSVPATNQLVDYVVNVGVGSPATTYSLLVDTGSSNTWLGADKSYVKTSTSSATSDKVSVTYGSGSFSGTEYTDTVTLGSLTIPKQSIGVASRDSGFDGVDGILGVGPVDLTVGTLSPHTSTSIPTVTDNLFSQGTIPTNLLAVSFEPTTSESSTNGELTFGATDSSKYTGSITYTPITSTSPASAYWGINQSIRYGSSTSILSSTAGIVDTGTTLTLIASDAFAKYKKATGAVADNNTGLLRLTTAQYANLQSLFFTIGGQTFELTANAQIWPRNLNTAIGGSASSVYLIVGDLGSDSGEGLDFINGLTFLERFYSVYDTTNKRLGLATTSFTTATSN >1pm2_A Ribonucleoside-diphosphate reductase 1 beta c Escherichia coli non-cyclic AYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLESIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFNTRSNPIPWINTWL >2x0k_A RIBOFLAVIN BIOSYNTHESIS PROTEIN RIBF Corynebacterium ammoniagenes non-cyclic MDIWYGTAAVPKDLDNSAVTIGVFDGVHRGHQKLINATVEKAREVGAKAIMVTFDPHPVSVFLPRRAPLGITTLAERFALAESFGIDGVLVIDFTRELSGTSPEKYVEFLLEDTLHASHVVVGANFTFGENAAGTADSLRQICQSRLTVDVIDLLDDEGVRISSTTVREFLSEGDVARANWALGRHFYVTGPVVRGAGRGGKELGFPTANQYFHDTVALPADGVYAGWLTILPTEAPVSGNMEPEVAYAAAISVGTNPTFGDEQRSVESFVLDRDADLYGHDVKVEFVDHVRAMEKFDSVEQLLEVMAKDVQKTRTLLAQDVQAHKMAPETYFLQAES >1txg_A glycerol-3-phosphate dehydrogenase [NAD(P)+] Archaeoglobus fulgidus non-cyclic MIVSILGAGAMGSALSVPLVDNGNEVRIWGTEFDTEILKSISAGREHPRLGVKLNGVEIFWPEQLEKCLENAEVVLLGVSTDGVLPVMSRILPYLKDQYIVLISKGLIDFDNSVLTVPEAVWRLKHDLRERTVAITGPAIAREVAKRMPTTVVFSSPSESSANKMKEIFETEYFGVEVTTDIIGTEITSALKNVYSIAIAWIRGYESRKNVEMSNAKGVIATRAINEMAELIEILGGDRETAFGLSGFGDLIATFRGGRNGMLGELLGKGLSIDEAMEELERRGVGVVEGYKTAEKAYRLSSKINADTKLLDSIYRVLYEGLKVEEVLFELATFK >3exe_B Pyruvate dehydrogenase E1 component subunit b Homo sapiens non-cyclic LQVTVRDAINQGMDEELERDEKVFLLGEEVAQYDGAYKVSRGLWKKYGDKRIIDTPISEMGFAGIAVGAAMAGLRPICEFMTFNFSMQAIDQVINSAAKTYYMSGGLQPVPIVFRGPNGASAGVAAQHSQCFAAWYGHCPGLKVVSPWNSEDAKGLIKSAIRDNNPVVVLENELMYGVPFEFPPEAQSKDFLIPIGKAKIERQGTHITVVSHSRPVGHCLEAAAVLSKEGVECEVINMRTIRPMDMETIEASVMKTNHLVTVEGGWPQFGVGAEICARIMEGPAFNFLDAPAVRVTGADVPMPYAKILEDNSIPQVKDIIFAIKKTLNI >1nns_A L-asparaginase II Escherichia coli non-cyclic LPNITILATGGTIAGGGDSATKSNYTVGKVGVENLVNAVPQLKDIANVKGEQVVNIGSQDMNDNVWLTLAKKINTDCDKTDGFVITHGTDTMEETAYFLDLTVKCDKPVVMVGAMRPSTSMSADGPFNLYNAVVTAADKASANRGVLVVMNDTVLDGRDVTKTNTTDVATFKSVNYGPLGYIHNGKIDYQRTPARKHTSDTPFDVSKLNELPKVGIVYNYANASDLPAKALVDAGYDGIVSAGVGNGNLYKSVFDTLATAAKTGTAVVRSSRVPTGATTQDAEVDDAKYGFVASGTLNPQKARVLLQLALTQTKDPQQIQQIFNQY >1j96_A 3alpha-hydroxysteroid dehydrogenase type 3 Homo sapiens non-cyclic DDSKYQCVKLNDGHFMPVLGFGTYAPAEVPKSKALEAVKLAIEAGFHHIDSAHVYNNEEQVGLAIRSKIADGSVKREDIFYTSKLWSNSHRPELVRPALERSLKNLQLDYVDLYLIHFPVSVKPGEEVIPKDENGKILFDTVDLCATWEAMEKCKDAGLAKSIGVSNFNHRLLEMILNKPGLKYKPVCNQVECHPYFNQRKLLDFCKSKDIVLVAYSALGSHREEPWVDPNSPVLLEDPVLCALAKKHKRTPALIALRYQLQRGVVVLAKSYNEQRIRQNVQVFEFQLTSEEMKAIDGLNRNVRYLTLDIFAGPPNYPFSDEY >1dxr_M PHOTOSYNTHETIC REACTION CENTER M SUBUNIT Rhodopseudomonas viridis non-cyclic ADYQTIYTQIQARGPHITVSGEWGDNDRVGKPFYSYWLGKIGDAQIGPIYLGASGIAAFAFGSTAILIILFNMAAEVHFDPLQFFRQFFWLGLYPPKAQYGMGIPPLHDGGWWLMAGLFMTLSLGSWWIRVYSRARALGLGTHIAWNFAAAIFFVLCIGCIHPTLVGSWSEGVPFGIWPHIDWLTAFSIRYGNFYYCPWHGFSIGFAYGCGLLFAAHGATILAVARFGGDREIEQITDRGTAVERAALFWRWTIGFNATIESVHRWGWFFSLMVMVSASVGILLTGTFVDNWYLWCVKHGAAPDYPAYLPATPDPASLPGAPK >1wq3_A Tyrosyl-tRNA synthetase Escherichia coli str. k12 substr. non-cyclic MASSNLIKQLQERGLVAQVTDEEALAERLAQGPIALVCGFDPTADSLHLGHLVPLLCLKRFQQAGHKPVALVGGATGLIGDPSFKAAERKLNTEETVQEWVDKIRKQVAPFLDFDCGENSAIAANNYDWFGNMNVLTFLRDIGKHFSVNQMINKEAVKQRLNREDQGISFTEFSYNLLQGYDFACLNKQYGVVLCIGGSDQWGNITSGIDLTRRLHQNQVFGLTVPLITKADGTKFGKTEGGAVWLDPKKTSPYKFYQFWINTADADVYRFLKFFTFMSIEEINALEEEDKNSGKAPRAQYVLAEQVTRLVHGEEGLQAAKR >1zxx_A 6-phosphofructokinase Lactobacillus delbrueckii subsp. bulgaricus non-cyclic MKRIGILTSGGDAPGMNAAVRAVTRVAIANGLEVFGIRYGFAGLVAGDIFPLESEDVAHLINVSGTFLYSARYPEFAEEEGQLAGIEQLKKHGIDAVVVIGGDGSYHGALQLTRHGFNSIGLPGTIDNDIPYTDATIGYDTACMTAMDAIDKIRDTASSHHRVFIVNVMGRNCGDIAMRVGVACGADAIVIPERPYDVEEIANRLKQAQESGKDHGLVVVAEGVMTADQFMAELKKYGDFDVRANVLGHMQRGGTPTVSDRVLASKLGSEAVHLLLEGKGGLAVGIENGKVTSHDILDLFDESHRGDYDLLKLNADLSR >1dl5_A PROTEIN-L-ISOASPARTATE O-METHYLTRANSFERASE Thermotoga maritima non-cyclic MREKLFWILKKYGVSDHIAKAFLEIPREEFLTKSYPLSYVYEDIVLVSYDDGEEYSTSSQPSLMALFMEWVGLDKGMRVLEIGGGTGYNAAVMSRVVGEKGLVVSVEYSRKICEIAKRNVERLGIENVIFVCGDGYYGVPEFSPYDVIFVTVGVDEVPETWFTQLKEGGRVIVPINLKLSRRQPAFLFKKKDPYLVGNYKLETRFITAGGNLGNLLERNRKLLREFPFNREILLVRSHIFVELVDLLTRRLTEIDGTFYYAGPNGVVEFLDDRMRIYGDAPEIENLLTQWESCGYRSFEYLMLHVGYNAFSHISCSI >1qlm_A METHENYLTETRAHYDROMETHANOPTERIN CYCLOHYDROLAS Methanopyrus kandleri non-cyclic MVSVNENALPLVERMIERAELLNVEVQELENGTTVIDCGVEAAGGFEAGLLFSEVCMGGLATVELTEFEHDGLCLPAVQVTTDHPAVSTLAAQKAGWQVQVGDYFAMGSGPARALALKPKETYEEIDYEDDADVAILCLESSELPDEDVAEHVADECGVDPENLYLLVAPTASIVGSVQVSARVVETGLYKLLEVLEYDVTRVKYATGTAPIAPVADDDGEAMGRTNDCILYGGTVYLYVEGDDELPEVVEELPSEASEDYGKPFMKIFEEADYDFYKIDPGVFAPARVVVNDLSTGKTYTAGEINVDVLKESFSL >1fmt_A METHIONYL-TRNA FMET FORMYLTRANSFERASE Escherichia coli non-cyclic SESLRIIFAGTPDFAARHLDALLSSGHNVVGVFTQPDRPAGRGKKLMPSPVKVLAEEKGLPVFQPVSLRPQENQQLVAELQADVMVVVAYGLILPKAVLEMPRLGCINVHGSLLPRWRGAAPIQRSLWAGDAETGVTIMQMDVGLDTGDMLYKLSCPITAEDTSGTLYDKLAELGPQGLITTLKQLADGTAKPEVQDETLVTYAEKLSKEEARIDWSLSAAQLERCIRAFNPWPMSWLEIEGQPVKVWKASVIDTATNAAPGTILEANKQGIQVATGDGILNLLSLQPAGKKAMSAQDLLNSRREWFVPGNRLV >1ogq_A POLYGALACTURONASE INHIBITING PROTEIN Phaseolus vulgaris non-cyclic ELCNPQDKQALLQIKKDLGNPTTLSSWLPTTDCCNRTWLGVLCDTDTQTYRVNNLDLSGLNLPKPYPIPSSLANLPYLNFLYIGGINNLVGPIPPAIAKLTQLHYLYITHTNVSGAIPDFLSQIKTLVTLDFSYNALSGTLPPSISSLPNLVGITFDGNRISGAIPDSYGSFSKLFTSMTISRNRLTGKIPPTFANLNLAFVDLSRNMLEGDASVLFGSDKNTQKIHLAKNSLAFDLGKVGLSKNLNGLDLRNNRIYGTLPQGLTQLKFLHSLNVSFNNLCGEIPQGGNLQRFDVSAYANNKCLCGSPLPACT >1ppr_M PERIDININ-CHLOROPHYLL PROTEIN Amphidinium carterae non-cyclic DEIGDAAKKLGDASYAFAKEVDWNNGIFLQAPGKLQPLEALKAIDKMIVMGAAADPKLLKAAAEAHHKAIGSISGPNGVTSRADWDNVNAALGRVIASVPENMVMDVYDSVSKITDPKVPAYMKSLVNGADAEKAYEGFLAFKDVVKKSQVTSAAGPATVPSGDKIGVAAQQLSEASYPFLKEIDWLSDVYMKPLPGVSAQQSLKAIDKMIVMGAQADGNALKAAAEAHHKAIGSIDATGVTSAADYAAVNAALGRVIASVPKSTVMDVYNAMAGVTDTSIPLNMFSKVNPLDANAAAKAFYTFKDVVQAAQ >2v04_A CHOLINE BINDING PROTEIN F Streptococcus pneumoniae non-cyclic NTTGGRFVDKDNRKYYVKDDHKAIYWHKIDGKTYYFGDIGEMVVGWQYLEIPGTGYRDNLFDNQPVNEIGLQEKWYYFGQDGALLEQTDKQVLEAKTSENTGKVYGEQYPLSAEKRTYYFDNNYAVKTGWIYEDGNWYYLNKLGNFGDDSYNPLPIGEVAKGWTQDFHVTIDIDRSKPAPWYYLDASGKMLTDWQKVNGKWYYFGSSGSMATGWKYVRGKWYYLDNKNGDMKTGWQYLGNKWYYLRSSGAMVTGWYQDGLTWYYLNAGNGDMKTGWFQVNGKWYYAYSSGALAVNTTVDGYSVNYNGEWVQ >2cex_A PROTEIN HI0146 Haemophilus influenzae non-cyclic ADYDLKFGMNAGTSSNEYKAAEMFAKEVKEKSQGKIEISLYPSSQLGDDRAMLKQLKDGSLDFTFAESARFQLFYPEAAVFALPYVISNYNVAQKALFDTEFGKDLIKKMDKDLGVTLLSQAYNGTRQTTSNRAINSIADMKGLKLRVPNAATNLAYAKYVGASPTPMAFSEVYLALQTNAVDGQENPLAAVQAQKFYEVQKFLAMTNHILNDQLYLVSNETYKELPEDLQKVVKDAAENAAKYHTKLFVDGEKDLVTFFEKQGVKITHPDLVPFKESMKPYYAEFVKQTGQKGESALKQIEAINP >1d9v_A PROTEIN (iron-utilization periplasmic protein Haemophilus influenzae non-cyclic DITVYNGQHKEAATAVAKAFEQETGIKVTLNSGKSEQLAGQLKEEGDKTPADVFYTEQTATFADLSEAGLLAPISEQTIQQTAQKGVPLAPKKDWIALSGRSRVVVYDHTKLSEKDMEKSVLDYATPKWKGKIGYVSTSGAFLEQVVALSKMKGDKVALNWLKGLKENGKLYAKNSVALQAVENGEVPAALINNYYWYNLAKEKGVENLKSRLYFVRHQDPGALVSYSGAAVLKASKNQAEAQKFVDFLASKKGQEALVAARAEYPLRADVVSPFNLEPYEKLEAPVVSATTAQDKEHAIKLIEEAGLK >1iqc_A di-heme peroxidase Nitrosomonas europaea non-cyclic ANEPIQPIKAVTPENADMAELGKMLFFDPRLSKSGFISCNSCHNLSMGGTDNITTSIGHKWQQGPINAPTVLNSSMNLAQFWDGRAKDLKEQAAGPIANPKEMASTHEIAEKVVASMPQYRERFKKVFGSDEVTIDRITTAIAQFEETLVTPGSKFDKWLEGDKNALNQDELEGYNLFKGSGCVQCHNGPAVGGSSYQKMGVFKPYETKNPAAGRMDVTGNEADRNVFKVPTLRNIELTYPYFHDGGAATLEQAVETMGRIQLNREFNKDEVSKIVAFLKTLTGDQPDFKLPILPPSNNDTPRSQPYE >2egu_A Cysteine synthase Geobacillus kaustophilus non-cyclic MARTVNSITELIGDTPAVKLNRIVDEDSADVYLKLEFMNPGSSVKDRIALAMIEAAEKAGKLKPGDTIVEPTSGNTGIGLAMVAAAKGYKAVLVMPDTMSLERRNLLRAYGAELVLTPGAQGMRGAIAKAEELVREHGYFMPQQFKNEANPEIHRLTTGKEIVEQMGDQLDAFVAGVGTGGTITGAGKVLREAYPNIKIYAVEPADSPVLSGGKPGPHKIQGIGAGFVPDILDTSIYDGVITVTTEEAFAAARRAAREEGILGGISSGAAIHAALKVAKELGKGKKVLAIIPSNGERYLSTPLYQFED >3df7_A Putative ATP-grasp superfamily protein Archaeoglobus fulgidus non-cyclic SLKLFLFEFATCGERIEDSTAVEGLAMFKSAFDGFKNYYEITGFVRPEFSCLFTLPVDSMDSMEKYLEKSDAFLIIAPEDDFLLYTLTKKAEKYCENLGSSSRAIAVTSDKWELYKKLRGEVQVPQTSLRPLDCKFIIKPRTACAGEGIGFSDEVPDGHIAQEFIEGINLSVSLAVGEDVKCLSVNEQIINNFRYAGAVVPARISDEVKREVVEEAVRAVECVEGLNGYVGVDIVYSDQPYVIEINARLTTPVVAFSRAYGASVADLLAGGEVKHVRRQMVRKSKSAEKPYVSVGDYTLEIIDLD >2yv5_A YjeQ protein Aquifex aeolicus non-cyclic MGKKELKRGLVVDREAQMIGVYLFEDGKTYRGIPRGKVLKKTKIYAGDYVWGEVVDPNTFAIEEVEERKNLLIRPKVANVDRVIIVETLKMPEFNNYLLDNMLVVYEYFKVEPVIVFNKIDLLNEEEKKELERWISIYRDAGYDVLKVSAKTGEGIDELVDYLEGFICILAGPSGVGKSSILSRLTGEELRTQEVSEKTERGRHTTTGVRLIPFGKGSFVGDTPGFSKVEATMFVKPREVRNYFREFLRYQCKYPDCTHTNEPGCAVKEAVKNGEISCERYKSYLKIIKVYLEEIKELCRED >1r4p_A shiga-like toxin type II A subunit Escherichia coli non-cyclic REFTIDFSTQQSYVSSLNSIRTEISTPLEHISQGTTSVSVINHTPPGSYFAVDIRGLDVYQARFDHLRLIIEQNNLYVAGFVNTATNTFYRFSDFTHISVPGVTTVSMTTDSSYTTLQRVAALERSGMQISRHSLVSSYLALMEFSGNTMTRDASRAVLRFVTVTAEALRFRQIQREFRQALSETAPVYTMTPGDVDLTLNWGRISNVLPEYRGEDGVRVGRISFNNISAILGTVAVILNCHHQGARSVRAVNEESQPECQITGDRPVIKINNTLWESNTAAAFLNRKSQFLYTTGK >1p4k_A N(4)-(Beta-N-acetylglucosaminyl)-L-asparagina Elizabethkingia meningoseptica non-cyclic TTNKPIVLSTWNFGLHANVEAWKVLSKGGKALDAVEKGVRLVEDDPTERSVGYGGRPDRDGRVTLDACIMDENYNIGSVACMEHIKNPISVARAVMEKTPHVMLVGDGALEFALSQGFKKENLLTAESEKEWKEWLKTSQYKPIVNIENHNTIGMIALDAQGNLSGACTTSGMAYKMHGRVGDSPIIGAGLFVDNEIGAATATGHGEEVIRTVGTHLVVELMNQGRTPQQACKEAVERIVKIVNRRGKNLKDIQVGFIALNKKGEYGAYCIQDGFNFAVHDQKGNRLETPGFALK >1f5z_A N-ACETYLNEURAMINATE LYASE Haemophilus influenzae non-cyclic MRDLKGIFSALLVSFNEDGTINEKGLRQIIRHNIDKMKVDGLYVGGSTGENFMLSTEEKKEIFRIAKDEAKDQIALIAQVGSVNLKEAVELGKYATELGYDCLSAVTPFYYKFSFPEIKHYYDTIIAETGSNMIVYSIPFLTGVNMGIEQFGELYKNPKVLGVKFTAGDFYLLERLKKAYPNHLIWAGFDEMMLPAASLGVDGAIGSTFNVNGVRARQIFELTKAGKLKEALEIQHVTNDLIEGILANGLYLTIKELLKLEGVDAGYCREPMTSKATAEQVAKAKDLKAKFLS >1dhp_A DIHYDRODIPICOLINATE SYNTHASE Escherichia coli non-cyclic MFTGSIVAIVTPMDEKGNVCRASLKKLIDYHVASGTSAIVSVGTTGESATLNHDEHADVVMMTLDLADGRIPVIAGTGANATAEAISLTQRFNDSGIVGCLTVTPYYNRPSQEGLYQHFKAIAEHTDLPQILYNVPSRTGCDLLPETVGRLAKVKNIIGIKEATGNLTRVNQIKELVSDDFVLLSGDDASALDFMQLGGHGVISVTANVAARDMAQMCKLAAEGHFAEARVINQRLMPLHNKLFVEPNPIPVKWACKELGLVATDTLRLPMTPITDSGRETVRAALKHAGLL >3i0w_A 8-oxoguanine-DNA-glycosylase Clostridium acetobutylicum non-cyclic MDFDMIEEKKDSVIVRNVENFELKDIFDCGQCFRWHRQENGNYIGIAFEKVVEVQKIGEDVVIYNINEEEFKNVWSEYFDLYRDYGEIKKELSRDPLLKKSVDFGEGIRILRQDPFEILLSFIISANNRIPMIKKCINNISEKAGKKLEYKGKIYYAFPTVDKLHEFTEKDFEECTAGFRAKYLKDTVDRIYNGELNLEYIKSLNDNECHEELKKFMGVGPQVADCIMLFSMQKYSAFPVDTWVKKAMMSLYVAPDVSLKKIRDFGREKFGSLSGFAQQYLFYYARENNI >1wp4_A 3-hydroxyisobutyrate dehydrogenase Thermus thermophilus non-cyclic MEKVAFIGLGAMGYPMAGHLARRFPTLVWNRTFEKALRHQEEFGSEAVPLERVAEARVIFTCLPTTREVYEVAEALYPYLREGTYWVDATSGEPEASRRLAERLREKGVTYLDAPVSGGTSGAEAGTLTVMLGGPEEAVERVRPFLAYAKKVVHVGPVGAGHAVKAINNALLAVNLWAAGEGLLALVKQGVSAEKALEVINASSGRSNATENLIPQRVLTRAFPKTFALGLLVKDLGIAMGVLDGEKAPSPLLRLAREVYEMAKRELGPDADHVEALRLLERWGGVEIR >2jl1_A TRIPHENYLMETHANE REDUCTASE Citrobacter sp. my-5 non-cyclic FSIAVTGATGQLGGLVIQHLLKKVPASQIIAIVRNVEKASTLADQGVEVRHGDYNQPESLQKAFAGVSKLLFISGPHYDNTLLIVQHANVVKAARDAGVKHIAYTGYAFAEESIIPLAHVHLATEYAIRTTNIPYTFLRNALYTDFFVNEGLRASTESGAIVTNAGSGIVNSVTRNELALAAATVLTEEGHENKTYNLVSNQPWTFDELAQILSEVSGKKVVHQPVSFEEEKNFLVNAGVPEPFTEITAAIYDAISKGEASKTSDDLQKLIGSLTPLKETVKQALKM >2qnk_A 3-hydroxyanthranilate 3,4-dioxygenase Homo sapiens non-cyclic SERRLGVRAWVKENRGSFQPPVCNKLMHQEQLKVMFIGGPNTRKDYHIEEGEEVFYQLEGDMVLRVLEQGKHRDVVIRQGEIFLLPARVPHSPQRFANTVGLVVERRRLETELDGLRYYVGDTMDVLFEKWFYCKDLGTQLAPIIQEFFSSEQYRTGKPIPDQLLKEPPFPLSTRSIMEPMSLDAWLDSHHRELQAGTPLSLFGDTYETQVIAYGQGSSEGLRQNVDVWLWQLEGSSVVTMGGRRLSLAPDDSLLVLAGTSYAWERTQGSVALSVTQDPACKKPLG >1diz_A 3-METHYLADENINE DNA GLYCOSYLASE II Escherichia coli non-cyclic MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA >3fcx_A S-formylglutathione hydrolase Homo sapiens non-cyclic MALKQISSNKCFGGLQKVFEHDSVELNCKMKFAVYLPPKAETGKCPALYWLSGLTCTEQNFISKSGYHQSASEHGLVVIAPDTSPRGCNIKGEDESWDFGTGAGFYVDATEDPWKTNYRMYSYVTEELPQLINANFPVDPQRMSIFGHSMGGHGALICALKNPGKYKSVSAFAPICNPVLCPWGKKAFSGYLGTDQSKWKAYDATHLVKSYPGSQLDILIDQGKDDQFLLDGQLLPDNFIAACTEKKIPVVFRLQEDYDHSYYFIATFITDHIRHHAKYLNA >1dxr_L PHOTOSYNTHETIC REACTION CENTER L SUBUNIT Rhodopseudomonas viridis non-cyclic ALLSFERKYRVRGGTLIGGDLFDFWVGPYFVGFFGVSAIFFIFLGVSLIGYAASQGPTWDPFAISINPPDLKYGLGAAPLLEGGFWQAITVCALGAFISWMLREVEISRKLGIGWHVPLAFCVPIFMFCVLQVFRPLLLGSWGHAFPYGILSHLDWVNNFGYQYLNWFYNPGHMSSVSFLFVNAMALGLHGGLILSVANPGDGDKVKTAEHENQYFRDVVGYSIGALSIHRLGLFLASNIFLTGAFGTIASGPFWTRGWPEWWGWWLDIPFWS >1a7u_A CHLOROPEROXIDASE T Streptomyces aureofaciens non-cyclic PFITVGQENSTSIDLYYEDHGAGQPVVLIHGFPLSGHSWERQSAALLDAGYRVITYDRRGFGQSSQPTTGYDYDTFAADLNTVLETLDLQDAVLVGFSMGTGEVARYVSSYGTARIAKVAFLASLEPFLLKTDDNPDGAAPKEFFDGIVAAVKADRYAFYTGFFNDFYNLDENLGTRISEEAVRNSWNTAASGGFFAAAAAPTTWYTDFRADIPRIDVPALILHGTGDRTLPIENTARVFHKALPSAEYVEVEGAPHGLLWTHAEEVNTALLAFLAK >3aax_A Putative thiosulfate sulfurtransferase Mycobacterium tuberculosis non-cyclic MARCDVLVSADWAESNLHAPKVVFVEVDEDTSAYDRDHIAGAIKLDWRTDLQDPVKRDFVDAQQFSKLLSERGIANEDTVILYGGNNNWFAAYAYWYFKLYGHEKVKLLDGGRKKWELDGRPLSSDPVSRPVTSYTASPPDNTIRAFRDEVLAAINVKNLIDVRSPDEFSGKILAPAHLPQEQSQRPGHIPGAINVPWSRAANEDGTFKSDEELAKLYADAGLDNSKETIAYCRIGERSSHTWFVLRELLGHQNVKNYDGSWTEYGSLVGAPIELGS >3hea_A Arylesterase Pseudomonas fluorescens non-cyclic STFVAKDGTQIYFKDWGSGKPVLFSHGWPLDADMWEYQMEYLSSRGYRTIAFDRRGFGRSDQPWTGNDYDTFADDIAQLIEHLDLKEVTLVGFSMGGGDVARYIARHGSARVAGLVLLGAVTPIFGQKPDYPQGVPLDVFARFKTELLKDRAQFISDFNAPFYGINKGQVVSQGVQTQTLQIALLASLKATVDCVTAFAETDFRPDMAKIDVPTLVIHGDGDQIVPFETTGKVAAELIKGAELKVYKDAPHGFAVTHAQQLNEDLLAFLKR >1ti2_B Pyrogallol hydroxytransferase small subunit Pelobacter acidigallici non-cyclic MEQYYMVIDVAKCQDCNNCFMGCMDEHELNEWPGYTASMQRGHRWMNIERRERGTYPRNDINYRPTPCMHCENAPCVAKGNGAVYQREDGIVLIDPEKAKGKKELLDTCPYGVMYWNEEENVAQKCTMCAHLLDDESWAPKMPRCAHNCGSFVYEFLKTTPEAMAKKVEEEGLEVIKPELGTKPRVYYKNLYRFEKNYVTAGILVQGDCFEGAKVVLKSGGKEVASAETNFFGEFKFDALDNGEYTVEIDADGKSYSDTVVIDDKSVDLGFIKL >2ggs_A 273aa long hypothetical dTDP-4-dehydrorhamnos Sulfolobus tokodaii non-cyclic MRTLITGASGQLGIELSRLLSERHEVIKVYNSSEIQGGYKLDLTDFPRLEDFIIKKRPDVIINAAAMTDVDKCEIEKEKAYKINAEAVRHIVRAGKVIDSYIVHISTDYVFDGEKGNYKEEDIPNPINYYGLSKLLGETFALQDDSLIIRTSGIFRNKGFPIYVYKTLKEGKTVFAFKGYYSPISARKLASAILELLELRKTGIIHVAGERISRFELALKIKEKFNLPGEVKEVDEVRGWIAKRPYDSSLDSSRARKILSTDFYTLDLDGMVV >1wtd_A EcoO109IR Escherichia coli non-cyclic MNKQEVILKVQECAAWWILERQSKLTKLMSETMSINPFMTPFIFDYHSLNDFDELVEAIIAKHLMTGHDTGFGKLIDEKILPRVFGAYKLDKSYRAANEPFIHPCFDEIDHVIQRDDGRIELLSLKAGKWTIQLTMAVQLNKAFHEIINNYPGVADNIVVGVFYGNSHGLTDKYRILRGINTGANHNVIDIRDKVHVYAGKEFWSWLNNGEAETQHWVLEGIERAVKEADIKEKNKDLIEKFKEHVAKKYNEQVLNADGTAQWHKLLEMINE >1e0c_A SULFURTRANSFERASE Azotobacter vinelandii non-cyclic MDDFASLPLVIEPADLQARLSAPELILVDLTSAARYAEGHIPGARFVDPKRTQLGQPPAPGLQPPREQLESLFGELGHRPEAVYVVYDDEGGGWAGRFIWLLDVIGQQRYHYLNGGLTAWLAEDRPLSRELPAPAGGPVALSLHDEPTASRDYLLGRLGAADLAIWDARSPQEYRGEKVLAAKGGHIPGAVNFEWTAAMDPSRALRIRTDIAGRLEELGITPDKEIVTHCQTHHRSGLTYLIAKALGYPRVKGYAGSWGEWGNHPDTPVEL >3p72_A Platelet glycoprotein Ib alpha chain Homo sapiens non-cyclic HPICEVSKVASHLEVNCDKRQLTALPPDLPKDTTILHLSENLLYTFSLATLMPYTRLTQLNLDRCELTKLQVDGTLPVLGTLDLSHNQLQSLPLLGQTLPALTVLDVSFNRLTSLPLGALRGLGELQELYLKGNELKTLPPGLLTPTPKLEKLSLANNQLTELPAGLLNGLENLDTLLLQENSLYTIPKGFFGSHLLPFAFLHGNPWLCNCEILYFRRWLQDNAENVYVWKQGVDVKAMTSNVASVQCDNSDKFPVYKYPGKGCPLVPR >1h2r_S PROTEIN (PERIPLASMIC [NIFE] HYDROGENASE SMALL Desulfovibrio vulgaris str. 'miyazaki non-cyclic LMGPRRPSVVYLHNAECTGCSESVLRAFEPYIDTLILDTLSLDYHETIMAAAGDAAEAALEQAVNSPHGFIAVVEGGIPTAANGIYGKVANHTMLDICSRILPKAQAVIAYGTCATFGGVQAAKPNPTGAKGVNDALKHLGVKAINIAGCPPNPYNLVGTIVYYLKNKAAPELDSLNRPTMFFGQTVHEQCPRLPHFDAGEFAPSFESEEARKGWCLYELGCKGPVTMNNCPKIKFNQTNWPVDAGHPCIGCSEPDFWDAMTPFYQN >1ee8_A MUTM (FPG) PROTEIN Thermus thermophilus non-cyclic PELPEVETTRRRLRPLVLGQTLRQVVHRDPARYRNTALAEGRRILEVDRRGKFLLFALEGGVELVAHLGMTGGFRLEPTPHTRAALVLEGRTLYFHDPRRFGRLFGVRRGDYREIPLLLRLGPEPLSEAFAFPGFFRGLKESARPLKALLLDQRLAAGVGNIYADEALFRARLSPFRPARSLTEEEARRLYRALREVLAEAVELGGSTLSDQSYRQPDGLPGGFQTRHAVYGREGLPCPACGRPVERRVVAGRGTHFCPTCQGEGP >1vce_A diphthine synthase Pyrococcus horikoshii non-cyclic MVLYFIGLGLYDERDITVKGLEIAKKCDYVFAEFYTSLMAGTTLGRIQKLIGKEIRVLSREDVELNFENIVLPLAKENDVAFLTPGDPLVATTHAELRIRAKRAGVESYVIHAPSIYSAVGITGLHIYKFGKSATVAYPEGNWFPTSYYDVIKENAERGLHTLLFLDIKAEKRMYMTANEAMELLLKVEDMKKGGVFTDDTLVVVLARAGSLNPTIRAGYVKDLIREDFGDPPHILIVPGKLHIVEAEYLVEIAGAPREILRVNV >3en3_A Glutamate receptor 4,Glutamate receptor Rattus norvegicus non-cyclic TVVVTTIMESPYVMYKKNHEMFEGNDKYEGYCVDLASEIAKHIGIKYKIAIVPDGKYGARDADTKIWNGMVGELVYGKAEIAIAPLTITLVREEVIDFSKPFMSLGISIMIKKGTPIESAEDLAKQTEIAYGTLDSGSTKEFFRRSKIAVYEKMWTYMRSAEPSVFTRTTAEGVARVRKSKGKFAFLLESTMNEYTEQRKPCDTMKVGGNLDSKGYGVATPKGSSLRTPVNLAVLKLSEAGVLDKLKNKWWYDKGEC >1lxa_A UDP N-ACETYLGLUCOSAMINE O-ACYLTRANSFERASE Escherichia coli non-cyclic MIDKSAFVHPTAIVEEGASIGANAHIGPFCIVGPHVEIGEGTVLKSHVVVNGHTKIGRDNEIYQFASIGEVNQDLKYAGEPTRVEIGDRNRIRESVTIHRGTVQGGGLTKVGSDNLLMINAHIAHDCTVGNRCILANNATLAGHVSVDDFAIIGGMTAVHQFCIIGAHVMVGGCSGVAQDVPPYVIAQGNHATPFGVNIEGLKRRGFSREAITAIRNAYKLIYRSGKTLDEVKPEIAELAETYPEVKAFTDFFARSTRGLIR >1ef8_A METHYLMALONYL COA DECARBOXYLASE Escherichia coli non-cyclic MSYQYVNVVTINKVAVIEFNYGRKLNALSKVFIDDLMQALSDLNRPEIRCIILRAPSGSKVFSAGHDIHELPSGGRDPLSYDDPLRQITRMIQKFPKPIISMVEGSVWGGAFEMIMSSDLIIAASTSTFSMTPVNLGVPYNLVGIHNLTRDAGFHIVKELIFTASPITAQRALAVGILNHVVEVEELEDFTLQMAHHISEKAPLAIAVIKEELRVLGEAHTMNSDEFERIQGMRRAVYDSEDYQEGMNAFLEKRKPNFVGH >2hl6_A Feruloyl esterase A Aspergillus niger non-cyclic ASTQGISEDLYNRLVEMATISQAAYADLCNIPSTIIKGEKIYNAQTDINGWILRDDTSKEIITVFRGTGSDTNLQLDTNYTLTPFDTLPQCNDCEVHGGYYIGWISVQDQVESLVKQQASQYPDYALTVTGHSLGASMAALTAAQLSATYDNVRLYTFGEPRSGNQAFASYMNDAFQVSSPETTQYFRVTHSNDGIPNLPPAEQGYAHGGVEYWSVDPYSAQNTFVCTGDEVQCCEAQGGQGVNDAHTTYFGMTSGACTW >1wmb_A D(-)-3-hydroxybutyrate dehydrogenase Pseudomonas fragi non-cyclic MLKGKVAVVTGSTSGIGLGIATALAAQGADIVLNGFGDAAEIEKVRAGLAAQHGVKVLYDGADLSKGEAVRGLVDNAVRQMGRIDILVNNAGIQHTALIEDFPTEKWDAILALNLSAVFHGTAAALPHMKKQGFGRIINIASAHGLVASANKSAYVAAKHGVVGFTKVTALETAGQGITANAICPGWVRTPLVEKQISALAEKNGVDQETAARELLSEKQPSLQFVTPEQLGGTAVFLASDAAAQITGTTVSVDGGWTAR >1qgi_A PROTEIN (CHITOSANASE) Bacillus circulans non-cyclic ASPDDNFSPETLQFLRNNTGLDGEQWNNIMKLINKPEQDDLNWIKYYGYCEDIEDERGYTIGLFGATTGGSRDTHPDGPDLFKAYDAAKGASNPSADGALKRLGINGKMKGSILEIKDSEKVFCGKIKKLQNDAAWRKAMWETFYNVYIRYSVEQARQRGFTSAVTIGSFVDTALNQGATGGSDTLQGLLARSGSSSNEKTFMKNFHAKRTLVVDTNKYNKPPNGKNRVKQWDTLVDMGKMNLKNVDSEIAQVTDWEMK >3g7n_A Lipase Penicillium expansum non-cyclic ATADAAAFPDLHRAAKLSSAAYTGCIGKAFDVTIVKRIYDLVTDTNGFVGYSTEKKTIAVIMRGSTTITDFVNDIDIALITPELSGVTFPSDVKIMRGVHRPWSAVHDTIITEVKALIAKYPDYTLEAVGHSLGGALTSIAHVALAQNFPDKSLVSNALNAFPIGNQAWADFGTAQAGTFNRGNNVLDGVPNMYSSPLVNFKHYGTEYYSSGTEASTVKCEGQRDKSCSAGNGMYAVTPGHIASFGVVMLTAGCGYLS >1geg_A ACETOIN REDUCTASE Klebsiella pneumoniae non-cyclic MKKVALVTGAGQGIGKAIALRLVKDGFAVAIADYNDATAKAVASEINQAGGHAVAVKVDVSDRDQVFAAVEQARKTLGGFDVIVNNAGVAPSTPIESITPEIVDKVYNINVKGVIWGIQAAVEAFKKEGHGGKIINACSQAGHVGNPELAVYSSSKFAVRGLTQTAARDLAPLGITVNGYCPGIVKTPMWAEIDRQVSEAAGKPLGYGTAEFAKRITLGRLSEPEDVAACVSYLASPDSDYMTGQSLLIDGGMVFN >1mg5_A alcohol dehydrogenase Drosophila melanogaster non-cyclic SFTLTNKNVIFVAGLGGIGLDTSKELLKRDLKNLVILDRIENPAAIAELKAINPKVTVTFYPYDVTVPIAETTKLLKTIFAQLKTVDVLINGAGILDDHQIERTIAVNYTGLVNTTTAILDFWDKRKGGPGGIICNIGSVTGFNAIYQVPVYSGTKAAVVNFTSSLAKLAPITGVTAYTVNPGITRTTLVHKFNSWLDVEPQVAEKLLAHPTQPSLACAENFVKAIELNQNGAIWKLDLGTLEAIQWTKHWDSGI >3bf7_A Esterase YbfF Escherichia coli non-cyclic MKLNIRAQTAQNQHNNSPIVLVHGLFGSLDNLGVLARDLVNDHNIIQVDVRNHGLSPREPVMNYPAMAQDLVDTLDALQIDKATFIGHSMGGKAVMALTALAPDRIDKLVAIDIAPVDYHVRRHDEIFAAINAVSESDAQTRQQAAAIMRQHLNEEGVIQFLLKSFVDGEWRFNVPVLWDQYPHIVGWEKIPAWDHPALFIPGGNSPYVSEQYRDDLLAQFPQARAHVIAGAGHWVHAEKPDAVLRAIRRYLNDH >1ufk_A TT0836 protein Thermus thermophilus non-cyclic MWVYRLKGTLEALDPILPGLFDGGARGLWEREGEVWAFFPAPVDLPYEGVWEEVGDEDWLEAWRRDLKPALAPPFVVLAPWHTWEGAEIPLVIEPGMAFGTGHHETTRLALKALARHLRPGDKVLDLGTGSGVLAIAAEKLGGKALGVDIDPMVLPQAEANAKRNGVRPRFLEGSLEAALPFGPFDLLVANLYAELHAALAPRYREALVPGGRALLTGILKDRAPLVREAMAGAGFRPLEEAAEGEWVLLAYGR >1xm8_A glyoxalase II Arabidopsis thaliana non-cyclic MQIELVPCLKDNYAYILHDEDTGTVGVVDPSEAEPIIDSLKRSGRNLTYILNTHHHYDHTGGNLELKDRYGAKVIGSAMDKDRIPGIDMALKDGDKWMFAGHEVHVMDTPGHTKGHISLYFPGSRAIFTGDTMFSLSCGKLFEGTPKQMLASLQKITSLPDDTSIYCGHEYTLSNSKFALSLEPNNEVLQSYAAHVAELRSKKLPTIPTTVKMEKACNPFLRSSNTDIRRALRIPEAADEAEALGIIRKAKDDF >1hxh_A 3BETA/17BETA-HYDROXYSTEROID DEHYDROGENASE Comamonas testosteroni non-cyclic TNRLQGKVALVTGGASGVGLEVVKLLLGEGAKVAFSDINEAAGQQLAAELGERSMFVRHDVSSEADWTLVMAAVQRRLGTLNVLVNNAGILLPGDMETGRLEDFSRLLKINTESVFIGCQQGIAAMKETGGSIINMASVSSWLPIEQYAGYSASKAAVSALTRAAALSCRKQGYAIRVNSIHPDGIYTPMMQASLPKGVSKEMVLHDPKLNRAGRAYMPERIAQLVLFLASDESSVMSGSELHADNSILGMGL >1gq9_A 3-DEOXY-MANNO-OCTULOSONATE CYTIDYLYLTRANSFERA Escherichia coli non-cyclic SKAVIVIPARYGSSRLPGKPLLDIVGKPMIQHVYERALQVAGVAEVWVATDDPRVEQAVQAFGGKAIMTRNDHESGTDRLVEVMHKVEADIYINLQGDEPMIRPRDVETLLQGMRDDPALPVATLCHAISAAEAAEPSTVKVVVNTRQDALYFSRSPIPYPRNAEKARYLKHVGIYAYRRDVLQNYSQLPESMPEQAESLEQLRLMNAGINIRTFEVAATGPGVDTPACLEKVRALMAQELAENA >1nxq_A R-alcohol dehydrogenase Lactobacillus brevis non-cyclic SNRLDGKVAIITGGTLGIGLAIATKFVEEGAKVMITGRHSDVGEKAAKSVGTPDQIQFFQHDSSDEDGWTKLFDATEKAFGPVSTLVNNAGIAVNKSVEETTTAEWRKLLAVNLDGVFFGTRLGIQRMKNKGLGASIINMSSIEGFVGDPSLGAYNASKGAVRIMSKSAALDCALKDYDVRVNTVHPGYIKTPLVDDLPGAEEAMSQRTKTPMGHIGEPNDIAYICVYLASNESKFATGSEFVVDGGYTAQ >2nxv_A ATP synthase subunits region ORF 6 Rhodobacter blasticus non-cyclic MKPVPTYVQDKDESTLMFSVCSLVRDQAKYDRLLESFERFGFTPDKAEFLAADNREGNQFHGFSWHKQMLPRCKGRYVIFCHEDVELVDRGYDDLVAAIEALEEADPKWLVAGVAGSPWRPLNHSVTAQALHISDVFGNDRRRGNVPCRVESLDECFLLMRRLKPVLNSYDMQGFHYYGADLCLQAEFLGGRAYAIDFHLHHYGRAIADENFHRLRQEMAQKYRRWFPGRILHCVTGRVALGGGWYEAR >1ilv_A STATIONARY-PHASE SURVIVAL PROTEIN SURE HOMOLO Thermotoga maritima non-cyclic MRILVTNDDGIQSKGIIVLAELLSEEHEVFVVAPDKERSATGHSITIHVPLWMKKVFISERVVAYSTTGTPADCVKLAYNVVMDKRVDLIVSGVNRGPNMGMDILHSGTVSGAMEGAMMNIPSIAISSANYESPDFEGAARFLIDFLKEFDFSLLDPFTMLNINVPAGEIKGWRFTRQSRRRWNDYFEERVSPFGEKYYWMMGEVIEDDDRDDVDYKAVREGYVSITPIHPFLTNEQCLKKLREVYD >1jp5_A single-chain Fv fragment 1696 Mus musculus non-cyclic DILMTQTPLYLPVSLGDQASISCRSSQTIVHNNGNTYLEWYLQKPGQSPQLLIYKVSNRFSGVPDRFSGSGSGTDFTLKISRVEAEDLGIYYCFQGSHFPPTFGGGTKLEIKGGGGSGGGGSGGGGSEVQLQQSGPELKKPGETVKISCKATNYAFTDYSMHWVKQAPGGDLKYVGWINTETDEPTFADDFKGRFAFSLDTSTSTAFLQINNLKNEDTATYFCVRDRHDYGEIFTYWGQGTTVTVSS >2ag5_A dehydrogenase/reductase (SDR family) member 6 Homo sapiens non-cyclic MGRLDGKVIILTAAAQGIGQAAALAFAREGAKVIATDINESKLQELEKYPGIQTRVLDVTKKKQIDQFANEVERLDVLFNVAGFVHHGTVLDCEEKDWDFSMNLNVRSMYLMIKAFLPKMLAQKSGNIINMSSVASSVKGVVNRCVYSTTKAAVIGLTKSVAADFIQQGIRCNCVCPGTVDTPSLQERIQARGNPEEARNDFLKRQKTGRFATAEEIAMLCVYLASDESAYVTGNPVIIDGGWSLG >3lyf_A Nucleocapsid protein Rift valley fever virus non-cyclic MDNYQELAIQFAAQAVDRNEIEQWVREFAYQGFDARRVIELLKQYGGADWEKDAKKMIVLALTRGNKPRRMMMKMSKEGKATVEALINKYKLKEGNPSRDELTLSRVAAALAGRTCQALVVLSEWLPVTGTTMDGLSPAYPRHMMHPSFAGMVDPSLPGDYLRAILDAHSLYLLQFSRVINPNLRGRTKEEVAATFTQPMNAAVNSNFISHEKRREFLKAFGLVDSNGKPSAAVMAAAQAYKTAA >1cns_A CHITINASE Hordeum vulgare non-cyclic SVSSIVSRAQFDRMLLHRNDGACQAKGFYTYDAFVAAAAAFSGFGTTGSADVQKREVAAFLAQTSHETTGGWATAPDGAFAWGYCFKQERGASSDYCTPSAQWPCAPGKRYYGRGPIQLSHNYNYGPAGRAIGVDLLANPDLVATDATVSFKTAMWFWMTAQPPKPSSHAVIVGQWSPSGADRAAGRVPGFGVITNIINGGIECGHGQDSRVADRIGFYKRYCDILGVGYGNNLDCYSQRPFA >1uay_A Type II 3-hydroxyacyl-CoA dehydrogenase Thermus thermophilus non-cyclic MERSALVTGGASGLGRAAALALKARGYRVVVLDLRREGEDLIYVEGDVTREEDVRRAVARAQEEAPLFAVVSAAGVGLAEKILGKEGPHGLESFRRVLEVNLLGTFNVLRLAAWAMRENPPDAEGQRGVIVNTASVAAFEGQIGQAAYAASKGGVVALTLPAARELAGWGIRVVTVAPGLFDTPLLQGLPEKAKASLAAQVPFPPRLGRPEEYAALVLHILENPMLNGEVVRLDGALRMAPR >1c1m_A PROTEIN (PORCINE ELASTASE) Sus scrofa non-cyclic VVGGTEAQRNSWPSQISLQYRSGSSWAHTCGGTLIRQNWVMTAAHCVDRELTFRVVVGEHNLNQNDGTEQYVGVQKIVVHPYWNTDDVAAGYDIALLRLAQSVTLNSYVQLGVLPRAGTILANNSPCYITGWGLTRTNGQLAQTLQQAYLPTVDYAICSSSSYWGSTVKNSMVCAGGDGVRSGCQGDSGGPLHCLVNGQYAVHGVTSFVSRLGCNVTRKPTVFTRVSAYISWINNVIASN >1bxt_A PROTEIN (STREPTOCOCCAL SUPERANTIGEN) Streptococcus pyogenes non-cyclic SSQPDPTPEQLNKSSQFTGVMGNLRCLYDNHFVEGTNVRSTGQLLQHDLIFPIKDLKLKNYDSVKTEFNSKDLATKYKNKDVDIFGSNYYYNCYYSEGNSCKNAKKTCMYGGVTEHHRNQIEGKFPNITVKVYEDNENILSFDITTNKKQVTVQELDCKTRKILVSRKNLYEFNNSPYETGYIKFIESSGDSFWYDMMPAPGAIFDQSKYLMLYNDNKTVSSSAIAIEVHLTKK >1x1e_A 2-deoxy-D-gluconate 3-dehydrogenase Thermus thermophilus non-cyclic MERKALVTGGSRGIGRAIAEALVARGYRVAIASRNPEEAAQSLGAVPLPTDLEKDDPKGLVKRALEALGGLHVLVHAAAVNVRKPALELSYEEWRRVLYLHLDVAFLLAQAAAPHMAEAGWGRVLFIGSVTTFTAGGPVPIPAYTTAKTALLGLTRALAKEWARLGIRVNLLCPGYVETEFTLPLRQNPELYEPITARIPMGRWARPEEIARVAAVLCGDEAEYLTGQAVAVDGGFLAY >1lp9_F T-cell Receptor beta chain Homo sapiens non-cyclic MEAAVTQSPRSKVAVTGGKVTLSCHQTNNHDYMYWYRQDTGHGLRLIHYSYVADSTEKGDIPDGYKASRPSQENFSLILELASLSQTAVYFCASSDWVSYEQYFGPGTRLTVLEDLRNVTPPKVSLFEPSKAEIANKQKATLVCLARGFFPDHVELSWWVNGKEVHSGVSTDPQAYKESNYSYALSSRLRVSATFWHNPRNHFRCQVQFHGLSEEDKWPEGSPKPVTQNISAEAWGRA >2ej9_A Putative biotin ligase Methanocaldococcus jannaschii non-cyclic MEIIHLSEIDSTNDYAKELAKEGKRNFIVLADKQNNGKGRWGRVWYSDEGGLYFSMVLDSKLYNPKVINLLVPICIIEVLKNYVDKELGLKFPNDIMVKVNDNYKKLGGILTELTDDYMIIGIGINVNNQIRNEIREIAISLKEITGKELDKVEILSNFLKTFESYLEKLKNKEIDDYEILKKYKKYSITIGKQVKILLSNNEIITGKVYDIDFDGIVLGTEKGIERIPSGICIHVR >1wnl_A biotin--[acetyl-CoA-carboxylase] ligase Pyrococcus horikoshii non-cyclic MLGLKTSIIGRRVIYFQEITSTNEFAKTSYLEEGTVIVADKQTMGHGRLNRKWESPEGGLWLSIVLSPKVPQKDLPKIVFLGAVGVVETLKEFSIDGRIKWPNDVLVNYKKIAGVLVEGKGDKIVLGIGLNVNNKVPNGATSMKLELGSEVPLLSVFRSLITNLDRLYLNFLKNPMDILNLVRDNMILGVRVKILGDGSFEGIAEDIDDFGRLIIRLDSGEVKKVIYGDVSLRFL >3h7t_A Group 3 allergen SMIPP-S YvT004A06 Sarcoptes scabiei type hominis non-cyclic IIGGKKSDITKEPWAVGVLVDEKPFCGGSILTANFVITAAQCVDGTKPSDISIHYGSSYRTTKGTSVMAKKIYIVRYHPLTMQNNYAVIETEMPIKLDDKTTKKIELPSLLYDPEPDTSVLVSGWGSTNFKSLEYSGDLMEANFTVVDRKSCEEQYKQIEADKYIYDGVFCAGGEYDETYIGYGDAGDPAVQNGTLVGVASYISSMPSEFPSVFLRVGYYVLDIKDIISGKVKPQ >1y6i_A Mg-chelatase cofactor GUN4 Synechocystis sp. non-cyclic MSDNLTELSQQLHDASEKKQLTAIAALAEMGEGGQGILLDYLAKNVPLEKPVLAVGNVYQTLRNLEQETITTQLQRNYPTGIFPLQSAQGIDYLPLQEALGSQDFETADEITRDKLCELAGPGASQRQWLYFTEVEKFPALDLHTINALWWLHSNGNFGFSVQRRLWLASGKEFTKLWPKIGWKSGNVWTRWPKGFTWDLSAPQGHLPLLNQLRGVRVAESLYRHPVWSQYGW >3nor_A ThiJ/PfpI family protein Pseudomonas fluorescens non-cyclic GSHMAVQIGFLLFPEVQQLDLTGPHDVLASLPDVQVHLIWKEPGPVVASSGLVLQATTSFADCPPLDVICIPGGTGVGALMEDPQALAFIRQQAARARYVTSVCSGSLVLGAAGLLQGKRATTHWAYHELLAPLGAIPVHERVVRDGNLLTGGGITAGIDFALTLAAELFDAATAQRVQLQLEYAPAPPFNAGSPDTAPASVVQQARQRAADSLHKRREITLRAAARLAAG >1fx7_A IRON-DEPENDENT REPRESSOR IDER Mycobacterium tuberculosis non-cyclic MNELVDTTEMYLRTIYDLEEEGVTPLRARIAERLDQSGPTVSQTVSRMERDGLLRVAGDRHLELTEKGRALAIAVMRKHRLAERLLVDVIGLPWEEVHAEACRWEHVMSEDVERRLVKVLNNPTTSPFGNPIPGLDELGVGPEPGADDANLVRLTELPAGSPVAVVVRQLTEHVQGDIDLITRLKDAGVVPNARVTVETTPGGGVTIVIPGHENVTLPHEMAHAVKVEKV >1qr2_A PROTEIN (QUINONE REDUCTASE TYPE 2) Homo sapiens non-cyclic AGKKVLIVYAHQEPKSFNGSLKNVAVDELSRQGCTVTVSDLYAMNFEPRATDKDITGTLSNPEVFNYGVETHEAYKQRSLASDITDEQKKVREADLVIFQFPLYWFSVPAILKGWMDRVLCQGFAFDIPGFYDSGLLQGKLALLSVTTGGTAEMYTKTGVNGDSRYFLWPLQHGTLHFCGFKVLAPQISFAPEIASEEERKGMVAAWSQRLQTIWKEEPIPCTAHWHFGQ >2o02_A 14-3-3 protein zeta/delta Homo sapiens non-cyclic MDKNELVQKAKLAEQAERYDDMAACMKSVTEQGAELSNEERNLLSVAYKNVVGARRSSWRVVSSIEQKTEGAEKKQQMAREYREKIETELRDICNDVLSLLEKFLIPNASQAESKVFYLKMKGDYYRYLAEVAAGDDKKGIVDQSQQAYQEAFEISKKEMQPTHPIRLGLALNFSVFYYEILNSPEKACSLAKTAFDEAIAELDTLSEESYKDSTLIMQLLRDNLTLWTS >1xss_A fluorescent protein Favia favus non-cyclic VSVITSEMKMELRMEGAVNGHKFVITGKGSGQPFEGIQNMDLTVIEGGPLPFAFDILTTVFDYGNRVFVKYPEEIVDYFKQSFPEGYSWERSMSYEDGGICLATNNITMKKDGSNCFVYEIRFDGVNFPANGPVMQRKTVKWEPSTEKMYVRDGVLKGDVNMALLLQGGGHYRCDFRTTYKAKKVVQLPDYHFVDHRIEITSHDKDYNKVKLYEHAKAHSGLPRLAK >1fy3_A HEPARIN-BINDING PROTEIN Homo sapiens non-cyclic IVGGRKARPRQFPFLASIQNQGRHFCGGALIHARFVMTAASCFQSQNPGVSTVVLGAYDLRRRERQSRQTFSISSMSENGYDPQQNLNDLMLLQLDREANLTSSVTILPLPLQNATVEAGTRCQVAGWGSQRSGGRLSRFPRFVNVTVTPEDQCRPNNVCTGVLTRRGGICNGDQGTPLVCEGLAHGVASFSLGPCGRGPDFFTRVALFRDWIDGVLNNPGPGPA >1kqj_A A/G-SPECIFIC ADENINE GLYCOSYLASE Escherichia coli non-cyclic MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKHSLCPLQNGCIAAANNSWALYPGKKPK >1tje_A Threonyl-tRNA synthetase Escherichia coli non-cyclic MPVITLPDGSQRHYDHAVSPMDVALDIGPGLAKACIAGRVNGELVDACDLIENDAQLSIITAKDEEGLEIIRHSCAHLLGHAIKQLWPHTKMAIGPVIDNGFYYDVDLDRTLTQEDVEALEKRMHELAEKNYDVIKKKVSWHEARETFANRGESYKVSILDENIAHDDKPGLYFHEEYVDMCRGPHVPNMRFCHHFKLMKTAGAYWRGDSNNKMLQRIYGTAWA >1bol_A PROTEIN (RIBONUCLEASE RH) Rhizopus niveus non-cyclic SSCSSTALSCSNSANSDTCCSPEYGLVVLNMQWAPGYGPDNAFTLHGLWPDKCSGAYAPSGGCDSNRASSSIASVIKSKDSSLYNSMLTYWPSNQGNNNVFWSHEWSKHGTCVSTYDPDCYDNYEEGEDIVDYFQKAMDLRSQYNVYKAFSSNGITPGGTYTATEMQSAIESYFGAKAKIDCSSGTLSDVALYFYVRGRDTYVITDALSTGSCSGDVEYPTK >1c08_B ANTI-HEN EGG WHITE LYSOZYME ANTIBODY (HYHEL-1 Mus musculus non-cyclic DVQLQESGPSLVKPSQTLSLTCSVTGDSITSDYWSWIRKFPGNRLEYMGYVSYSGSTYYNPSLKSRISITRDTSKNQYYLDLNSVTTEDTATYYCANWDGDYWGQGTLVTVSAA >1rc9_A cysteine-rich secretory protein Viridovipera stejnegeri non-cyclic NVDFDSESPRKPEIQNEIVDLHNSLRRSVNPTASNMLRMEWYPEAADNAERWAYRCIESHSSYESRVIEGIKCGENIYMSPYPMKWTDIIHAWHDEYKDFKYGVGADPPNAVTGHYTQIVWYKSYRIGCAAAYCPSSPYSYFFVCQYCPAGNFIGKTATPYTSGTPCGDCPSDCDNGLCTNPCTRENKFTNCNTMVQQSSCQDNYMKTNCPASCFCQNKII >1a7q_L IGG1-KAPPA D1.3 FV (LIGHT CHAIN) Mus musculus non-cyclic DIVLTQSPASLSASVGETVTITCRAGGNTHNYLAWYQQKQGKSPQLLVYYTTTLAAGVPSRFSGSGSGTQYSLKINSLQPDDFGSYYCQHFWSTPRSFGGGTKLEI >2vo4_A 2,4-D INDUCIBLE GLUTATHIONE S-TRANSFERASE Glycine max non-cyclic MQDEVVLLDFWPSPFGMRVRIALAEKGIKYEYKEEDLRNKSPLLLQMNPVHKKIPVLIHNGKPICESLIAVQYIEEVWNDRNPLLPSDPYQRAQTRFWADYVDKKIYDLGRKIWTSKGEEKEAAKKEFIEALKLLEEQLGDKTYFGGDNLGFVDIALVPFYTWFKAYETFGTLNIESECPKFIAWAKRCLQKESVAKSLPDQQKVYEFIMDLRKKLGIE >1yal_A CHYMOPAPAIN Carica papaya non-cyclic YPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYVANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFKGFA >2h6m_A Picornain 3C Hepatitis a virus non-cyclic STLEIAGLVRKNLVQFGVGEKNGSVRWVMNALGVKDDWLLVPSHAYKFEKDYEMMEFYFNRGGTYYSISAGNVVIQSLDVGFQDVVLMKVPTIPKFRDITQHFIKKGDVPRALNRLATLVTTVNGTPMLISEGPLKMEEKATYVHKKNDGTTVDLTVDQAWRGKGEGLPGMCGGALVSSNQSIQNAILGIHVAGGNSILVAKLVTQEMFQNI >3crt_A Glutathione S-transferase class-mu 26 kDa iso Schistosoma japonicum non-cyclic MSPILGYWKIKGLVQPTRLLLEYLEEKYEEHLYERDEGDKWRNKKFELGCEFPNLPYYIDGDVKLTQSMAIIRYIADKHNMLGGCPKERAEISMLEGAVLDIRYGVSRIAYSKDFETLKVDFLSKLPEMLKMFEDRLCHKTYLNGDHVTHPDFMLYDALDVVLYMDPMCLDAFPKLVCFKKRIEAIPQIDKYLKSSKYIAWPLQGWQATFGGGD >1bqu_A PROTEIN (GP130) Homo sapiens non-cyclic PGSSGLPPEKPKNLSCIVNEGKKMRCEWDGGRETHLETNFTLKSEWATHKFADCKAKRDTPTSCTVDYSTVYFVNIEVWVEAENALGKVTSDHINFDPVYKVKPNPPHNLSVINSEELSSILKLTWTNPSIKSVIILKYNIQYRTKDASTWSQIPPEDTASTRSSFTVQDLKPFTEYVFRIRCMKEDGKGYWSDWSEEASGITYEDRPSKEPSFW >1sur_A PAPS REDUCTASE Escherichia coli non-cyclic SKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTH >1amw_A HEAT SHOCK PROTEIN 90 Saccharomyces cerevisiae non-cyclic MASETFEFQAEITQLMSLIINTVYSNKEIFLRELISNASDALDKIRYKSLSDPKQLETEPDLFIRITPKPEQKVLEIRDSGIGMTKAELINNLGTIAKSGTKAFMEALSAGADVSMIGQFGVGFYSLFLVADRVQVISKSNDDEQYIWESNAGGSFTVTLDEVNERIGRGTILRLFLKDDQLEYLEEKRIKEVIKRHSEFVAYPIQLVVTKEVE >1io2_A RIBONUCLEASE HII Thermococcus kodakarensis non-cyclic MKIAGIDEAGRGPVIGPMVIAAVVVDENSLPKLEELKVRDSKKLTPKRREKLFNEILGVLDDYVILELPPDVIGSREGTLNEFEVENFAKALNSLKVKPDVIYADAADVDEERFARELGERLNFEAEVVAKHKADDIFPVVSAASILAKVTRDRAVEKLKEEYGEIGSGYPSDPRTRAFLENYYREHGEFPPIVRKGWKTLKKIAEKVESEKK >1lbu_A MURAMOYL-PENTAPEPTIDE CARBOXYPEPTIDASE Streptomyces albus non-cyclic DGCYTWSGTLSEGSSGEAVRQLQIRVAGYPGTGAQLAIDGQFGPATKAAVQRFQSAYGLAADGIAGPATFNKIYQLQDDDCTPVNFTYAELNRCNSDWSGGKVSAATARANALVTMWKLQAMRHAMGDKPITVNGGFRSVTCNSNVGGASNSRHMYGHAADLGAGSQGFCALAQAARNHGFTEILGPGYPGHNDHTHVAGGDGRFWSAPSCGI >2o72_A Epithelial-cadherin Homo sapiens non-cyclic DWVIPPISSPENEKGPFPKNLVQIKSNKDKEGKVFYSITGQGADTPPVGVFIIERETGWLKVTEPLDRERIATYTLFSHAVSSNGNAVEDPMEILITVTDQNDNKPEFTQEVFKGSVMEGALPGTSVMEVTATDADDDVNTYNAAIAYTILSQDPELPDKNMFTINRNTGVISVVTTGLDRESFPTYTLVVQAADLQGEGLSTTATAVITVTD >1jax_A conserved hypothetical protein Archaeoglobus fulgidus non-cyclic MRVALLGGTGNLGKGLALRLATLGHEIVVGSRREEKAEAKAAEYRRIAGDASITGMKNEDAAEACDIAVLTIPWEHAIDTARDLKNILREKIVVSPLVPVSRGAKGFTYSSERSAAEIVAEVLESEKVVSALHTIPAARFANLDEKFDWDVPVCGDDDESKKVVMSLISEIDGLRPLDAGPLSNSRLVESLTPLILNIMRFNGMGELGIKFL >3b5g_A AMYLOID LAMBDA 6 LIGHT CHAIN VARIABLE REGION Homo sapiens non-cyclic NFMLTQSHSVSESPGKTVTISCTRSSGSIASNYVQWYQQRPGSSPTTVIYEDNQRPSGVPDRFSGSIDSSSNSASLTISGLKTEDEADYYCQSYDSSNHVVFGGGTKLTVL >1bvb_A CYTOCHROME C-554 Nitrosomonas europaea non-cyclic ADAPFEGRKKCSSCHKAQAQSWKDTAHAKAMESLKPNVKKEAKQKAKLDPAKDYTQDKDCVGCHVDGFGQKGGYTIESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKDLAKKGQDFHFEERCSACHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVKAMHEHYKLEGVFEGEPKFKFHDEFQASAKPAKKGK >1z6j_T Tissue factor Homo sapiens non-cyclic SGTTNTVAAYNLTWKSTNFKTILEWEPKPVNQVYTVQISTKSGDWKSKCFYTTDTECDLTDEIVKDVKQTYLARVFSYPAGNVESTGSAGEPLYENSPEFTPYLETNLGQPTIQSFEQVGTKVNVTVEDERTLVRRNNTFLSLRDVFGKDLIYTLYYWKSSSSGKKTAKTNTNEFLIDVDKGENYCFSVQAVIPSRTVNRKSTDSPVECMG >1lbk_A Glutathione S-transferase class pi chimaera ( Homo sapiens non-cyclic PPYTVVYFPVRGRCAALRMLLADQGQSWKEEVVTVETWQEGSLKASCLYGQLPKFQDGDLTLYQSNTILRHLGRTLGLYGKDQQEAALVDMVNDGVEDLRCKYISLIYTNYEAGKDDYVKALPGQLKPFETLLSQNQGGKTFIVGDQISFADYNLLDLLLIHEVLAPGCLDAFPLLSAYVGRLSARPKLKAFLASPEYVNLPMDEKSL >1pn9_A Glutathione S-transferase 1-6 Anopheles gambiae non-cyclic MDFYYLPGSAPCRAVQMTAAAVGVELNLKLTDLMKGEHMKPEFLKLNPQHCIPTLVDNGFALWESRAIQIYLAEKYGKDDKLYPKDPQKRAVVNQRLYFDMGTLYQRFADYHYPQIFAKQPANPENEKKMKDAVGFLNTFLEGQEYAAGNDLTIADLSLAATIATYEVAGFDFAPYPNVAAWFARCKANAPGYALNQAGADEFKAKFLS >1tu7_A Glutathione S-transferase 2 Onchocerca volvulus non-cyclic MSYKLTYFSIRGLAEPIRLFLVDQDIKFIDDRIAKDDFSSIKSQFQFGQLPCLYDGDQQIVQSGAILRHLARKYNLNGENEMETTYIDMFCEGVRDLHVKYTRMIYMAYETEKDPYIKSILPGELAKFEKLLATRGNGRNLILGDKISYADYALFEELDVHQILDPHCLDKFPLLKVFHQRMKDRPKLKEYCEKRDAAKVPVNGNGKQ >1tw9_A glutathione S-transferase 2 Heligmosomoides polygyrus non-cyclic MVHYKLTYFNGRGAGECARQVFALADQKYEDVRLTQETFVPLKATFPFGQVPVLEVDGQQLAQSQAICRYLAKTFGFAGATPFESALIDSLADAYTDYRAEMKTYYYTALGFMTGDVDKPKTDVLLPARTKFLGFITKFLKKNSSGFLVGDKISWVDLLVAEHVADMTNRVPEYIEGFPEVKAHMERIQQTPRIKKWIETRPETPF >1bf8_A CHAPERONE PROTEIN FIMC Escherichia coli non-cyclic GVALGATRVIYPAGQKQEQLAVTNNDENSTYLIQSWVENADGVKDGRFIVTPPLFAMKGKKENTLRILDATNNQLPQDRESLFWMNVKAIPSMDKSKLTENTLQLAIISRIKLYYRPAKLALPPDQAAEKLRFRRSANSLTLINPTPYYLTVTELNAGTRVLENALVPPMGESTVKLPSDAGSNITYRTINDYGALTPKMTGVME >2w7s_A SERINE PROTEASE SPLA Staphylococcus aureus non-cyclic EKNVKEITDATKEPYNSVVAFVGGTGVVVGKNTIVTNKHIAKSNDIFKNRVSAHHSSKGKGGGNYDVKDIVEYPGKEDLAIVHVHETSTEGLNFNKNVSYTKFADGAKVKDRISVIGYPKGAQTKYKMFESTGTINHISGTFMEFDAYAQPGNSGSPVLNSKHELIGILYAGSGKDESEKNFGVYFTPQLKEFIQNNIEK >2bj0_A ACETYLCHOLINE-BINDING PROTEIN Bulinus truncatus non-cyclic QIRWTLLNQITGESDVIPLSNNTPLNVSLNFKLMNIVEADTEKDQVEVVLWTQASWKVPYYSSLLSSSSLDQVSLPVSKMWTPDLSFYNAIAAPELLSADRVVVSKDGSVIYVPSQRVRFTCDLINVDTEPGATCRIKVGSWTHDNKQFALITGEEGVVNIAEYFDSPKFDLLSATQSLNRKKYSCCENMYDDIEITFAFRKK >1f5j_A BETA-1,4-XYLANASE Dictyoglomus thermophilum non-cyclic ALTSNASGTFDGYYYELWKDTGNTTMTVYTQGRFSCQWSNINNALFRTGKKYNQNWQSLGTIRITYSATYNPNGNSYLCIYGWSTNPLVEFYIVESWGNWRPPGATSLGQVTIDGGTYDIYRTTRVNQPSIVGTATFDQYWSVRTSKRTSGTVTVTDHFRAWANRGLNLGTIDQITLCVEGYQSSGSANITQNTFSQSS >1nxm_A dTDP-6-deoxy-D-xylo-4-hexulose 3,5-epimerase Streptococcus suis non-cyclic MTENFFGKTLAARPVEAIPGMLEFDIPVHGDNRGWFKENFQKEKMLPLGFPESFFAEGKLQNNVSFSRKNVLRGLHAEPWDKYISVADGGKVLGTWVDLREGETFGNTYQTVIDASKSIFVPRGVANGFQVLSDFVAYSYLVNDYWALELKPKYAFVNYADPSLDIKWENLEEAEVSEADENHPFLKDVKPLRKEDL >1ioo_A SF11-RNASE Nicotiana alata non-cyclic DFEYLQLVLTWPASFCYANHCERIAPNNFTIHGLWPDNVKTRLHNCKPKPTYSYFTGKMLNDLDKHWMQLKFEQDYGRTEQPSWKYQYIKHGSCCQKRYNQNTYFGLALRLKDKFDLLRTLQTHRIIPGSSYTFQDIFDAIKTVSQENPDIKCAEVTKGTPELYEIGICFTPNADSMFRCPQSDTCDKTAKVLFRR >2ck2_A HUMAN FIBRONECTIN Homo sapiens non-cyclic VSDVPRDIEVVAVTPTSALISWDAPAVTIRYIRLTYGETGGNSPVQEITLPGSKSTYTISGLKPGTDYTVTLYSVTGRGDSPASSKPASINFRTEI >1pvx_A PROTEIN (ENDO-1,4-BETA-XYLANASE) Paecilomyces variotii non-cyclic GTTPNSEGWHDGYYYSWWSDGGGDSTYTNNSGGTYEITWGNGGNLVGGKGWNPGLNARAIHFTGVYQPNGTSYLSVYGWTRNPLVEYYIVENFGSSNPSSGSTDLGTVSCDGSTYTLGQSTRYNAPSIDGTQTFNQYWSVRQDKRSSGTVQTGCHFDAWASAGLNVTGDHYYQIVATEGYFSSGYARITVADVG >3lio_A iron superoxide dismutase Pseudoalteromonas haloplanktis non-cyclic AFELPSLPYAIDALEPHISKETLEFHHGKHHNTYVVKLNGLIPGTKFENKSLEEIVCSSDGGVFNNAAQIWNHTFYWNSLSPNGGGAPTGAVADAINAKWGSFDAFKEALNDKAVNNFGSSWTWLVKLADGSLDIVNTSNAATPLTDDGVTPILTVDLWEHAYYIDYRNVRPDYLKGFWSLVNWEFANANFA >2pth_A PEPTIDYL-TRNA HYDROLASE Escherichia coli non-cyclic TIKLIVGLANPGAEYAATRHNAGAWFVDLLAERLRAPLREEAKFFGYTSRVTLGGEDVRLLVPTTFMNLSGKAVAAMASFFRINPDEILVAHDELDLPPGVAKFKLGGGHGGHNGLKDIISKLGNNPNFHRLRIGIGHPGDKNKVVGFVLGKPPVSEQKLIDEAIDEAARCTEMWFTDGLTKATNRLHAFKAQ >1huw_A HUMAN GROWTH HORMONE Homo sapiens non-cyclic FPTIPLSRLADNAWLRADRLNQLAFDTYQEFEEAYIPKEQIHSFWWNPQTSLCPSESIPTPSNKEETQQKSNLELLRISLLLIQSWLEPVQFLRSVFANSLVYGASDSNVYDLLKDLEEGIQTLMGRLEDGSPRTGQIFKQTYSKFDTNSHNDDALLKNYGLLYCFNKDMSKVSTYLRTVQCRSVEGSCGF >1z6o_M Ferritin heavy chain Trichoplusia ni non-cyclic TQCNVNPVQIPKDWITMHRSCRNSMRQQIQMEVGASLQYLAMGAHFSKDVVNRPGFAQLFFDAASEEREHAMKLIEYLLMRGELTNDVSSLLQVRPPTRSSWKGGVEALEHALSMESDVTKSIRNVIKACEDDSEFNDYHLVDYLTGDFLEEQYKGQRDLAGKASTLKKLMDRHEALGEFIFDKKLLGIDV >1fbt_A FRUCTOSE-2,6-BISPHOSPHATASE Rattus norvegicus non-cyclic RSIYLCRHGESELNLRGRIGGDSGLSARGKQYAYALANFIRSQGISSLKVWTSHMKRTIQTAEALGVPYEQWKALNEIDAGVCEEMTYEEIQEHYPEEFALRDQDKYRYRYPKGESYEDLVQRLEPVIMELERQENVLVICHQAVMRCLLAYFLDKSSDELPYLKCPLHTVLKLTPVAYGCRVESIYLNV >2qtr_A Nicotinate (Nicotinamide) nucleotide adenylyl Bacillus anthracis non-cyclic MRKIGIIGGTFDPPHYGHLLIANEVYHALNLEEVWFLPNQIPPHKQGRNITSVESRLQMLELATEAEEHFSICLEELSRKGPSYTYDTMLQLTKKYPDVQFHFIIGGDMVEYLPKWYNIEALLDLVTFVGVARPGYKLRTPYPITTVEIPEFAVSSSLLRERYKEKKTCKYLLPEKVQVYIERNGLYES >1u4h_A Heme-based Methyl-accepting Chemotaxis Protei Thermoanaerobacter tengcongensis non-cyclic MKGTIVGTWIKTLRDLYGNDVVDESLKSVGWEPDRVITPLEDIDDDEVRRIFAKVSEKTGKNVNEIWREVGRQNIKTFSEWFPSYFAGRRLVNFLMMMDEVHLQLTKMIKGATPPRLIAKPVAKDAIEMEYVSKRKMYDYFLGLIEGSSKFFKEEISVEEVERGEKDGFSRLKVRIKFKNPVFEYKKN >1r8n_A Kunitz trypsin inhibitor Delonix regia non-cyclic SDAEKVYDIEGYPVFLGSEYYIVSAIIGAGGGGVRPGRTRGSMCPMSIIQEQSDLQMGLPVRFSSPEEKQGKIYTDTELEIEFVEKPDCAESSKWVIVKDSGEARVAIGGSEDHPQGELVRGFFKIEKLGSLAYKLVFCPKSDSGSCSDIGINYEGRRSLVLKSSDDVPFRVVFVKPRSGSETES >2ic7_A Maltose transacetylase Geobacillus kaustophilus non-cyclic MKSEKEKMLAGHLYNPADLELVKERERARRLVRLYNETLETEYDKRTGLLKELFGSTGERLFIEPNFRCDYGYNIHVGENFFMNFDGVILDVCEVRIGDHCFIGPGVHIYTATHPLDPHERNSGLEYGKPVVIGHNVWIGGRAVINPGVTIGDNAVIASGAVVTKDVPANAVVGGNPAKVIKWLK >1rtv_A dTDP-4-dehydrorhamnose 3,5-epimerase Pseudomonas aeruginosa non-cyclic SMAMKATRLAIPDVILFEPRVFGDDRGFFFESYNQRAFEEACGHPVSFVQDNHSRSARGVLRGLHYQIRQAQGKLVRATLGEVFDVAVDLRRGSPTFGQWVGERLSAENKRQMWIPAGFAHGFVVLSEYAEFLYKTTDFWAPEHERCIVWNDPELKIDWPLQDAPLLSEKDRQGKAFADADCFP >3bm1_A Protein ydjA Escherichia coli non-cyclic MDALELLINRRSASRLAEPAPTGEQLQNILRAGMRAPDHKSMQPWHFFVIEGEGRERFSAVLEQGAIAAGSDDKAIDKARNAPFRAPLIITVVAKCEENHKVPRWEQEMSAGCAVMAMQMAAVAQGFGGIWRSGALTESPVVREAFGCREQDKIVGFLYLGTPQLKASTSINVPDPTPFVTYF >2yvm_A MutT/nudix family protein Thermus thermophilus non-cyclic MSPWERILLEEILSEPVRLVKERVRTHTGRELTYVYRPGPVAASFVLPVTERGTALLVRQYRHPTGKFLLEVPAGKVDEGETPEAAARRELREEVGAEAETLIPLPSFHPQPSFTAVVFHPFLALKARVVTPPTLEEGELLESLELPLTEVYALLAKGEIQDASTALTLFYAEPHLKRLGLL >1lem_A LECTIN Lens culinaris non-cyclic TETTSFSITKFSPDQQNLIFQGDGYTTKGKLTLTKAVKSTVGRALYSTPIHIWDRDTGNVANFVTSFTFVIDAPSSYNVADGFTFFIAPVDTKPQTGGGYLGVFNSKEYDKTSQTVAVEFDTFYNAAWDPSNKERHIGIDVNSIKSVNTKSWNLQNGERANVVIAFNAATNVLTVTLTYPN >1i4u_A CRUSTACYANIN null non-cyclic DKIPDFVVPGKCASVDRNKLWAEQTPNRNSYAGVWYQFALTNNPYQLIEKCVRNEYSFDGKQFVIESTGIAYDGNLLKRNGKLYPNPFGEPHLSIDYENSFAAPLVILETDYSNYACLYSCIDYNFGYHSDFSFIFSRSANLADQYVKKCEAAFKNINVDTTRFVKTVQGSSCPYDTQKTL >1n71_A aac(6')-Ii Enterococcus faecium non-cyclic MIISEFDRNNPVLKDQLSDLLRLTWPEEYGDSSAEEVEEMMNPERIAVAAVDQDELVGFIGAIPQYGITGWELHPLVVESSRRKNQIGTRLVNYLEKEVASRGGITIYLGTDDLDHGTTLSQTDLYEHTFDKVASIQNLREHPYEFYEKLGYKIVGVLPNANGWDKPDIWMAKTIIPRPD >1cdy_A T-CELL SURFACE GLYCOPROTEIN CD4 Homo sapiens non-cyclic KKVVLGKKGDTVELTCTASQKKSIQFHWKNSNQIKILGNQGSFLTKSPSKLNDRADSRRSLWDQGNFPLIIKNLKIEDSDTYICEVEDQKEEVQLLVFGLTANSDTHLLQGQSLTLTLESPPGSSPSVQCRSPRGKNIQGGKTLSVSQLELQDSGTWTCTVLQNQKKVEFKIDIVVLA >1eb6_A NEUTRAL PROTEASE II Aspergillus oryzae non-cyclic TEVTDCKGDAESSLTTALSNAAKLANQAAEAAESGDESKFEEYFKTTDQQTRTTVAERLRAVAKEAGSTSGGSTTYHCNDPYGYCEPNVLAYTLPSKNEIANCDIYYSELPPLAQKCHAQDQATTTLHEFTHAPGVYQPGTEDLGYGYDAATQLSAQDALNNADSYALYANAIELKC >3h05_A uncharacterized protein VPA0413 Vibrio parahaemolyticus non-cyclic MKKIAIFGSAFNPPSLGHKSVIESLSHFDLVLLEPSIAHAWGKNMLDYPIRCKLVDAFIKDMGLSNVQRSDLEQALYQPGQSVTTYALLEKIQEIYPTADITFVIGPDNFFKFAKFYKAEEITERWTVMACPEKVKIRSTDIRNALIEGKDISTYTTPTVSELLLNEGLYRETLSGK >3g7x_A Female-specific histamine-binding protein 2 Rhipicephalus appendiculatus non-cyclic NQPDWADEAANGAHQDAWKSLKARVENVYYMVKATYKNDPVWGNDFTCVGVMANDVNEDEKSIQAEFLFMNNADTNMQFATEKVTAVKMYGYNRENAFRYETEDGQVFTDVIAYSDDNCDVIYVPGTDGNEEGYELWTTDYDNIPANCLNKFNEYAVGRETRDVFTSACLE >2wq9_A RETINOL-BINDING PROTEIN 4 Homo sapiens non-cyclic ERDCRVSSFRVKENFDKARFSGTWYAMAKKDPEGLFLQDNIVAEFSVDETGQMSATAKGRVRLLNNWDVCADMVGTFTDTEDPAKFKMKYWGVASFLQKGNDDHWIVDTDYDTYAVQYSCRLLNLDGTCADSYSFVFSRDPNGLPPEAQKIVRQRQEELCLARQYRLIVHNGYC >1fpo_A CHAPERONE PROTEIN HSCB Escherichia coli non-cyclic MDYFTLFGLPARYQLDTQALSLRFQDLQRQYHPDKFASGSQAEQLAAVQQSATINQAWQTLRHPLMRAEYLLSLHGFDLASEQHTVRDTAFLMEQLELREELDEIEQAKDEARLESFIKRVKKMFDTRHQLMVEQLDNETWDAAADTCRKLRFLDKLRSSAEQLEEKLLDF >1aa9_A C-HA-RAS Homo sapiens non-cyclic MTEYKLVVVGAGGVGKSALTIQLIQNHFVDEYDPTIEDSYRKQVVIDGETCLLDILDTAGQEEYSAMRDQYMRTGEGFLCVFAINNTKSFEDIHQYREQIKRVKDSDDVPMVLVGNKCDLAARTVESRQAQDLARSYGIPYIETSAKTRQGVEDAFYTLVREIRQHKLRKL >1f8a_B PEPTIDYL-PROLYL CIS-TRANS ISOMERASE NIMA-INTE Homo sapiens non-cyclic GSHGMADEEKLPPGWEKRMSRSSGRVYYFNHITNASQWERPSGNSSSGGKNGQGEPARVRCSHLLVKHSQSRRPSSWRQEKITRTKEEALELINGYIQKIKSGEEDFESLASQFSDCSSAKARGDLGAFSRGQMQKPFEDASFALRTGEMSGPVFTDSGIHIILRTE >3mmh_A Methionine-R-sulfoxide reductase Neisseria meningitidis non-cyclic MHALHFSASDKAALYREVLPQIESVVADETDWVANLANTAAVLKEAFGWFWVGFYLVDTRSDELVLAPFQGPLACTRIPFGRGVCGQAWAKGGTVVVGDVDAHPDHIACSSLSRSEIVVPLFSDGRCIGVLDADSEHLAQFDETDALYLGELAKILEKRFEASRQAV >1fw9_A CHORISMATE LYASE Escherichia coli non-cyclic SHPALTQLRALRYSKEIPALDPQLLDWLLLEDSMTKRFEQQGKTVSVTMIREGFVEQNEIPEELPLLPKESRYWLREILLSADGEPWLAGRTVVPVSTLSGPELALQKLGKTPLGRYLFTSSTLTRDFIEIGRDAGLWGRRSRLRLSGKPLLLTELFLPASPLY >2go2_A Kunitz-type serine protease inhibitor BbKI Bauhinia bauhinioides non-cyclic SSVVVDTNGQPVSNGADAYYLVPVSHGHAGLALAKIGNEAEPRAVVLDPHHRPGLPVRFESPLRINIIKESYFLNIKFGPSSSDSGVWDVIQQDPIGLAVKVTDTKSLLGPFKVEKEGEGYKIVYYPERGQTGLDIGLVHRNDKYYLAVKDGEPCVFKIRKAT >1t6s_A conserved hypothetical protein Chlorobium tepidum tls non-cyclic MQEQRQQLLRSLEALIFSSEEPVNLQTLSQITAHKFTPSELQEAVDELNRDYEATGRTFRIHAIAGGYRFLTEPEFADLVRQLLAPVIQRRLSRSMLEVLAVVAWHQPVTKGEIQQIRGASPDYSIDRLLARGLIEVRGRADSPGRPLQYGTTEVFLDLFHL >2d16_A hypothetical protein PH1918 Pyrococcus horikoshii non-cyclic MVRIEVIDIEKPEGVEVIIGQGNFSIFTVDDLARALLTAVPGIKFGIAMNEAKPQLTRYTGNDPELEALAAKNAVKIGAGHVFVILMKNAYPINVLNTIKNHPAVAMIYGASENPFQVIVAETELGRAVIGVVDGKAANKIETDEQKKERRELVEKIGYKID >1b0o_A BETA-LACTOGLOBULIN Bos taurus non-cyclic LIVTQTMKGLDIQKVAGTWYSLAMAASDISLLDAQSAPLRVYVEELKPTPEGDLEILLQKWENGECAQKKIIAEKTKIPAVFKIDALNENKVLVLDTDYKKYLLFCMENSAEPEQSLACQCLVRTPEVDDEALEKFDKALKALPMHIRLSFNPTQLEEQCHI >2i6i_A Sulfolobus solfataricus protein tyrosine phos Sulfolobus solfataricus non-cyclic MYWVRRKTIGGSGLPYTENEILEWRKEGVKRVLVLPEDWEIEESWGDKDYYLSILKKNGLQPLHIPIPDGGVPSDSQFLTIMKWLLSEKEGNLVHCVGGIGRTGTILASYLILTEGLEVESAIDEVRLVRPGAVQTYEQEMFLLRVEGMRKSWLKNIYSNS >1rya_A GDP-mannose mannosyl hydrolase Escherichia coli non-cyclic MMFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPGGRVQKDETLEAAFERLTMAELGLRLPITAGQFYGVWQHFYDDNFSGTDFTTHYVVLGFRFRVSEEELLLPDEQHDDYRWLTSDALLASDNVHANSRAYFLAEKRTGVPGL >1g85_A ODORANT-BINDING PROTEIN Bos taurus non-cyclic AQEEEAEQNLSELSGPWRTVYIGSTNPEKIQENGPFRTYFRELVFDDEKGTVDFYFSVKRDGKWKNVHVKATKQDDGTYVADYEGQNVFKIVSLSRTHLVAHNINVDKHGQTTELTGLFVKLNVEDEDLEKFWKLTEDKGIDKKNVVNFLENENHPHPE >1sjy_A MutT/nudix family protein Deinococcus radiodurans non-cyclic MEHDERTHVPVELRAAGVVLLNERGDILLVQEKGIPGHPEKAGLWHIPSGAVEDGENPQDAAVREACEETGLRVRPVKFLGAYLGRFPDGVLILRHVWLAEPEPGQTLAPAFTDEIAEASFVSREDFAQLYAAGQIRMYQTKLFYADALREKGFPALPV >1h0a_A EPSIN Rattus norvegicus non-cyclic MSTSSLRRQMKNIVHNYSEAEIKVREATSNDPWGPSSSLMSEIADLTYNVVAFSEIMSMIWKRLNDHGKNWRHVYKAMTLMEYLIKTGSERVSQQCKENMYAVQTLKDFQYVDRDGKDQGVNVREKAKQLVALLRDEDRLREERAHALKTKEKLAQTA >2fkz_A Bacterioferritin Azotobacter vinelandii non-cyclic MKGDKIVIQHLNKILGNELIAINQYFLHARMYEDWGLEKLGKHEYHESIDEMKHADKLIKRILFLEGLPNLQELGKLLIGEHTKEMLECDLKLEQAGLPDLKAAIAYCESVGDYASRELLEDILESEEDHIDWLETQLDLIDKIGLENYLQSQMD >1e7l_A RECOMBINATION ENDONUCLEASE VII Bacteriophage t4 non-cyclic MLLTGKLYKEEKQKFYDAQNGKCLICQRELNPDVQANHLDHDHELNGPKAGKVRGLLCNLCDAAEGQMKHKFNRSGLKGQGVDYLEWLENLLTYLKSDYTQNNIHPNFVGDKSKEFSRLGKEEMMAEMLQRGFEYNESDTKTQLIASFKKQLRKSLK >2iu7_A CYANATE HYDRATASE Escherichia coli non-cyclic MIQSQINRNIRLDLADAILLSKAKKDLSFAEIADGTGLAEAFVTAALLGQQALPADAARLVGAKLDLDEDSILLLQMIPLRGCIDDRIPTDPTMFRFYEMLQVYGTTLKALVHEKFGDGIISAINFKLDVKKVADPEGGERAVITLDGKYLPTKPF >1icx_A PROTEIN LLR18A Lupinus luteus non-cyclic GIFAFENEQSSTVAPAKLYKALTKDSDEIVPKVIEPIQSVEIVEGNGGPGTIKKIIAIHDGHTSFVLHKLDAIDEANLTYNYSIIGGEGLDESLEKISYESKILPGPDGGSIGKINVKFHTKGDVLSETVRDQAKFKGLGLFKAIEGYVLAHPDY >2b06_A MutT/nudix family protein Streptococcus pneumoniae non-cyclic MSRSQLTILTNICLIEDLETQRVVMQYRAPENNRWSGYAFPGGHVENDEAFAESVIREIYEETGLTIQNPQLVGIKNWPLDTGGRYIVICYKATEFSGTLQSSEEGEVSWVQKDQIPNLNLAYDMLPLMEMMEAPDKSEFFYPRRTEDDWEKKIF >1keo_A cation-dependent mannose-6-phosphate receptor Bos taurus non-cyclic TEEKTCDLVGEKGKESEKELALLKRLTPLFQKSFESTVGQSPDMYSYVFRVCREAGQHSSGAGLVQIQKSNGKETVVGRFNETQIFQGSNWIMLIYKGGDEYDNHCGREQRRAVVMISCNRHTLADNFNPVSEERGKVQDCFYLFEMDSSLACS >104m_A MYOGLOBIN Physeter catodon non-cyclic VLSEGEWQLVLHVWAKVEADVAGHGQDILIRLFKSHPETLEKFDRFKHLKTEAEMKASEDLKKHGVTVLTALGAILKKKGHHEAELKPLAQSHATKHKIPIKYLEFISEAIIHVLHSRHPGDFGADAQGAMNKALELFRKDIAAKYKELGYQG >1xxu_A Hypothetical protein Rv2238c/MT2298 Mycobacterium tuberculosis non-cyclic MLNVGATAPDFTLRDQNQQLVTLRGYRGAKNVLLVFFPLAFTGICQGELDQLRDHLPEFENDDSAALAISVGPPPTHKIWATQSGFTFPLLSDFWPHGAVSQAYGVFNEQAGIANRGTFVVDRSGIIRFAEMKQPGEVRDQRLWTDALAALTA >2bk9_A CG9734-PA Drosophila melanogaster non-cyclic MNSDEVQLIKKTWEIPVATPTDSGAAILTQFFNRFPSNLEKFPFRDVPLEELSGNARFRAHAGRIIRVFDESIQVLGQDGDLEKLDEIWTKIAVSHIPRTVSKESYNQLKGVILDVLTAASSLDESQAATWAKLVDHVYAIIFKAIDDDGNAK >1oj6_A NEUROGLOBIN Homo sapiens non-cyclic MERPEPELIRQSWRAVSRSPLEHGTVLFARLFALEPDLLPLFQYNGRQFSSPEDSLSSPEFLDHIRKVMLVIDAAVTNVEDLSSLEEYLASLGRKHRAVGVKLSSFSTVGESLLYMLEKSLGPAFTPATRAAWSQLYGAVVQAMSRGWDGE >1k66_A Phytochrome Response Regulator RcpB Tolypothrix sp. pcc 7601 non-cyclic AVGNATQPLLVVEDSDEDFSTFQRLLQREGVVNPIYRCITGDQALDFLYQTGSYCNPDIAPRPAVILLDLNLPGTDGREVLQEIKQDEVLKKIPVVIMTTSSNPKDIEICYSYSISSYIVKPLEIDRLTETVQTFIKYWLDIVVLPEMG >2ecr_A flavin reductase component (HpaC) of 4-hydrox Thermus thermophilus non-cyclic MKEAFKEALARFASGVTVVAARLGEEERGMTATAFMSLSLEPPLVALAVSERAKLLPVLEGAGAFTVSLLREGQEAVSEHFAGRPKEGIALEEGRVKGALAVLRCRLHALYPGGDHRIVVGLVEEVELGEGGPPLVYFQRGYRRLVWPS >2wcv_A L-FUCOSE MUTAROTASE Escherichia coli non-cyclic MLKTISPLISPELLKVLAEMGHGDEIIFSDAHFPAHSMGPQVIRADGLLVSDLLQAIIPLFELDSYAPPLVMMAAVEGDTLDPEVERRYRNALSLQAPCPDIIRINRFAFYERAQKAFAIVITGERAKYGNILLKKGVTP >1exr_A CALMODULIN Paramecium tetraurelia non-cyclic AEQLTEEQIAEFKEAFALFDKDGDGTITTKELGTVMRSLGQNPTEAELQDMINEVDADGNGTIDFPEFLSLMARKMKEQDSEEELIEAFKVFDRDGNGLISAAELRHVMTNLGEKLTDDEVDEMIREADIDGDGHINYEEFVRMMVSK >1jf3_A monomer hemoglobin component III Glycera dibranchiata non-cyclic GLSAAQRQVVASTWKDIAGADNGAGVGKECLSKFISAHPEMAAVFGFSGASDPGVAELGAKVLAQIGVAVSHLGDEGKMVAEMKAVGVRHKGYGNKHIKAEYFEPLGASLLSAMEHRIGGKMNAAAKDAWAAAYGDISGALISGLQS >2p5d_A UPF0310 protein MJECL36 Methanocaldococcus jannaschii non-cyclic MDLMAYWLCITNEDNWKVIKEKKIWGVAERYKNTINKVKVGDKLIIYEIQRSGKDYKPPYIRGVYEVVSEVYKDSSKIFKPTPRNPNEKFPYRVKLKEIKVFEPPINFKELIPKLKFITNKKRWSGHLMGKAMREIPEEDYKLIVGN >1a00_B HEMOGLOBIN (BETA CHAIN) Homo sapiens non-cyclic MHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPYTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH >2zfo_D Extracellular giant hemoglobin major globin s Oligobrachia mashikoi non-cyclic ECCSRGDAEVVISEWDQVFNAAMAGSSESAIGVAIFDVFFTSSGVSPSMFPGGGDSSSAEFLAQVSRVISGADIAINSLTNRATCDSLLSHLNAQHKAISGVTGAAVTHLSEAISSVVAQVLPSAHIDAWGYCMAYIAAGIGAGL >1j3a_A 50S ribosomal protein L13P Pyrococcus horikoshii non-cyclic MRIINADGLILGRLASRVAKMLLEGEEVVIVNAEKAVITGNREVIFSKYKQRTGLRTLTNPRRGPFYPKRSDEIVRRTIRGMLPWKTDRGRKAFRRLKVYVGIPKEFQDKQLETIVEAHVSRLSRPKYVTVGEVAKFLGGKF >3b8f_A Putative Blasticidin S deaminase Bacillus anthracis non-cyclic LNIEQQLYDVVKQLIEQRYPNDWGGAAAIRVEDGTIYTSVAPDVINASTELCMETGAILEAHKFQKKVTHSICLARENEHSELKVLSPCGVCQERLFYWGPEVQCAITNAKQDIIFKPLKELQPYHWTEAYHDEMVKEWSTR >1a00_A HEMOGLOBIN (ALPHA CHAIN) Homo sapiens non-cyclic VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR >1q2y_A similar to hypothetical proteins Bacillus subtilis non-cyclic MKAVIAKNEEQLKDAFYVREEVFVKEQNVPAEEEIDELENESEHIVVYDGEKPVGAGRWRMKDGYGKLERICVLKSHRSAGVGGIIMKALEKAAADGGASGFILNAQTQAVPFYKKHGYRVLSEKEFLDAGIPHLQMMKD >3m97_X Cytochrome c-552 Paracoccus denitrificans non-cyclic MGHGAEGEEHAQAYTYPVESAGGAEGEAVDEGPDFATVLASADPAAGEKVFGKCKACHKLDGNDGVGPHLNGVVGRTVAGVDGFNYSDPMKAHGGDWTPEALQEFLTNPKAVVKGTKMAFAGLPKIEDRANLIAYLEGQQ >2of1_A Staphylococcal thermonuclease Staphylococcus aureus non-cyclic ATSTKKLHKEPATLIKAIDGDTVKLMYKGQPMTFRLLLVDTPEFNEKYGPEASAFTKAMVENAKKIEVEFDKGQRTDKYGRGLAYWYADGAMVNEALVRQGLAAVAYVYKGNNTHEQLLRAAEAQAKKEKLNIWSEDN >1dqe_A PHEROMONE-BINDING PROTEIN Bombyx mori non-cyclic SQEVMKNLSLNFGKALDECKKEMTLTDAINEDFYNFWKEGYEIKNRETGCAIMCLSTKLNMLDPEGNLHHGNAMEFAKKHGADETMAQQLIDIVHGCEKSTPANDDKCIWTLGVATCFKAEIHKLNWAPSMDVAVGE >1wjg_A probable ATP binding protein Thermus thermophilus non-cyclic MFKTILLAYDGSEHARRAAEVAKAEAEAHGARLIVVHAYEPVPDYLGEPFFEEALRRRLERAEGVLEEARALTGVPKEDALLLEGVPAEAILQAARAEKADLIVMGTRGLGALGSLFLGSQSQRVVAEAPCPVLLVR >1kx3_A histone H3 Homo sapiens non-cyclic ARTKQTARKSTGGKAPRKQLATKAARKSAPATGGVKKPHRYRPGTVALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVMALQEASEAYLVALFEDTNLCAIHAKRVTIMPKDIQLARRIRGERA >1a78_A GALECTIN-1 Bufo arenarum non-cyclic ASAGVAVTNLNLKPGHCVEIKGSIPPDCKGFAVNLGEDASNFLLHFNARFDLHGDVNKIVCNSKEADAWGSEQREEVFPFQQGAEVMVCFEYQTQKIIIKFSSGDQFSFPVRKVLPSIPFLSLEGLAFKSITTE >1dyt_A EOSINOPHIL CATIONIC PROTEIN Homo sapiens non-cyclic RPPQFTRAQWFAIQHISLNPPRCTIAMRAINNYRWRCKNQNTFLRTTFANVVNVCGNQSIRCPHNRTLNNCHRSRFRVPLLHCDLINPGAQNISNCRYADRPGRRFYVVACDNRDPRDSPRYPVVPVHLDTTI >1lpj_A Retinol-binding protein IV, cellular Homo sapiens non-cyclic PADLSGTWTLLSSDNFEGYMLALGIDFATRKIAKLLKPQKVIEQNGDSFTIHTNSSLRNYFVKFKVGEEFDEDNRGLDNRKCKSLVIWDNDRLTCIQKGEKKNRGWTHWIEGDKLHLEMFCEGQVCKQTFQRA >1bbh_A CYTOCHROME C' Allochromatium vinosum non-cyclic AGLSPEEQIETRQAGYEFMGWNMGKIKANLEGEYNAAQVEAAANVIAAIANSGMGALYGPGTDKNVGDVKTRVKPEFFQNMEDVGKIAREFVGAANTLAEVAATGEAEAVKTAFGDVGAACKSCHEKYRAK >1ogc_A HIGH AFFINITY RIBOSE TRANSPORT PROTEIN RBSD Bacillus subtilis non-cyclic MKKHGILNSHLAKILADLGHTDKIVIADAGLPVPDGVLKIDLSLKPGLPAFQDTAAVLAEEMAVEKVIAAAEIKASNQENAKFLENLFSEQEIEYLSHEEFKLLTKDAKAVIRTGEFTPYANCILQAGVLF >1cpq_A CYTOCHROME C' Rhodobacter capsulatus non-cyclic ADTKEVLEAREAYFKSLGGSMKAMTGVAKAFDAEAAKVEAAKLEKILATDVAPLFPAGTSSTDLPGQTEAKAAIWANMDDFGAKGKAMHEAGGAVIAAANAGDGAAFGAALQKLGGTCKACHDDYREED >1nep_A Epididymal secretory protein E1 Bos taurus non-cyclic EPVKFKDCGSWVGVIKEVNVSPCPTQPCKLHRGQSYSVNVTFTSNTQSQSSKAVVHGIVMGIPVPFPIPESDGCKSGIRCPIEKDKTYNYVNKLPVKNEYPSIKVVVEWELTDDKNQRFFCWQIPIEVEA >2hy5_A Putative sulfurtransferase dsrE Allochromatium vinosum non-cyclic MKFALQINEGPYQHQASDSAYQFAKAALEKGHEIFRVFFYHDGVNNSTRLTTPPQDDRHIVNRWAELAEQYELDMVVCVAAAQRRGIVDEGEASRNGKDATNIHPKFRISGLGQLVEAAIQADRLVVFGD >1zav_U 50S ribosomal protein L7/L12 Thermotoga maritima non-cyclic MTIDEIIEAIEKLTVSELAELVKKLEDKFG >1kx3_C histone H2A.1 Homo sapiens non-cyclic SGRGKQGGKTRAKAKTRSSRAGLQFPVGRVHRLLRKGNYAERVGAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRIIPRHLQLAVRNDEELNKLLGRVTIAQGGVLPNIQSVLLPKKTESSKSKSK >1n0q_A 3 ankyrin repeats null non-cyclic NGRTPLHLAARNGHLEVVKLLLEAGADVNAKDKNGRTPLHLAARNGHLEVVKLLLEAGADVNAKDKNGRTPLHLAARNGHLEVVKLLLEAGAY >2ji2_A DESULFOFERRODOXIN Desulfovibrio baarsii non-cyclic MPERLQVYKCEVCGNIVEVLNGGIGELVCCNQDMKLMSENTVDAAKEKHVPVIEKIDGGYKVKVGAVAHPMEEKHYIQWIELLADDKCYTQFLKPGQAPEAVFLIEAAKVVARAYCNIHGHWKAEN >1a4y_B ANGIOGENIN Homo sapiens non-cyclic QDNSRYTHFLTQHYDAKPQGRDDRYCESIMRRRGLTSPCKDINTFIHGNKRSIKAICENKNGNPHRENLRISKSSFQVTTCKLHGGSPWPPCQYRATAGFRNVVVACENGLPVHLDQSIFRRP >1kou_A PHOTOACTIVE YELLOW PROTEIN Halorhodospira halophila non-cyclic MEHVAFGSEDIENTLAKMDDGQLDGLAFGAIQLDGDGNILQYNAAEGDITGRDPKQVIGKNFFKDVAPCTDSPEFYGKFKEGVASGNLNTMFEYTFDYQMTPTKVKVHMKKALSGDSYWVFVKRV >11ba_A PROTEIN (RIBONUCLEASE, SEMINAL) Bos taurus non-cyclic KESAAAKFERQHMDSGNSPSSSSNYCNLMMCCRKMTQGKCKPVNTFVHESLADVKAVCSQKKVTCKNGQTNCYQSKSTMRITDCRETGSSKYPNCAYKTTQVEKHIIVACGGKPSVPVHFDASV >2ux6_A PSEUDOAZURIN Achromobacter cycloclastes non-cyclic ADFEVHMLNKGKDGAMVFEPASLKVAPGDTVTFIPTDKGHNVETIKGMIPDGAEAFKSKINENYKVTFTAPGVYGVKCTPHPFMVGVVQVGDAPANLEAVKGAKNPKKAQERLDAALAALGN >1e8e_A CYTOCHROME C'' Methylophilus methylotrophus non-cyclic DVTNAEKLVYKYTNIAHSANPMYEAPSITDGKIFFNRKFKTPSGKEAACASCHTNNPANVGKNIVTGKEIPPLAPRVNTKRFTDIDKVEDEFTKHCNDILGADCSPSEKANFIAYLLTETKPTK >1faz_A PHOSPHOLIPASE A2 Streptomyces violaceoruber non-cyclic APADKPQVLASFTQTSASSQNAWLAANRNQSAWAAYEFDWSTDLCTQAPDNPFGFPFNTACARHDFGYRNYKAAGSFDANKSRIDSAFYEDMKRVCTGYTGEKNTACNSTAWTYYQAVKIFG >1d9c_A INTERFERON-GAMMA Bos taurus non-cyclic QGQFFREIENLKEYFNASSPDVAKGGPLFSEILKNWKDESDKKIIQSQIVSFYFKLFENLKDNQVIQRSMDIIKQDMFQKFLNGSSEKLEDFKKLIQIPVDDLQIQRKAINELIKVMNDLS >1u6t_A SH3 domain-binding glutamic acid-rich-like pr Homo sapiens non-cyclic VIRVYIASSSGSTAIKKKQQDVLGFLEANKIGFEEKDIAANEENRKWMRENVPENSRPATGYPLPPQIFNESQYRGDYDAFFEARENNAVYAFLGLTAPPGSKEAEVQAKQQALEHHHHHH >3f6p_A Transcriptional regulatory protein yycF Bacillus subtilis non-cyclic MDKKILVVDDEKPIADILEFNLRKEGYEVHCAHDGNEAVEMVEELQPDLILLDIMLPNKDGVEVCREVRKKYDMPIIMLTAKDSEIDKVIGLEIGADDYVTKPFSTRELLARVKANLRRQ >2zpo_A Ribonuclease Chelonia mydas non-cyclic ETRYEKFLRQHVDYPRTAAPDTRTYCNQMMQRRGMTLPVCKFTNTFVHASAASITTICGPGGAPAGGNLRDSTASFALTTCRLQGGSQRPPCNYNGGTSTQRIRIACDGGLPVHYDRAI >3n1s_A HIT-like protein hinT Escherichia coli non-cyclic MAEETIFSKIIRREIPSDIVYQDDLVTAFRDISPQAPTHILIIPNILIPTVNDVSAEHEQALGRMITVAAKIAEQEGIAEDGYRLIMNTNRHGGQEVYHIHMHLLGGRPLGPMLAHKGL >1w7o_A CYTOCHROME C3 Desulfomicrobium baculatus non-cyclic ADAPGDDYVISAPEGMKAKPKGDKPGALQKTVPFPHSKHATVECAQCHHTLEADGGAVKKCTTSGCHDSLEFRDKANAKDIKLVENAYHTQCIDCHKALKKDKKPTGPTACGKCHTTN >1dlw_A HEMOGLOBIN Paramecium caudatum non-cyclic SLFEQLGGQAAVQAVTAQFYANIQADATVATFFNGIDMPNQTNKTAAFLCAALGGPNAWTGRNLKEVHANMGVSNAQFTTVIGHLRSALTGAGVAAALVEQTVAVAETVRGDVVTV >2cbo_A NEOCARZINOSTATIN Streptomyces carzinostaticus non-cyclic AAPTATVTPSSGLSDGTVVKVAGAGLQAGTAYWVAQWARVDTGVWAYNPADNSSVTADANGSASTSLTVRRSFEGFLFDGTRWGTVDCTTAACQVGLSDAAGNGPEGVAISFNHH >1uqx_A LECTIN Ralstonia solanacearum non-cyclic AQQGVFTLPANTSFGVTAFANAANTQTIQVLVDNVVKATFTGSGTSDKLLGSQVLNSGSGAIKIQVSVNGKPSDLVSNQTILANKLNFAMVGSEDGTDNDYNDGIAVLNWPLG >1dw0_A CYTOCHROME C Rhodobacter sphaeroides non-cyclic GDTSPAQLIAGYEAAAGAPADAERGRALFLSTQTGGKPDTPSCTTCHGADVTRAGQTRTGKEIAPLAPSATPDRFTDSARVEKWLGRNCNSVIGRDCTPGEKADLLAWLAAQ >2qhl_A Novel immune-type receptor 10 Ictalurus punctatus non-cyclic MDIKELHVKTVKRGENVTMECSMSKVKDKDKLAWYRQSFGKVPQYFVRYYSSNSGYKFAEGFKDSRFSMTVNDQKFDLNIIGTREDDGGEYFCGEVEGNTIKFTSGTRLQF >1m1f_A Kid toxin protein Escherichia coli non-cyclic MERGEIWLVSLDPTAGHEQQGTRPVLIVTPAAFNRVTRLPVVVPVTSGGNFARTAGFAVSLDGVGIRTTGVVRCDQPRTIDMKARGGKRLERVPETIMNEVLGRLSTILT >1b8r_A PROTEIN (PARVALBUMIN) Cyprinus carpio non-cyclic AFAGVLNDADIAAALEACKAADSFNHKAFFAKVGLTSKSADDVKKAFAIIDQDKSGFIEEDELKLFLQNFKADARALTDGETKTFLKAGDSDGDGKIGVDEWTALVKA >1gmx_A THIOSULFATE SULFURTRANSFERASE GLPE Escherichia coli non-cyclic MDQFECINVADAHQKLQEKEAVLVDIRDPQSFAMGHAVQAFHLTNDTLGAFMRDNDFDTPVMVMCYHGNSSKGAAQYLLQQGYDVVYSIDGGFEAWQRQFPAEVAYGA >2ox5_Z SoxZ protein Paracoccus denitrificans non-cyclic ADDAKPRVKVPSSAKAGETVTVKALISHKMESGQRKDADGKLIPRSIINRFTCELNGVNVVDVAIDPAVSTNPYFEFDAKVDAAGEFKFTWYDDDGSVYEDVKPIAVA >1bkf_A FK506 BINDING PROTEIN Homo sapiens non-cyclic GVQVETISPGDGRTFPKRGQTCVVHYTGMLEDGKKFDSSRDKNKPFKFMLGKQEVIRGWEEGVAQMSVGQRAKLTISPDYAYGATGVPGIIPPHATLVFDVELLKLE >2cdv_A CYTOCHROME C3 Desulfovibrio vulgaris non-cyclic APKAPADGLKMDKTKQPVVFNHSTHKAVKCGDCHHPVNGKENYQKCATAGCHDNMDKKDKSAKGYYHAMHDKGTKFKSCVGCHLETAGADAAKKKELTGCKGSKCHS >1ew4_A CYAY PROTEIN Escherichia coli non-cyclic MNDSEFHRLADQLWLTIEERLDDWDGDSDIDCEINGGVLTITFENGSKIIINRQEPLHQVWLATKQGGYHFDLKGDEWICDRSGETFWDLLEQAATQQAGETVSFR >2ebe_A Hypothetical protein TTHA0061 Thermus thermophilus non-cyclic MQAVRLFQGYMWHPRALALDLKALLPGEVAGARLLWDEVPPPTPFFEDGTPTHTQRFYQLTLLVLTEEPPEALKPLAEEAAEALGEVLEGLPPEVGWLLLEDLRPL >1aiu_A THIOREDOXIN Homo sapiens non-cyclic MVKQIESKTAFQEALDAAGDKLVVVDFSATWCGPCKMIKPFFHSLSEKYSNVIFLEVDVNDCQDVASECEVKCMPTFQFFKKGQKVGEFSGANKEKLEATINELV >1v70_A probable antibiotics synthesis protein Thermus thermophilus non-cyclic MEIKDLKRLARYNPEKMAKIPVFQSERMLYDLYALLPGQAQKVHVHEGSDKVYYALEGEVVVRVGEEEALLAPGMAAFAPAGAPHGVRNESASPALLLVVTAPRP >1m42_A Copper resistance protein C Pseudomonas syringae non-cyclic HPKLVSSTPAEGSEGAAPAKIELHFSENLVTQFSGAKLVMTAMPGMEHSPMAVKAAVSGGGDPKTMVITPASPLTAGTYKVDWRAVSSDTHPITGSVTFKVK >1chp_D CHOLERA TOXIN B PENTAMER Vibrio cholerae non-cyclic TPQNITDLCAEYHNTQIHTLNDKIFSYTESLADKREMAIITFKNGATFQVEVPGSQHIDSQKKAIERMKDTLRIAYLTEAKVEKLCVWNNKTPHAIAAISMAN >3nzn_A Glutaredoxin Methanosarcina mazei non-cyclic SNAVNLFGQKDRGNHVSGVDRGKVIMYGLSTCVWCKKTKKLLTDLGVDFDYVYVDRLEGKEEEEAVEEVRRFNPSVSFPTTIINDEKAIVGFKEKEIRESLGF >1a5k_B UREASE (BETA SUBUNIT) Klebsiella aerogenes non-cyclic MIPGEYHVKPGQIALNTGRATCRVVVENHGDRPIQVGSHYHFAEVNPALKFDRQQAAGYRLNIPAGTAVRFEPGQKREVELVAFAGHRAVFGFRGEVMGPL >1vub_A CCDB Escherichia coli non-cyclic MQFKVYTYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARLLSDKVSRELYPVVHIGDESWRMMTTDMASVPVSVIGEEVADLSHRENDIKNAINLMFWGI >3fpr_A Evasin-1 Rhipicephalus sanguineus non-cyclic EDDEDYGDLGGCPFLVAENKTGYPTIVACKQDCNGTTETAPNGTRCFSIGDEGLRRMTANLPYDCPLGQCSNGDCIPKETYEVCYRRNWRDKKNHHHHHH >1zgx_A Guanyl-specific ribonuclease Sa Streptomyces aureofaciens non-cyclic DVSGTVCLSALPPEATDTLNLIASDGPFPYSQDGVVFQNRESVLPTQSYGYYHEYTVITPGAR >3k6d_A T-cadherin Xenopus laevis non-cyclic SIVAPSISIPENQRIPFPKIVGRVVVSDRIPGSKIKLYGKGVDQEPKGIFKINENSGEVSVTKALDREAIPSYQLQVETTDENGKTIEGPVDLEILVID >1a70_A FERREDOXIN Spinacia oleracea non-cyclic AAYKVTLVTPTGNVEFQCPDDVYILDAAEEEGIDLPYSCRAGSCSSCAGKLKTGSLNQDDQSFLDDDQIDEGWVLTCAAYPVSDVTIETHKKEELTA >2acy_A ACYLPHOSPHATASE Bos taurus non-cyclic AEGDTLISVDYEIFGKVQGVFFRKYTQAEGKKLGLVGWVQNTDQGTVQGQLQGPASKVRHMQEWLETKGSPKSHIDRASFHNEKVIVKLDYTDFQIVK >1r8o_A Kunitz trypsin inhibitor Copaifera langsdorffii non-cyclic RLVDTDGKPIENDGAEYYILPSVRGKGGGLVLAKSGGEKCPLSVVQSPSELSNGLPVRFKASPRSKYISVGMLLGIEVIESPECAPKPSMWSVKSG >2wnd_A PROTEIN S100-A7 Homo sapiens non-cyclic SNTQAERSIIGMIDMFHKYTRRDDKIDKPSLLTMMKENFPNFLSACDKKGTNYLAGVFEKKDKNEDKKIDFSEFLSLMGDIATDYHKKSHGAAPCS >1s12_A hypothetical protein TM1457 Thermotoga maritima non-cyclic MIKVTVTNSFFEVTGHAPDKTLCASVSLLTQHVANFLKAEKKAKIKKESGYLKVKFEELENCEVKVLAAMVRSLKELEQKFPSQIRVEVIDNGS >1bc7_C PROTEIN (ETS-DOMAIN PROTEIN) Homo sapiens non-cyclic MDSAITLWQFLLQLLQKPQNKHMICWTSNDGQFKLLQAEEVARLWGIRKNKPNMNYDKLSRALRYYYVKNIIKKVNGQKFVYKFVSYPEILNM >3lhr_A Zinc finger protein 24 Homo sapiens non-cyclic GSPDPEIFRQRFRQFGYQDSPGPREAVSQLRELCRLWLRPETHTKEQILELVVLEQFVAILPKELQTWVRDHHPENGEEAVTVLEDLESELDD >1ay7_B BARSTAR Streptomyces aureofaciens non-cyclic KKAVINGEQIRSISDLHQTLKKELALPEYYGENLDALWDCLTGWVEYPLVLEWRQFEQSKQLTENGAESVLQVFREAKAEGCDITIILS >2eh1_A Stage V sporulation protein S (SpoVS) related Thermus thermophilus non-cyclic METLRVSSKSRPNSVAGAIAAMLRTKGEVEVQAIGPQAVNQAVKAIAIARGYIAPDNLDLVVKPAFVKLELENEERTALKFSIKAHPLET >1eyt_A HIGH-POTENTIAL IRON-SULFUR PROTEIN Thermochromatium tepidum non-cyclic AAPANAVTADDPTAIALKYNQDATKSERVAAARPGLPPEEQHCANCQFMQANVGEGDWKGCQLFPGKLINVNGWCASWTLKAG >1aaz_A GLUTAREDOXIN Enterobacteria phage t4 non-cyclic MFKVYGYDSNIHKCVYCDNAKRLLTVKKQPFEFINIMPEKGVFDDEKIAELLTKLGRDTQIGLTMPQVFAPDGSHIGGFDQLREYFK >1eue_A CYTOCHROME B5 Rattus norvegicus non-cyclic DPAVTYYRLEEVAKRNTAEETWMVIHGRVYDITRFLSEHPGGEEILLEQAGADATESFEDIGHSPDAREMLKQYYIGDVHPNDLKP >1t0p_B Intercellular adhesion molecule-3 Homo sapiens non-cyclic QEFLLRVEPQNPVLSAGGSLFVNCSTDCPSSEKIALETSLSKELVASGMGWAAFNLSNVTGNSRILCSVYCNGSQITGSSNITVYG >1cm2_A HISTIDINE-CONTAINING PROTEIN Escherichia coli non-cyclic MFQQEVTITAPNGLDTRPAAQFVKEAKGFTSEITVTSNGKSASAKSLFKLQTLGLTQGTVVTISAEGEDEQKAVEHLVKLMAELE >1w53_A PHOSPHOSERINE PHOSPHATASE RSBU Bacillus subtilis non-cyclic MDFREVIEQRYHQLLSRYIAELTETSLYQAQKFSRKTIEHQIPPEEIISIHRKVLKELYPSLPEDVFHSLDFLIEVMIGYGMAY >1xd3_B UBC protein Homo sapiens non-cyclic MQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRG >2exv_A Cytochrome c-551 Pseudomonas aeruginosa non-cyclic EDPEVLAKNKGCVACHAIDTKMVGPAYKDVAAKFAGQAGAEAELAQRIKNGSQGVWGPIPMPPNAVSDDEAQTLAKWVLSQK >1iqz_A Ferredoxin Bacillus thermoproteolyticus non-cyclic PKYTIVDKETCIACGACGAAAPDIYDYDEDGIAYVTLDDNQGIVEVPDILIDDMMDAFEGCPTDSIKVADEPFDGDPNKFE >1mg8_A Parkin Mus musculus non-cyclic MGMIVFVRFNSSYGFPVEVDSDTSILQLKEVVAKRQGVPADQLRVIFAGKELPNHLTVQNCDLEQQSIVHIVQRPRRR >2q5w_D Molybdopterin converting factor, subunit 1 Staphylococcus aureus non-cyclic MKVLYFAEIKDILQKAQEDIVLEQALTVQQFEDLLFERYPQINNKKFQVAVNEEFVQKSDFIQPNDTVALIPPVSGG >1gyj_A HYPOTHETICAL PROTEIN YDCE Escherichia coli non-cyclic PHIDIKCFPRELDEQQKAALAADITDVIIRHLNSKDSSISIALQQIQPESWQAIWDAEIAPQMEALIKKPGYSMNA >1hlq_A HIGH-POTENTIAL IRON-SULFUR PROTEIN Rhodoferax fermentans non-cyclic AAPLVAETDANAKSLGYVADTTKADKTKYPKHTKDQSCSTCALYQGKTAPQGACPLFAGKEVVAKGWCSAWAKKA >1bvn_T PROTEIN (TENDAMISTAT) Sus scrofa non-cyclic DTTVSEPAPSCVTLYQSWRYSQADNGCAETVTVKVVYEDDTEGLCYAVAPGQITTVGDGYIGSHGHARYLARCL >1c9o_A COLD-SHOCK PROTEIN Bacillus caldolyticus non-cyclic MQRGKVKWFNNEKGYGFIEVEGGSDVFVHFTAIQGEGFKTLEEGQEVSFEIVQGNRGPQAANVVKL >3cjs_B 50S ribosomal protein L11 Thermus thermophilus non-cyclic MKKVVAVVKLQLPAGKATPAPPVGPALGQHGANIMEFVKAFNAATANMGDAIVPVEITIYADRSFTFVTKTP >1r4p_B shiga-like toxin type II B subunit Escherichia coli non-cyclic ADCAKGKIEFSKYNEDDTFTVKVDGKEYWTSRWNLQPLLQSAQLTGMTVTIKSSTCESGSGFAEVQFNND >2j7z_A STROMAL CELL-DERIVED FACTOR 1 ALPHA Homo sapiens non-cyclic KPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIVARLKNNNRQVCIDPKLKWIQEYLEKALNK >1n89_A lipid transfer protein Triticum turgidum subsp. durum non-cyclic ACQASQLAVCASAILSGAKPSGECCGNLRAQQGCFCQYAKDPTYGQYIRSPHARDTLTSCGLAVPHC >2orm_A Probable tautomerase HP0924 Helicobacter pylori non-cyclic PFINIKLVPENGGPTNEQKQQLIEGVSDLMVKVLNKNKASIVVIIDEVDSNNYGLGGESVHHLRQKN >1aho_A TOXIN II Androctonus australis non-cyclic VKDGYIVDDVNCTYFCGRNAYCNEECTKLKGESGYCQWASPYGNACYCYKLPDHVRTKGPGRCH >2q8r_E CCL14 Homo sapiens non-cyclic GPYHPSECCFTYTTYKIPRQRIMDYYETNSQCSKPGIVFITKRGHSVCTNPSDKWVQDYIKDMKEN >1em7_A PROTEIN G Streptococcus sp. non-cyclic TTYKLILNGKTLKGETTTEAVDAETAERVFKEYAKKNGVDGEWTYDDATKTFTVTE >1b7d_A PROTEIN (NEUROTOXIN TS1) Tityus serrulatus non-cyclic KEGYLMDHEGCKLSCFIRPSGYCGRECGIKKGSSGYCAWPACYCYGLPNWVKVWDRATNKC >1eq7_A OUTER MEMBRANE LIPOPROTEIN Escherichia coli non-cyclic SSNAKIDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAARANQRLDNMATKYR >2og0_A Excisionase Enterobacteria phage lambda non-cyclic MYLTLQEWNARQRRPRSLETVRRWVRESRIFPPPVKDGREYLFHESAVKVDL >3n3f_A Collagen alpha-1(XV) chain Homo sapiens non-cyclic NLVTAFSNMDDMLQKAHLVIEGTFIYLRDSTEFFIRVRDGWKKLQLGELIPIPA >1fe6_A TETRABRACHION Staphylothermus marinus non-cyclic GSIINETADDIVYRLTVIIDDRYESLKNLITLRADRLEMIINDNVSTILASG >1zv8_A E2 glycoprotein Sars coronavirus non-cyclic NQKQIANQFNKAISQIQESLTTTSTALGKLQDVVNQNAQALNTLVKQLSS >1ofs_B PEA LECTIN BETA CHAIN Pisum sativum non-cyclic VTSYTLSDVVSLKDVVPEWVRIGFSATTGAEYAAHEVLSWSFHSELSG >1an1_I TRYPTASE INHIBITOR Sus scrofa non-cyclic KKVCACPKILKPVCGSDGRTYANSCIARCNGVSIKSEGSCPTGILN >2oqq_A Transcription factor HY5 Arabidopsis thaliana non-cyclic GSAYLSELENRVKDLENKNSELEERLSTLQNENQMLRHILKN >1t6f_A Geminin null non-cyclic TLYEALKENEKLHKEIEQKDNEIARLKKENKELAEVA >1ce9_A PROTEIN (GCN4-PMSE) null non-cyclic MSVKELEDKVEELLSKNYHLENEVARLKKLVGER >1g2z_A HEPATOCYTE NUCLEAR FACTOR 1-ALPHA null non-cyclic MVSKLSQLQTELMAALLESGLSKEALIQALGE >3mhp_C TIC62_peptide Pisum sativum non-cyclic KTEQPLSPYTAYDDLKPPSSPSPTKP >2wfv_B PROBABLE INSULIN-LIKE PEPTIDE 5 B CHAIN Drosophila melanogaster non-cyclic NSLRACGPALMDMLRVACPNGFN >1apm_I PEPTIDE INHIBITOR PKI(5-24) Mus musculus non-cyclic TTYADFIASGRTGRRNAIHD >2cny_B PUTATIVE OUTER MEMBRANE PROTEIN Salmonella typhimurium non-cyclic GSFLPNSEQQKSVDAVFSSP >1f47_A CELL DIVISION PROTEIN FTSZ Escherichia coli non-cyclic KEPDYLDIPAFLRKQAD >3gof_C Nitric oxide synthase, inducible Gallus gallus non-cyclic RRREIRFRVLVKVVFF >3fp7_I Pancreatic trypsin inhibitor Rattus norvegicus non-cyclic RPDFCLEPPYTGPCK >1xh3_C aa 4-17 (LPAVVGLSPGEQEY) of alternative readi Homo sapiens non-cyclic LPAVVGLSPGEQEY >1zsd_C BZLF1 trans-activator protein Homo sapiens non-cyclic EPLPQGQLTAY >2buo_T INHIBITOR OF CAPSID ASSEMBLY Human immunodeficiency virus 1 non-cyclic ITFEDLLDYYGP >2pv2_E C-peptide Escherichia coli non-cyclic NFTLKFWDIFRK >3d32_C K1 peptide Homo sapiens non-cyclic DATYTWEHLAWP >3ggw_E PEPTIDE B1 Mus musculus non-cyclic YLEDWIKYNNQK >3jzq_P pDIQ peptide (12mer) Homo sapiens non-cyclic ETFEHWWSQLLS >1n12_B Peptide corresponding to the N-terminal exten Escherichia coli non-cyclic SDVAFRGNLLD >1zuk_C Proline-rich protein LAS17 Saccharomyces cerevisiae non-cyclic RGPAPPPPPHR >2fyy_C 11-mer peptide from Epstein-Barr nuclear anti Homo sapiens non-cyclic HPVGEADYFEY >3ln5_C 11-mer peptide from S-methyl-5'-thioadenosine Homo sapiens non-cyclic HEEAVSVDRVL